2022 Volume 2 Issue 3 Pages rev-1-rev-17
Oxidative folding is an essential process for polypeptide chains containing cysteine (Cys) residues to form a bioactive three-dimensional structure. During this process, the folded state with the correct disulfide (SS) combination, which is found in the native state, cannot be obtained at 100% yield because various byproducts, such as misfolded states with mispaired SS bonds, oligomers, and aggregates are simultaneously produced. The formation of misfolded states in vivo has been suggested to cause critical human diseases such as neurodegenerative disorders. Therefore, the development of methods to promote the correct structural maturation of polypeptides, including Cys residues, both in vivo and in vitro, is a challenging task in protein synthesis, medicine, and drug discovery. To rapidly form correctly folded proteins at high yields, two potential strategies are available. First, called the outside strategy, is to control oxidative folding from the outside of proteins using artificial small molecules as catalyst and a reagent that mimics the function of protein disulfide isomerase, which catalyzes SS-related reactions during oxidative folding in cells. Second, called the inside strategy, is to insert mimics of SS linkage(s) into the inside of polypeptides to form a rigid covalent bond chemoselectively, thereby avoiding the formation of a misfolded state having mispaired SS bonds. In this review, recent developments and trends based on the unique redox properties of selenols and diselenides, which are selenium analogs of thiols and disulfides, respectively, are outlined, and their future prospects are discussed.
Proteins are first synthesized as polypeptide chains through transcription from genetic information in DNA into mRNA and subsequent translation, during which amino acids corresponding to three specific bases (codons) in mRNA are sequentially elongated (i.e., central dogma, Fig. 1). For the synthesized polypeptide chains to exert their physiological functions as proteins, they must gain a unique three-dimensional structure through protein folding. In classical protein science, polypeptide chains spontaneously fold into their most stable conformation in an aqueous solution under properly controlled reaction conditions (Fig. 1, in vitro folding) [1]. However, only a few proteins strictly follow the basic principle of biophysics, known as the Anfinsen’s dogma. In cells with high concentrations of biomolecules, intermolecular interactions and the resulting irreversible aggregation are inevitable; thus, correct folding is inhibited. Therefore, in modern protein science, it is important to understand the structures and functions of enzymes that promote correct folding [2].
Biosynthesis of polypeptides following the central dogma and subsequent oxidative protein folding controlled by protein disulfide isomerase family members (PDIs). For oxidative folding, two-dimensional energy landscapes (folding funnels) for in vivo and in vitro folding are shown, which represent the relationship of energy and entropy (i.e., diversity of peptidyl conformation).
Oxidative cross-linking between cysteinyl thiol (SH) groups to form disulfide (SS) bond(s) is a typical and important posttranslational modification, particularly for secreted proteins; folding coupled with SS formation is commonly termed oxidative folding [3]. Synthesized nascent polypeptides undergo oxidative folding in the endoplasmic reticulum (ER), where the oxidative environment is maintained. Protein disulfide isomerase (PDI), a typical ER-resident SS-based oxidoreductase, cooperatively controls proteostasis together with more than 20 PDI family enzymes (PDIs) in the ER (Fig. 1) [4]. PDI has four functional cysteine (Cys) residues that are important for its enzymatic activity (see below). The generation and accumulation of reactive nitrogen species (RNS) due to excessive oxidative stress in the ER leads to undesired oxidative modifications of Cys residues in PDI and significantly decreases enzymatic activity [5]. Dysfunction of PDI leads to disruption of proteostasis in the ER, resulting in misfolding, oligomerization, aggregation, and amyloidogenesis of proteins (Fig. 1), eventually triggering various fatal human diseases such as cancer, neurodegenerative disorders, diabetes, and atherothrombotic stroke [6–12].
Oxidative folding is also an essential process for chemically synthesized polypeptides to exert their physiological functions. The solid-phase peptide synthesis (SPPS) method pioneered by Merrifield enabled the synthesis of polypeptides with arbitrary amino acid sequences [13]. Consequently, rare proteins, non-natural proteins, and peptide-based formulations have been easily synthesized, and related fields such as structural biology, medicine, and drug discovery have progressed remarkably [14]. Generally, oxidative folding of synthetic polypeptides can be completed within a reasonable reaction time (hours to days) under mild oxidative conditions. However, considerable effort is often required to determine the optimal reaction conditions. While the use of PDI as a catalyst is effective for efficient folding, it is costly, and separation of the folded protein from PDI is usually difficult. In addition, peptide chains containing three or more Cys residues generally convert to isomeric species with non-native SS bonds (scrambled species) in addition to the native state during oxidative folding. Because the SS bonding pattern depends on the thermodynamic stability of the peptide loop, it is impossible to achieve 100% folding yield, even under precisely optimized conditions. Therefore, purification of the target protein by high-performance liquid chromatography (HPLC) is usually required after oxidative folding, thus reducing cost and time efficiency of the overall protein synthesis.
Consequently, developing tools for efficient oxidative folding, which has troublesome aspects both in vivo and in vitro, should be valuable in medicinal science and drug discovery where protein misfolding is a concern. Currently, two chemical strategies are available for this purpose:
In this review, the outside and inside strategies utilizing the redox properties of organoselenium molecules and peptides are discussed. First, the latest research trends and important achievements, in addition to some significant pioneering studies for each strategy, are outlined, and future perspectives of these strategies for artificial manipulation of in vivo and in vitro oxidative folding are discussed.
(a) Control of oxidative folding using artificial reagents and catalysts (outside strategy). (b) Regioselective covariant bond formation in peptides and proteins by substituting native SS bonds with its representative mimics (inside strategy). Disulfide (SS) mimic E (diselenide bridge) was discussed in this review. For SS mimics A, B, C, and D, see the reports in Refs. [15], [16], [17], and [18], respectively.
(a) Structural maturation coupled with SS-formation (phase 1) and subsequent SS-isomerization (phase 2). (b) SS-formation promoted by PDIox via an intermolecular bond exchange reaction. (c) SS-isomerization catalyzed by PDIred via a temporal SS-cleavage in substrate proteins.
Generally, during in vitro oxidative folding, two chemical reaction phases, SS-formation (phase 1) and SS-isomerization (phase 2), are observed (Fig. 3a). In phase 1, a reduced protein (R) loses its chain entropy with the hydrophobic collapse of the polypeptide, accompanied by the formation of intramolecular non-native SS bonds. In phase 2, the SS intermediates (scrambled species) generated in the first phase undergo conformational folding accompanied by rearrangement of SS bonds to search for correctly paired SS bonding patterns found in the native state. PDI accelerates the overall rate of oxidative folding by exhaustively catalyzing SS-related reactions in these phases (Fig. 3b, c) [19–22].
(a) Molecular structure of PDI composed of four tandem thioredoxin (Trx)-like domains. (b) Reversible redox state of CGHC motif as the redox active sequence of PDI.
PDI is composed of four tandem thioredoxin (Trx)-like domains, in which the N- and C-terminal domains (a and a', respectively) contain a redox active sequence, namely Cys-Gly-His-Cys (CGHC) (Fig. 4a). Under physiological conditions, CGHC sequences are in equilibrium between the two redox states, disulfide (PDIox) and dithiol (PDIred) states (Fig. 4b). During phase 1, PDIox rapidly introduces SS bonds to substrate proteins via an intermolecular bond exchange reaction (Fig. 3b). On the other hand, PDIred catalyzes SS-isomerization during phase 2 by temporarily cleaving the miscrossed SS bonds due to the superior nucleophilic capability of the SH groups in CGHC (Fig. 3c). CGHC in PDIox and PDIred have two crucial physicochemical properties so that PDI effectively catalyzes these reactions. First, the SS reduction potential (E°') of PDIox (‒180 mV) is considerably higher than that of the CXXC motifs in other PDI family members except for ERdj5 [23]. This indicates that PDIox possesses a strong oxidizing capability, which is an ideal property for promoting phase 1. Furthermore, the pKa values (ca. 6.7) of the SH groups in CGHC are substantially lower than those of general cysteinyl SH groups in proteins (ca. 8.3) due to the effect of the neighboring basic His residue. Thus, for phase 2, which is initiated by nucleophilic attack from the SH group in CGHC, PDI also has ideal reactivity. Moreover, the high E°' indicates that PDIred is less likely to be ring-closed (oxidized) to PDIox, indicating that PDIred promotes SS-isomerization rather than SS-reduction (Fig. 3c).
Since organothiols (RSH) exist in redox equilibrium states with coexisting disulfide (R'SSR') compounds in an aqueous solution (Eq. 1), they can promote SS-related reactions via a PDI-like catalytic activity during oxidative folding.
In addition to monothiol glutathione (GSH) and its oxidized form (GSSG), which have been conventionally used as oxidative folding reagents, a number of sulfur-based compounds that mimic the physicochemical properties and catalytic functions of PDI active centers have been reported [24–29]. However, their catalytic ability is remarkably lower than that of PDI, and compounds must be added at a molar ratio of 10 to 100-fold with respect to the substrate proteins to achieve reasonable folding yields and rates.
Values are from Refs. a [30], b [31], c [32], d [33], e [34], f [35], and g [36]. h No data.
Selenium, an essential micronutrient in living organisms, is incorporated into selenocysteine (Sec). Sec is an analog of Cys, in which the SH group in cysteine is replaced by a selenol (SeH) group. Sec is a building block in proteins that functions as an active center in various reductases to reduce various biological substrates, due to the excellent two-electron-donating ability of the SeH groups. Various organoselenols (RSeH), including Sec, can also be in redox equilibrium with coexisting disulfide compounds (R'SS R'), as RSH (Eq. 2).
In general, the pKa value of RSeH is considerably lower than that of RSH. Although there are few reports on the pKa of a SeH group because the isolation and purification of organoselenols are generally difficult due to their enormously high reactivity, the pKa values of Cys and Sec residues have been reported to be 8.22 and ca. 5.2, respectively (Table 1). Thus, the SeH group almost exists as a deprotonated state (selenolate; Se-) under physiological conditions and functions as a superior nucleophile. The acidity of the SeH group exceeds that of Cys residues in the active centers of PDI, indicating that organoselenols have more ideal chemical properties as reagents for promoting SS-isomerization (phase 2, Fig. 3c). On the other hand, the reduction potential of the diselenide (SeSe) bond (E°'SeSe) is generally lower than that of the SS bond (Table 1), and the oxidation of SH groups to SS bonds with SeSe compounds (eq. 2, forward reaction) is thermodynamically unfavorable [37]. Therefore, molecular design to increase the E°'SeSe values is important for improving the function of organodiselenides as PDI mimics.
4.1. Selenoglutathione: The first selenocompound as an oxidative folding catalyst
Selenoglutathione (GSeSeG, Fig. 5), in which Cys residues in glutathione disulfide (GSSG) are replaced by Sec residues, was first reported as a water-soluble diselenide-based reagent used in oxidative folding [31]. Hilvert et al. used GSeSeG as an oxidant to the oxidative folding of the reduced state of ribonuclease A (RNase A) with no SS bond, and found that GSeSeG promoted oxidative folding at a higher rate than GSSG. They further demonstrated that in the oxidative folding of the reduced state of bovine pancreatic trypsin inhibitor (BPTI), GSeSeG promoted early SS-formation events (phase 1, Fig. 3b) more effectively and provided a higher final folding yield than GSSG. GSeSeG, despite having a considerably lower reduction potential (E°'SeSe = ‒407 mV) than GSSG (E°'SS = ‒256 mV), exerts a higher oxidizing ability for SS-formation than GSSG. The nucleophilic attack of an SH group on the SeSe bond was previously shown to be up to four orders of magnitude faster than the attack on the SS bond, indicating that the kinetics of organodiselenides make them more favorable SS-forming reagents [38,39]. In addition, monoselenol GSeH, which is produced as a byproduct during SS-formation (GSeSeG + Protein[SH,SH] → 2GSe- + 2H+ + Protein[S-S]), promotes SS-isomerization reactions (phase 2, Fig. 3c) more effectively than GSH because of its higher nucleophilicity, which accelerates the overall folding rate.
In the oxidative folding of BPTI, the accumulation of two kinetically trapped SS-intermediates decelerates the overall folding rate and reduces the yield of the folded state (see section 5-2). Metanis and Hirvert et al. previously showed that GSeH promotes SS-isomerization of SS-intermediates on unproductive pathways, thus improving folding efficiency [40]. Furthermore, the addition of catalytic amounts of GSeSeG can substantially promote oxidative folding under aerobic conditions because the GSeH generated during the reaction is readily reoxidized by molecular oxygen to GSeSeG (i.e., 4GSeH + O2 → 2GSeSeG + 2H2O) as an SS-forming agent [41]. GSeSeG was applied to the oxidative folding of various reduced proteins, and its wide substrate scope has been demonstrated [42].
Recently, Iwaoka et al. reported an effective synthetic route for preparing GSeSeG using a liquid-phase peptide synthesis method [43]. Furthermore, they showed that scrambled RNase A with four non-native SS bonds can be quickly repaired to the native state by treatment with NADPH and glutathione reductase (GR) in the presence of catalytic amounts of GSeSeG (1 mol%) (Fig. 6). This is because GSeSeG is reduced to GSeH, which catalyzes SS-isomerization, with NADPH in the presence GR that can recognize both GSSG and GSeSeG as substrates [44].
4.2. Modification of chemical structures
Metanis et al. synthesized small-molecule diselenides 1‒3 as second-generation dimeric-type diselenide compounds to replace GSeSeG (Fig. 7) [45]. These compounds can be synthesized more easily and at a higher yield than peptide-based compounds through a short synthetic route using inexpensive starting materials. Furthermore, the compound is smaller than GSeSeG, improving the probability of contact with free cysteinyl SH groups buried in kinetically trapped SS-intermediates, and consequently accelerating the overall velocity of oxidative folding. Indeed, in the oxidative folding of BPTI, selenol states generated from corresponding diselenide compounds smoothly mediated SS-isomerization of the kinetically trapped SS-intermediates generated during the folding, and remarkably improved the oxidative folding rate and yield more than GSSG. Notably, the capability of compounds 2 and 3 as folding promoters was comparable to or slightly higher than that of GSeSeG.
4.3. Cyclic diselenides
Not only dimeric diselenides as shown above but also cyclic diselenide compounds are frequently used to manipulate the structural maturation of denatured and reduced proteins. Raines et al. reported that the diselenolate form (4a) produced by the reduction of (S)-1,2-diselenan-4-amine (4) can rapidly reduce protein SS bonds [46] (Fig. 8). Arai et al. extended the potential application of compound 4 to oxidative folding [36]. In addition to 4, analogs 5 and 6, which differ in ring size and functional groups, respectively, were also synthesized, and the ability of the oxidative folding catalyst was compared by folding experiments with reduced RNase A. Five-membered ring diselenide 5 was found to have the highest SS-formation and SS-isomerization abilities [36]. Among the six-membered ring diselenides (4 and 6) (Fig. 8), the E°'SeSe value of 4, which has an amino group, is higher than that of 6, which has two hydroxy groups. This may be due to the ammonium (-NH3+) group in the aqueous solution that stabilizes the chain-opened (diselenol) form (4a) by electrostatic interactions with the Se- group in the molecule. Moreover, compound 5, which has a stronger ring strain, had a significantly higher E°'SeSe value than compound 4. These results suggest that the E°'SeSe value, and thus the folding catalytic activity of the cyclic diselenides, can be modulated by changing the ring size and functional groups.
However, such structural modifications of simple Se-containing aliphatic compounds have not been readily achieved in practical organic syntheses [47]. Meanwhile, we also reported that the E°'SeSe values could be easily increased by conjugating His via an amide bond with compound 4 [48] (Fig. 8). This result may be attributed to the stabilization of the reduced form due to the formation of NH•••Se hydrogen bonds between the selenium atom in the diselenol state and the imidazole ring of His in 7a (Fig. 8) [49]. Consequently, in the oxidative folding of hen egg-white lysozyme (HEL), compound 7 showed higher catalytic activity than the parent compound (4) for both oxidative folding of the reduced state without SS bonds and repair of the misfolded state (scrambled species) with non-native SS bonds. Importantly, the addition of compound 7 (0.3–0.5 mM) also efficiently inhibited the aggregation of HEL (1.4 mg/mL) induced by thermal denaturation. In the oxidative folding of high concentrations of reduced HEL (0.7 mg/mL), correct folding of the monomeric protein was also accompanied with undesired aggregation and oligomerization in a redox buffer solution containing common glutathione (GSH/GSSG, 1.0 mM/0.2 mM), finally recovering only 25% yield of biologically active state. In contrast, the coexistence of compound 7 (0.2 mM) instead of GSSG inhibited intermolecular contacts of proteins, accelerated the oxidative folding rate, and improved the folding yield to 55% [48]. Thus, these results strongly suggest that the conjugation of His with compound 4 simultaneously enhances its redox reactivity as a PDI-like catalyst and chaperone-like capability to suppress protein aggregation.
More recently, compound 8, in which Pro is inserted as a spacer amino acid between compound 4 and His, had a significantly higher E°'SeSe value than 7 and 9 without or with Gly spacer instead of Pro residue, respectively (Fig. 8) [50,51]. Detailed structural analyses of the compounds by circular dichroism (CD) and 2-dimensional nuclear magnetic resonance (NMR) spectroscopy clearly showed that reductive cleavage of the SeSe bond in 8 induces transformation of the peptide backbone into a rigid γ-turn structure. Therefore, the SeH and imidazole groups are spatially close to each other in the reduced state, forming NH•••Se hydrogen bonds (or salt bridges) that thermodynamically stabilize the diselenol moiety. Thus, compound 8 functions as an excellent oxidative folding catalyst. Moreover, aliphatic cyclic diselenols are promising regulators of intracellular redox balance in cells because they function as good two-electron donors and are effective not only in SS-related reactions during folding, but also in the reduction of reactive oxygen/nitrogen species, such as peroxides and nitrosothiols, which cause protein misfolding [51,52].
4.4. Enhancement of oxidative folding in cells by selenocystamine
Hilvert et al. previously reported that selenocystamine ([SeCA]2), an oxidized state of SeCA (Table 1) and a smaller diselenide compound than GSeSeG, also significantly accelerates the oxidative folding of RNase A. However, its catalytic ability is slightly lower than that of GSeSeG. [41]. They further reported that low concentrations (1‒10 μM) of [SeCA]2 administered to Escherichia coli lacking disulfide bond isomerase A (DsbA), a typical CXXC motif-containing folding enzyme in the periplasm of gram-negative bacteria, promoted oxidative protein folding instead of DsbA [53].
Recently, eukaryotic green algae have attracted attention as platforms for the production of recombinant proteins. The rate-limiting step in the accumulation of recombinant proteins is the structural maturation of proteins coupled with SS-formation and SS-isomerization in chloroplasts, which contain protein-folding enzymes, such as chaperones, peptidyl prolyl isomerases, and PDIs. Interestingly, the addition of [SeCA]2 was found to improve the accumulation of SS-containing proteins in the chloroplasts, suggesting that the selenocompound would be promising for effective protein production by genetic engineering [54].
5.1. Folding pathway kinetically controlled by insertion of Sec residues
Because SS-boding patterns in proteins with multiple Cys residues are essentially governed by the thermodynamics of polypeptide chains, it is impossible to form SS bonds at arbitrary positions at 100% yield without direct modification of peptide chains in the synthetic process. Moloder et al. pioneered a method to kinetically govern oxidative protein folding by internalizing the physicochemical properties of the SeH group into the polypeptide itself [55]. Endothelin-1 (ET1; 21 amino acids), a vasoconstrictive peptide, is a mini protein stabilized by two intramolecular SS bonds (Cys1‒Cys15 and Cys3‒Cys11). Under optimal conditions, oxidation of reduced ET1 without SS bond by O2 provides a regioisomer with two non-native SS bonds (isomer A) and native ET1 (isomer B) in a 1:3 ratio, depending on the relative thermodynamic stability of the peptide loops (Fig. 9a) [56]. Moroder et al. synthesized an ET1 analog ([C3U,C11U]-ET1), in which the two Cys residues (Cys3 and Cys11) that form SS bonds in the native state were substituted by Sec residues to selectively yield correctly folded ET1. Because the pKa value of the SeH groups in the reduced state is significantly lower than that of the cysteinyl SH groups (Table 1), SeSe bond formation proceeds preferentially even when multiple Cys residues coexist in the peptide chain. In addition, the SeSe bond in the peptide chain is more thermodynamically stable than the SS bond [37], and thus bond isomerization via SH-SeSe exchange is thermodynamically less likely to proceed. Consequently, the generation of a misfolded state with incorrect bonding pairs was inhibited during the oxidative folding of [C3U,C11U]-ET1 (Fig. 9a). Notably, replacing SS with SeSe did not affect the biological activity and folded structure of ET1.
(a) Predominant oxidation of Sec residues and selective production of folded endothelin-1 (ET1) analog ([C3U,C11U]-ET1). (b) Regioselective diselenide- (SeSe) and SS-formation to produce possible apamin isomers.
Apamin, a bee toxin protein, has the same SS bond topology as ET1. Moroder et al. selectively synthesized three possible isomers, including the native state, by utilizing the Sec-substituting strategy (Fig. 9b) [57,58]. The results strongly suggest that beyond the structural information encoded in the primary sequence, the kinetic and thermodynamic properties of the SeH groups and SeSe bonds, respectively, contribute more strongly to the folding pathway and SS-bonding pattern in the final product. Consequently, Sec was also utilized to generate misfolded forms in a purposeful manner [55]. Since then, this method has been applied to structural analysis of protein folding intermediates, elucidation of the folding pathway, and effective production of peptide formulations, as described below.
5.2. Enhancement of the foldability of conotoxins
SS-rich bioactive peptides, such as neurotoxins, plant cyclotides, antimicrobial peptides, and protease inhibitors, are potential therapeutic agents for analgesics, antihypertensives, antiarrhythmics, antitumor medicines, antivirals, and antibiotics. SS-rich conotoxins, known as neurotoxins from Conus, are peptide drug candidates that have been explored to enhance the foldability and structural stability of their native states by the Sec-substituting strategy. Multiple SS bonds in conotoxins are readily reduced or isomerized in the presence of an SH-based reductant, such as GSH, thioredoxin, and albumin, thus easily degrading them in the blood. Alewood et al. synthesized α-selenoconotoxins, in which one or two SS bonds in wild-type α-conotoxin were replaced by SeSe bonds [59]. Due to regioselective SeSe bonding, the folded state of α-selenoconotoxins is effectively produced at high yields, and all analogs exhibited remarkable stability against SH-based biomolecules [59,60]. Similarly, for pharmacologically relevant conotoxins containing three SS bonds, such as μ-conotoxin [61], ω-selenoconotoxin GVIA [62], μ-conotoxin KIIIA [63], and μO-conotoxin MrVIB [64], the folding yield was improved by replacing one SS bond with a SeSe bond. In both cases, the substitution of SeSe bonds did not alter the 3D structure and reduce the biological activity. Furthermore, Bulaj et al. compared the overall folding rates of seleno-analogs of ω-conotoxin GVIA and μ-conotoxin SIIIA with wild types. Both selenopeptides, in which one of the three SS bonds was replaced by SeSe bond, promoted SS-formation and SS-isomerization during oxidative folding at a higher rate than the wild-type peptides, even in the absence of chemical redox reagents. This is due to the autocatalytic functions of inserted Sec residues in the peptides (Fig. 10). [65].
5.3. Steering of the folding pathway by inserting Sec residues
A detailed oxidative folding pathway has been reported for wild-type BPTI (58 amino acids), which has long been employed as a benchmark protein in folding studies (Fig. 11a) [66–68]. In the key intermediates, N* and N', which have native-like structures, free SH groups are buried inside the molecules. Generation of these kinetically trapped intermediates consequently decelerates the overall folding rate of BPTI. Conversely, if the formation of N* and N' could be avoided, reduced BPTI (R) could lead to rapid and high-yield formation of the native state (N).
(a) Major oxidative folding pathways of BPTI and its Sec-substituted analogs. (b) Modification of energy landscape for oxidative folding of BPTI by insertion of Sec, instead of cysteine (Cys) residues.
Based on this concept, Hilvert et al. proposed a new strategy to efficiently yield N by replacing Cys5 and Cys14 of BPTI with Sec residues, preferentially undergoing a pathway in which N* and N' are not involved (Fig. 11a, pathways highlighted in pink) [69]. Indeed, the seleno-analog, [C5U,C14U]-BPTI, folded into N at 70‒80% yield at a higher rate than wild-type BPTI under weakly basic and aerobic conditions, whereas oxidative folding of wild-type BPTI produced N* and N' as the major products (80%) with N at a low yield. This result indicates that [C5U,C14U]-BPTI can selectively undergo a reaction pathway with smaller activation barriers than those in wild-type BPTI by preferentially forming the non-native crosslink via the SeSe bond at the early folding phase (Fig. 11b, middle funnel). On the other hand, for [C14U,C38U]-BPTI, intermolecular contacts preferentially form aggregates because of the formation of an unfoldable intermediate with a SeSe bond at the non-native position (Fig. 11b, right funnel) [70]. In the case of [C5U]-BPIT, in which only one Cys residue (Cys5) is replaced by Sec, the folding pathway is essentially unaltered (Fig. 11a, pathways highlighted in blue), but N* can be quickly converted to N', which is accessible to the precursor (
5.4. Simplification of the oxidative folding pathway involving diverse SS-intermediates
A detailed oxidative folding pathway of hirudin (65 amino acid residues), a thrombin inhibitor with three SS bonds (Cys6‒Cys14, Cys16‒Cys28, and Cys22‒Cys39), was reported in the 1990s by Chang et al. [75–77]. While hirudin has almost the same number of amino acid residues as BPTI with three SS bonds, their oxidative folding phenomena are completely different [78]. BPTI achieves oxidative folding through a limited number of SS-intermediates with native-like structures (Fig. 11a). In contrast, the oxidative folding of hirudin involves SS-intermediates (scrambled species), 1SS, 2SS, and 3SS, which have one, two, and three non-native SS bonds in the molecule, respectively. These intermediates are sequentially generated from the reduced state (R) (i.e., R→1SS→2SS→3SS), accompanied by the hydrophobic collapse of the polypeptide. 3SS subsequently undergoes SS-isomerization with conformational folding to obtain the native SS-bonding pattern and folded structure (Fig. 12).
(a) Oxidative folding pathway of hirudin and seleno-hirudin via scrambled SS-intermediates. Heterogeneous ensembles with one (1SS), two (2SS), and three (3SS) SS bonds in the molecule are shown. (b) Energy landscape for oxidative folding of wild-type hirudin (gray) and seleno-hirudin (blue).
Metanis et al. prepared seleno-hirudins ([C6U,C14U], [C16U,C28U] and [C22U,C39U]), in which one of the three native SS bonds was replaced by SeSe bond, and investigated their folding behavior [79]. All analogs folded into the native state at a higher rate than the wild type. During the early oxidative folding event (SS-formation) of wild-type hirudin, heterogeneous 1SS ensembles, including various isomers, are generated as scrambled species, whereas for seleno-hirudin, the components in the scrambled species are biased, decreasing the heterogeneity. Particularly, [C6U,C14U]-hirudin with the SeSe bond directly gained an additional SS bond without conversion into the scrambled species via the SH-SeSe exchange reaction (Fig. 12a). In contrast, [C6U,C16U]-hirudin with the SeSe bond at the non-native position rapidly isomerized to a heterogeneous ensemble. These results indicate that the introduction of the SeSe bond in the appropriate position significantly reduces the diversity (i.e., peptidyl entropy) of the folding intermediates (Fig. 12b) and that the folding rate is accelerated by undergoing a simpler pathway.
5.5. Potent tool for interchain cross-coupling
Insulin, an important peptide-based drug, is widely known as a hypoglycemic agent for diabetic patients. Its native structure is stabilized by two interchain SS bonds (Cys7A‒Cys7B and Cys20A‒Cys19B) between the A-chain (21 amino acid residues) and the B-chain (30 amino acid residues), in addition to the intrachain SS bond (Cys6A‒Cys11A) in the A-chain. Direct coupling of the unprotected A-chain and B-chain via SS linkages in an oxidative environment to obtain the native form is the simplest and most rational synthetic method. However, intramolecular SS cross-linking in each peptide chain is usually preferred, producing a folding yield of less than 5% [80–82]. Although several efforts have been directed toward developing innovative synthetic methodologies, a technology that involves a simple synthetic process and exhibits high product yield is yet to be developed [83].
Arai et al. predicted that the A-chain and B-chain could be coupled directly and effectively by utilizing the Sec-substituting strategy, which has been applied to single-chain polypeptide folding (see above). Prior to the trial of this strategy, an exhaustive mechanistic investigation on oxidative folding of the unprotected A-chain and B-chain, namely native chain assembly (NCA), was performed to determine the appropriate positions for inserting Sec residues [84]. The results revealed that the NCA pathway of insulin includes several pathways, all of which involve a common metastable precursor (2SS*) that lacks one interchain SS bond (Cys7A‒Cys7B) (Fig. 13). Furthermore, Cys7A‒Cys7B, which is solvent-exposed in the native structure, was found to be the most kinetically and thermodynamically unstable among the three native SS bonds.
Consequently, Arai et al. synthesized the [C7U]-A-chain and [C7U]-B-chain, which are seleno-analogs of the component peptides of bovine pancreatic insulin (BPIns) [85]. The A-chain and B-chain were isolated as 2-pyridylsulfanyl- (Pys) protected derivatives with one SeS bond and an oxidized form including one SeS bond, respectively (Fig. 14a). The peptide chains were activated to reactive species containing SH and SeH groups under weakly reductive conditions, and the chains gradually coupled with each other by O2 as an oxidant to form the [C7UA,C7UB] variant of BPIns, namely selenoinsulin (SeIns), at up to 27% isolated yield. SeIns has almost the same biological activity and 3D structure as the wild type. More notably, SeIns exhibited much higher resistance against insulin-degrading enzyme (IDE), which is found in the liver and pancreas in mammals, than wild-type insulin (i.e., τ1/2 ≈ 8 h vs. ≈ 1 h for BPIns), and was found to have potential as a long-acting formulation that could circulate in the body for a long time. X-ray crystallographic analysis of SeIns suggests that the SeSe bond (Sec7A‒Sec7B) has enhanced native interactions in the N-terminal region of the B-chain, resulting in increased stability of the monomeric state, which may explain the high IDE resistances.
(a) [C7UA,C7UB]-bovine pancreatic insulin (SeIns) with external SeSe bond reported by Arai et al. (b) Human selenoinsulin with internal SeSe bond reported by Metanis et al.
The second selenoinsulin [C6UA,C11UA] variant of human insulin (HIns) (Fig. 14b) was synthesized by Metanis et al. [86]. They focused on Cys6A‒Cys11A, which is formed as a major SS component in 1SSA during the early NCA event (Fig. 13), and prepared [C6UA,C11UA]-A-chain through SPPS. Interestingly, the A-chain analog, which has an intrachain SeSe bond, coupled effectively with the wild-type B-chain to produce [C6UA,C11UA]-HIns with a correctly folded structure at 31% isolation yield. Importantly, the replacement of internal SS bond, which is buried in the hydrophobic core of the molecule, enhanced thermodynamic stability against guanidinium chloride (GdmCl) and improved resistance to peptide hydrolysis by Glu-V8 protease and reductive unfolding by GSH. These results suggest that replacement of SS bonds with SeSe bonds in proteins enable effective oxidative folding and improve the pharmacological effects of peptide formulations.
In vivo folding studies over the past decades have revealed that the oxidative folding pathway of proteins can be flexibly changed to non-native pathways that are not encoded in the primary amino acid sequence by regulating the environment, such as the additives, pH, temperature, and peptide modifications [87]. As described above, the unique redox properties of the SeSe bond and SeH groups can also modify the energy landscape of oxidative folding and accelerate folding velocity by avoiding a pathway involving a kinetically trapped intermediate, or by altering the unproductive pathway to a productive pathway with lower energetic barriers. The addition of a catalytic amount of small molecule diselenide to the oxidative folding of wild-type peptides greatly improves both the rate and yield despite its ease of use; however, its catalytic activity is still lower than that of PDI. One reason for the enormously high catalytic activity of PDI is that it aggressively incorporates structurally immature proteins, which have exposed hydrophobic regions in the molecules, into its own hydrophobic cavity [88]. To further enhance the catalytic activity of SeSe-based catalysts, it is necessary to improve their redox properties and design novel molecules to selectively recognize undesired species that prohibit oxidative folding, such as misfolded states, kinetically trapped intermediates, and oligomers, and finally lead them to productive pathways.
In addition, although diselenide compounds may be promising in regulating protein quality control in cells, there are only a few reports on their biological applications (see Section 4.4). This is probably because SeSe-based compounds are highly reactive, and therefore, often highly toxic to cells. To develop diselenide-based therapeutics for protein misfolding diseases, it is also necessary to design molecules that can exert appropriate catalytic activity at the target organelles. Furthermore, disulfides/thiols redox reactions are involved in a variety of biological phenomena related to protein quality control as well as oxidative folding in the ER. For example, inositol-requiring enzyme 1 (IRE1), which is an ER transmembrane protein, can detect misfolded proteins, and subsequently assembles with other IRE1 molecules via intermolecular SS bonds, eventually activating the unfolded protein response (UPR) mechanisms [89]. To avoid excessive accumulation and prolonged retention of IRE1-oligomers that cause undesired UPR-associated apoptosis, PDI family A member 6 (PDIA6 or P5) cleaves the intermolecular SS bonds in the oligomers to reproduce monomeric IRE1 [90,91]. Organoselenol compounds are also well known to function as a potent reductant for protein SS bonds due to their high nucleophilic potency [36,46,50], and thus could also be expected to possess the capability to chemically mimic the SS-reductase activity of PDIA6, which attenuates the IRE1 activity.
Meanwhile, the replacement of SS bonds with SeSe bonds in peptides provides various advantages, such as the promotion of oxidative folding, enhancement of structural stability, improvement of intrinsic bioactivity, and introduction of a novel biological function. However, although many SeSe-containing peptides have been reported as potential candidates for practical formulations, their clinical application has not yet been achieved. For drug discovery, further knowledge of the toxicity and pharmacokinetics, as well as the chemical phenomena of the SeSe bond in vivo, is necessary.
This work was supported by the Promotion and Mutual Aid Corporation (PMAC) for Private Schools of Japan [The Science Research Promotion Fund] and the Research and Study Project of Tokai University, Educational System General Research Organization.
There are no conflicts of interest to declare.