Circulation Journal
Online ISSN : 1347-4820
Print ISSN : 1346-9843
ISSN-L : 1346-9843
Reviews
Current Trends and Future Perspectives of State-of-the-Art Proteomics Technologies Applied to Cardiovascular Disease Research
Sasha A. SinghElena AikawaMasanori Aikawa
Author information
JOURNAL FREE ACCESS FULL-TEXT HTML

2016 Volume 80 Issue 8 Pages 1674-1683

Details
Abstract

The use of mass spectrometry (MS)-dependent protein research is increasing in the cardiovascular sciences. A major reason for this is the versatility of and ability for MS technologies to accommodate a variety of biological questions such as those pertaining to basic research and clinical applications. In addition, mass spectrometers are becoming easier to operate, and require less expertise to run standard proteomics experiments. Nonetheless, despite the increasing interest in proteomics, many non-expert end users may not be as familiar with the variety of mass spectrometric tools and workflows available to them. We therefore review the major strategies used in unbiased and targeted MS, while providing specific applications in cardiovascular research. Because MS technologies are developing rapidly, it is important to understand the core concepts, strengths and weaknesses. Most importantly, we hope to inspire the further integration of this exciting technology into everyday research in the cardiovascular sciences. (Circ J 2016; 80: 1674–1683)

Many developments in the field of proteomics have come from the yeast model system13 and from cancer research (Figure 1A). In addition, immortalized cells (eg, HeLa, HEK293T, RAW264.7, THP-1) have provided mass spectrometrists ample protein with limited variability across biological replicates, factors that are important to establish technical reproducibility when novel workflows are being developed. The cardiovascular sciences have also influenced and benefited from developments in unbiased and targeted mass spectrometry (MS) approaches (Figure 1B). Research pertaining to the heart and its metabolism,4,5 extracellular matrix remodeling6,7 posttranslational modification in vascular development.8 lipoprotein content and metabolism,911 and platelet function12,13 have thrived as a result of the application of MS technologies. Recently, cardiovascular disease research has taken advantage of the readily accessible plasma pool for biomarker- and target-based studies. Tissues affected by heart failure, atherosclerosis and aortic aneurysms are in direct contact with the circulatory system, which increases the likelihood that proteins either causal or consequential to these pathologies can be detected in plasma.14

Figure 1.

An overview of proteomics and the dynamic range issue. (A) Bar graph depicting PUBMED results for the various research fields + the term, “proteomics”. Cancer comprises the largest number of incidents (67%); the other fields range from <1% (neurobiology/neurology) to 9% (heart). (B) Summary of the types of mass spectrometric acquisition methods for unbiased and targeted proteomics studies. Methods depicted in blue are detailed in this review. (C) The dynamic range of a mass spectrometer (MS, blue) does not match that of protein abundance in tissues/cells (green). Depletion methods remove the few but highly abundant and interfering proteins such as albumin. Enrichment methods increase the signal of low-abundance proteins such as receptors. In each case the dynamic range of the MS is usually constant.

Addressing the Large Dynamic Range in Protein Concentrations

The ongoing development of MS is in part influenced by the drive to identify and quantify as many proteins as possible in a given sample.1,1517 A major challenge in proteomics, however, is that the dynamic range of a typical deep sequencing analysis (5–6 orders in magnitude) does not reflect the more extensive dynamic range of protein abundance that is as extensive as 12 orders in plasma.1820 In addition, cytoskeletal and extracellular matrix proteins dominate samples, masking the less abundant transcription factors and signaling molecules that are usually of interest.1,18,19 Depletion of these highly abundant proteins, such as albumin from plasma, is required to sequence deep into the proteome (Figure 1C).2123 Also, the less abundant proteins can be enriched by cellular fractionation methods such as those that separate cellular compartments or those that use immunopurification strategies (Figure 1C).24,25 The dynamic range capabilities of MS continually improve, but the identification and quantification of low-abundance proteins still depend on sample preparation procedures that shift these low signals into the detection range of the instrument either by depletion and/or enrichment approaches.

Protein and Peptide Separation Methods

Protein isolation needs to take into consideration the sample source, cell culture or tissue.26 Often, the proteomes of 2 or more sets of tissues or samples are compared (Figure 2A), for example, healthy vs. diseased tissues. A typical proteomics experiment does not measure the protein itself, but its peptide representatives that are generated through digestion by a known proteolytic enzyme.27 The proteins are thus inferred by their detected peptides. Trypsin and Lys-C are the most commonly used proteolytic enzymes for proteomics. However, in order to increase protein coverage, additional proteases, such as chymotrypsin and Asp-N, can be used in parallel samples preparations.28,29

Figure 2.

Proteomics isolation and separation methods. (A) Proteomics aims to compare the protein profiles across different samples, cell culture or whole tissue extracts. (B) Protein separation using gel electrophoresis. Spots from a 2D gel, or bands from a 1D gel, are excised for proteolysis. (C) Greater peptide yields are achieved by in-solution proteolysis. (D) Fractionation and enrichment strategies used for common peptides aim to reduce peptide complexity for mass spectrometric analysis.

Before proteolysis, the proteome can be fractionated in-gel to reduce sample complexity (Figure 2B).30 The advantage of in-gel digestion is that the molecular mass reference is available; thus protein isoforms that share peptides but have distinct molecular masses can be differentiated from each other. Initially, 2D electrophoresis technology was a common separation method for subsequent MS analysis. In this method, proteins are separated based on charge and then by molecular weight31 (Figure 2B). 2D gel electrophoresis was used in an investigation into the effects of nafamostat mesylate, a serine protease inhibitor of inflammation, post myocardial ischemia-reperfusion (MI/R) in rabbits.32 The authors analyzed the proteomes of myocardial tissue in 3 groups: sham, MI/R and MI/R plus mafamostat mesylate/inhibitor. A total of 10 spots with differential staining were excised and analyzed by MS; 2 of the 10 proteins were identified as superoxide dismutase and αB-crystallin. Each protein decreased in MI/R but was restored after MI/R+inhibitor treatment.32

1D gel electrophoresis is now more commonly used because less protein input is required when compared with 2D gels, and multiple samples can be run simultaneously on a single gel (Figure 2B). Consequently, thousands of proteins can be compared across conditions. The alternative to in-gel proteolysis is the in-solution procedure (Figure 2C), with its major advantage of an increase in peptide yield because the protein substrates and peptide products are not intercalated in the gel matrix. The disadvantage is that all protein isoforms are intermixed so it is very challenging to rationalize from which isoform specific peptides are derived. Nonetheless, in-solution proteolysis is often preferred because the high peptide yield is conducive to downstream peptide fractionation methods such as phosphopeptide enrichment, isoelectric focusing, cation/anion exchange and reverse phase separation technologies (Figure 2D).5,33,34

Global Quantitative Proteomics Approaches for Basic Science Research

Quantitative methods fall into 1 of 2 categories: label (the use of stable isotopes) or label-free (Figures 3A–C). The major advantage of labeling strategies is that individual protein/peptide pools are differentially tagged so that they can be combined/multiplexed for subsequent fractionation and MS analysis (Figures 2D,3D).

Figure 3.

Label-free vs. label proteomics. (A) Label-free proteomics requires separate injections of the peptide pools to be compared by mass spectrometry (MS). (B) Stable isotope labeling of the cell proteome permits combining the different cells/conditions, for a single peptide pool preparation and MS analysis. (C) Stable isotope labeling of distinct peptide pools permits combining of various samples for MS analysis. (D) Fractionation of labeled peptide pools is often done to reduce the complexity of the sample introduced into the mass spectrometer.

Label-Free

In this approach, the peptides from 2 or more distinct samples are injected into the mass spectrometer separately (Figure 3A). Relative changes in the abundance of each protein are estimated by either the area under the curve (AUC) of their peptides’ extracted ion chromatograms, or by the number of times the proteins’ peptides have been sequenced (spectral counting).35 In general, the more abundant the protein, the more likely its peptides will predominate the signal and sequencing output (Figure 4A).

Figure 4.

Examples of label-free and SILAC proteomics. (A) Label-free proteomics is done by either the area under the curve of the peptide’s chromatogram or by the number of times a peptide has been sequenced (spectral counting). (B) Inspired by Hofsteen et al.36 A summary of a study that used the label-free approach to monitor the proteome during the differentiation of human embryonic stem cells (hESCs) to cardiomycytes (CMs). DAB2 was identified as a protein that associated with the intermediate cardioprogenitor cells (CPCs). (C) Stable isotopic labeling by amino acids in cell culture (SILAC) quantification is enabled by the area under the curve of the unlabeled and labeled peptide peaks. (D) Inspired by Kruger et al,40 a study that used a triple SILAC stitching strategy plus phospho-enrichment to identify early signaling events in insulin signaling.

A recent study monitored the changes in the proteome during the transition from human embryonic stem cells (hESCs) to cardioprogenitor cells (CPCs) to cardiomyocytes (CMs) using the AUC label-free approach.36 The researchers performed control experiments to ensure that differentiation into CMs was successful, including fluorescence-activated cell sorting for cardiomyocyte troponin to verify that the majority of cells had acquired the CM phenotype. Reverse-transcriptase polymerase chain reaction verified the expression of hallmark genes such as SOX2 (hESCs), BRYT (CPCs) and MYH6 (CMs) during differentiation. Label-free quantitative proteomics confirmed SOX2 in hESCs whereas MYH7 was most abundant in the CMs (Figure 4B). A more global data analysis using gene ontology terms confirmed the enrichment of WNT-related pathways expected for the CPC phenotype, and those related to muscle function for the CMs. Analysis of positive controls in the proteomics data validated the quality of the experiment. As a result, candidate novel regulators of CM differentiation, for example DAB2, a negative regulator of WNT signaling, are more likely to be true hits.36

SILAC

Cells can be cultured in conditions under which they are dependent on exogenous supplies of amino acids that are either unlabeled (12C and 14N, light) or stable isotope-labeled (13C, 15N and sometimes 2H/deuterium) (Figure 4C).3739 Stable isotopic labeling by amino acids in cell culture (SILAC) requires conversion of a cell line into its labeled form, a process that takes approximately 5–6 passages to complete. The differentially labeled cells can be combined early in the workflow and processed simultaneously so that oxidation of peptides or sample loss during peptide preparation will be consistent in both samples; thus differences in peptide abundance are from differences in the biological state rather than those pertaining to sample handling. Labeling of lysine with 13C and 15N incurs no discernible changes in the physicochemical properties of the peptide/protein, so both peptides co-elute and enter the mass spectrometer simultaneously. However, their differences in mass are revealed by the MS1 scan (Figure 4C).

A double (light and heavy) and triple (light, medium and heavy labels) SILAC approach used for mouse brown adipocytes was combined with anti-tyrosine phosphorylation immuno-purification to identify proteins with increased phosphorylation post insulin treatment.40 In order to catch the very early phosphorylation signaling events, the samples were analyzed between 0 and 20 min using 2 sets of triple SILAC. This kinetics analysis revealed that phosphorylation occurred as early as 1–5 min (Figure 4D). Insulin receptor and insulin receptor substrate 1 phosphorylation were induced within 1 min, whereas ERK-1/2 peaked at 5 min. The ability to resolve these changes in phosphorylation over intervals of minutes emphasizes the power of quantitative proteomics to detect early signaling cascade events.

Tandem Mass Tagging

Although the SILAC approach is very useful, a major disadvantage is the requirement for cells to replicate. For samples such as primary cells or tissues, the more practical approach is isobaric tandem mass tagging of peptides using, for instance, TMT (tandem mass tags) or iTRAQ (isobaric tags for relative and absolute quantification) reagents.4144 This method uses isotope-labeled amine-reactive chemical tags to label peptides derived from distinct samples (Figure 5A). Most laboratories analyze up to 6 samples; however, recent advancements permit up to 10 samples.45 The net mass of each chemical tag is the same for each sample’s labeled peptide. Consequentially, all labeled peptides appear as a single peak in the MS1 (Figure 5A) and are thereby selected simultaneously for fragmentation. In the MS/MS scan (MS2), the tags fragment to produce 1 of 6 reporter ions with an incremental mass of 1 Da, through incorporation of an increasing number of 13C and 15N isotopes (increasing from 126 to 131 Da, Figure 5A). Their relative intensities are proportional to the relative abundance of their respective peptides.

Figure 5.

Example of tandem mass tagging. (A, Left) Detailed view of the tandem mass tagging (TMT) method. Peptides from each sample are differentially labeled with a stable isotope-labeled chemical tag, designed to fragment in the mass spectrometer in a manner that produces a reporter ion unique to the tag (colored numbers, 127–131). (A, Right) Reporter ions are measured during MS2, providing information about the relative abundance of the proteins respective peptides in the original sample. (B) Based on Itou et al.48 TMT used to monitor changes in the proteome during the macrophage to osteoclast transition. Acp5 (increase) and Psb3 (no change) were controls. Cth/CSE was identified as increasing in the transition. Subsequent studies demonstrated that the production of hydrogen sulfide via CSE regulates the transition between the 2 cell phenotypes. Propargylglycine inhibits CSE; GYY4137 is a hydrogen sulfide donor.

The iTRAQ method was used, for example, to monitor the myocardial proteome in response to the hypothermic cardiopulmonary bypass procedure,46 and to evaluate the potential energy metabolism status of atrial tissues, primarily the left atrium, from patients with valvular disease with and without atrial fibrillation.47 In the latter study, the left atrial tissues of atrial fibrillation patients exhibited a decrease in several proteins related to energy metabolism, such as electron transport and ATP metabolism, demonstrating that the proteome can be a readout for the electrical status of the atrium. Itou et al from our laboratory investigated the processes that regulate osteoclastogenesis using the TMT method.48 The aim was to identify novel regulators of osteoclastogenesis. In that study a TMT 6-plex endpoint analysis was performed on osteoclasts derived from mouse macrophage-like RAW264.7 cells. In order to evaluate the reproducibility of the cell culture conditions and the TMT approach, 2 TMT 6-plex analyses comprising 3 differentiated osteoclasts induced by receptor activator of nuclear factor kappa-B ligand (RANKL) vs. 3 control (no RANKL) cultures were performed. Consistent data within and between each TMT experiment was demonstrated, as confirmed by known osteoclast proteins such as tissue alkaline phosphatase (Acp5). These reproducible results identified the novel osteoclast-induced protein cystathionine γ-lyase (Cth) (Figure 5B). Further analysis determined that Cth activity, through the production of hydrogen sulfide (H2S), is required for osteoclastogenesis. Whether Cth activity was reduced by either siRNA or the inhibitor propargylglycine, osteoclast formation decreased as marked by the reduction in multinuclear cells positive for tartrate-resistant acid phosphatase (Figure 5B). On the other hand, supplementation of excess H2S in the form of a precursor form, GYY4137, increased the number of forming osteoclasts (Figure 5B),48 thus establishing that Cth is a novel protein involved in functional osteoclastogenesis.

Targeted Proteomics for In Vivo and Clinical Applications

Targeted proteomics pertains to MS strategies that target specific proteins. Its applications include follow-up on proteins identified in global/unbiased proteomics screening, conducting functional studies such as posttranslational modification analysis, or when the proteins of interest are candidate or validated biomarkers of a disease. These applications are enabled by peptide libraries acquired during global proteomics experiments or by peptides standards.10,19,4955 A peptide library provides specificity by including the precursor/peptide and fragment ion masses, and the peptide’s retention time. When the chromatography and MS settings are fixed, a peptide will behave the same during MS acquisition, irrespective of the source of peptide. Targeted proteomics is a powerful method for biomarker detection when an immuno-based method is not available.

Selected Reaction Monitoring (SRM) and Parallel Reaction Monitoring (PRM)

SRM, sometimes referred to as multiple reaction monitoring, and PRM are targeted MS methods. The overall principle behind SRM and PRM is similar; however, SRM selects a single precursor followed by 1 fragment ion per scan (Figure 6A), whereas PRM selects a single precursor followed by all fragment ions per scan (Figure 6B). There are scenarios for which each method can be tweaked to expand further its scan strategies, but those details are discussed elsewhere.10,56,57

Figure 6.

Targeted proteomics approaches. (A) An overview of selected reaction monitoring (SRM). (B) An overview of parallel reaction monitoring (PRM). (C,D) Goettsch et al7 combined peptide standards, immunopurification (IP) and PRM to detect low levels of sortilin phosphorylation in human and mouse plaques. (C) Adapted with permission from Goettsch C, et al.7 Samples used for PRM. The synthetic sortilin peptide was treated with the Fam20c kinase to generate the phosphopeptide version as a reference. Plaque tissues were isolated for IP against sortilin protein. (D) The PRM spectra of the phosphopeptide from the standard as compared that which was enriched from plaque samples. The plaque samples also contain other contaminating peptide signals; however, high resolution/accurate mass scans (inset) show that the major fragment ions can be differentiated from the background.

Recent research from our laboratory into the mechanisms of vascular calcification, a prominent feature of cardiovascular diseases, demonstrated that sortilin is a key regulator of smooth muscle cell (SMC)-mediated calcification.7 MS analysis of cell culture-derived immunopurified sortilin determined that a C-terminal serine phosphorylation (pSer825) was sufficient to promote osteogenic reprogramming of SMCs. As exciting as this phosphorylation finding was, there was and still is no commercially available antibody against sortilin pSer825, which could have complicated our efforts further for in vivo studies. We instead developed a PRM-based method to quantify pSer825 in vivo: a sortilin peptide library was generated using a stable isotope-labeled cell-free synthesized full-length sortilin standard that included the C-terminal pSer825 peptide.7,54 SMC culture experiments revealed that C-terminal phosphorylation increased during osteogenic reprogramming and that phosphorylation was mediated in part by the kinase, Fam20C.7,58 The sortilin standard peptides were then used to develop a PRM assay that monitored the phosphorylation of the C-terminus in mouse and human atheroma (Figures 6C,D), and confirmed that calcified regions contained more phosphorylated sortilin than did non-calcified regions.7 This study is a good example of how unbiased and targeted MS methods enabled the transition from in vitro finding to in vivo significance.

Dyslipidemia is the most studied factor affecting atherosclerosis and cardiovascular risk and has seen several applications of targeted proteomics in both basic research and in the clinic. Clinical trials aimed to examine whether increasing the level of high-density lipoprotein cholesterol (HDL-C) by cholesterol ester transfer protein (CETP) inhibitors or decreasing the level of low-density lipoprotein cholesterol (LDL-C) by proprotein convertase subtilisin-kexin type 9 (PCSK9) inhibitors decreases cardiovascular risk are underway or completed. The outcome of the phase III clinical trial of the CETP inhibitor anacetrapib is highly anticipated because other compounds in the same class yielded either no benefit on clinical outcomes or adverse effects.5961 Clinical trials demonstrated that PCSK9 monoclonal antibodies, evolocumab and alirocumab, decreased LDL-C by more than 50% in patients with hypercholesterolemia.62,63 Despite the promising results of the PCSK9 biologics, however, the high costs of antibody-based therapies64 has driven the development of PCSK9 small molecule inhibitors. K-312 is a novel dual CETP/PCSK9 inhibitor currently in phase I clinical trial, and the PCSK9 transcription-dependent mechanism in a rabbit model of atherosclerosis was recently reported by our laboratory.25 K-312 decreased PCSK9 mRNA in the liver. Moreover, rabbits treated with K-312 exhibited smaller atherosclerotic plaques and lower levels of LDL-C. However, because of the lack of a reliable anti-rabbit PCSK9 antibody, confirmation of a decrease in plasma PCSK9 would have been impossible if not for the availability of targeted proteomics. We generated an in vitro synthesized form of rabbit PCSK9 and used PRM to develop a rabbit PCSK9 spectral library and confirmed that K-312 did in fact lower circulating PCSK9 by >50%.10,25

Targeted proteomics has also been applied to in vivo metabolism studies of mostly LDL and HDL apolipoproteins during lipid-lowering clinical trials. Subjects are injected with amino acid tracer (D3-Leu), which is taken up by primarily the liver and incorporated into nascent apolipoproteins that are detected in circulating lipoproteins.10,6567 The rate of appearance or clearance of a protein on a lipoprotein in response to a therapy provides researchers with potential mechanisms of action of the therapy. ApoA-I and apoB metabolism mark HDL and VLDL/LDL levels/metabolism, respectively, and as a consequence have been used as surrogates for the respective lipoproteins in preclinical studies and clinical trials/studies.6571 Other proteins whose metabolism has been monitored include apoA-II, PCSK9 and CETP.25,67

Most of these clinical metabolism studies have depended on SRM/MRM to detect very low levels of D3-Leu tracer incorporation into apolipoproteins. The substitution of 3 hydrogens with deuterium atoms causes a 3-Da shift in the mass of Leu-containing peptides, superimposing the deuterium isotope (2H M3) with the natural M3 isotope (comprising 13C and 15N), making the 2 peaks indistinguishable from each if the D3-Leu tracer is very low (Figure 7A). As a consequence of this detection constraint, most apolipoprotein metabolism studies have been limited to either the quickly turning over apoB (~10% enrichment) or to the total HDL plasma pool (up to 2% peak enrichment depending on the labeling strategy).68 Singh et al from our laboratory, however, demonstrated the ability of PRM to measure <1% tracer enrichment in not only apoA-I from 5 HDL size fractions, but also 6 additional HDL apolipoproteins.72 Figure 7B demonstrates a PRM scan of the apoA-I peptide, THLAPYSDELR. A magnification of a fragment ion (b 4, THLA) shows its isotopes, M0–M3. Further magnification of the M3 isotope shows that the 2H M3 tracer ion is distinguishable from the M3 isotopolog (Figure 7B). Figure 7C contrasts the enrichment curves of 4 HDL apolipoproteins, apoA-I, apoA-II, apoE and apoM, against the decay curve of the of the plasma D3-Leu pool. This detailed view of HDL apolipoprotein metabolism, which was enabled by PRM technology, yielded completely novel findings about HDL metabolism (Figure 7D): the HDL size expansion model is not the predominant route for HDL metabolism; instead, the various HDL particles appear in circulation through their simultaneous release from the liver. These findings mark a pivotal change in HDL metabolism theory, and also highlight the power and versatility of PRM for clinical applications.

Figure 7.

PRM applied to in vivo metabolism studies. (A) Low incorporation of D3-Leu in plasma proteins makes the 2H M3 ion difficult to resolve from the M3 ion in a traditional MS1 scan. (B) PRM scan of an apoA-I peptide with very low D3-Leu tracer enrichment (<0.5%). The 2H M3 labeled ion is resolved from its sister M3 peak at the indicated resolution (R). (C) Enrichment curves for 4 apolipoproteins on the alpha3 size fraction of high-density lipoprotein. The plasma D3-Leu tracer curve is also plotted. All curves are for samples from the same subject. (C,D) Adapted with permission from Singh SA, et al.72

Summary and Outlook

We have highlighted the most common MS technologies with the emphasis on applications in the cardiovascular sciences. Cell culture experiments have enabled the development and validation of reliable and reproducible MS technologies; as a consequence, these technologies are now poised to take on the challenges associated with clinical studies including limited sample size, protein dynamic range and donor-to-donor variability. We project that the next major frontier in cardiovascular proteomics is accessing and extracting meaningful information from the circulatory system itself. Biomarker and drug target discovery and validation will rely on the continual development of technologies such as PRM, while still relying on the readily accessible and well-established methods such as label-free quantification.

Disclosures

This work is supported by a research grant from Kowa Company, Ltd (Nagoya, Japan, to M.A.).

References
 
© 2016 THE JAPANESE CIRCULATION SOCIETY
feedback
Top