The Tohoku Journal of Experimental Medicine
Online ISSN : 1349-3329
Print ISSN : 0040-8727
ISSN-L : 0040-8727
Regular Contribution
Targeted Next-Generation Sequencing Newly Identifies Mutations in Exostosin-1 and Exostosin-2 Genes of Patients with Multiple Osteochondromas
Xiaoyan GuoMingrui LinTengfei ShiWei YanWenxu Chen
Author information
JOURNAL FREE ACCESS FULL-TEXT HTML

2017 Volume 242 Issue 3 Pages 173-181

Details
Abstract

Multiple osteochondromas (MO) is one of the most common benign bone tumors in humans with an autosomal dominant hereditary mode. MO is a genetic heterogeneity disease with variable number and size of osteochondromas, as well as changeable number and location of diseased bones. Mutations in Exostosin-1/Exostosin-2 (EXT1/EXT2) genes are the main molecular basis of MO. EXT1 and EXT2 genes encode exostosin 1 and exostosin 2, respectively, both of which are transmembrane glycosyltransferases that elongate the chains of heparin sulfate (HS) at HS proteoglycans (HSPGs). HSPGs are considered to be involved in regulating the proliferation and differentiation of chondrocytes. Owing to large size of EXT1/EXT2 genes and lack of mutation hotspots, molecular diagnosis of MO is challenging. Here, we applied targeted next-generation sequencing (t-NGS) in mutation screening of EXT1/EXT2 genes for 10 MO patients. The results were compared and validated with Sanger sequencing. Overall, nine mutations identified by t-NGS were confirmed with Sanger sequencing, excluding two variants of false positive, suggesting the reliability of mutation screening by t-NGS. The nine mutations identified by t-NGS include two missense mutations (EXT1: c.1088G>A and c.2120C>T), one splicing mutation (EXT2: c.744-1G>T), and six nonsense mutations (EXT1: c.351C>G, c.1121G>A, and c.1843_1846dup; EXT2: c.67C>T, c.561delG, and c.575T>A). In summary, our paper provides the primary data of the application of t-NGS in MO molecular diagnosis, including six newly identified mutations (EXT1: c.1843_1846dup, c.1088G>A, c.351C>G, and c.2120C>T and EXT2: c.744-1G>T and c.575T>A), which further enrich the mutation database of MO from the Chinese population.

Introduction

Multiple osteochondromas (MO) is a commonly benign skeletal disease characterized by multiple cartilage-capped bony protuberances in the growth plate of long and tubular bones. MO is an autosomal dominant disease with an estimation of 1/50,000 in Caucasian population (Schmale et al. 1994). MO is a genetic heterogeneity disease with variable number and size of osteochondromas, as well as changeable number and location of diseased bones. Furthermore, complications are usually encountered in the patients, such as deformities of involved joints, functional limitations of joints and limbs, compression of nerves and blood vessels due to protruding masses, and short stature (Jennes et al. 2012; Bozzola et al. 2015; Aouini et al. 2015). However, peripheral chondrosarcoma is deemed to be the most serious complication with an incidence of 0.5-2% (Wicklund et al. 1995).

MO mainly results from mutations in two tumor-suppressor genes: Exostosin-1 (EXT1) (MIM*608177) (Ahn et al. 1995) and Exostosin-2 (EXT2) (MIM*608210) (Stickens et al. 1996; Wuyts et al. 1996). Nevertheless, no obvious mutations are detected from EXT1 or EXT2 gene in some families or sporadic patients with a rate of 10-15%, suggesting that some special types of mutations (inversions, translocations and somatic mosaicism) provoke the disease (Vink et al. 2005; Szuhai et al. 2011). Most mutations (80%) in EXT1/EXT2 genes are nonsense, splice-site, and small frameshift mutations and often introduce a premature stop codon that silences the functional genes (Jennes et al. 2009). Others are missense mutations (Cheung et al. 2001) and single-/multi-exon deletions/insertions (Jennes et al. 2008, 2011). Approximately 90% of MO patients have an affected parent, while about 10% of individuals develop MO owing to a de novo mutation (Jennes et al. 2009). So far, 436 variants from EXT1 gene and 223 variants from EXT2 gene have been described in Multiple Osteochondroma Mutation Database (http://medgen.ua.ac.be/LOVDv.2.0/home.php).

EXT1 and EXT2 genes encode exostosin 1 and exostosin 2, respectively, both of which act as transmembrane glycosyltransferases in the endoplasmic reticulum and Golgi apparatus. Exostosin 1 and exostosin 2 form a hetero-oligomeric complex that plays a role in the elongation of heparan sulfate (HS); HS is associated with the transduction of signal, which regulates the differentiation, ossification, and apoptosis of the chondrocyte (Senay et al. 2000; Esko and Lindahl 2001). Molecular diagnosis of MO is relatively complex and challenging due to the big size of the two genes and the absence of mutation hotspots. Generally speaking, Sanger sequencing combined with multiplex ligation-dependent probe amplification (MLPA) (White et al. 2004) and array comparative genomic hybridization (array-CGH) (Szuhai et al. 2011) is able to cover almost all types of mutations in EXT1/EXT2 genes, including point mutations, small fragment insertions/deletions, single-/multi-exon deletions/insertions and some inversions, translocations or somatic mosaicism. However, time-consuming and multi-step procedures, as well as high cost are the main problems from these methods. Whole genome sequencing or exome sequencing is an alternative technique for screening mutations in MO with a broad spectrum (Zhang et al. 2013; Liu et al. 2015), but they are also costly and need long checking-time. Second-generation sequencing, targeted next-generation sequencing (t-NGS), has already been applied in molecular diagnosis for neurofibromatosis type 1 (NF1) and Duchenne and Becker muscular dystrophies (DMD) with excellent accuracy and high efficiency (Pasmant et al. 2015; Okubo et al. 2016). T-NGS specially aims to targeted genes and has successfully detected almost all subtle mutations (point mutation, small deletions/insertions, mosaic mutation) and large deletions/insertions for a large cohort in a few days. Compared to the whole genome sequencing or exome sequencing, t-NGS is a cost-effective method for some genetic diseases with known pathogenic genes. In this paper, we introduced and validated a t-NGS approach through screening mutations for 5 EXT-mutated MO patients and prospectively analyzing 5 other MO patients, thereby developed a new diagnostic program for MO.

Materials and Methods

MO patients and samples

All MO cases were inpatients from the Second Hospital of Fuzhou Affiliated to Xiamen University, they were definitely diagnosed based on clinical manifestations, imaging and histopathological examinations. After the signing of informed consent, peripheral blood samples were obtained from 10 MO families, and 10 probands from these families were picked out to test and verify the clinical utility of t-NGS. Among them, 5 probands had been identified with mutations by Sanger sequencing previously, the remaining 5 probands were directly screened for mutations by t-NGS. Normal controls were healthy people with normal physical examinations in the hospital. This study was approved by the Ethics Committee of the Second Hospital of Fuzhou Affiliated to Xiamen University.

T-NGS of EXT1 and EXT2 genes

Genomic DNA was extracted from peripheral blood lymphocytes of MO probands and normal controls according to the procedures of SE Blood DNA Kit (Omega Bio-Tek, CA, USA), DNA purification was processed by Ampure Beads, and the purified DNA was tested by t-NGS with minimum amount of 30 ng for each sample. Samples were divided into two groups which were tested on an Ion Personal Genome Machine (PGM) sequencer (Thermo Fisher Scientific, MA, USA) one after another in different times: group 1 included five MO subjects with known variants which had been identified by Sanger sequencing; group 2 contained five newly gathered MO subjects without any information of gene mutation (Table 1). The design of multiplex primer pools targeting EXT1/EXT2 genes was based on Ion AmpliSeq Designer software (Thermo Fisher Scientific) (Table 2). The targeted regions covered 100% of EXT1/EXT2 genes, which mainly involved 100% of the coding regions, part of intronic regions from exon–intron boundary (10 base pairs), the 5′- and 3′-untranslated regions (UTRs). Theoretically the entire coverage could reach 99.2%. The EXT1 gene (reference sequence NM_000127.2) was theoretically covered at 100%, the EXT2 gene (reference sequence NM_207122.1) was theoretically covered at 98.4% (64 base pairs in the 5′ UTR of EXT2 gene were not covered). Targeted fragments were sequenced using an Ion AmpliSeq Library Kit 2.0 (Thermo Fisher Scientific) and performed on the Ion PGM sequencer utilizing 318 Chip (Thermo Fisher Scientific) based on the manufacturer’s protocol.

Table 1.

Clinical phenotype and genotype of MO probands in the study.

*+, joint deformity; ++, joint deformity, nerve compression or injury, or shortened limb; −, normal in joints and nerves.

Table 2.

Multiplex primers designed for EXT1 and EXT2 genes.

T-NGS bioinformatics analysis: variant identification

Ion Torrent Suite software 4.0.1 (Life Technologies) was used to extract sequence data and align with sequences of EXT1 and EXT2 genes in human. Variants such as single nucleotide variants (SNVs), small insertions/deletions were picked out through the Variant Caller plugin. The major variant screening parameters were set as follows: (1) For SNVs and small insertions/deletions, minimum allele frequency (MAF) was set for ≥ 10%. (2) For SNVs, minimum sequencing depth was set for ≥ 6 ×, and ≥ 15 × for small insertions/deletions. (3) For small insertions/deletions, minimum sequencing depth on either strand was set for ≥ 5 ×. The 10% MAF cutoff filter was used to avoid any possible false-negative event. Potential pathogenic variants selected from this filter were further analyzed based on two algorithms: variant score and variant frequency. Variant score was calculated through the NextGENe software (Softgenetics), while variant frequency referred to the 137 SNP database.

Variant validation through Sanger sequencing and pathogenicity analysis for the variants

EXT1/EXT2 variants filtered from the t-NGS approach were validated separately by Sanger sequencing of the corresponding mutated fragments. Sequence alignments and analysis were carried out respectively through DANMAN8.0 (Lynnon Biosoft, USA) and Chromas2 (Technelysium Pty Ltd, Tewantin, qLD, Australia). Two currently published genomic DNA sequences (NC_000008.11 and NC_000011.10) were chosen as reference sequences. Exons of EXT1 gene complied with the NCBI nomenclature (numbered 1-11), wheras exons of EXT2 gene were numbered 1-15. The primers, procedures and annealing temperatures of PCR before Sanger sequencing referred to two literatures (Wuyts et al. 2005; Tian et al. 2014).

The exostosin sequences of 18 species were downloaded from NCBI (http://www.ncbi.nlm.nih.gov/) to analyze the conservation of mutation sites. Three bioinformatics softwares—PolyPhen (Ramensky et al. 2002), SIFT (Ng and Henikoff 2003) and PROVEAN (Choi and Chan 2015) were selected to predict the functional effect of amino acid changes. The CRYP-SKIP server (http://cryp-skip.img.cas.cz/), MaxEntScan (http://genes.mit.edu/burgelab/maxent/Xmaxentscan_scoreseq_acc.html) and Human Splicing Finder (HSF) (http://www.umd.be/HSF/) were used to analyze the splice mutation. The corresponding fragments which mutated in MO patients from 100 normal controls (200 alleles) were sequenced for polymorphism analysis.

Results

Clinical, imaging, histopathological examinations and pedigree investigation

From all subjects invited in this study we obtained detailed physical, imaging examinations and pedigree investigation. The diagnosis was all confirmed by histopathological examination finally. The affected parts mainly involved long bones, adjacent joints, nerve and their function (Table 1).

Identification of EXT1/EXT2 variants by t-NGS

Five MO patients in group 1 (Subjects 1-5) with probable causative variants through Sanger sequencing were detected by t-NGS on a 318 chip successfully, and normal control (NC) was set for assessing the efficiency of capturing the targeted area. A total of 78.3 MB data was obtained, and the coverage of the targeted area was close to or exceeded 95% except subject 5 (Table 3). Finally, five variants in subjects 1-5 were all discovered by t-NGS, in which two variants were identified in EXT1 gene and three in EXT2 gene (Fig. 1, Table 1). Then, five newly gathered MO patients with unknown variants (Subjects 6-10 in group 2) were detected by t-NGS; 56 MB data were achieved and the coverage of the targeted area was 93.55% (20×) (Table 3). However, only four suspicious pathogenic variants were separately detected in subjects 6-9 by t-NGS, and the variants were validated by Sanger sequencing (Fig. 1, Table 1). No pathogenic variant was detected in subject 10 by t-NGS, or by Sanger sequencing.

In addition, three variants (EXT1: c.1065C>T and c.1761G>A; and EXT2: c.28 C>A) were detected by t-NGS and confirmed by Sanger sequencing. However, they were all synonymous mutations in which c.1065C>T was found in subject 3, c.1761G>A in subjects 4-6 and 9, and c.28 C>A in subjects 3, 4, 6 and 10.

Fig. 1.

Variants detected in EXT1/EXT2 genes from 9 MO subjects.

①-⑤: Five variants identified by Sanger sequencing previously were also detected by t-NGS (EXT1: ① c.1843_1846dup, ② c.1088G>A; EXT2: ③ c.744-1G>T, ④ c.575T>A, ⑤c.561delG). ⑥-⑨: Four variants detected by t-NGS were validated by Sanger sequencing (EXT1: ⑥ c.351C>G, ⑦ c.1121G>A, ⑧ c.2120C>T; EXT2: ⑨ c.67C>T). The top of the figure was a partial view from t-NGS. Blue bar indicated reverse sequencing, pink bar indicated forward sequencing, each bar was a read. The lower part was the corresponding results of Sanger sequencing, the red arrows indicated the positions of the variants (MT: mutant type, WT: wild type).

Table 3.

Summary reports for 10 MO patients from t-NGS.

Exclusion of false positive and pathogenicity analysis of three non-synonymous SNVs

Subjects 5, 6, and 10 were all found with a point mutation of one thymine (c.59T>G, p.Leu20Trp) and a deletion of one thymine (c.64delT, Tyr22Ilefs*114) by t-NGS, both of which were discovered from a homopolymeric stretch in exon 1 of EXT1 gene; however, the two variants were verified to be false positive by Sanger sequencing, ultimately.

EXT1 and EXT2 genes are composed of 11 exons and 15 exons, respectively (Fig. 2A). C.1088G>A/p.Gly363Glu, c.2120C>T/p.Thr707Met and c.744-1G>T were three newly discovered non-synonymous SNVs. PolyPhen, SIFT and PROVEAN predicted that c.1088G>A was probably a pathogenic mutation while c.2120C>T was possibly damaging (Table 4). Multiple sequence alignment of exostosin 1 showed that both amino acids in the sites of 363 and 707 were highly conserved in 18 different species (see Fig. 2C). Moreover, both exon 3 and exon 11 of EXT1 gene in 100 normal controls were sequenced without related mutations. The CRYP-SKIP server predicted that the probability of cryptic splice site activation (PCR-E) was 0.7, while exon 5 skipping was 0.3 (1-PCR-E) due to c.744-1G>T in EXT2 gene (Fig. 2B). HSF indicated that the altered acceptor site (atTC) was probably affecting splicing and its consensus value (0-100) dropped to 55.77 (agTC: 84.72). HSF also provided a potential acceptor site (agGT) at 31 base pairs downstream the wild-type acceptor site (agTC) with a consensus value of 90.89, MaxEntScan illustrated that the mutated-type acceptor site had a lower MAXENT (1.96) while the MAXENT of wild-type acceptor site was 10.56.

Fig. 2.

Mutation spectrum of the EXT1/EXT2 genes and pathogenicity analysis of three nonsynonymous mutations.

A: *Newly identified mutations in the MO probands in the present study, black boxes indicate the constitutive exons of EXT1 and EXT2 genes, gray boxes indicate the alternative exons of EXT2 gene; B: Prediction for c.744-1G>T in EXT2 gene from CRYP-SKIP; C: Part results from multiple sequence alignment of exostosin 1 among eighteen species.

Table 4.

Prediction of the pathogenicity for two missense mutations by bioinformatics.

Discussion

Molecular diagnosis of MO is challenging. In previous studies, Sanger sequencing or denaturing high performance liquid chromatography (DHPLC) combined with Sanger sequencing of EXT genes was usually preferred. If there were no subtle mutations from Sanger sequencing, MLPA was performed to detect exon deletion/duplication, or array-CGH was used to search for some rare types of mutations, such as unbalanced translocations or mosaic deletions/duplications. As t-NGS is able to detect almost all types of mutations in MO except some unbalanced translocations, t-NGS is helpful to simplify the diagnostic process of MO. In our study, t-NGS approach was successfully applied in mutation screening for 10 MO patients; it is probably an alternative technique in routine molecular diagnosis for MO in the future.

After definite diagnosis, 10 MO patients were chosen for our study, nine of them were males and one female, their average onset age was close to 9 years old. Three patients had at least one affected parent, while seven were sporadic patients. The first group with known mutations was selected to evaluate the sensitivity and specificity of the t-NGS strategy compared to Sanger sequencing. Finally, the five known mutations were also detected by t-NGS, which illustrated the accuracy and sensitivity of t-NGS. Among these mutations, c.1088G>A and c.1843_1846dup were identified in EXT1 gene; c.1088G>A is a missense mutation which results in glutamic acid instead of glycine at the 363 amino acid of exostosin 1, and c.1843_1846dup is a frameshift mutation which is expected to create a premature stop codon at the 616 amino acid of exostosin 1. C.561delG, c.575T>A and c.744-1G>T were found in EXT2 gene; c.561delG is a frameshift mutation which could create a premature stop codon at the 269 amino acid of exostosin 2, c.575T>A could also introduce a premature stop codon at the 192 amino acid of exostosin 2, and c.744-1G>T is a splice mutation that could cause skipping of exon 5 of EXT2 gene or abnormal splicing due to the mutation (Fig. 2A, Table 1).

The second group was prospectively sequenced on PGM sequencer, and four probable pathogenic mutations were picked out, in which c.351C>G, c.1121G>A and c.2120C>T were from EXT1 gene and c.67C>T from EXT2 gene. All of them were point mutations and confirmed by Sanger sequencing. C.351C>G and c.1121G>A were both nonsense mutations which led to early appearance of stop codons at the 117 amino acid and the 374 amino acid of exostosin 1, respectively. c.2120C>T is a missense mutation which could replace threonine by methionine at the 707 amino acid of exostosin 1, and c.67C>T in EXT2 gene is a nonsense mutation which could truncate exostosin 2 at the 23 amino acid. One patient (Subject 10) with negative result from t-NGS was also confirmed by Sanger sequencing (Fig. 1, Table 1). Meanwhile, three synonymous mutations were identified by t-NGS in exonic regions of EXT1/EXT2 genes from six MO patients, and the results were consistent with Sanger sequencing. Considering that all the variants identified by Sanger sequencing were also detected by t-NGS in our study, t-NGS may be a better alternative mutation screening method in molecular diagnosis of MO.

However, two suspicious pathogenic mutations (c.59T>G, c.64delT) were detected in EXT1 gene by t-NGS from three MO probands (Subjects 5, 6, 10), but they were determined to be false positive by Sanger sequencing. Thus, Ion Torrent PGM had limited accuracy especially in case of homopolymers of any length (Loman et al. 2012). Subject 10 was discovered with no pathogenic mutation by t-NGS, and the subsequent DNA Sanger sequencing confirmed the negative result. The unsuccessful detection may exist in both methods, especially if there is exon deletion/duplication, unbalanced translocation, or mosaic mutation in subject 10. We were regretful that we had not verified the result through cDNA Sanger sequencing, MLPA, or array-CGH as recommended (Szuhai et al. 2011; Sabbagh et al. 2013). However, the negative patient shows a sporadic form of moderate MO phenotype (Table 1) (Francannet et al. 2001); there may be a low-abundance mosaic mutation in the patient, which may explain the undetectable somatic EXT mutation from peripheral blood. However, t-NGS is helpful to detect mosaic mutation (Pasmant et al. 2015).

Meanwhile, 6 mutations were newly identified in our study (EXT1: c.1843_1846dup, c.1088G>A, c.351C>G, and c.2120C>T; and EXT2: c.744-1G>T and c.575T>A), while the remaining 3 mutations had been reported previously (Multiple Osteochondroma Mutation Database) (Fig. 2A). C.1843_1846dup, c.351C>G and c.575T>A are all nonsense mutations which probably cause the transcripts with early stop codons to activate the mechanism of RNA surveillance and nonsense-mediated mRNA decay (Gerards et al. 2010). Thus, the functional exostosin 1 and exostosin 2 would be reduced. C.1088G>A and c.2120C>T are missense mutations, prediction of computer and bioinformatics, as well as polymorphism analysis of both mutation sites suggest that the two new mutations were probably damaging and responsible for the occurrence of MO (Fig. 2C, Table 4). C.744-1G>T in EXT2 gene was a splice mutation, both HSF and MaxEntScan indicated that the splicing probability of 3′ splice site of exon 5 would decrease due to the mutation, and a potential splice site could be activated according to the prediction of HSF and the CRYP-SKIP server, moreover, if the potential acceptor site (agGT) at the 31 base pairs downstream the wild-type acceptor site is activated, exon 5 of EXT2 gene will be aberrantly spliced and a premature termination codon is to appear at 31 base pairs downstream the potential acceptor site of the abnormally spliced exon 5, then exostosin 2 is truncated and functional protein is also reduced.

In short, we applied t-NGS in molecular diagnosis of MO for the first time, and we successfully identified 9 mutations in EXT1/EXT2 genes from 10 MO subjects. Among the nine mutations, six mutations were newly discovered, which further enriched the mutation database of MO from the Chinese population.

Acknowledgments

Supported by the Science and Technology Project of Fuzhou (2015-S-141-14) and Youth Research Project of Health and Family Planning in Fujian Province (2016-2-41).

Conflict of Interest

The authors declare no conflict of interest.

References
 
© 2017 Tohoku University Medical Press
feedback
Top