2016 Volume 5 Issue 1 Pages A0045
A workflow based on liquid chromatography/high-resolution mass spectrometry (LC/HR-MS) was applied for the identification of compounds in urban environments. Substances extracted by solid-phase extraction from river water were wholly analyzed by LC/HR-MS without any purification. Fragmentation in collision-induced dissociation was manually studied for the 20 most intense ions in positive- and negative-ion electrospray ionization with accurate mass determination at a resolution of 100,000. Sixteen anthropogenic compounds in the extract were identified and confirmed using standard reference reagents. These compounds consisted of pharmaceuticals, surfactants, flame retardants, and industrial intermediates. The majority of the compounds are common in our daily life. In the identification process, two automated methods, MAGMa and MetFrag/MetFusion, for reading fragmentation were evaluated for the sixteen compounds. Although automated methods could be used to retrieve the correct molecular structures in most cases, they could not always be promoted to the top rank. Automated methods have yet to be a complete solution for identifying chemical compounds, but will considerably reduce the burden for humans in reading fragmentation.
Mass spectrometry (MS) is a key technology for the identification and structural elucidation of small molecules. The molecular formulas of small molecules can be determined from accurate mass values measured by high-resolution (HR) MS because relative atomic masses other than carbon are not strictly integer quantities. Furthermore, there is a close relationship between molecular structure and reactivity of ions in MS, so information on molecular structure can be obtained by interpretation of their fragment ions in the mass spectra. HRMS can make interpretation of mass spectra fairly certain due to its high accuracy and precision. Although a mass spectral library of reference standards is quite helpful in the process of interpreting mass spectra, existing libraries are insufficient for reading all observed data. Of course, attempts to enrich the libraries are continuing, but the situation is not improving rapidly. In addition, ionization and fragmentation are sometimes inherent to instrumental conditions particularly in liquid chromatography/mass spectrometry, which is often applied to polar compounds. Therefore, manual interpretation of the resulting data continues to be time consuming and cumbersome.
During the last five years, several promising automated methods for interpreting accurate mass spectral data have appeared on the scene. One of the methods is the utilization of public molecular structure databases such as KEGG (Kyoto Encyclopedia of Genes and Genomes), PubChem, and ChemSpider, which are more comprehensive than are mass spectral libraries. Insufficiency in mass spectral libraries is considered serious even in the field of environmental research because the number of unregulated emerging micropollutants has increased over the last few decades. The environmental risk caused by unregulated pollutants is of great concern, and environmental researchers are always finding new targets. Many were studied and some of them have been introduced into regulation. However, there is no method by which environmental researchers can evaluate all emerging pollutants and their transformation products. Regulation has been implemented in order of priority level, but potential threats to ecosystems and human health by overlooked compounds remain.
During the last decade, new methodology coupled with HR-MS, referred as to suspect screening and nontarget analysis, has been introduced in environmental analysis.1–5) Hug et al.6) and Schymanski et al.7) found novel micropollutants and transformation products in their nontarget strategy. In this study, we used two automated methods, MAGMa8) and MetFrag/MetFusion,9,10) to identify compounds in an urban aquatic environment and evaluate the two methods using the results. The methods are available on the Internet and they performed well in a public contest, CASMI2013 (Critical Assessment of Small Molecule Identification).11)
In February 2015, 500 mL of river water was collected from Neyagawa River, Osaka City, Japan, and passed through a solid phase extraction cartridge, Presep-C Agri Short (Divinylbenzene polymethacrylate copolymer sorbent, 200 mg, Wako Pure Chemical Industries, Ltd.) without pH adjustment. After drying at reduced pressure by a vacuum manifold assembly, substances that had absorbed on the cartridge were eluted by 5 mL of acetonitrile. The eluate was analyzed by LC/MS without further cleanup. A blank experiment using ultrapure water, which was generated by TORAYPURE LV-10T (Toray, Tokyo, Japan), was also carried out.
InstrumentalFor nontarget analyses, an LC/MS system consisting of an Ultimate 3000 (Dionex, Sunnyvale, CA, USA) and an Exactive Orbitrap Mass Spectrometer (Thermo Fisher Scientific Inc., Waltham, MA, USA) was used. Both positive- and negative-ion electrospray ionization (ESI) were applied. Mass spectra were acquired from m/z 100 to 2,000 with a resolution of 100,000 and a mass accuracy of 5 ppm at m/z 200. Mass calibration was carried out with a standard calibration mix including Ultramark 1621, sodium dodecyl sulfate, caffeine, and peptide MRFA (Thermo). For observation of fragment ions, all ion fragmentation (AIF) mode was applied at 25 eV. In the AIF mode, multiple dissociation techniques such as in-source collision-induced dissociation and higher-energy collisional dissociation cell were employed. Data were acquired both under MS mode and under AIF mode. Determination of elemental composition from accurate mass measurement was carried out with Thermo Fisher Xcalibur Software. The LC column was an ODS-100S column (2.0 mm×150 mm, 5 μm; Tosoh Corp., Tokyo, Japan). A gradient condition consisting of 2 mM ammonium bicarbonate in ultrapure water (A) and methanol (B) was adopted. The gradient, expressed as changes in mobile phase B, was as follows: 0–2 min, hold at 30% B, 2–15 min, a linear increase from 30% to 100% B; 15–20 min, hold at 100% B; 20–25 min, equilibration at 30% B. Another ODS-100S column was installed before the autosampler to delay elution of substances from the LC system. Confirmation of substances with reference reagents was conducted using a triple quadrupole mass spectrometer. A system consisting of Agilent 1100 (Agilent Technologies, Palo Alto, CA, USA) and an API 2000 (Sciex, Foster City, CA, USA) was used. The condition of selected reaction monitoring in ESI was optimized for each substance.
Approaches to explain the mass spectra examined in this studyMAGMaRidder et al. introduced an automated method for the interpretation of accurate mass spectra in 2012.8) The method is based on an algorithm for candidate substructure annotation of multistage accurate mass spectral trees without relying on a spectral library. First, candidate structures are retrieved from a compound database by querying on monoisotopic mass. Next, fragmentation of structural skeletons (i.e., non-hydrogen atoms) is performed by removing each non-hydrogen atom sequentially and collecting substructures. During the fragmentation, a simple penalty score that depends on the type of bond is taken into account. Then in silico generated substructures are assigned to accurate m/z values in a mass spectrum. Referred molecular structures are retrieved from chemical compound databases such as PubChem, Human Metabolite Database (HMDB), and KEGG. An online version, MAGMa, has a user-friendly web interface, http://www.emetabolomics.org/magma/. Use of this method was selected as the best automated tool of the international contest, CASMI2013, which tested the ability to explain blind mass spectral data.11) In the present study, PubChem was selected as the compound database.
MetFrag/MetFusionMetFrag was developed in 2010 by Wolf et al. MetFrag also simulates fragmentation to explain ions in a mass spectrum.9) Before bond disconnection of candidate structures, a small set of rules for molecular rearrangements are applied. Then all bonds to be disconnected are labeled linear or ring. A redundancy check is performed at every disconnection process. The in silico fragments are matched against the query peak list. Referred molecular structures are retrieved from a chemical compound database such as PubChem, KEGG, or ChemSpider. A web application is available at the following URL: http://msbi.ipb-halle.de/MetFrag/. MetFusion is the successor of MetFrag and was released by Gerlich et al. in 2013.10) MetFusion incorporates the similarity using a mass spectral library in MetFrag scoring. Its spectral library refers to MassBank, HMDB, NIST’11, and METLIN. As with MAGMa, PubChem was selected as the compound database.
Preparation of queries for automated methodsInitially, obvious noise and substances detected in the blank experiment were excluded from the examination for compound identification. The intense ions in the mass spectra were extracted from each LC/MS analysis according to intensity. In ESI, without advance information, we usually cannot know whether these ions indicate which kind of ionic form, such as molecular ions [M]+, protonated molecules [M+H]+, sodiated molecules [M+Na]+, or fragments of unstable precursors. Relevant ions with identical chromatographic peak shapes were extracted in order to assign specific ionic forms to them. The difference in m/z values between the ions offers good criteria for annotation. For example, the m/z differences of [M+Na]+ from [M+NH4]+ and [M+H]+ are 4.955 and 21.982, respectively. Another LC/MS analysis in opposite polarity also can be used as criteria. Ions with an m/z difference of 2.011 with identical chromatographic peak shapes in both polarities is evidence of the presence of [M+H]+ and [M−H]−. Ions related to isotopologues and adduct ions were grouped as a single feature. The 20 most intense features were extracted. After the assignment of precursor ions, analyses were carried out in AIF mode. Ions with the same chromatographic peak shapes as precursor ions were extracted to find fragment ions, just as the aforementioned assignment of a feature. Ions with great differences in the first decimal place of the m/z values from that of precursor ions were excluded. A list of accurate masses of precursor and fragment ions was consolidated to obtain a virtual precursor ion and product ion relationship for each feature. This process is quite laborious but necessary because our Exactive is not a multistage mass spectrometer. The list was converted to appropriate formats depending on the requirements of each automated method. All m/z and peak area values used for evaluating automated methods are summarized in the electronic supplementary material (Table S7).
Table 1 is a list of the 20 most intense features in positive- and negative-ion ESI and m/z values of ions with highest intensity in each feature. In positive ion mode, feature P1 with the highest intensity among the features showed an m/z value of 240.1497. P1 included a weak ion that had an m/z of 479.2950, the value of twofold minus 1.005. This relationship could be considered that of [M+H]+ and [2M+H]+. Therefore, the ion of m/z 240.1497 could be annotated as [M+H]+. The m/z values of the examples of the major fragment ions were 106.0654, 108.0811, 133.0763, and 223.1232. Typical chromatograms are shown in Fig. 1. Table 2 is a list of m/z value, peak area, and its isotopologue composition of suspected fragment ions of P1. The isotopologue composition indicated the absence of elements that had a specificity at X+2 such as chlorine, bromine, and sulfur, and the presence of roughly 15 carbon atoms. In addition, the presence of C7H7+ (P1AIF1 in Table 2) suggested a benzyl or tolyl substructure. The candidate formula of fragment ion P1AIF7 corresponded to the neutral loss of fragment ions P1AIF3 or P1AIF4 in Table 2. The values of ring and double bond equivalents (RDB) of P1AIF3, 4, and 7 were fairly consistent with the inclusion of a tolyl or benzyl substructure. Therefore a highly symmetric structure was suspected. Although the m/z value, 240.1497, has several candidate formulas such as C13H23NOP, C15H18N3, and C17H20O, only C15H18N3 is consistent with the elemental compositions of the fragment ions of P1 because the precursor ion must include nitrogen atoms. Given that feature P1 is composed of two identical substructures, the consistent substructures were considered to be aminobenzyl, anilinomethyl, benzylamino, or toluidino groups. Each substructure contributes 4 of RDB. Hence, the residual structure must consist of one atom each of carbon, hydrogen, nitrogen, and one double bond, and connect two substructures at the carbon atom. Features with substructures of the benzylamino and toluidino groups can be considered having a guanidine structure. Figure 2 shows the probable structures for P1, which correspond to the toluidino (A), benzylamino (B), anilinomethyl (C), and aminobenzyl (D) substructures. The cleavage positions of the bonds to generate the fragment ions listed in Table 2 are also shown in Fig. 2. As shown in Fig. 2, substance A could explain the generation of most fragment ions by simple scission. Consequently, substance A, tolylguanidine, was most likely but the methyl position on the phenyl ring could not be determined from interpretation of the fragmentation.
Positive ion mode | Negative ion mode | ||||||
---|---|---|---|---|---|---|---|
No. | Accurate mass | RT*/min | Peak area | No. | Accurate mass | RT/min | Peak area |
P1 | 240.1497 | 6.5 | 21795672 | N1 | 325.1834 | 20.1 | 15039200 |
P2 | 212.1184 | 4.4 | 11645483 | N2 | 311.1676 | 18.9 | 14519946 |
P3 | 399.2509 | 21.3 | 4832006 | N3 | 339.1990 | 21.6 | 10083557 |
P4 | 368.4253 | 23.0 | 4224291 | N4 | 297.1523 | 17.7 | 5756181 |
P5 | 235.1806 | 4.8 | 3697944 | N5 | 421.2256 | 20.4 | 1298573 |
P6 | 298.2226 | 4.1 | 2799889 | N6 | 180.9721 | 2.5 | 1071585 |
P7 | 204.1385 | 16.5 | 2521985 | N7 | 248.9598 | 2.5 | 573028 |
P8 | 312.3626 | 22.1 | 2312859 | N8 | 396.2415 | 16.2 | 564249 |
P9 | 342.2489 | 4.1 | 1517764 | N9 | 313.1112 | 3.0 | 465045 |
P10 | 391.2303 | 13.7 | 1315704 | N10 | 253.2164 | 24.3 | 417956 |
P11 | 230.2480 | 17.5 | 1282643 | N11 | 651.3744 | 20.1 | 356371 |
P12 | 515.2446 | 17.0 | 1255801 | N12 | 623.3431 | 18.9 | 341778 |
P13 | 252.2171 | 14.5 | 1071096 | N13 | 316.9471 | 2.5 | 269727 |
P14 | 369.3842 | 22.0 | 1070924 | N14 | 513.2285 | 17.0 | 262597 |
P15 | 326.3783 | 21.3 | 812233 | N15 | 128.9585 | 2.5 | 215132 |
P16 | 248.2375 | 15.0 | 795483 | N16 | 283.1367 | 16.6 | 162474 |
P17 | 294.1550 | 13.9 | 786037 | N17 | 281.0849 | 3.1 | 137739 |
P18 | 748.4850 | 13.7 | 757304 | N18 | 264.9338 | 2.5 | 129963 |
P19 | 284.3313 | 21.3 | 749116 | N19 | 543.0455 | 16.5 | 125226 |
P20 | 310.2590 | 16.5 | 735019 | N20 | 269.0369 | 17.8 | 109648 |
*RT, retention time.
No. | Accurate mass | Peak area | Isotope pattern | Candidate formula | Exact mass | Error (ppm) | RDB | Counterpart to [M+H]+ | Corresponding scission* | ||
---|---|---|---|---|---|---|---|---|---|---|---|
X | X+1 | X+2 | |||||||||
P1AIF1 | 91.0546 | 187745 | 100 | 7.1 | — | C7H7+ | 91.0542 | 4.4 | 4.5 | C8H11N3 | iii or ii,’ vi (+1) |
P1AIF2 | 93.0576 | 207917 | 100 | 4.3 | — | C6H7N+ | 93.0573 | 3.2 | 4 | C9H11N2 | ii,’ iv (+2) or v |
P1AIF3 | 106.066 | 1770545 | 100 | 7.3 | — | C7H8N+ | 106.0651 | 2.8 | 4.5 | C8H10N2 | ii’ |
P1AIF4 | 108.0811 | 3758787 | 100 | 7.6 | — | C7H10N+ | 108.0808 | 2.8 | 3.5 | C8H8N2 | ii’(+2) |
P1AIF5 | 116.0497 | 254521 | 100 | — | — | C8H6N+ | 116.0495 | 1.7 | 6.5 | C7H12N2 | i, ii(−1) |
P1AIF6 | 118.0654 | 75726 | 100 | — | — | C8H8N+ | 118.0651 | 2.5 | 5.5 | C7H10N2 | i, ii(+1) |
P1AIF7 | 133.0763 | 4445934 | 100 | 8.6 | — | C8H9N2+ | 133.076 | 2.3 | 5.5 | C7H9N | ii |
P1AIF8 | 208.0998 | 79498 | 100 | 10.9 | — | C14H12N2+ | 208.0995 | 1.4 | 10 | CH6N | i, iv |
P1AIF9 | 223.1232 | 1394438 | 100 | 12.2 | — | C15H15N2+ | 223.123 | 0.9 | 9.5 | NH3 | i |
P1AIF10 | 240.1497 | 3500331 | 100 | 16.2 | 1.1 | C15H18N3+ | 240.1495 | 0.8 | 8.5 |
* The numbers of scission positions in this column are illustrated in Fig. 2. The number in parentheses indicates mass shift in nominal mass.
Queries for MAGMa and MetFrag were prepared using ions having a relative intensity of over 1% of the base peak according to their format and submitted. Where the submitted fragment ions begin and end is difficult to define because one is sometimes redundant and another is sometimes essential. We never know the judgment criteria. First, intensity criteria were determined to be 1%. The number of retrieved candidates within 3 ppm accuracy tolerance by MAGMa and MetFrag were 2,708 and 2,738, respectively. The top rank consisted of not just one compound but of five compounds for MAGMa and three compounds for MetFrag. MAGMa has a second parameter called refscore besides the fragment ion is based on a candidate score. Refscore is estimated from the number of PubChem records for one compound. The top rank of refscore was 1,3-di-o-tolylguanidine, which was one of the 14 compounds having the next best score. The identification was confirmed by LC/MS/MS with an authenticated reagent purchased from Sigma-Aldrich (St. Louis, MO, USA). Therefore, 1,3-di-o-tolylguanidine was determined to be the correct answer. In the case of MetFrag, the answer ranked 100th with five incorrect compounds that had the same score as the answer. MetFusion could not improve the result. Then, queries with all detected fragment ions were submitted. The MAGMa answer ranked 16th with seven incorrect compounds; the MetFrag answer ranked 17th with five incorrect compounds. In so far as rank goes, the rank of MAGMa worsened and the rank of MetFrag improved by the submission of all detected fragment ions. In the case of P1, P1 had a high number of candidates and many candidates concentrated with close scores. Therefore, subtle differences in scoring resulted in large differences in rank.
Around the peak of 1,3-di-o-tolylguanidine, another peak was observed with an m/z difference of 28.0312 corresponding to C2H4 and the second highest intensity. The m/z value was 212.1185. The ion closely resembled 1,3-di-o-tolylguanidine in its fragmentation pattern. Therefore, diphenylguanidine could easily be inferred. It was purchased from Tokyo Chemical Industry (Tokyo, Japan) to confirm the identification. After submitting queries of diphenylguanidine to both MAGMa and MetFrag, the correct answer ranked 3rd in 1,442 candidates and 88th in 1,499 candidates, respectively.
In a similar way, fragmentation data were sequentially interpreted to determine compounds. All identification processes were completed by using authentic standard reagents. Identified compounds and their results with MAGMa and MetFrag are listed in Table 3. Detailed interpretation of each feature is provided in the electronic supplementary material. Identification could be completed for more than half of the features. Automated methods were able to retrieve the correct answers but not always promote them to be top rank.
Positive ion mode | |||||
---|---|---|---|---|---|
No. | Substance | Structure | Submitted ions | MAGMa | MetFrag |
P1 | 1,3-Di-o-tolylguanidine (C15H17N3, CAS RN 97-39-2) | ![]() | 11 | 6(14)/2708* | 100(6)/2738 |
P2 | 1,3-Diphenylguanidine (C13H13N3, CAS RN 102-06-7) | ![]() | 5 | 3(26)/1442 | 88(10)/1499 |
P3 | Tris(butoxylethyl)phosphate (C18H39O7P, CAS RN 78-51-3) | ![]() | 5 | 1(5)/785 | 8(16)/1249 |
P7 | Crotamiton (C13H17NO, CAS RN 483-63-6) | ![]() | 12 | 13(10)/6116 | 13(10)/6409 |
P9 | Ethylene oxide | ||||
P10 | 2,4,6-Tris[bis(methoxylmethyl)amino]-1,3,5-triazine (C15H30N6O6, CAS RN 3089-11-0) | ![]() | 8 | 18(5)/119 | 59(3)/390 |
P11 | Dimethyldodecylamine oxide (C14H31NO, CAS RN 1643-20-5) | ![]() | 2 | 1(1109)/1109 | 1(1137)/1137 |
P12 | Telmisartan (C33H30N4O2, CAS RN 144701-48-4) | ![]() | 3 | 1(26)/424 | 2(27)/1029 |
P14 | Stearamidopropyl dimethylamine (C23H48N2O, CAS RN 7651-02-7) | ![]() | 4 | 1(11)/114 | 2(10)/103 |
P16 | Benzalkonium chloride (C17H30N+, CAS RN 46917-11-7) | ![]() | 4 | 1(36)/2821 | 1(10)/113 |
P18 | Clarithromycin (C38H69NO13, CAS RN 81103-11-9) | ![]() | 5 | 3(11)/48 | 2(10)/47 |
* The denominators, numerators, and parentheses, respectively, denote the number of retrieved substances, the order of the correct answer, and the number of substances with the same score as the correct answer.
In the case of negative ion mode, the top four features had a constant interval of m/z 14.015 corresponding to CH2. Their chromatograms are shown in Fig. 3 and indicate the presence of many isomers. Furthermore, in their fragmentation, these substances resembled a principal fragment ion, m/z 183.0110. Neutral losses of the precursor ions corresponded to the values of CnH2n+2. This indicates the presence of an identical substructure and an alkyl chain with a different chain length and different substitution position. The m/z values of the common fragment ions were 183.0110 and 119.0491. The isotopologue composition of these fragment ions was different in X+2. The mass defect of these fragment ions indicated a loss of an element with a large negative mass defect such as sulfur. The accurate mass difference 63.9619 between the two fragment ions corresponded exactly to sulfur dioxide. The elemental composition candidates of the fragment ion 119.0491 included C6H5N3 and C8H7O. C8H7O was more likely because the RDB of C6H5N3 was integral and it was not consistent with its anionic form. Its RDB was 5.5 and suggested the presence of a phenyl ring. It was possible that the precursors had amphipathic structures such as sulfonate-type surfactants. The loss of CnH2n+2 is typical fragmentation in anionic surfactants.12) In Japan, linear alkyl benzenesulfonate (LAS) is a leading detergent with an annual manufactured and imported quantity of 48,160 t in FY 2013.13) Its monoisotopic masses of anionic forms are 297.1530 (4-decan-x-ylbenzenesulfonate, x=2–5, C10LAS), 311.1686 (4-undecan-x-ylbenzenesulfonate, x=2–6, C11LAS), 325.1843 (4-dodecan-x-ylbenzenesulfonate, x=2–6, C12LAS), and 339.1999 (4-tridecan-x-ylbenzenesulfonate, x=2–7, C13LAS). These values coincided with the values observed (N1–N4 in Table 1). The monoisotopic masses of their dimers also coincided (N11 and N12). The product ion, m/z 183 (in nominal mass), is well known in typical LAS analyses by LC/MS/MS.14) Eventually, the identification of LAS was confirmed by using reference standards (Wako Pure Chemical Industries, Ltd., Osaka, Japan). These m/z values were submitted to both MetFrag and MAGMa. MAGMa and MetFrag retrieved 219 and 446 candidates for C10LAS, respectively. The correct answer ranked 1st in MAGMa and 4th in MetFrag, respectively. However, numerous candidates ranked at the same position as the correct answer. Although the retrieved candidates included LAS, neither tool could explain these fragment ions. In the fragmentation of sulfonate-type surfactants, a product ion, SO3−, is conceivable. The present collision energy, however, was not enough to generate SO3−, which would improve the results. Identified compounds and their results of MAGMa and MetFrag are listed in Table 4. Detailed interpretation of each compound is provided in electronic supplementary material.
Negative ion mode | |||||
---|---|---|---|---|---|
No. | Substance | Structure | Submitted ions | MAGMa | MetFrag |
N1 | C12-LAS (C18H29O3NaS, CAS RN 25155-30-0) | ![]() | 3 | 1(184)/184* | 8(89)/515 |
N2 | C11-LAS (C17H27O3NaS, CAS RN 27636-75-5) | ![]() | 3 | 1(170)/170 | 16(52)/579 |
N3 | C13-LAS (C19H31O3NaS, CAS RN 26248-24-8) | ![]() | 4 | 2(153)/154 | 6(32)/124 |
N4 | C10-LAS (C16H25O3NaS, CAS RN 1322-98-1) | ![]() | 3 | 1(219)/219 | 4(442)/446 |
N5 | Aerosol OT (C20H37O7SNa, CAS RN 577-11-7) | ![]() | 5 | 1(42)/4160 | 35(30)/300 |
N11 | Dimer of C12-LAS | ||||
N12 | Dimer of C11-LAS | ||||
N14 | Telmisartan (C33H30N4O2, CAS RN 144701-48-4) | ![]() | 4 | 364(144)/497 | 4(18)/739 |
N19 | Bicalutamide** (C18H14N2O4F4S, CAS RN 90357-06-5) | ![]() | 5 | 25(6)/1071 | 3(5)/1190 |
* The denominators, numerators, and parentheses, respectively, denote the number of retrieved substances, the order of the correct answer, and the number of substances with the same score as the correct answer. ** The query was submitted under the condition of 5 ppm tolerance and halogen inclusion.
Schymanski et al. has proposed a level system to facilitate the communication of compound identification confidence in the field of environmental research.15) According to the level system, the 16 present compounds reached Level 1 confidence (Confirmed structure). These compounds consisted of pharmaceutical compounds, surfactants, a flame retardant, and other industrial materials. Osaka City has a population of about 2.6 million and is situated at the mouth of the Yodo River on Osaka Bay. There are many streams and watercourses, which often become stagnant or even flow backwards. Hence, the aquatic environment is considerably influenced by human activity.16,17) The compounds found in the present study also reflected this situation. Crotamiton, telmisartan, and clarithromycin are common pharmaceutical compounds that are often targeted in the context of environmental research.18,19) In Japan, more than 6 million patients suffer from high blood pressure.20) The unequivocal detection of telmisartan appears quite conceivable. However, several compounds such as bicalutamide and 1,3-di-o-tolylguanidine have rarely been targeted. 1,3-Di-o-tolylguanidine is known not only as a vulcanization accelerator in rubber production but also as a ligand of the σ receptor in the central nervous system21) and designated in Monitoring Chemical Substances found in Japan’s Act on the Evaluation of Chemical Substances and Regulation of Their Manufacture, etc. Its annual manufactured and imported quantity in Japan was over 100 t from FY 2007 to FY 2009.22) Subsequent environmental research on these compounds is expected.
In Schymanski’s level system, several features remained at Level 5 (Exact mass). Several of them had a relationship like a constant m/z difference. Figure 4 shows an example of two homologous series ions with an m/z difference of 28.031 corresponding to CH2. Features P4, P8, P15, and P19 were included in these series. Although the m/z differences of 14.015 were found between the series, it seemed the two series had different structures because the retention time was not consecutive. These features were remarkable but it is unusual to keep a record of such features in scientific communication because their exact identification could not be ascertained. Even though their formulas could not be assigned, we believe that reciprocation of such data under assignment Level 5 is still meaningful for a better understanding of chemical substances in the environment.
In this study, nontarget analysis was conducted to identify anthropogenic compounds in an urban aquatic environment and two automated methods were evaluated using the results. Both MAGMa and MetFrag retrieved correct answers in most queries. They also usually retrieved many incorrect answers with the same score as the correct answer. Increasing the number of submitted fragment ions has not always improved results. Both tools appeared to be equal in their results in the present sample, but MAGMa offered a great advantage in terms of processing time.
To explain the fragment ions in a query, MAGMa and MetFrag use disconnecting bonds of numerous compounds in molecular structure databases. This procedure is impossible to conduct manually, and considerably reduces the burden on humans in reading fragmentation. Of course, that reason is not enough to recommend the procedure and it should be complemented from broader perspectives such as chemical reaction probability, isotope pattern, consistency between precursor structure and product structure, and specific neutral loss on substituent group or element, as described in the electronic supplementary material. In addition to the tools used here, many automated tools that include different perspectives are available. To complement legacy target analysis for environmental risk evaluation of chemical compounds, new methodology is anticipated. Such future development would also be a breakthrough in the field of environmental research.
This work was financially supported by JSPS KAKENHI Grant Number 26241026 and 26505011.