An Effective Approach to Mass Spectrometry Imaging Data Partitioning Using UMAP and k-Means Clustering

Shinichi Yamaguchi; Masaya Ikegawa

doi:10.5702/massspectrometry.A0174

Abstract

In this study, we propose an effective summarization method for mass spectrometry imaging (MSI) data and demonstrate its efficacy. The MSI data used in this study were obtained from thoracic tissue sections of mice, including the thymus. The thymus is a multi-lobed organ composed of cortical and medullary areas, playing a crucial role in T-cell differentiation. By applying MSI to the thoracic region, including the thymus, this study aims to comprehensively visualize changes in molecular localization and metabolic patterns across thoracic organs. MSI data are highly information-rich, making effective summarization and organization challenging. Therefore, we explored a method to organize and visualize the data based on either spatial or m/z values. Specifically, we employed Uniform Manifold Approximation and Projection (UMAP) to project m/z data into 3-dimensional space, followed by k-means clustering to divide it into multiple clusters. This approach enables detailed and comprehensive representation of diverse features. The objective of this study is to identify molecular localizations and patterns that conventional methods may overlook. Furthermore, experimental results demonstrated that the pseudo-color images generated using UMAP highlighted specific m/z values that significantly influence image characteristics. When focusing on thoracic data, spatial segmentation resulted in clearer color differentiation; however, molecular localizations corresponding to blood vessels were not observed. This finding confirms that m/z segmentation is more effective than spatial segmentation in discovering new molecular localizations.

1. INTRODUCTION

In recent years, mass spectrometry imaging (MSI) has emerged as a powerful technique for detailed visualization of molecular distributions within tissues and has been widely applied in various fields such as biomedicine, pharmacology, and materials science.^1–3) Since Geladi’s pioneering work prior to 1990 on multivariate image analysis, numerous approaches have been developed to handle the complex data generated by spectroscopic imaging techniques.^4–10) In the specific context of MSI, early applications using secondary ion mass spectrometry (MS),^11,12) laser ablation inductively coupled plasma MS (LA-ICP-MS),^13–15) and matrix-assisted laser desorption/ionization (MALDI)^16–18) have demonstrated the technique’s versatility across different sample types and analytical questions. The MSI technology possesses high spatial resolution and molecular specificity, allowing the simultaneous detection of various molecules within tissues. However, a single sample typically generates thousands of molecular signals, making the effective visualization of this vast amount of information a significant challenge.

Fonville et al. have proposed pseudo-color techniques for visualizing MSI data.¹⁹⁾ Additionally, dimensionality reduction methods such as t-distributed stochastic neighbor embedding (t-SNE)²⁰⁾ and uniform manifold approximation and projection (UMAP)²¹⁾ have been shown to be effective means of extracting important information from complex datasets.^22–25) Various clustering methods have also been proposed as summarization techniques for MSI data.^26–28) Despite these advancements, comprehensively visualizing MSI data as a single image remains challenging, and many important features may be overlooked. Therefore, more effective data analysis methods are needed.

In this study, we propose a new approach for analyzing MSI data. We use UMAP and k-means clustering to partition the MSI data in the m/z information direction and generate pseudo-color images for each partition. This method aims to improve the efficiency of MSI data analysis, allowing for detailed and comprehensive representation of diverse features and enabling more accurate analysis.

The MSI data used in this study were obtained from thoracic tissue sections of mice, including the thymus.²⁹⁾ The thymus is a multi-lobed organ composed of cortical and medullary areas, playing a crucial role in T-cell differentiation. During T-cell development, thymocytes traverse specific thymic compartments and interact with the microenvironments of the cortex and medulla.

This significance of this lies in proposing a new method for summarizing and visualizing MSI data, demonstrating its effectiveness with actual biological samples. By doing so, we aim to uncover new molecular localizations that conventional methods may overlook.

2. EXPERIMENTAL PROCEDURES

Four-week-old female ICR mice were administered 5 mg/kg of dexamethasone intraperitoneally. The experimental data from these mice have been detailed in a previous publication.²⁹⁾ In this study, we reused the MSI data obtained from these experiments to validate the proposed method.

For the acquisition of MSI data in negative ion mode, the tissue sections were coated with a 9-aminoacridine matrix. MSI was performed using an atmospheric pressure MALDI-quadrupole ion trap-time-of-flight-MS (Shimadzu Corporation, Kyoto, Japan).

The acquired MSI data were normalized using the total ion count, and 3000 peaks were extracted in order of intensity from the average spectrum to create a data matrix. IMAGEREVEAL MS Ver.1.30 (Shimadzu, Kyoto, Japan) was used for processing. This data matrix was analyzed using Python (3.10) (Supplementary Table 1).

2.1 Data processing and analysis

The overall workflow for MSI data processing and analysis is depicted in Fig. 1. IMAGEREVEAL MS Ver.1.30 and Python (3.10) were combined to meet the advanced analysis requirements of this study. IMAGEREVEAL MS Ver.1.30 provides numerous useful features for processing MSI data, such as creating data matrices, spatial projection of pixels, and generating pseudo-color images. However, it lacks the capability to project m/z data into 3 dimensions using UMAP and to apply k-means clustering for data partitioning. Therefore, Python (3.10), which offers these algorithms, was integrated into the analysis process.

Fig. 1. Data analysis pipeline. This figure illustrates the complete analytical workflow used in this study. The process begins with (1) data acquisition and MSI data preparation, where tissue sections are processed and mass spectrometry imaging data are collected. Next, (2) dimensionality reduction is performed using UMAP to project the high-dimensional m/z data into a 3-dimensional space, which is then visualized as a pseudo-color image (shown in the middle). The workflow continues with (3) m/z partitioning through a 3-stage binary division process, creating a hierarchical structure of 15 distinct m/z groups as shown in the tree diagram. These partitioned m/z values are then used for (4) image generation, producing 15 pseudo-color images that represent different molecular patterns within the tissue. Finally, (5) key discoveries are highlighted in the results section, where specific molecular localizations (such as blood vessels in the thymus and specific structures in the esophagus) are identified, as shown in the bottom images. The optical image (top) shows the H&E-stained tissue section for anatomical reference. H&E, hematoxylin and eosin; MSI, mass spectrometry imaging; UMAP, Uniform Manifold Approximation and Projection.

Specifically, the data matrix created using IMAGEREVEAL MS Ver.1.30 was transferred to the Python environment, where UMAP was used to project the m/z data into 3-dimensional (3D) space. Subsequently, k-means clustering was applied to the resulting 3D data to classify the data. This process enabled the identification of molecular localizations and patterns that were difficult to detect using conventional methods.

The parameters used for these algorithms are detailed in Supplementary Table 1. For UMAP, we used the umap-learn library (version 0.5.6) with 3 components for dimensionality reduction, 15 neighbors to balance local and global structure preservation, and a minimum distance of 0.1 to prevent excessive crowding of points. The Euclidean distance metric was selected as it is appropriate for the continuous nature of our intensity data. For k-means clustering, the scikit-learn library (version 1.4.2) was employed with 50 clusters for the pixel-direction analysis to capture the diversity of molecular patterns without over-partitioning. We used the k-means++ initialization method to ensure stable and reproducible results, with 10 initializations and a maximum of 300 iterations to guarantee convergence.

For pseudo-color image generation, the 3-dimensional UMAP coordinates were directly mapped to RGB values (0–255), and images were constructed with each pixel displaying the color of its corresponding cluster centroid. The PIL library was used to generate the final visualization.

The number of clusters (50) in this study merely determines the number of colors for visual representation and is not itself a subject of biological significance or analysis. This value was selected based on visual evaluation, as too few clusters result in insufficient color differentiation, while too many lead to difficulty in distinguishing similar colors. The value of 50 was empirically determined as the optimal number of colors for visual identification of spatial distribution patterns.

For the comparative principal component analysis (PCA) analysis, IMAGEREVEAL MS Ver.1.30 was used with 2 principal components and the Pareto scaling method. By combining the visual analysis capabilities of IMAGEREVEAL MS Ver.1.30 with the advanced numerical processing and statistical modeling provided by Python, we achieved efficient and precise analysis.

Python (3.10) was chosen for its stability and compatibility. By combining the visual analysis capabilities of IMAGEREVEAL MS Ver.1.30 with the advanced numerical processing and statistical modeling provided by Python, we achieved efficient and precise analysis.

2.2 Data visualization

For the visualization of data in the pixel direction, the dimensionality in the m/z direction was reduced to 3 dimensions using UMAP, and the position of each pixel in the 3D space was converted to RGB color coding to represent the pixel’s color. ^22,23) This conversion was performed by directly mapping the 3 UMAP dimensions to the R, G, and B color channels, respectively. Specifically, the 1st UMAP dimension (dim_1) was mapped to the red channel, the 2nd dimension (dim_2) to the green channel, and the 3rd dimension (dim_3) to the blue channel. Each dimension was scaled to the 0–255 range for standard RGB color representation using the formula: RGB_value = 255 × (UMAP_value − min_value)/(max_value − min_value), with higher values in each dimension resulting in more intense red, green, or blue components in the final color. This direct mapping approach preserves the relative distances between points in the UMAP space, allowing similar molecular profiles to appear as similar colors in the visualization.

Subsequently, k-means clustering was used to divide the pixels into 50 clusters, and the color of each cluster was assigned based on the centroid’s color to construct a pseudo-color image. In the m/z direction partitioning, we implemented a three-stage binary division process.

We adopted a 3-stage binary division method for m/z data segmentation. We chose this approach primarily because dividing into multiple clusters at once could potentially mix clusters that could not be distinguished in the overall UMAP projection. The binary division method allows us to separate m/z values into 2 groups based on the most significant differences at each stage, providing stable clustering results even without clear objective indicators for multiple divisions. Furthermore, the 3-stage hierarchical process enabled us to observe the molecular pattern separation process step by step, allowing detailed tracking of how each division reveals different molecular localizations.

As an example of spatial partitioning, the thymus was isolated, and the m/z data were reduced to 3 dimensions using UMAP to create a pseudo-color image. Additionally, for comparison, regions of interest (ROI) were set for each part, and PCA analysis was performed based on the average spectrum of these ROIs. All experiments were run on an Intel Xeon CPU W-2123 3.60 GHz machine with 64 GB RAM.

3. RESULTS

A series of pseudo-color images were obtained by segmenting the m/z data according to the hierarchical structure shown in Fig. 2. Figure 3A–3O shows the 15 pseudo-color images corresponding to each segment. Additionally, Fig. 4 shows the results of projecting the data into 3 dimensions using UMAP. In panels A–O of Fig. 4, the position of each measurement point (pixel) is represented by RGB values, while panel P displays the distribution of m/z values themselves, color-coded according to the 8 final segments from the 3rd division stage. Furthermore, a pseudo-color image limited to the thymus region was also created (Fig. 5). Note that the 1-mm scale bar shown in Fig. 3A applies to Figs. 3 and 4A–4O, but Fig. 5 is displayed at a different scale.

Fig. 2. Schematic diagram illustrating the 2-way division of m/z conducted in 3 stages. The 1st number in each pair indicates the number of divisions performed, while the 2nd number represents the sequential number of images created at each division stage. A value of 0 signifies the entire dataset. For example, “1-2” refers to the 2nd image generated from a single division of m/z, and “3-7” denotes the 7th image produced after 3 rounds of division.

Fig. 3. False color images. False-color images before and after division in the m/z direction. (A) to (O) correspond to the pairs of values shown in Fig. 2. (A) represents the unsegmented complete data (segment “0”), showing the overall molecular distribution with clear differentiation between peripheral tissues (pink/orange) and internal organs (green/blue). (B) (segment “1-1”) reveals unique white spots within the thymus (dotted box) that were identified as blood vessels through histological examination, a feature not visible in the original image (A). (C) (segment “1-2”) displays a pattern highly similar to the original unsegmented image (A), suggesting that the m/z values in this segment significantly contribute to the overall image characteristics. (D) (segment “2-1”) derives from segment “1-1” and similarly emphasizes internal structure details but with darker tones. (E) (segment “2-2”) notably shows the bladder clearly distinguished in cream color, while the green regions surrounding the thymus represent the thymic cortex. This distinct coloration prominently displays the molecular composition differences between various tissue types. (F) (segment “2-3”) closely resembles segment “1-2” and the original unsegmented image, confirming that these m/z values determine the major features of the image. (G) (segment “2-4”) shows lymph nodes and lung tissues with excellent contrast, clearly depicting the distribution patterns of molecules associated with these specific tissues. (H) (segment “3-1”) is a further division of segment “2-1” and shows characteristic magenta spots near the center, corresponding to specific molecular localization within the esophagus, later identified as associated with m/z 474.34. (I) (segment “3-2”) derives from segment “2-1” and shows similar coloration and patterns to (D), with subtle color differences in certain tissue regions. (J) (segment “3-3”) derives from segment “2-2” and displays a characteristic pattern with scattered yellow spots, suggesting localization of specific molecules. (K) (segment “3-4”) is a further division of segment “2-2” with predominantly blue tones emphasizing internal structures. (L) (segment “3-5”) derives from segment “"2-3” and maintains the features of the original image while showing color tone changes particularly in the thymus region. (M) (segment “3-6”) is a further division of segment “2-3” but shows remarkable similarity to the original unsegmented image (A) despite containing only 218 out of 3000 m/z values. This is a significant finding indicating that a small number of specific m/z values dominate the overall characteristics of the image. (N) (segment “3-7”) and (O) (segment “3-8”) derive from segment “2-4” and show different molecular distribution patterns while maintaining the basic features of their parent segment. Through this hierarchical division process, new molecular localization patterns are revealed at each stage, with (B) and (H) in particular visualizing fine structures that could not be detected by conventional methods. The scale bar in the upper left corner of (A) represents 1 mm.

Fig. 4. Data projected in 3 dimensions using UMAP. Panels A to O correspond to Figs. 2 and 3, showing each measurement point (pixel) projected into 3 dimensions with RGB assigned to each axis. (A) displays the complete dataset (segment “0”), showing a continuous distribution of diverse colors. (B) and (C) represent the 1st division results, with (B) (segment “1-1”) forming a concentrated cluster predominantly blue, containing small isolated points corresponding to the vascular structures in Fig. 3B, while (C) (segment “1-2”) shows a broad distribution similar to (A), explaining why Fig. 3C resembles the original image. In the 2nd division stage, (D) and (E) derive from (B), with (D) (segment “2-1”) characterized by an elongated shape, and (E) (segment “2-2”) showing an isolated cluster corresponding to the bladder in Fig. 3E. (F) and (G) result from the division of (C), with (F) (segment “2-3”) remarkably similar to (A), retaining the primary characteristics of the original data. In the final division stage, (H) (segment “3-1”) derives from (D) and displays a distinctive isolated magenta point separated from the main cluster, corresponding to the specific molecular localization in the esophagus in Fig. 3H. (I) (segment “3-2”) also derives from (D), showing an elongated distribution similar to its parent segment, explaining the visual similarity between Fig. 3I and 3D. (J) (segment “3-3”) and (K) (segment “3-4”) result from the division of (E), with (J) showing a distribution of mixed tones, and (K) characterized by a predominantly blue cluster with an isolated beige cluster corresponding to the cream-colored region in Fig. 3K. (L) (segment “3-5”) and (M) (segment “3-6”) derive from (F), with (L) showing its own distribution pattern, while (M) has a distribution very similar to (A) and (F), demonstrating that the few m/z values (218) in this segment determine the major characteristics of the original image. (N) (segment “3-7”) and (O) (segment “3-8”) derive from (G) and show distributions in different color tones while maintaining the basic spread of their parent segment. Panel P displays how the m/z values themselves are distributed in 3-dimensional UMAP space, color-coded according to the 8 segments from the 3rd division stage: 3-1 (gray), 3-2 (magenta), 3-3 (pink), 3-4 (orange), 3-5 (purple), 3-6 (red), 3-7 (green), and 3-8 (blue). The red segment (3-6) contains only 218 out of the total 3000 m/z values, yet it strongly contributes to the characteristics of the original image, clearly demonstrating that MSI data are dominated by a small number of characteristic m/z values. MSI, mass spectrometry imaging; UMAP, Uniform Manifold Approximation and Projection.

Fig. 5. False-color image limited to the thymus space. This figure shows a pseudo-color image generated by limiting the spatial data to the thymus region without partitioning the m/z data. Despite enhanced color differentiation within the thymus, the blood vessels visible in Fig. 3B remain undetected. The scale bar in the lower right corner represents 1 mm.

When visualizing the data without segmenting the m/z values (Fig. 3A, corresponding to segment “0” in Fig. 2), we confirmed that peripheral areas such as skin and muscle were clearly distinguishable from internal organs. After segmenting the m/z data in 3 stages, certain pseudo-color images (Fig. 3C, 3F, and 3M, corresponding to segments “1-2,” “2-3,” and “3-6” in Fig. 2, respectively) were observed to be similar to the original unsegmented image.

Segment “3-6” (Fig. 3M) contains only 218 m/z values (Supplementary Table 2) and is represented in red in Fig. 4P. The visual similarity between Figs. 3M and 3A suggests that the m/z values included in segment “3-6” strongly contribute to the characteristics of the original image.

The pseudo-color images obtained after segmentation revealed interesting localizations that were not observed before segmentation. For example, white spots were observed in the thymus in Fig. 3B (segment “1-1”), which were confirmed to be blood vessels by histological examination shown in Fig. 3P. Additionally, magenta spots were observed near the center in Fig. 3H (segment “3-1”).

To further investigate these magenta spots, we utilized the similar image extraction function of IMAGEREVEAL MS shown in Supplementary Fig. 4. By using the magenta region in segment “3-1” as a teacher image for partial least squares regression analysis, we identified m/z 474.34 as strongly associated with localization within the esophagus. Preliminary identification by METLIN³⁰⁾ suggested that this m/z value corresponds to the [M–H₂O–H] ion of C₂₅H₅₂NO₆P.

In the pseudo-color image where the spatial data were limited to the thymus without segmenting the m/z data (Fig. 5), the color differentiation became clearer compared to Fig. 3A; however, the localization corresponding to blood vessels observed in Fig. 3B was not detected.

4. DISCUSSION

4.1 Significance of m/z segmentation for feature discovery

This study demonstrates that pseudo-color images generated using UMAP are determined by a small number of characteristic m/z values. Specifically, we confirmed that 218 m/z values (approximately 7% of the total 3000) belonging to segment “3-6” determine the main features of the original unsegmented image.

The PCA loading plot analysis in Supplementary Fig. 2B further supports this observation. The m/z value group corresponding to segment “3-6” (shown in red in Fig. 4P) exhibits large variance in the PCA analysis, confirming its strong influence on the overall data characteristics. Consequently, conventional visualization methods may systematically overlook other important molecular information due to the influence of these dominant characteristic m/z values.

The generation of multiple similar images in our analysis (as seen in Figs. 3A, 3C, 3F, and 3M) is a direct consequence of this data characteristic. These similarities arise because certain m/z value clusters share dominant influence on the overall molecular distribution pattern. This repetition of similar visualization patterns across different segmentation steps further validates our finding that a small subset of m/z values strongly determines the image characteristics.

In our approach, after dimensionality reduction using UMAP, we employed k-means clustering to divide the m/z values into multiple clusters. The essence of this method lies in grouping m/z values that exhibit similar behavior and visualizing each group independently. The k-means segmentation allows us to separate other m/z value groups from the influence of the few m/z values that dominate the overall data characteristics, thereby revealing the unique molecular distribution patterns of each group.

4.1.1 Mathematical considerations for minor peak detection

From a mathematical perspective, UMAP alone struggles to highlight minor ion peaks because it preserves both the local and global structure of the high-dimensional data. When projecting the entire dataset, m/z values with high variance and abundance naturally dominate the projection, while minor peaks with limited spatial distribution become effectively “hidden” in the dimensionality reduction process, despite being potentially biologically significant.

K-means clustering proves effective for selecting these minor peaks because it partitions the m/z space based on similarities in spatial distribution patterns rather than absolute intensity. Within each cluster, m/z values share similar distribution patterns regardless of their absolute abundance. This separation enables visualization of molecular distributions that would otherwise be overshadowed by dominant m/z values. Mathematically, this works because k-means minimizes within-cluster variance while maximizing between-cluster variance, effectively isolating groups of m/z values with distinct spatial behaviors even when they represent a small fraction of the total signal.

The fact that blood vessels within the thymus and specific localizations within the esophagus were detectable only after m/z segmentation concretely demonstrates the effectiveness of this approach. These structures were clearly present in the data yet completely invisible in the pre-segmentation visualization.

4.1.2 Guidelines for selecting informative images

When selecting the most informative images from multiple k-means clustering results, we recommend the following guidelines:

1. Novelty assessment: Compare each segmented image with the original unsegmented image. Images that reveal structures or patterns not visible in the original image (such as Figs. 3B and 3H in our study) should be prioritized for further analysis.
2. Biological relevance: Evaluate segmented images in the context of known biological structures. Images showing distinct patterns that align with anatomical features deserve particular attention.
3. Cluster size consideration: Examine images generated from smaller clusters, as these often reveal minor but biologically significant molecular distributions that would otherwise be masked by dominant signals.
4. Complementary information: Select images that provide complementary rather than redundant information. For instance, in our study, Fig. 3M, 3F, and 3A show similar patterns and may be considered redundant, while Fig. 3B, 3E, and 3H each provide unique information.

4.2 Advantages over spatial segmentation

Our results clearly demonstrate that m/z segmentation is more effective than spatial segmentation for discovering new molecular localizations. When we limited the analysis to the thymus region (spatial segmentation), although the overall color differentiation improved, the blood vessel localization detected in the m/z-segmented image was not observed.

This phenomenon can be explained by our observation that pseudo-color images are determined by a small number of dominant m/z values. In spatial segmentation, the relative influence relationships of m/z values within the selected region remain unchanged, thus limiting the ability to reveal subtle molecular patterns.

On the other hand, m/z segmentation using the k-means method classifies the 3000 m/z values in the dataset into multiple groups based on similarity and visualizes the spatial distribution of each group independently. This reveals molecular patterns that were previously obscured by the few dominant m/z values, enabling the visualization of microregions such as blood vessels.

4.3 Molecular identification limitations

Regarding the localization observed in the esophagus (Fig. 3H), preliminary identification based on m/z value 474.34 suggested C₂₅H₅₂NO₆P. However, definitive molecular identification would require confirmation by tandem MS. As this study utilized archived MSI data from previous experiments, and the original samples are no longer available for additional analysis, our identification remains tentative and based solely on accurate mass matching.

4.4 Applications and future directions

The m/z segmentation approach proposed in this study has potential applications in various fields:

1. Biomedical research: More effective detection of subtle molecular differences could improve understanding of disease mechanisms and progression.
2. Pharmaceutical studies: Better visualization of drug distribution and metabolism in tissues could aid drug development and efficacy assessment.
3. Pathology: Detection of molecular changes that precede morphological changes could enable earlier disease diagnosis.

Future work should focus on:

1. Optimization of segmentation parameters: Determining the optimal UMAP dimensionality reduction parameters and k-means cluster numbers for different tissue types and experimental conditions.
2. Integration with other analytical techniques: Combining our MSI visualization approach with complementary techniques such as liquid chromatography-tandem MS for more definitive molecular identification.
3. Development of automated analysis pipelines: Creating user-friendly software tools that implement our approach to make it accessible to a broader range of researchers.

5. CONCLUSION

This study demonstrates that pseudo-color images generated using UMAP are predominantly determined by a small number of characteristic m/z values. Our analysis revealed that merely 218 m/z values (approximately 7% of the total 3000) strongly influence the characteristics of unsegmented MSI data visualizations. This finding has important implications for MSI data analysis, suggesting that conventional visualization methods may systematically overlook significant molecular features due to the dominance of these few m/z values.

Additionally, the pseudo-color images obtained through m/z segmentation using k-means clustering revealed molecular localizations that were completely overlooked by conventional methods. Specifically, blood vessels within the thymus and specific localizations within the esophagus were detected only after segmentation, despite being clearly present in the data. The identification of m/z 474.34 as a marker for esophageal localization through our targeted analysis further demonstrates the value of exploring segmented data.

On the other hand, when spatial data were limited to the thymus, although the color differentiation became clearer, the blood vessel localization observed in the m/z-segmented image was not detected. This observation confirms that spatial segmentation is less effective than m/z segmentation in discovering new molecular localizations, as it does not alter the relative influence of dominant m/z values within the selected region.

From these results, we conclude that m/z segmentation using k-means clustering after UMAP dimensionality reduction is essential for extracting more detailed and meaningful information from MSI data. By separating the influence of dominant m/z values and independently analyzing similar m/z groups, this approach enables the visualization of molecular patterns that would otherwise remain hidden. The importance of this method is emphasized for future research in various fields including biomedical research, pharmaceutical studies, and pathology, providing a foundation for deeper biological understanding and potential clinical applications.

ACKNOWLEDGMENTS

We deeply appreciate Assistant Professor Yudai Tsuji from Fujita Health University for his assistance in preparing the samples and obtaining the data.

Notes

Mass Spectrom (Tokyo) 2025; 14(1): A0174

REFERENCES

1) I. Marković, I. Lukić, M. Lukić, E. Pavičić, S. Mrđenović, A. Kotris, B. Dmitrović, Ž Debeljak. Differentiation of the chronic lymphocytic leukemia response to ibrutinib and acalabrutinib treatment by single-cell MALDI-TOF MS imaging. J. Pharm. Biomed. Anal. 255: 116664, 2025.
2) P. Chen, B. Chen, M. He, Y. Zhou, L. Lei, J. Han, B. Zhou, L. Hu, B. Hu. Nanoplastics and nano-ZnO facilitate Cd accumulation in zebrafish larvae via a distinct pathway: Revelation by LA-ICP-MS imaging. Chin. Chem. Lett. 36: 109908, 2025.
3) J. Doungchawee, L. J. Castellanos-García, K. N. Sikora, X. Zhang, Y. Liu, D. K. Agrohia, T. Jantarat, J. D. Lauterbach, V. M. Rotello, R. W. Vachet. Mass spectrometry imaging reveals that gold nanomaterial surface chemistry affects transport and excretion in mice over time. Chemical & Biomedical Imaging: 4c0082, 2025.
4) P. Geladi, B. Sethson, J. Nyström, T. Lillhonga, T. Lestander, J. Burger. Chemometrics in spectroscopy: Part 2. Examples. Spectrochim. Acta B At. Spectrosc. 59: 1347–1357, 2004.
5) P. Geladi. Chemometrics in spectroscopy. Part 1. Classical chemometrics. Spectrochim. Acta B At. Spectrosc. 58: 767–782, 2003.
6) P. Geladi, K. Esbensen. Can image analysis provide information useful in chemistry? J. Chemometr. 3: 419–429, 1989.
7) P. Geladi, H. Grahn. in Multivariate Analysis in the Pharmaceutical Industry, Elsevier, 2018, pp. 13–34.
8) P. Geladi, H. Grahn, K. Esbensen, E. Bengtsson. Image analysis in chemistry II. Multivariate image analysis. Trends Analyt. Chem. 11: 121–130, 1992.
9) P. Geladi, H. F. Grahn. Multivariate Image Analysis, R. A. Meyers, John Wiley & Sons Ltd, Hoboken, New Jersey, USA, 2006.
10) P. Geladi, H. Grahn, K. H. Esbensen. in Partial Least Squares Path Modeling (Ed: H. Latan, R. Noonan), Springer International Publishing, Cham, 2017, pp. 339–365.
11) P. Fragu, J. Clerc, C. Briançon, C. Fourré, J. Jeusset, S. Halpern. Recent developments in medical applications of SIMS microscopy. Micron 25: 361–370, 1994.
12) P. Fragu, C. Briançon, C. Fourré, J. Clerc, O. Casiraghi, J. Jeusset, F. Omri, S. Halpern. SIMS microscopy in the biomedical field. Biol. Cell 74: 5–18, 1992.
13) J. S. Becker, M. V. Zoriy, C. Pickhardt, N. Palomero-Gallagher, K. Zilles. Imaging of copper, zinc, and other elements in thin section of human brain samples (hippocampus) by laser ablation inductively coupled plasma mass spectrometry. Anal. Chem. 77: 3208–3216, 2005.
14) J. S. Becker, M. V. Zoriy, M. Dehnhardt, C. Pickhardt, K. Zilles. Copper, zinc, phosphorus and sulfur distribution in thin section of rat brain tissues measured by laser ablation inductively coupled plasma mass spectrometry: Possibility for small-size tumor analysis. J. Anal. At. Spectrom. 20: 912–917, 2005.
15) M. V. Zoriy, M. Dehnhardt, G. Reifenberger, K. Zilles, J. S. Becker. Imaging of Cu, Zn, Pb and U in human brain tumor resections by laser ablation inductively coupled plasma mass spectrometry. Int. J. Mass Spectrom. 257: 27–33, 2006.
16) R. M. Caprioli, T. B. Farmer, J. Gile. Molecular imaging of biological samples: Localization of peptides and proteins using MALDI-TOF MS. Anal. Chem. 69: 4751–4760, 1997.
17) R. W. Garden, J. V. Sweedler. Heterogeneity within MALDI samples as revealed by mass spectrometric imaging. Anal. Chem. 72: 30–36, 2000.
18) S. A. Schwartz, R. J. Weil, M. D. Johnson, S. A. Toms, R. M. Caprioli. Protein profiling in brain tumors using mass spectrometry: Feasibility of a new technique for the analysis of protein expression. Clin. Cancer Res. 10: 981–987, 2004.
19) J. M. Fonville, C. L. Carter, L. Pizarro, R. T. Steven, A. D. Palmer, R. L. Griffiths, P. F. Lalor, J. C. Lindon, J. K. Nicholson, E. Holmes, J. Bunch. Hyperspectral visualization of mass spectrometry imaging data. Anal. Chem. 85: 1415–1423, 2013.
20) L. van der Maaten, G. Hinton. Visualizing data using t-SNE. J. Mach. Learn. Res. 9: 2579–2605, 2008
21) L. McInnes, J. Healy, J. Melville. UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction. 2018.
22) T. Smets, T. De Keyser, T. Tousseyn, E. Waelkens, B. De Moor. Correspondence-aware manifold learning for microscopic and spatial omics imaging: A novel data fusion method bringing mass spectrometry imaging to a cellular resolution. Anal. Chem. 93: 3452–3460, 2021.
23) T. Smets, N. Verbeeck, M. Claesen, A. Asperger, G. Griffioen, T. Tousseyn, W. Waelput, E. Waelkens, B. De Moor. Evaluation of distance metrics and spatial autocorrelation in uniform manifold approximation and projection applied to mass spectrometry imaging data. Anal. Chem. 91: 5706–5714, 2019.
24) T. Smets, E. Waelkens, B. De Moor. Prioritization of m/z-values in mass spectrometry imaging profiles obtained using uniform manifold approximation and projection for dimensionality reduction. Anal. Chem. 92: 5240–5248, 2020.
25) W. Wu, J. Hou, Z. Zhang, F. Li, R. Zhang, L. Gao, H. Ni, T. Zhang, H. Long, M. Lei, B. Shen, J. Yan, R. Huang, Z. Zeng, W. Wu. Information entropy-based strategy for the quantitative evaluation of extensive hyperspectral images to better unveil spatial heterogeneity in mass spectrometry imaging. Anal. Chem. 94: 10355–10366, 2022.
26) L. Guo, X. Liu, C. Zhao, Z. Hu, X. Xu, K. K. Cheng, P. Zhou, Y. Xiao, M. Shah, J. Xu, J. Dong, Z. Cai. iSegMSI: An interactive strategy to improve spatial segmentation of mass spectrometry imaging data. Anal. Chem. 94: 14522–14529, 2022.
27) T. Alexandrov. MALDI imaging mass spectrometry: Statistical data analysis and current computational challenges. BMC Bioinformatics 13(Suppl 16): S11, 2012.
28) H. Hu, R. Yin, H. M. Brown, J. Laskin. Spatial segmentation of mass spectrometry imaging data by combining multivariate clustering and univariate thresholding. Anal. Chem. 93: 3477–3485, 2021.
29) Y. Tsuji, S. Yamaguchi, T. Nakamura, M. Ikegawa. Mass spectrometry imaging (MSI) delineates thymus-centric metabolism in vivo as an effect of systemic administration of dexamethasone. Appl. Sci. (Basel) 11: 11038, 2021.
30) C. Guijas, J. R. Montenegro-Burke, X. Domingo-Almenara, A. Palermo, B. Warth, G. Hermann, G. Koellensperger, T. Huan, W. Uritboonthai, A. E. Aisporna, D. W. Wolan, M. E. Spilker, H. P. Benton, G. Siuzdak. METLIN: A technology platform for identifying knowns and unknowns. Anal. Chem. 90: 3156–3164, 2018.

Corresponding author

Register with J-STAGE for free!