Digitalomics: Towards Artificial Intelligence / Machine Learning-Based Precision Cardiovascular Medicine

Akihiro Nomura; Yasuaki Takeji; Masaya Shimojima; Masayuki Takamura

doi:10.1253/circj.CJ-24-0865

Abstract

Recent advances in traditional “-omics” technologies have provided deeper insights into cardiovascular diseases through comprehensive molecular profiling. Accordingly, digitalomics has emerged as a novel transdisciplinary concept that integrates multimodal information with digitized physiological data, medical imaging, environmental data, electronic health records, environmental records, and biometric data from wearables. This digitalomics-driven augmented multiomics approach can provide more precise personalized health risk assessments and optimization when combined with conventional multiomics approaches. Artificial intelligence and machine learning (AI/ML) technologies, alongside statistical methods, serve as key comprehensive analytical tools in realizing this comprehensive framework. This review focuses on two promising AI/ML applications in cardiovascular medicine: digital phonocardiography (PCG) and AI text generators. Digital PCG uses AI/ML models to objectively analyze heart sounds and predict clinical parameters, potentially surpassing traditional auscultation capabilities. In addition, large language models, such as generative pretrained transformer, have demonstrated remarkable performance in assessing medical knowledge, achieving accuracy rates exceeding 80% in medical licensing examinations, although there are issues regarding knowledge accuracy and safety. Current challenges to the implementation of these technologies include maintaining up-to-date medical knowledge and ensuring consistent accuracy of outputs, but ongoing developments in fine-tuning and retrieval-augmented generation show promise in addressing these challenges. Integration of AI/ML technologies in clinical practice, guided by appropriate validation and implementation strategies, may notably advance precision cardiovascular medicine through the digitalomics framework.

In recent years, traditional “-omics” technologies have transformed our understanding of cardiovascular disease mechanisms through comprehensive molecular profiling across multiple biological layers.¹ Through integration of genome, transcriptome, proteome, metabolome, and microbiome data, researchers have elucidated complex pathophysiological pathways and identified novel therapeutic targets in cardiovascular medicine.²^,³ Recent advances in single-cell multiomics approaches have unveiled previously unknown cellular heterogeneity in cardiac tissues and provided unprecedented insights into disease progression mechanisms, particularly in heart failure and atherosclerosis.⁴^,⁵ Furthermore, the integration of epigenomics into genomics has enhanced our understanding of gene regulation in cardiovascular diseases, revealing crucial mechanisms in cardiac remodeling and vascular dysfunction.⁶

In addition to these conventional multiomics approaches, digitalomics has emerged as an innovative transdisciplinary concept that integrates multimodal digital data streams.⁷ These data streams include digitized physiological data, medical imaging, environmental data, electronic health records (e.g., clinical information from daily charts, laboratory findings, and annual health checkups), environmental records, and biometric data from wearables, particularly mobile health (mHealth) and extended reality (XR) devices (Figure 1). The augmented multiomics approach aims to integrate these diverse digital elements with traditional multiomics approaches to provide more precise personalized health risk assessments and customized recommendations for health optimization. This digitalomics-driven approach requires incorporation of cutting-edge digital technologies, with artificial intelligence (AI) and machine learning (ML) serving as pivotal analytical tools along with conventional statistical methods (Figure 1).⁸ Notably, application of AI/ML to conventional multiomics datasets and/or wearable-derived biometric data can result in the development of more precise predictive models for cardiovascular outcomes and personalized therapeutic strategies, improving patient stratification and treatment selection.⁹^,¹⁰ Of these digital technologies, this review focuses on AI/ML, which can prove invaluable in identifying novel biomarkers for early disease detection, monitoring treatment responses in various cardiovascular conditions, and achieving precision cardiovascular medicine.

Figure 1.

Framework of digitalomics-driven augmented multiomics. AI, artificial intelligence; EHR, electronic health records; ML, machine learning; XR, cross reality.

AI and ML

AI can make accurate and advanced inferences from a vast amount of knowledge data.¹¹ However, no definition exists that can unequivocally define this concept in its entirety. Currently, ML represents the forefront of AI used in healthcare.¹²^–¹⁵ Particularly in recent times, the development of computational resources and easy access to big data have considerably advanced the use and clinical application of AI/ML, including its subfield, deep learning.¹² Regarding the use of AI in medicine, the main objectives include inference (e.g., prediction or classification) and generation. In this review, two examples have been selected to attempt to apply AI/ML to the medical settings: digital phonocardiography (PCG) and AI text generators.

Digital PCG

Ever since René Laennec’s invention of the stethoscope in 1816, cardiac auscultation has remained a cornerstone of physical examinations in clinical medicine.¹⁶ However, the auscultatory findings are highly subjective and influenced by the examiner’s expertise and experiences.¹⁷^,¹⁸ Digital PCG has emerged as a solution to objectively record acoustic data and graphically represent heart sound characteristics.¹⁹^–²¹ Furthermore, simultaneous acquisition of electrocardiogram (ECG) data enables a more precise assessment of cardiac and valvular function.²² Despite PCG’s capability to objectively capture heart sounds, many clinicians still find it challenging to interpret the complex wavelet data and numerous features necessary for the accurate identification of murmurs and cardiac pathologies.²³ Therefore, other than objective recording capabilities, an automatic and reproducible PCG interpretation pipeline is essential to increase the clinical utility of this technology.

AI/ML approaches have emerged as promising solutions to these challenges, leveraging conventional heart sound features alongside PCG-derived waveform data.¹⁷ ML models can successfully identify patients with diastolic dysfunction,²⁴ detect low left ventricular ejection fraction,²⁵ and diagnose valvular heart diseases.²⁶ Herein, we present an advanced PCG system incorporating an AI/ML model for prediction of B-type natriuretic peptide (BNP) concentrations and classification of valvular heart disease severity.

The digital PCG system, marketed as the “Super StethoScope” (AMI-SSS01 series) by AMI, Inc. (Kagoshima, Japan), has been developed to automatically predict BNP levels and classify valvular disease severity using auscultation sounds and ECG through a specialized AI/ML model.²⁷ The system’s fundamental configuration integrates a bipolar ECG with 4 frequency band-classified phonocardiograms. In typical clinical practice, healthcare providers obtain waveforms from 1 to 5 chest locations, recording data for only 8 seconds from each site to evaluate features indicative of heart failure, valvular diseases, or other cardiac conditions. Figure 2A shows normal ECG/PCG waveforms, whereas Figure 2B shows waveform patterns characteristic of moderate aortic stenosis. The device maintains signal integrity across a frequency range of 20–600 Hz within 6 dB, minimizing attenuation and thereby facilitating detection of extra heart sounds, particularly the low-frequency S3 and S4 components.²⁷

Figure 2.

Wavelet data obtained from a digital phonocardiogram. (A) Normal control. (B) Representative case of aortic stenosis. ECG, electrocardiogram; PCG, phonocardiogram.

The wavelet analyzer, AMI’s proprietary ECG/PCG waveform analysis system, is specifically designed for heart sound analysis and visualization (Figure 3). In the frequency analysis, in addition to the raw ECG and PCG waveforms, the system uses both the short-time Fourier transform spectrograms and continuous wavelet transform scalograms. This arrangement of heart sound waveforms in the time–frequency domain enables visual identification of acoustic features specific to various cardiac and valvular conditions. Furthermore, by generating and combining these acoustic features, AMI has been developing ML models to estimate a cardiac stress index (analogous to blood BNP levels) and predict valvular disease severity. The models primarily use 1-dimensional convolutional neural network (CNN) architectures, incorporating supervised learning and transfer learning, achieving high inference accuracy (Figure 4). A secure cloud-based service has already been launched, wherein input ECG and PCG data are processed through these AI/ML models to infer cardiac load indices and valvular disease severity, with the analyzed results then being reviewed by specialist physicians before being returned as reports on the wavelet analyzer platform. Currently, regulatory approval procedures for the AI/ML diagnostic support functionality intended for direct use by clinicians as medical devices are underway. The clinical performance of this PCG has demonstrated the potential to surpass traditional auscultation skills of physicians.²⁷ We evaluate the ability of the AI medical device to predict “future” cardiovascular events (i.e., before their onset) based solely on auscultation sounds.

Figure 3.

Example of acoustic sound visualization for frequency analysis by a wavelet analyzer. In the frequency analysis, in addition to the raw electrocardiogram (ECG) and phonocardiogram (PCG) waveforms, the system uses short-time Fourier transform (STFT) spectrograms and continuous wavelet transform (CWT) scalograms with appropriate transformation (log-transformation or linear regression).

Figure 4.

Prediction of the cardiac stress index (Index_CS) and severity of valvular dysfunction via digital phonocardiography and machine learning. (A) Overall prediction pipeline. (B) Prediction algorithm for the Index_CS. (C) Prediction algorithm for the aortic stenosis severity index (Index_AS). CNN, convolutional neural network; CWT, continuous wavelet transform; ECG, electrocardiogram; PCG, phonocardiogram; STFT, short-time Fourier transform; W.N.L., within normal limit.

Generative AI and Text Generators

Generative AI is a type of AI capable of creating new images, movies, text, information, or music using trained ML algorithms.²⁸ It has become widespread because of the processing of input and output in natural language with user-friendly web applications, such as Midjourney for image generation and ChatGPT for text generation.²⁹ Because the use of ChatGPT, a generative pretrained transformer (GPT)-based chat-style web application launched in November 2022, has become widespread, we can easily process input and output in natural language through the applications. In addition, various health services using large language models (LLM) have been emerging. However, this rapid development has occurred without sufficient time to discuss their application in medicine.³⁰

At the core of these advances lies natural language processing (NLP), which encompasses technologies that enable computers to process, understand, and generate human language without requiring users to have specialized knowledge of programing languages such as Java or Python.³¹ The history of NLP dates back to 1966 with the chatbot ELIZA.³² However, a notable milestone was achieved in 2011, when IBM Watson, equipped with statistical analysis-based NLP capabilities, defeated human champions in the US quiz show “ Jeoparady! ”.³³ NLP excels in various applications, including text correction, language translation, chatbot-based dialog, question-answering systems, and sentiment analysis.³¹

The ML algorithms historically used for NLP tasks include CNNs, which have notably increased accuracy in image analysis (including ECG interpretation), and recurrent neural networks (RNNs), which have been particularly used to increase machine translation accuracy.³⁴ However, these neural networks struggled with long-distance dependencies (the ability to maintain contextual relationships between distant words), resulting in decreased accuracy in generating longer translations.³⁵ They also experienced computational challenges with large input datasets because of the extensive time required for token probability calculations. The introduction of attention mechanism marked a notable breakthrough, substantially enhancing the accuracy of long-text translation.³⁶ The transformer architecture, introduced in 2017, further advanced this technology by incorporating self-attention, multihead attention, positional encoding, residual connections, and layer normalization.³⁷ This innovation addressed the long-distance dependency limitations of RNNs while enabling parallel processing for reduced computation time, hence establishing itself as the foundation architecture for contemporary NLP language models.³⁷ The transformer architecture spawned several derivative language models, notably Google’s bidirectional encoder representations from transformers (BERT),³⁸ Text-to-Text Transfer Transformer (T5),³⁹ and OpenAI’s GPT.

History and Performance of GPT

GPT, developed by OpenAI, is an enhanced language model based on the transformer architecture. Although the original GPT delivered state-of-the-art performance in natural language inference, question answering, and text classification, it required task-specific fine-tuning with supervised data.⁴⁰ Subsequent iterations (GPT-2 and GPT-3) eliminated this requirement.⁴¹^,⁴² GPT-3, in particular, leveraged the scaling law⁴³ by correlating model performance with parameter count, training data volume, and computational power, incorporating 175 billion parameters and training on approximately 1 trillion words using extensive GPU resources.⁴¹ It introduced in-context learning, allowing for task adaptation through example demonstrations without parameter updates, and formalized the concept of prompting to elicit desired responses. Despite these advances, challenges persisted, including inconsistency in long-text generation, physical law violations in responses, bias control in pretrained outputs, and management of harmful or hallucinated content.

InstructGPT (also known as GPT-3.5) was developed to address these challenges, focusing on content control issues.⁴⁴ This model incorporated reinforcement learning from human feedback, using extensive human feedback for model training to generate outputs aligned with human intent and instructions. It successfully reduced harmful and hallucinated content while maintaining task performance accuracy.⁴⁴ InstructGPT served as the foundation for ChatGPT’s initial release in November 2022. Subsequently, GPT-4, released in March 2023, expanded upon GPT-3.5 with an estimated 1.76 trillion parameters in its mixture-of-experts architecture.⁴⁵ Currently, the model continues to evolve through iterations including GPT-4o and GPT-4o1, with ongoing performance enhancements.⁴⁶

Can an AI Text Generator Pass Medical Licensing Examinations?

ChatGPT has exhibited remarkable capabilities in appropriately responding to questions across various specialized fields without requiring task-specific fine-tuning, prompting extensive research into whether it can surpass human intellectual capability. To address this, studies focused on comparing LLM performance with that of human physicians, specifically using actual medical licensing examinations as benchmark datasets.

In this context, the pioneering study by Kung et al. evaluated ChatGPT’s (GPT-3.5) performance on the United States Medical Licensing Examination (USMLE Step 1, Step 2CK, and Step 3).⁴⁷ Kung et al. analyzed ChatGPT’s accuracy in answering 350 questions from the 2022 USMLE, excluding items containing images or tables that were unprocessable by the then-available version of ChatGPT. The question formats included free-response and multiple-choice questions. Although ChatGPT failed to achieve the passing threshold (∼60% accuracy) for Step 1, it achieved or approached passing scores for Step 2CK and Step 3.⁴⁷ This achievement garnered considerable attention, because previous attempts at accurately answering medical questions typically required medicine-specific pretraining or fine-tuning. The fact that a general-purpose LLM without medical task-specific optimization could achieve such high accuracy was noteworthy.

Our research group also investigated the medical applications of GPT by evaluating ChatGPT’s (GPT-3.5 and GPT-4) performance on Japan’s National Medical Licensing Examination, analyzing answer accuracy and reasoning capabilities. For instance, when tested against Japan’s National Medical Licensing Exams, the ChatGPT-4 model achieved remarkable accuracy rates of 82.7% for essential questions and 77.2% for basic and clinical questions, surpassing the minimum passing requirements for human candidates.⁴⁸ These values of performance metrics were comparable to those reported in contemporaneous studies.⁴⁹^,⁵⁰ However, analysis of the incorrect responses revealed limitations in comprehensive medical knowledge, understanding of Japan-specific healthcare systems, and mathematical problem-solving capabilities.⁴⁸ More concerning were instances wherein responses included contraindicated medical recommendations, highlighting potential clinical risks. These findings suggest that during our study, reliance on GPT-generated medical content without critical evaluation was hazardous for users who lacked sufficient medical knowledge or the ability to verify medical information.

Nevertheless, ever since the publication of our ChatGPT research, various approaches, such as fine-tuning and retrieval-augmented generation, have been developed and implemented to address these known limitations.⁵¹ Furthermore, GPT-4’s image recognition capability (GPT-4V) has demonstrated potential for application in image-based examinations and clinical diagnosis.⁵²^,⁵³ These advances are steadily resolving the issues identified as AI text generators continue to rapidly evolve towards clinical implementation.

Digitalomics: Towards AI/ML-Based Precision Cardiovascular Medicine

In this article, we introduce the promising applications of AI/ML technology in cardiovascular medicine, focusing on digital PCG and AI text generators. Such AL/ML models, including LLM, are not only capable of answering questions meant for human candidates but may also become invaluable partners in addressing unmet medical needs in healthcare.⁵⁴ However, currently, challenges persist in maintaining up-to-date specialized medical knowledge and ensuring the consistent accuracy of outputs. Despite these challenges, it is expected that that these AI/ML technologies will soon demonstrate their clinical utility and safety through scientific validation, potentially becoming one of the foundation models for the proposed generalist medical AI.⁵⁵^,⁵⁶ As mentioned at the beginning of this review, these technologies will likely play a crucial role in realizing the digitalomics concept.

Conclusions

In conclusion, digitalomics represents a novel transdisciplinary concept that integrates multimodal digital data with clinical, imaging, environmental, and personal health information to provide a comprehensive understanding of individual health backgrounds and risks. The incorporation of digitalomics into traditional multiomics approaches creates augmented multiomics, enabling more precise personalized health risk prediction and optimized therapeutic recommendations. The AI/ML technology, in conjunction with statistical methods, serves as a fundamental digital analytical tool in achieving personalized cardiovascular health optimization through this augmented multiomics approach. This field continues to evolve rapidly, with numerous AI/ML-powered medical services already being implemented, demonstrating steady progress in clinical integration. Very similar to the early days of the now-indispensable infrastructure such as the Internet and smartphones, we are not meant to resist this technological evolution. Instead, our role is to properly understand and embrace AI/ML as a new partner in clinical practice. We can work towards creating a better future for cardiovascular medicine by maximizing the potential of these technologies through appropriate implementation.

Acknowledgments

This review is based on the presentation delivered at the Circulation Journal (CJ) - European Heart Journal (EHJ) joint session at the 88^th Annual Meeting of the Japanese Circulation Society. The authors thank Toshihisa Anzai for organizing this session. The authors also express their gratitude to all their colleagues who have contributed to their biomedical research, with special thanks to Hajime Takeuchi and Shumpei Saito at AMI, Inc. During the preparation of this manuscript, the authors used Claude 3.5 Sonnet (LLM developed by Anthropic PBC, CA, USA) for English editing. In addition, English language assistance was provided by Enago’s English Editing service.

Sources of Funding

None.

Disclosures

A.N. and M.T. have received commissioned research grants from AMI, Inc.

Author Contributions

All authors conceptualized the contents. A.N. researched, wrote, and edited all content in this review. All authors critically reviewed and agreed to the final version of the manuscript. A.N. is responsible for the integrity of the work as a whole.

Data Availability

Not applicable (no new data were analyzed in this review).

References

1. Karczewski KJ, Snyder MP. Integrative omics for health and disease. Nat Rev Genet 2018; 19: 299–310, doi:10.1038/nrg.2018.4.
2. Doran S, Arif M, Lam S, Bayraktar A, Turkez H, Uhlen M, et al. Multi-omics approaches for revealing the complexity of cardiovascular disease. Brief Bioinform 2021; 22: bbab061, doi:10.1093/bib/bbab061.
3. Wang RS, Maron BA, Loscalzo J. Multiomics network medicine approaches to precision medicine and therapeutics in cardiovascular diseases. Arterioscler Thromb Vasc Biol 2023; 43: 493–503, doi:10.1161/ATVBAHA.122.318731.
4. Litvinukova M, Talavera-Lopez C, Maatz H, Reichart D, Worth CL, Lindberg EL, et al. Cells of the adult human heart. Nature 2020; 588: 466–472, doi:10.1038/s41586-020-2797-4.
5. Baysoy A, Bai Z, Satija R, Fan R. The technological landscape and applications of single-cell multi-omics. Nat Rev Mol Cell Biol 2023; 24: 695–713, doi:10.1038/s41580-023-00615-w.
6. Shi Y, Zhang H, Huang S, Yin L, Wang F, Luo P, et al. Epigenetic regulation in cardiovascular disease: Mechanisms and advances in clinical trials. Signal Transduct Target Ther 2022; 7: 200, doi:10.1038/s41392-022-01055-2.
7. Tamura Y, Nomura A, Kagiyama N, Mizuno A, Node K. Digitalomics, digital intervention, and designing future: The next frontier in cardiology. J Cardiol 2024; 83: 318–322, doi:10.1016/j.jjcc.2023.12.002.
8. Nomura A. Digital health, digital medicine, and digital therapeutics in cardiology: Current evidence and future perspective in Japan. Hypertens Res 2023; 46: 2126–2134, doi:10.1038/s41440-023-01317-8.
9. Yu Y, Zhang X, Chen Y, Li Y, Bian S, Yang Y, et al. Single-cell sequencing of immune cells after marathon and symptom-limited cardiopulmonary exercise. iScience 2023; 26: 106532, doi:10.1016/j.isci.2023.106532.
10. Shen X, Kellogg R, Panyard DJ, Bararpour N, Castillo KE, Lee-McMullen B, et al. Multi-omics microsampling for the profiling of lifestyle-associated changes in health. Nat Biomed Eng 2024; 8: 11–29, doi:10.1038/s41551-022-00999-8.
11. The Japanese Society for Artificial Intelligence. Statutes of the Japanese Society for Artificial Intelligence [in Japanese]. 1990. https://www.ai-gakkai.or.jp/about/about-us/jsai_teikan/ (accessed December 19, 2024).
12. Nomura A, Noguchi M, Kometani M, Furukawa K, Yoneda T. Artificial intelligence in current diabetes management and prediction. Curr Diab Rep 2021; 21: 61, doi:10.1007/s11892-021-01423-2.
13. Nakamura T, Aiba T, Shimizu W, Furukawa T, Sasano T. Prediction of the presence of ventricular fibrillation from a Brugada electrocardiogram using artificial intelligence. Circ J 2023; 87: 1007–1014, doi:10.1253/circj.CJ-22-0496.
14. Omori H, Kawase Y, Mizukami T, Tanigaki T, Hirata T, Okubo M, et al. Diagnostic accuracy of artificial intelligence-based angiography-derived fractional flow reserve using pressure wire-based fractional flow reserve as a reference. Circ J 2023; 87: 783–790, doi:10.1253/circj.CJ-22-0771.
15. Yamamoto T, Sugizaki Y, Kawamori H, Toba T, Hiromasa T, Sasaki S, et al. Enhanced plaque stabilization effects of alirocumab: Insights from artificial intelligence-aided optical coherence tomography analysis of the alirocumab for thin-cap fibroatheroma in patients with coronary artery disease estimated by optical coherence tomography (ALTAIR) study. Circ J 2024; 88: 1809–1818, doi:10.1253/circj.CJ-24-0480.
16. Roguin A. Rene Theophile Hyacinthe Laennec (1781–1826): The man behind the stethoscope. Clin Med Res 2006; 4: 230–235, doi:10.3121/cmr.4.3.230.
17. Tokodi M, Kovacs A. Reviving the origins: Acoustic biomarkers of heart failure with preserved ejection fraction. Eur Heart J Digit Health 2023; 4: 1–3, doi:10.1093/ehjdh/ztac075.
18. Marcus GM, Vessey J, Jordan MV, Huddleston M, McKeown B, Gerber IL, et al. Relationship between accurate auscultation of a clinically useful third heart sound and level of experience. Arch Intern Med 2006; 166: 617–622, doi:10.1001/archinte.166.6.617.
19. Debbal SM, Bereksi-Reguig F. Computerized heart sounds analysis. Comput Biol Med 2008; 38: 263–280, doi:10.1016/j.compbiomed.2007.09.006.
20. Mubarak QU, Akram MU, Shaukat A, Hussain F, Khawaja SG, Butt WH. Analysis of PCG signals using quality assessment and homomorphic filters for localization and classification of heart sounds. Comput Methods Programs Biomed 2018; 164: 143–157, doi:10.1016/j.cmpb.2018.07.006.
21. Jaros R, Koutny J, Ladrova M, Martinek R. Novel phonocardiography system for heartbeat detection from various locations. Sci Rep 2023; 13: 14392, doi:10.1038/s41598-023-41102-8.
22. Luo H, Weerts J, Bekkers A, Achten A, Lievens S, Smeets K, et al. Association between phonocardiography and echocardiography in heart failure patients with preserved ejection fraction. Eur Heart J Digit Health 2023; 4: 4–11, doi:10.1093/ehjdh/ztac073.
23. Reyna MA, Kiarashi Y, Elola A, Oliveira J, Renna F, Gu A, et al. Heart murmur detection from phonocardiogram recordings: The George B. Moody PhysioNet Challenge 2022. PLoS Digit Health 2023; 2: e0000324, doi:10.1371/journal.pdig.0000324.
24. Zheng Y, Guo X, Yang Y, Wang H, Liao K, Qin J. Phonocardiogram transfer learning-based CatBoost model for diastolic dysfunction identification using multiple domain-specific deep feature fusion. Comput Biol Med 2023; 156: 106707, doi:10.1016/j.compbiomed.2023.106707.
25. Howard-Quijano K, Saraf K, Borgstrom P, Baek C, Wasko M, Zhang X, et al. Evaluation of wearable acoustic sensors and machine learning algorithms for automated measurement of left ventricular ejection fraction. Am J Cardiol 2023; 200: 87–94, doi:10.1016/j.amjcard.2023.04.047.
26. Al-Issa Y, Alqudah AM. A lightweight hybrid deep learning system for cardiac valvular disease classification. Sci Rep 2022; 12: 14297, doi:10.1038/s41598-022-18293-7.
27. Ogawa S, Namino F, Mori T, Sato G, Yamakawa T, Saito S. AI diagnosis of heart sounds differentiated with super StethoScope. J Cardiol 2024; 83: 265–271, doi:10.1016/j.jjcc.2023.09.007.
28. Raza MM, Venkatesh KP, Kvedar JC. Generative AI and large language models in health care: Pathways to implementation. NPJ Digit Med 2024; 7: 62, doi:10.1038/s41746-023-00988-4.
29. Zhou E, Lee D. Generative artificial intelligence, human creativity, and art. PNAS Nexus 2024; 3: pgae052, doi:10.1093/pnasnexus/pgae052.
30. Su Z, Tang G, Huang R, Qiao Y, Zhang Z, Dai X. Based on medicine, the now and future of large language models. Cell Mol Bioeng 2024; 17: 263–277, doi:10.1007/s12195-024-00820-3.
31. Hirschberg J, Manning CD. Advances in natural language processing. Science 2015; 349: 261–266, doi:10.1126/science.aaa8685.
32. Weizenbaum J. ELIZA: A computer program for the study of natural language communication between man and machine. Commun ACM 1966; 9: 36–45, doi:10.1145/365153.365168.
33. Rachlin H. Making IBM’s computer, Watson, human. Behav Anal 2012; 35: 1–16, doi:10.1007/BF03392260.
34. Yin W, Kann K, Yu M, Schütze H. Comparative study of CNN and RNN for natural language processing. arXiv 2017, doi:10.48550/arXiv.1702.01923.
35. Le P, Zuidema W. Quantifying the vanishing gradient and long distance dependency problem in recursive neural networks and recursive LSTMs. arXiv 2016, doi:10.48550/arXiv.1603.00423.
36. Bahdanau D, Cho K, Bengio Y. Neural machine translation by jointly learning to align and translate. arXiv 2014, doi:10.48550/arXiv.1409.0473.
37. Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, et al. Attention is all you need. Advances in Neural Information Processing Systems 2017, doi:10.48550/arXiv.1706.03762.
38. Devlin J, Chang MW, Lee K, Toutanova K. BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv 2018, doi:10.48550/arXiv.1810.04805.
39. Raffel C, Shazeer N, Roberts A, Lee K, Narang S, Matena M, et al. Exploring the limits of transfer learning with a unified text-to-text transformer. J Machine Learn Res 2020; 21: 1–67.
40. Radford A, Narasimhan K. Improving language understanding by generative pre-training. 2018. https://cdn.openai.com/research-covers/language-unsupervised/language_understanding_paper.pdf (accessed December 19, 2024).
41. Brown TB, Mann B, Ryder N, Subbiah M, Kaplan J, Dhariwal P, et al. Language models are few-shot learners. arXiv 2020, doi:10.48550/arXiv.2005.14165.
42. Radford A, Wu J, Child R, Luan D, Amodei D, Sutskever I. Language Models are unsupervised multitask learners. 2019. https://cdn.openai.com/better-language-models/language_models_are_unsupervised_multitask_learners.pdf (accessed December 19, 2024).
43. Kaplan J, McCandlish S, Henighan T, Brown TB, Chess B, Child R, et al. Scaling laws for neural language models. arXiv 2020, doi:10.48550/arXiv.2001.08361.
44. Ouyang L, Wu J, Jiang X, Almeida D, Wainwright C, Mishkin P, et al. Training language models to follow instructions with human feedback. Advances in Neural Information Processing Systems 2022; 35: 27730–27744.
45. OpenAI, Achiam J, Adler S, Agarwal S, Ahmad L, Akkaya I, et al. GPT-4 technical report. arXiv 2023, doi:10.48550/arXiv.2303.08774.
46. OpenAI. Learning to reason with LLMs. 2024. https://openai.com/index/learning-to-reason-with-llms/ (accessed December 19, 2024).
47. Kung TH, Cheatham M, Medenilla A, Sillos C, De Leon L, Elepano C, et al. Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models. PLoS Digit Health 2023; 2: e0000198, doi:10.1371/journal.pdig.0000198.
48. Tanaka Y, Nakata T, Aiga K, Etani T, Muramatsu R, Katagiri S, et al. Performance of generative pretrained transformer on the National Medical Licensing Examination in Japan. PLoS Digit Health 2024; 3: e0000433, doi:10.1371/journal.pdig.0000433.
49. Kasai J, Kasai Y, Sakaguchi K, Yamada Y, Radev D. Evaluating GPT-4 and ChatGPT on Japanese Medical Licensing Examinations. arXiv 2023, doi:10.48550/arXiv.2303.18027.
50. Takagi S, Watari T, Erabi A, Sakaguchi K. Performance of GPT-3.5 and GPT-4 on the Japanese Medical Licensing Examination: Comparison study. JMIR Med Educ 2023; 9: e48002, doi:10.2196/48002.
51. Zakka C, Shad R, Chaurasia A, Dalal AR, Kim JL, Moor M, et al. Almanac: Retrieval-augmented language models for clinical medicine. NEJM AI 2024; 1: AIoa2300068, doi:10.1056/AIoa2300068.
52. Noda M, Ueno T, Koshu R, Takaso Y, Shimada MD, Saito C, et al. Performance of GPT-4V in answering the Japanese Otolaryngology Board Certification Examination Questions: Evaluation study. JMIR Med Educ 2024; 10: e57054, doi:10.2196/57054.
53. Noda M, Yoshimura H, Okubo T, Koshu R, Uchiyama Y, Nomura A, et al. Feasibility of multimodal artificial intelligence using GPT-4 vision for the classification of middle ear disease: Qualitative study and validation. JMIR AI 2024; 3: e58342, doi:10.2196/58342.
54. Khera R, Oikonomou EK, Nadkarni GN, Morley JR, Wiens J, Butte AJ, et al. Transforming cardiovascular care with artificial intelligence: From discovery to practice: JACC state-of-the-art review. J Am Coll Cardiol 2024; 84: 97–114, doi:10.1016/j.jacc.2024.05.003.
55. Moor M, Banerjee O, Abad ZSH, Krumholz HM, Leskovec J, Topol EJ, et al. Foundation models for generalist medical artificial intelligence. Nature 2023; 616: 259–265, doi:10.1038/s41586-023-05881-4.
56. Tu T, Azizi S, Driess D, Schaekermann M, Amin M, Chang PC, et al. Towards generalist biomedical AI. NEJM AI 2024; 1: AIoa2300138, doi:10.1056/AIoa2300138.

Corresponding author

Register with J-STAGE for free!