International Journal of World Medicine, 2025, 6(1); doi: 10.38007/IJWM.2025.060103.
Jun Ye
Electrical and computer engineering, Carnegie Mellon University, Pittsburgh, Pennsylvania, 15213, United States
Intelligent diagnosis and treatment play a crucial role in the era of medical big data, especially in processing multimodal electronic medical record data. This article focuses on the patient similarity classification task and proposes two effective classification methods to address challenges such as poor data integrity and diverse expressions. A patient similarity learning method based on multimodal summary extraction is proposed by utilizing the GPT model for TLDR summary extraction and combining it with an improved Support Vector Machine (SVM). A patient similarity learning method based on unstructured text approximate matching is proposed using Jieba segmentation, bag of words model, and improved Jaccard function, combined with XGBoost method. Based on the above method, a patient similarity classification system was designed and implemented, providing doctors with a tool for rapid diagnosis. These methods still have shortcomings, and future work will explore lightweight network architectures, design algorithms that consider dynamically adding patient features and time series, and integrate image classification and text classification to promote further development of intelligent diagnosis and treatment.
Intelligent diagnosis and treatment, multimodal electronic medical records, patient similarity classification, GPT model, XGBoost method
Jun Ye. Multimodal Medical Data Intelligent Classification Method and System Implementation Based on Improved SVM and Similarity Learning Algorithm. International Journal of World Medicine (2025), Vol. 6, Issue 1: 19-27. https://doi.org/10.38007/IJWM.2025.060103.
[1] Gu, Yiting. "Practical Approaches to Develo**High-performance Web Applications Based on React." Frontiers in Science and Engineering 5.2 (2025): 99-105.
[2] Shi, Chongwei. "Ovarian Hereditary Diseases: Progress in Prevention and Treatment and Research on Prenatal Diagnosis." Scientific Journal of Technology 7.2 (2025): 125-131.
[3] Shi, Chongwei. "Research on Gene Identification Algorithms Based on Signal Processing Techniques." 2024 6th International Conference on Artificial Intelligence and Computer Applications (ICAICA). IEEE, 2024.
[4] Liu, Boyang. "Study on the Frequency of Computer Language Use Based on Big Data Analysis." Academic Journal of Computing & Information Science 7.10 (2024).
[5] Yang J. Research on the Strategy of MedKGGPT Model in Improving the Interpretability and Security of Large Language Models in the Medical Field[J]. Academic Journal of Medicine & Health Sciences, 5(9): 40-45.
[6] Chen, H., Wang, Z., & Han, A. (2024). Guiding Ultrasound Breast Tumor Classification with Human-Specified Regions of Interest: A Differentiable Class Activation Map Approach. In 2024 IEEE Ultrasonics, Ferroelectrics, and Frequency Control Joint Symposium (UFFC-JS) (pp. 1-4). IEEE.
[7] Varatharajah, Y., Chen, H., Trotter, A., & Iyer, R. K. (2020). A Dynamic Human-in-the-loop Recommender System for Evidence-based Clinical Staging of COVID-19. In HealthRecSys@ RecSys (pp. 21-22).
[8] Cao, Y., Cao, P., Chen, H., Kochendorfer, K. M., Trotter, A. B., Galanter, W. L., ... & Iyer, R. K. (2022). Predicting ICU admissions for hospitalized COVID-19 patients with a factor graph-based model. In Multimodal AI in healthcare: A paradigm shift in health intelligence (pp. 245-256). Cham: Springer International Publishing.
[9] Chen, H., Zuo, J., Zhu, Y., Kabir, M. R., & Han, A. (2024). Generalizable Deep Learning for Pulse-echo Speed of Sound Imaging via Time-shift Maps. In 2024 IEEE Ultrasonics, Ferroelectrics, and Frequency Control Joint Symposium (UFFC-JS) (pp. 1-4). IEEE.
[10] Zhao F. Application and Performance Improvement of K-Means Algorithm in Collaborative[C]//2025 International Conference on Intelligent Systems and Computational Networks (ICISCN). IEEE, 2025: 1-6.
[11] Fan, Sunjia, et al. "Defense methods against multi-language and multi-intent LLM attacks." International Conference on Algorithms, High Performance Computing, and Artificial Intelligence (AHPCAI 2024). Vol. 13403. SPIE, 2024.
[12] Maqsood S, Damaševičius R, Maskeliūnas R. Multi-modal brain tumor detection using deep neural network and multiclass SVM[J]. Medicina, 2022, 58(8): 1090.
[13] Zhu P. Construction and Experimental Verification[C]//Cyber Security Intelligence and Analytics: The 6th International Conference on Cyber Security Intelligence and Analytics (CSIA 2024), Volume 1. Springer Nature, 2025, 1351: 391.
[14] Dong P. Research on the Application of Knowledge Graph-Driven Two-Dimensional Convolutional Embedding Methods in Recommender Systems[C]//2025 International Conference on Intelligent Systems and Computational Networks (ICISCN). IEEE, 2025: 1-6.
[15] Amal S, Safarnejad L, Omiye J A, et al. Use of multi-modal data and machine learning to improve cardiovascular disease care[J]. Frontiers in cardiovascular medicine, 2022, 9: 840262.
[16] Abid M H, Ashraf R, Mahmood T, et al. Multi-modal medical image classification using deep residual network and genetic algorithm[J]. Plos one, 2023, 18(6): e0287786.
[17] Navaz A N, T. El-Kassabi H, Serhani M A, et al. A novel patient similarity network (PSN) framework based on multi-model deep learning for precision medicine[J]. Journal of Personalized Medicine, 2022, 12(5): 768.
[18] Kumar S, Rani S, Sharma S, et al. Multimodality Fusion Aspects of Medical Diagnosis: A Comprehensive Review[J]. Bioengineering, 2024, 11(12): 1233.
[19] Chen, H., Varatharajah, Y., de Ramirez, S. S., Arnold, P., Frankenberger, C., Hota, B., & Iyer, R. (2020). A retrospective longitudinal study of COVID-19 as seen by a large urban hospital in Chicago. medRxiv, 2020-11.
[20] Chen, H., Zhu, Y., Zuo, J., Kabir, M. R., & Han, A. (2024). TranSpeed: Transformer-based Generative Adversarial Network for Speed-of-sound Reconstruction in Pulse-echo Mode. In 2024 IEEE Ultrasonics, Ferroelectrics, and Frequency Control Joint Symposium (UFFC-JS) (pp. 1-4). IEEE.
[21] Yang, **zhu. "Application of Multi-model Fusion Deep NLP System in Classification of Brain Tumor Follow-Up Image Reports." Cyber Security Intelligence and Analytics: The 6th International Conference on Cyber Security Intelligence and Analytics (CSIA 2024), Volume 1. Vol. 1351. Springer Nature, 2025.
[22] Chen, H., Ma, K., & Shen, J. (2024). Interpretable Machine Learning Facilitates Disease Prognosis: Applications on COVID-19 and Onward. International Journal of Computer Science and Information Technology, 3(3), 428-436.
[23] Zhang, Yiru. "Design and Implementation of a Computer Network Log Analysis System Based on Big Data Analytics." Advances in Computer, Signals and Systems,(2024) 8(6),40-46.
[24] Liu, Yu. "Build an Audit Framework for Data Privacy Protection in Cloud Environment." Procedia Computer Science 247 (2024): 166-175.
[25] Liu, Boyang. "Design and Application of Experimental Data Management System Integrating Remote Monitoring and Historical Data Analysis." Journal of Electronics and Information Science 9.3 (2024): 160-167.
[26] Yang, Jinzhu "Integrated Application of LLM Model and Knowledge Graph in Medical Text Mining and Knowledge Extraction."Social Medicine and Health Management (2024), 5(2): 56-62
[27] Zhao, Fengyi. "Risk Assessment Model and Empirical Study of in Vitro Diagnostic Reagent Project Based on Analytic Hierarchy Process." International Journal of New Developments in Engineering and Society 8.5 (2024), 76-82
[28] Wang, Yuxin "Research on Intelligent Macro Image Recognition Algorithm of Oil Pipe Failure Based on Deep Learning." Journal of Image Processing Theory and Applications (2025), 8(1): 1-7
[29] Xu, Y. (2025). Research on Maiustream Web Database Development Technclogy. Journal of Computer Science and Artificial Intelligence, 2(2),29-32
[30] Zhao, Fengyi "Development Design and Signal Processing Algorithm Optimization of Traditional Chinese Medicine Pulse Acquisition System Based on CP301 Sensor." Advances in Computer, Signals and Systems (2024), 8(6): 106-111
[31] Shi C. Research on Deep Learning Algorithms for Predicting DNA-Binding Proteins Based on Sequence Information[C]//2024 IEEE 2nd International Conference on Electrical, Automation and Computer Engineering (ICEACE). IEEE, 2024: 1566-1570.
[32] Xu, Yue. "Research on Graph Network Social Recommendation Algorithm Based on AGRU-GNN." 2024 IEEE 4th International Conference on Data Science and Computer Application (ICDSCA). IEEE, 2024.
[33] Shi, C. (2024). Research on the Application of Computer Technology in Biostatistics. Journal of Computing and Electronic Information Management.Vol. 14, No. 3, 2024,12-15
[34] Wang Y. Design and Implementation of a General Data Collection System Architecture Based on Relational Database Technology[C]//The International Conference on Cyber Security Intelligence and Analytics. Cham: Springer Nature Switzerland, 2024: 561-572.
[35] Zhang, Jinshuo "Research on Real Time Condition Monitoring and Fault Warning System for Construction Machinery under Multi Source Heterogeneous Data Fusion." Journal of Engineering Mechanics and Machinery (2024), 9(2): 139-144