International Journal of World Medicine, 2025, 6(1); doi: 10.38007/IJWM.2025.060105.
Chun Chen
Quanzhou Huaguang Vocational College, Quanzhou 362121, Fujian, China
This paper systematically discusses the integrated application path of natural language processing (NLP) and knowledge graph (KG) in medical text mining and knowledge extraction. When unstructured text data such as electronic medical records and clinical records emerge rapidly, traditional text processing methods are unable to meet the needs of medical information understanding and reasoning. NLP can improve the depth of semantic understanding, while KG can provide support for structured knowledge. The combination of the two can achieve semantic enhancement, entity standardization and clinical reasoning. The article accurately analyzes the processing challenges caused by the professionalism, ambiguity and heterogeneity of medical texts, and also reviews pre-trained models such as BioBERT and MedGPT and their performance in medical entity recognition and relationship extraction. The author introduces a system architecture solution that integrates the "extraction-standardization-fusion-reasoning" process, including the establishment of multi-level knowledge graphs, GNN reasoning principles and cross-platform deployment methods. At the end of the article, three application scenarios, namely auxiliary diagnosis, clinical decision support and public health monitoring, are used to illustrate the effectiveness of the system in actual medical environments. This study has built a theoretical and practical foundation for the advancement of intelligent medical knowledge computing.
natural language processing, knowledge graph, medical text mining, medical entity recognition
Chun Chen. Integrated Application of Natural Language Processing and Knowledge Graph in Medical Text Mining and Knowledge Extraction. International Journal of World Medicine (2025), Vol. 6, Issue 1: 34-44. https://doi.org/10.38007/IJWM.2025.060105.
[1] Ma, K., & Shen, J. (2024). Interpretable Machine Learning Enhances Disease Prognosis: Applications on COVID-19 and Onward. arXiv preprint arXiv:2405.11672.
[2] Yang, **zhu. "Application of Multi-model Fusion Deep NLP System in Classification of Brain Tumor Follow-Up Image Reports." Cyber Security Intelligence and Analytics: The 6th International Conference on Cyber Security Intelligence and Analytics (CSIA 2024), Volume 1. Vol. 1351. Springer Nature, 2025.
[3] Jun Ye, Multimodal Medical Data Intelligent Classification Method and System Implementation Based on Improved SVM and Similarity Learning Algorithm, International Journal of World Medicine, 2025, 6(1),19-27
[4] Yang J. Research on the Strategy of MedKGGPT Model in Improving the Interpretability and Security of Large Language Models in the Medical Field[J]. Academic Journal of Medicine & Health Sciences, 5(9): 40-45.
[5] Jinshuo Zhang, Research on Intelligent Power Electronic Inverter Control System Based on Knowledge Base and Data Driven, Journal of Electrotechnology, Electrical Engineering and Management (2024), 7(3), 69-74
[6] Jun Ye, Design of a Non-Invasive Brain Computer Interface System for Handwritten Text Based on L2 Regularization and Attention Supervision Paradigm, and Optimization of EEG Signal Decoding, International Journal of Big Data Intelligent Technology, 2025, 6(1),126-134
[7] Shi C. Research on Gene Identification Algorithms Based on Signal Processing Techniques[C]//2024 6th International Conference on Artificial Intelligence and Computer Applications (ICAICA). IEEE, 2024: 72-77.
[8] Hui X. Medical Entity Recognition Based on Bidirectional LSTM-CRF and Natural Language Processing Technology and Its Application in Intelligent Consultation[J]. 2025, 6(1),1-8
[9] Ding M. Application of AI Technology in Improving Design Process Efficiency[J]. European Journal of AI, Computing & Informatics, 2025, 1(1): 101-106.
[10] Liu Z. Research on the Application of Signal Integration Model in Real-Time Response to Social Events[J]. Journal of Computer, Signal, and System Research, 2025, 2(2): 102-106.
[11] Shi, Chongwei. "Ovarian Hereditary Diseases: Progress in Prevention and Treatment and Research on Prenatal Diagnosis." Scientific Journal of Technology 7.2 (2025): 125-131.
[12] Li, X.(2025)“Research on the application of GPS, total station and CAD Technology in architectural Grid.” Computer Life (2024),12(3),36-39.
[13] Lu, Chuying, Research on Intelligent 3D Reconstruction System Integrating Transformer and Adaptive Point Cloud Registration, International Journal of Big Data Intelligent Technology, 2025, 6(1),101-108
[14] Bukun Ren, Efficient Multimodal Visual Segmentation Model Based on Phased Fusion of Differential Modalities, International Journal of Big Data Intelligent Technology, 2025, 6(1), 109-117
[15] Xiangtian Hui, Research on Medical Named Entity Recognition Technology Based on Prompt BioMRC Model for Deep NLP Algorithm, International Journal of Big Data Intelligent Technology, 2025, 6(1),118-125
[16] Zhu Z. Research on Parallel Execution Techniques for Improving the Expandability of Database Systems[J]. Journal of Computer, Signal, and System Research, 2025, 2(3): 69-74.
[17] Xuan Li, Research on integration technology of architectural CAD and budget estimate, Journal of Computing and Electronic Information Management, 2024, 15(1), 20-23
[18] Zhang M. Optimization of Medical Device Software Lifecycle Management Based on DevOps[J]. Journal of Medicine and Life Sciences, 2025, 1(3): 8-13.
[19] Shi, C. (2024). Research on the Application of Computer Technology in Biostatistics. Journal of Computing and Electronic Information Management.Vol. 14, No. 3, 2024,12-15
[20] Shi C. Research on Deep Learning Algorithms for Predicting DNA-Binding Proteins Based on Sequence Information[C]//2024 IEEE 2nd International Conference on Electrical, Automation and Computer Engineering (ICEACE). IEEE, 2024: 1566-1570.
[21] Zhou Y. Application of Anomaly Detection Mechanism in Large-Scale Data Processing[J]. European Journal of AI, Computing & Informatics, 2025, 1(1): 78-84.
[22] Shi, C. (2024). DNA Microarray Technology Principles and Applications in Genetic Research. Computer Life.Vol. 12, No. 3, 2024,19-24
[23] Liu, Boyang. "Study on the Frequency of Computer Language Use Based on Big Data Analysis." Academic Journal of Computing & Information Science 7.10 (2024): 55-59.
[24] Dong P. Research on the Application of Knowledge Graph-Driven Two-Dimensional Convolutional Embedding Methods in Recommender Systems[C]//2025 International Conference on Intelligent Systems and Computational Networks (ICISCN). IEEE, 2025: 1-6.
[25] Xingyu Liu, Exploration of Personalized User Experience Optimization in Virtual Reality with Deep Learning, International Journal of Finance and Investment, 2025, 2(2),15-19
[26] Liu Z. Application of Machine Learning in Financial Risk Classification and Account Verification Optimization Strategy[J]. Economics and Management Innovation, 2025, 2(2): 64-70.
[27] Chen J. Design and Implementation of a Personalized Recommendation System Based on Deep Learning Distributed Collaborative Filtering Algorithm on Social Media Platforms[C]//2025 3rd International Conference on Integrated Circuits and Communication Systems (ICICACS). IEEE, 2025: 1-5.
[28] Li, X. (2025). Research on Three-dimensional Modeling of Urban Buildings based on CityGM. Scientific Journal of Technology, 7(3), 302-306
[29] Gu Y. Javascript Code Simplification And Optimization Based On Hybrid Static and Dynamic Analysis Techniques[C]//2025 IEEE 14th International Conference on Communication Systems and Network Technologies (CSNT). IEEE, 2025: 826-833.
[30] Wang, Buqin. "Strategies and Practices for Load Test Optimization in Distributed Systems." Scientific Journal of Technology 7.2 (2025): 132-137.
[31] Xu, Qianru. "Practical Applications of Large Language Models in Enterprise-Level Applications." Journal of Computer Science and Artificial Intelligence 2.2 (2025): 17-21.
[32] Fan Y. Automatic Optimization of Trading Strategies Based on Reinforcement Learning[C]//2025 IEEE 14th International Conference on Communication Systems and Network Technologies (CSNT). IEEE, 2025: 59-64.
[33] Chen, H., Varatharajah, Y., de Ramirez, S. S., Arnold, P., Frankenberger, C., Hota, B., & Iyer, R. (2020). A retrospective longitudinal study of COVID-19 as seen by a large urban hospital in Chicago. medRxiv, 2020-11.
[34] Cao, Y., Cao, P., Chen, H., Kochendorfer, K. M., Trotter, A. B., Galanter, W. L., ... & Iyer, R. K. (2022). Predicting ICU admissions for hospitalized COVID-19 patients with a factor graph-based model. In Multimodal AI in healthcare: A paradigm shift in health intelligence (pp. 245-256). Cham: Springer International Publishing.
[35] Zhang J. Research on User Behavior Interest Feature Extraction and Accurate Advertising Recommendation Algorithm based on Deep Learning and Improved LDA Model[C]//2025 3rd International Conference on Integrated Circuits and Communication Systems (ICICACS). IEEE, 2025: 1-5.
[36] Varatharajah, Y., Chen, H., Trotter, A., & Iyer, R. K. (2020). A Dynamic Human-in-the-loop Recommender System for Evidence-based Clinical Staging of COVID-19. In HealthRecSys@ RecSys (pp. 21-22).
[37] Fan, Sunjia, et al. "Defense methods against multi-language and multi-intent LLM attacks." International Conference on Algorithms, High Performance Computing, and Artificial Intelligence (AHPCAI 2024). Vol. 13403. SPIE, 2024.
[38] Tan, Weiyan, Shujia Wu, and Ke Ma. "Freight Volume Prediction for Logistics Sorting Centers Using an Integrated GCN-BiLSTM-Transformer Model." Advances in Computer and Engineering Technology Research 1.4 (2024): 320-324
[39] Shanshan Feng, Ke Ma, Gongpin Cheng, Risk Evolution along the Oil and Gas Industry Chain: Insights from Text Mining Analysis, Finance Research Letters, 2025, 106813, ISSN 1544-6123
[40] Fan Y. Automatic Optimization of Trading Strategies Based on Reinforcement Learning[C]//2025 IEEE 14th International Conference on Communication Systems and Network Technologies (CSNT). IEEE, 2025: 59-64.
[41] Chen, H., Wang, Z., & Han, A. (2024). Guiding Ultrasound Breast Tumor Classification with Human-Specified Regions of Interest: A Differentiable Class Activation Map Approach. In 2024 IEEE Ultrasonics, Ferroelectrics, and Frequency Control Joint Symposium (UFFC-JS) (pp. 1-4). IEEE.
[42] Li, Bin. "Application of Data Analysis in Climate Policy in Environmental Planning." Frontiers in Science and Engineering 5.2 (2025): 106-112.
[43] Ma, K., Zhang, N., Mei, X., Feng, C., Hou, W., & Ye, Z. (2024, October). Research on Optimization of Shared Bicycle Scheduling Based on Genetic Algorithm and LSTM. In 2024 IEEE 6th International Conference on Civil Aviation Safety and Information Technology (ICCASIT) (pp. 936-940). IEEE.