International Journal of Multimedia Computing, 2025, 6(1); doi: 10.38007/IJMC.2025.060114.
Chuying Lu
University of Michigan, University of Michigan, Michigan 48109, USA
High-resolution remote sensing images play an important role in urban decision-making, natural resource supervision and environmental assessment, etc. To meet the demands of such fields, deep detection methods such as FasterRCNN and YOLOv5 were deeply analyzed. Multi-scale feature fusion was achieved using FPN, attention expression was enhanced through Transformer and CBAM, and detection efficiency was improved through MobileNet and pruning techniques. In terms of segmentation, the semantic description of spatial information is highlighted by using dilated convolution and jump connection. Multimodal fusion can improve the recognition accuracy of complex ground objects. The semantic model ability is enhanced through the Transformer module. Finally, a new edge detection branch is proposed to improve the contour extraction effect.
High-resolution remote sensing; Object detection; Image segmentation
Chuying Lu. Object Detection and Image Segmentation Algorithm Optimization in High-Resolution Remote Sensing Images. International Journal of Multimedia Computing (2025), Vol. 6, Issue 1: 144-151. https://doi.org/10.38007/IJMC.2025.060114.
[1] Gong H, Sun Q, Fang C, et al. TreeDetector: Using Deep Learning for the Localization and Reconstruction of Urban Trees from High-Resolution Remote Sensing Images. Remote Sensing, 2024, 16(3):22.
[2] Wang M, Shen L. High-Resolution Remote Sensing Imagery for the Recognition of Traditional Villages. Journal of Architectural Research and Development, 2024, 8(1):75-83.
[3] Sun Y, Chen J, Huang X Z H. Multi-Level Perceptual Network for Urban Building Extraction from High-Resolution Remote Sensing Images. Photogrammetric Engineering & Remote Sensing: Journal of the American Society of Photogrammetry, 2023, 89(7):427-434.
[4] Fan L, Zeng C, Li Y, et al. GRC-Net: Fusing GAT-Based 4D Radar and Camera for 3D Object Detection. SAE International Journal of Advances and Current Practices in Mobility, 2024(5):6.
[5] Cuevas E, Héctor Becerra, Luque A, et al. Fast multi-feature image segmentation. Applied Mathematical Modelling, 2021, 90(5):742-757.
[6] Zou, Y. (2025). Design and Implementation of a Cloud Computing Security Assessment Model Based on Hierarchical Analysis and Fuzzy Comprehensive Evaluation. arXiv preprint arXiv:2511. 05049.
[7] Su H, Luo W, Mehdad Y, et al. Llm-friendly knowledge representation for customer support[C]//Proceedings of the 31st International Conference on Computational Linguistics: Industry Track. 2025: 496-504.
[8] Liu, B. (2025). Design and Implementation of Data Acquisition and Analysis System for Programming Debugging Process Based on VS Code Plug-In. arXiv preprint arXiv: 2511. 05825.
[9] Zhu, P. (2025). The Role and Mechanism of Deep Statistical Machine Learning In Biological Target Screening and Immune Microenvironment Regulation of Asthma. arXiv preprint arXiv:2511. 05904.
[10] Sun, Jiahe. "Research on Sentiment Analysis Based on Multi-source Data Fusion and Pre-trained Model Optimization in Quantitative Finance.” (2025).
[11] Chang, Chen-Wei. "Compiling Declarative Privacy Policies into Runtime Enforcement for Cloud and Web Infrastructure.” (2025).
[12] F. Liu, "Transformer XL Long Range Dependency Modeling and Dynamic Growth Prediction Algorithm for E-Commerce User Behavior Sequence,” 2025 2nd International Conference on Intelligent Algorithms for Computational Intelligence Systems (IACIS), Hassan, India, 2025, pp. 1-6.
[13] F. Liu, "Architecture and Algorithm Optimization of Realtime User Behavior Analysis System for Ecommerce Based on Distributed Stream Computing, " 2025 International Conference on Intelligent Communication Networks and Computational Techniques (ICICNCT), Bidar, India, 2025, pp. 1-8.
[14] Q. Hu, "Research on Dynamic Identification and Prediction Model of Tax Fraud Based on Deep Learning,” 2025 2nd International Conference on Intelligent Algorithms for Computational Intelligence Systems (IACIS), Hassan, India, 2025, pp. 1-6.
[15] D. Shen, "Complex Pattern Recognition and Clinical Application of Artificial Intelligence in Medical Imaging Diagnosis,” 2025 International Conference on Intelligent Communication Networks and Computational Techniques (ICICNCT), Bidar, India, 2025, pp. 1-8.
[16] Wu Y. Optimization of Generative AI Intelligent Interaction System Based on Adversarial Attack Defense and Content Controllable Generation. 2025.
[17] Sun J. Quantile Regression Study on the Impact of Investor Sentiment on Financial Credit from the Perspective of Behavioral Finance. 2025.
[18] Wang Y. Application of Data Completion and Full Lifecycle Cost Optimization Integrating Artificial Intelligence in Supply Chain. 2025.
[19] Chen M. Research on Automated Risk Detection Methods in Machine Learning Integrating Privacy Computing. 2025.
[20] Wei, X. (2025). Deployment of Natural Language Processing Technology as a Service and Front-End Visualization. International Journal of Engineering Advances, 2(3), 117-123.