International Journal of Art Innovation and Development, 2023, 4(1); doi: 10.38007/IJAID.2023.040108.
Yang Nan
Jinzhong University, Jinzhong, China
Philippine Christian University, Manila, Philippine
The main melodic information of music can be used for content-based audio retrieval, management as features of audio files and to meet the business needs of specific departments and scenarios. The purpose of this paper is to study music scene recognition methods and system design based on artificial intelligence. The overall system architecture design is integrated by analysing the current stage of music scene recognition, proposing a modelling approach for music scene recognition system for music scene recognition needs and a CNN-based music scene recognition approach. A music recognition module was implemented based on the music scene recognition framework, and the generic scene text recognition system was tested. The experimental results show that the average time consumed for model computational efficiency is 60ms and the cpu resource consumption is maintained at about 28%.
Artificial Intelligence, Music Scene, Scene Recognition, System Design
Yang Nan. The Design of a Music Scene Recognition Method and System Based on Artificial Intelligence. International Journal of Art Innovation and Development (2023), Vol. 4, Issue 1: 102-109. https://doi.org/10.38007/IJAID.2023.040108.
[1] Gwenaelle Cunha Sergio, Minho Lee. Scene2Wav: a deep convolutional sequence-to-conditional SampleRNN for emotional scene musicalization. Multim. Tools Appl. (2021) 80(2): 1793-1812.
[2] Bela Usabaev, Anna Eschenbacher, Angela Brennecke. The Virtual Theremin: Designing an Interactive Digital Music Instrument for Film Scene Scoring. I-com. (2022) 21(1): 109-121.
[3] Neeraj Gupta, Anand Singh Jalal. Traditional to transfer learning progression on scene text detection and recognition: a survey. Artif. Intell. Rev. (2022) 55(4): 3457-3502.
[4] Recep Sinan Tümen, T. Metin Sezgin. Segmentation and Recognition of Offline Sketch Scenes Using Dynamic Programming. IEEE Computer Graphics and Applications. (2022) 42(1): 56-72.
[5] T. Mithila, R. Arunprakash, A. Ramachandran. CNN and Fuzzy Rules Based Text Detection and Recognition from Natural Scenes. Comput. Syst. Sci. Eng. (2022) 42(3): 1165-1179.
[6] Veronica Naosekpam, Nilkanta Sahu. Text detection, recognition, and script identification in natural scene images: a Review. Int. J. Multim. Inf. Retr. (2022) 11(3): 291-314.
[7] Khosro Rezaee, Mohammad Reza Khosravi, Maryam Saberi Anari. Deep-Transfer-Learning-Based Abnormal Behavior Recognition Using Internet of Drones for Crowded Scenes. IEEE Internet Things Mag. (2022) 5(2): 41-44.
[8] Leena Mary Francis, N. Sreenath. Robust scene text recognition: Using manifold regularized Twin-Support Vector Machine. J. King Saud Univ. Comput. Inf. Sci. (2022) 34(3): 589-604.
[9] Riadh Harizi, Rim Walha, Fadoua Drira, Mourad Zaied. Convolutional neural network with joint stepwise character/word modeling based system for scene text recognition. Multim. Tools Appl. (2022) 81(3): 3091-3106.
[10] Riadh Harizi, Rim Walha, Fadoua Drira, Mourad Zaied. Convolutional neural network with joint stepwise character/word modeling based system for scene text recognition. Multim. Tools Appl. (2022) 81(3): 3091-3106.
[11] Fatemeh Naiemi, Vahid Ghods, Hassan Khalesi. Scene text detection and recognition: a survey. Multim. Tools Appl. (2022) 81(14): 20255-20290.
[12] Andreea Glavan, Estefanía Talavera. InstaIndoor and multi-modal deep learning for indoor scene recognition. Neural Comput. Appl. (2022) 34(9): 6861-6877.
[13] Nitika Nigam, Tanima Dutta, Hari Prabhat Gupta. FactorNet: Holistic Actor, Object, and Scene Factorization for Action Recognition in Videos. IEEE Trans. Circuits Syst. Video Technol. (2022) 32(3): 976-991.
[14] Cinu C. Kiliroor, S. Shrija, R. Ajay. Augmented Scene Text Recognition Using Crosswise Feature Extraction. Wirel. Pers. Commun. (2022) 123(1): 421-436.
[15] Payel Sengupta, Ayatullah Faruk Mollah. Journey of scene text components recognition: Progress and open issues. Multim. Tools Appl. (2021) 80(4): 6079-6104.
[16] L. T. Akin Sherly, T. Jaya. An efficient indoor scene character recognition using Bayesian interactive search algorithm-based adaboost-CNN classifier. Neural Comput. Appl. (2021) 33(22): 15345-15356.
[17] S. Chandrakala, S. L. Jayalakshmi. Generative Model Driven Representation Learning in a Hybrid Framework for Environmental Audio Scene and Sound Event Recognition. IEEE Trans. Multim. (2020) 22(1): 3-14.
[18] Anna Zhu, Seiichi Uchida. Scene word recognition from pieces to whole. Frontiers Comput. Sci. (2019) 13(2): 292-301.