Welcome to Scholar Publishing Group

Machine Learning Theory and Practice, 2021, 2(1); doi: 10.38007/ML.2021.020104.

Machine Learning-Based Study on the Identification of Misstatements in Annual Reports of Listed Companies - A Financial Restatement Prediction Perspective

Author(s)

Ga dan Cai Rang

Corresponding Author:
Ga dan Cai Rang
Affiliation(s)

Qinghai Normal University, Qinghai, China

Abstract

The significance and importance of financial reporting as the basis and foundation for relevant decision makers to make judgments cannot be overstated, and both capital market participants and regulators attach great importance to the disclosure of financial reports. Although relevant regulatory authorities such as China's Securities Regulatory Commission and Accounting Standards Board have issued corresponding standards and regulations to strictly regulate the disclosure behaviour of companies' financial reports, due to insufficient supervision or relatively low costs of non-compliance, listed companies have committed FF in violation of relevant laws and regulations in order to preserve their own interests, causing unmeasurable losses to stakeholders. The main objective of this paper is to launch a study on the identification of misstatements in listed companies' annual reports based on ML under the perspective of financial restatement prediction. Based on the research on FF patterns and FFI in the first part, we firstly select the set of primary features based on the theory of FF motive, then perform Mann-Whitney test on the set of primary features to obtain the set of original features, and then use Bortua algorithm to select the final set of FI features from the set of original features. The data of the fraud samples and non-fraud samples were loaded into the four types of financial fraud identification models (FFIM) in the order of the original FI features and the FI features constructed by the BA. The model identification results showed that the combination of the FFI features constructed by the BA and the RFM had a good identification effect, and the overall evaluation indexes of G-mean and F-value were 75.9% and 78.3% respectively.

Keywords

ML, Listed Companies, Annual Report Misstatement Identification, Financial Restatement Prediction

Cite This Paper

Ga dan Cai Rang. Machine Learning-Based Study on the Identification of Misstatements in Annual Reports of Listed Companies - A Financial Restatement Prediction Perspective. Machine Learning Theory and Practice (2021), Vol. 2, Issue 1: 26-37. https://doi.org/10.38007/ML.2021.020104.

References

[1] Jay Henderson, Tanya R. Jonker, Edward Lank, Daniel Wigdor, Ben Lafreniere: Investigating Cross-Modal Approaches for Evaluating Error Acceptability of a Recognition-Based Input Technique. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 6(1): 15:1-15:24 (2020).Candace E. Peacock, Ben Lafreniere, Ting Zhang, Stephanie Santosa, Hrvoje Benko, Tanya R. Jonker: Gaze as an Indicator of Input Recognition Errors. Proc. ACM Hum. Comput. Interact. 6(ETRA): 1-18 (2020). https://doi.org/10.1145/3517262

[2] Thomas C. Omer, Marjorie K. Shelley, Frances M. Tice: Do Director Networks Matter for Financial Reporting Quality? Evidence from Audit Committee Connectedness and Restatements. Manag. Sci. 66(8): 3361-3388 (2020). https://doi.org/10.1287/mnsc.2019.3331

[3] Recep Sinan Arslan, Necaattin Barisçi, Nursal Arici, Sabri Koçer: Detecting and correcting automatic speech recognition errors with a new model. Turkish J. Electr. Eng. Comput. Sci. 29(5): 2298-2311 (2021). https://doi.org/10.3906/elk-2010-117

[4] Rebecca Brooke Bays, Mary Ann Foley, Annelise Cohen: Is it all in the details? Description content and false recognition errors. Cogn. Process. 21(2): 185-196 (2020). https://doi.org/10.1007/s10339-019-00945-8

[5] A. B. Dhivya, M. Sundaresan: Tablet identification using support vector machine based text recognition and error correction by enhanced n-grams algorithm. IET Image Process. 14(7): 1366-1372 (2020). https://doi.org/10.1049/iet-ipr.2019.0993

[6] Farhad Abedini, Mohammad Reza Keyvanpour, Mohammad Bagher Menhaj: Correction Tower: A General Embedding Method of the Error Recognition for the Knowledge Graph Correction. Int. J. Pattern Recognit. Artif. Intell. 34(10): 2059034:1-2059034:38 (2020). https://doi.org/10.1142/S021800142059034X

[7] Nacereddine Hammami, Isah Abdullahi Lawal, Mouldi Bedda, Nadir Farah: Recognition of Arabic speech sound error in children. Int. J. Speech Technol. 23(3): 705-711 (2020). https://doi.org/10.1007/s10772-020-09746-3

[8] Roghayeh Mojarad, Ferhat Attal, Abdelghani Chibani, Yacine Amirat: Automatic Classification Error Detection and Correction for Robust Human Activity Recognition. IEEE Robotics Autom. Lett. 5(2): 2208-2215 (2020). https://doi.org/10.1109/LRA.2020.2970667

[9] Se-In Jang, Geok-Choo Tan, Kar-Ann Toh, Andrew Beng Jin Teoh: Online Heterogeneous Face Recognition Based on Total-Error-Rate Minimization. IEEE Trans. Syst. Man Cybern. Syst. 50(4): 1286-1299 (2020). https://doi.org/10.1109/TSMC.2017.2724761

[10] Leandro Miranda, José Viterbo, Flávia Bernardini: A survey on the use of ML methods in context-aware middlewares for human activity recognition. Artif. Intell. Rev. 55(4): 3369-3400 (2020). https://doi.org/10.1007/s10462-021-10094-0

[11] Esma Uzunhisarcikli, Erhan Kavuncuoglu, Ahmet Turan Özdemir: Investigating classification performance of hybrid deep learning and ML architectures on activity recognition. Comput. Intell. 38(4): 1402-1449 (2020). https://doi.org/10.1111/coin.12517

[12] Syed Saqib Raza Rizvi, Muhammad Adnan Khan, Sagheer Abbas, Muhammad AsadUllah, Nida Anwer, Areej Fatima: Deep Extreme Learning Machine-Based Optical Character Recognition System for Nastalique Urdu-Like Script Languages. Comput. J. 65(2): 331-344 (2020). https://doi.org/10.1093/comjnl/bxaa042

[13] Miroslav Stampar, Kresimir Fertalj: Applied ML in recognition of DGA domain names. Comput. Sci. Inf. Syst. 19(1): 205-227 (2020). https://doi.org/10.2298/CSIS210104046S

[14] Saswati Debnath, Pinki Roy: Audio-visual speech recognition based on ML approach. Int. J. Adv. Intell. Paradigms 21(3/4): 211-224 (2020). 

[15] Leila Boussaad, Aldjia Boucetta: Extreme Learning Machine-Based Age-Invariant Face Recognition with Deep Convolutional Descriptors. Int. J. Appl. Metaheuristic Comput. 13(1): 1-18 (2020). https://doi.org/10.4018/IJAMC.290540

[16] R. Vinodini, M. Karnan: Face detection and recognition system based on hybrid statistical, ML and nature-based computing. Int. J. Biom. 14(1): 3-19 (2020). 

[17] Soumia Faouci, Djamel Gaceb, Mohammed Haddad: Offline Arabic handwritten character recognition: from conventional ML system to deep learning approaches. Int. J. Comput. Sci. Eng. 25(4): 385-398 (2020).

[18] Maria Chiara Fastame: Are subjective cognitive complaints associated with executive functions and mental health of older adults? Cogn. Process. 23(3): 503-512 (2020).