International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 08 | Aug 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1492
A Comparative Study to Detect Fraud Financial Statement using Data
Mining and Machine Learning Algorithms
Ankita Sharma
1
, Mayank Patel
2
, Manish Tiwari
3
1
M.Tech. Student(IV sem) , Dept. of Computer Science, GITS, Udaipur, Rajasthan ,India
2
Professor, Dept. of Computer Science, GITS, Udaipur, Rajasthan, India
3
Professor, Dept. of Computer Science, GITS, Udaipur, Rajasthan, India
----------------------------------------------------------------------***---------------------------------------------------------------------
Abstract — Financial frauds are increasing day by day within the world. The enterprise acknowledges the problem
and is just now beginning to act. Many data mining algorithms are developed that permit one to extract relevant
data from a large quantity of information like false financial statements. In recent years, machine learning has
developed and received major attention in the predictive analytics in audit analysis A careful reading of the financial
statements will indicate whether or not the corporate is running smoothly or is in crisis. Financial statements are
records of economic flows of a business. Generally, they include balance sheets, income statements, cash flow
statements, statements of retained earnings, and a few different statements. during a shell, the financial statements
are the mirrors of a company‘s financial status. This paper contains the analysis of various classification techniques
to detect frauds, can sometimes find ways in which to avoid such measures and compares classification techniques
and produce the result based on accuracy level.
Index Terms — Data mining, Fraud Detection, Financial Fraud, Financial Statements- Means clustering, Machine
Learning.
I. INTRODUCTION
Financial statement frauds (FSF) have received wide attention from the general public, the financial community and
restrictive bodies due to many high profile frauds reportable at massive companies like Enron, Lucent, and WorldCom and
Satyam computers over the previous few years. Finding financial statements primarily contains elements manipulating by
overstating assets, profit, or understating liabilities. Detecting management fraud using traditional audit procedures could
be a tough task. First, there's a shortage of knowledge regarding the characteristics of management fraud. Second, most
auditors lack the expertise necessary to find it. Finally, financial managers and accountants are deliberately trying to
deceive the auditors for such managers, who perceive the restrictions of an audit, standard auditing procedures could also
be limited. These limitations recommend the requirement for extra analytical procedures for the effective detection of
false financial statements. Statistics and data mining strategies are applied with success to find activities like money
laundering, e-commerce master card fraud, telecommunications fraud, insurance fraud, and laptop intrusion etc. However,
FSF is difficult and detecting them is tough. Individuals tend to question regarding the way to do it and the way effective
they're. The main objective this paper is to provide a comprehensive review on financial fraud detection (FFD) method.
Selected data-mining- based strategies that are utilized in FFD were examined.
II. RELATED WORK
Data mining has been applied in several aspects of financial analysis. Few areas wherever data processing techniques
have already getting used include: bankruptcy prediction, master card approval, loan decision, money-laundering
detection, stock analysis, etc. However, analysis associated with the utilization of data mining for detection of financial
statement fraud is restricted. The most objective of this analysis is to predict the occurrence of financial statement fraud
in financial as accurately as possible using intelligent techniques. There has been a restricted use of data mining
techniques for detection of financial statement fraud.
[1] Ravisankar applied six data mining techniques specifically Multilayer Feed Forward Neural Network (MLFF)
Sensitivity (50.6%), Specificity (65.6%), Support Vector Machines (SVM) Sensitivity (45.6%), Specificity (65.6%),
Genetic Programming (GP), group methodology of data Handling (GMDH), logistic Regression (LR), and Probabilistic
Neural Network (PNN) to spot financial that resort to financial statement fraud on an information set obtained from
202 Chinese financial. They found Probabilistic neural network because the best techniques without feature choice.
Multilayer Feed Forward Neural Network and PNN outperformed others with feature choice and with
marginally equal accuracies Sensitivity (65.6%), Specificity (74.6%).