Open Access Journal

ISSN : 2394-2320 (Online)

International Journal of Engineering Research in Computer Science and Engineering (IJERCSE)

Monthly Journal for Computer Science and Engineering

Open Access Journal

International Journal of Engineering Research in Computer Science and Engineering (IJERCSE)

Monthly Journal for Computer Science and Engineering

ISSN : 2394-2320 (Online)

Big Data Analytics Based Approach to Tax Evasion Detection

Author : Yashashwita Shukla 1 Neena Sidhu 2 Akshita Jain 3 T.B. Patil 4 S.T. Sawant-Patil 5

Date of Publication :29th March 2018

Abstract: Tax evasion is the illegal underpayment or non-payment of tax to the government. Detecting these illegal tax evasion activities is an important as well as challenging issue for the tax administration system of any country and the biggest challenge is the increasing volume of tax data. In this paper, we propose a system that can help to characterize and detect the probable tax evaders using the information in their tax payment through big data analysis techniques. After pre-processing the available tax data, K-mean clustering algorithm is applied to form clusters of taxpayers with similar behaviour. Then decision trees are used to classify each cluster into tax payers with and without fraud and patterns in their associated behaviour are detected. Using these characteristics and patterns, the artificial neural network is trained and potential fraudsters could be detected based on the information available. This system will help detect tax fraudsters and enhance knowledge on their patterns of fraud which will be helpful in the prevention of fraud.

Reference :

    1.  Pamela Castellón González, Juan D. Velásquez, “Characterization and detection of taxpayers with false invoices using data mining techniques”, Expert Systems with Applications: An International Journal, Volume 40, Issue 5, Pages 1427-1436, April, 2013.
    2. Roung-Shiunn Wu, C.S. Ou, Hui-ying Lin, She-I Chang, David C. Yen, “Using data mining technique to enhance tax evasion detection performance”, Expert Systems with Applications: An International Journal, Volume 39, Issue 10, Pages 8769-8777 , August, 2012.
    3. Fend Tian, Tian Lan, Kuo-Ming Chao, Nick Godwin, “Mining suspicious Tax Evasion groups in Big Data”, IEEE Transactions on Knowledge and Data Engineering, Volume 28, Issue 10, Pages 2651 - 2664, October 1st, 2016.
    4. Freddy Duitama-Munoz, Julian D. Arias-Lundono, “Fraud detection in big data using supervised and semi-supervised learning techniques”, IEEE Columbian Conference on Communications and Computing, August, 2017.
    5. You Dai, Jin Yan, Xiaoxin Tang, Han Zhao, Minyi Gua, “Online credit card fraud detection: A hybrid framework with Big Data technologies”, TrustCom/BigDataSE/ISPA 2016 IEEE, August, 2016.
    6. Andrea Trevino, “Introduction to K-means Clustering”, 2016, [Online] Available: https://www.datascience.com /blog /k-meansclustering, [Accessed: February 15, 2018].
    7. Bc. Miroslav Hlav´aˇcek, “Multilayer feedforward neural networks based on multi-valued neurons”, 2014, [Online] Available: https://is.muni.cz /th/359995/ fi_m/dp_text.pdf, [Accessed: February 16, 2018].
    8. Barry de Ville, “Decision Trees for Business Intelligence and Data Mining: Using SAS Enterprise Miner”, SAS Institute Inc., Cary, NC, USA, 2006.

Recent Article