Open Access Journal

ISSN : 2394-2320 (Online)

International Journal of Engineering Research in Computer Science and Engineering (IJERCSE)

Monthly Journal for Computer Science and Engineering

Open Access Journal

International Journal of Engineering Research in Computer Science and Engineering (IJERCSE)

Monthly Journal for Computer Science and Engineering

ISSN : 2394-2320 (Online)

Content-based SMS Spam Messages classification using Natural Language Processing and Machine Learning

Author : S. Sumahasan 1 Uday Kumar Addanki 2 Anjani Kintali 3 Srilekha Bontu 4

Date of Publication :31st July 2021

Abstract: Spam is any unwanted digital communication that is sent in bulk from compromised machines. In this work, the suggested strategy is a model that uses the Bag of Words technique to calculate the frequency of words and Supervised Machine Learning techniques such as Naïve Bayes and Support Vector Machine to categorise the message. The suggested system shows the message's categorization as well as the most prevalent spam terms discovered in the message. We compare the performance of the Naïve Bayes and Support Vector Machine algorithms in this study. The feature of adding additional spam messages to the collection improves accuracy as well.

Reference :

    1. Pavas Navaney, Gaurav Dubey, Ajay Rana. "SMS Spam Filtering Using Supervised Machine Learning Algorithms", 2018 8th International Conference on Cloud Computing, Data Science & Engineering (Confluence), 2018.
    2. M. Singh, R. Pamula and S. k. Shekhar, "Email Spam Classification by Support Vector Machine," 2018 International Conference on Computing, Power and Communication Technologies (GUCON), 2018, pp. 878-882.
    3. N. Kumar, S. Sonowal and Nishant, "Email Spam Detection Using Machine Learning Algorithms," 2020 Second International Conference on Inventive Research in Computing Applications (ICIRCA), 2020, pp. 108-113.
    4. S. Bosaeed, I. Katib, and R. Mehmood, "A FogAugmented Machine Learning based SMS Spam Detection and Classification System," 2020 Fifth International Conference on Fog and Mobile Edge Computing (FMEC), 2020, pp. 325-330.
    5. S. B. Rathod and T. M. Pattewar, "Content-based spam detection in email using Bayesian classifier," 2015 International Conference on Communications and Signal Processing (ICCSP), 2015, pp. 1257-1261.
    6. W. Etaiwi and A. Awajan, "The Effects of Features Selection Methods on Spam Review Detection Performance," 2017 International Conference on New Trends in Computing Sciences (ICTCS), 2017, pp. 116-120.
    7. P. Sethi, V. Bhandari and B. Kohli, "SMS spam detection and comparison of various machine learning algorithms," 2017 International Conference on Computing and Communication Technologies for Smart Nation (IC3TSN), 2017, pp. 28-31
    8. A. Junnarkar, S. Adhikari, J. Fagania, P. Chimurkar, and D. Karia, "E-Mail Spam Classification via Machine Learning and Natural Language Processing," 2021 Third International Conference on Intelligent Communication Technologies and Virtual Mobile Networks (ICICV), 2021, pp. 693-699.
    9. K. Jayamalini, M. Ponnavaikko, J. Kothandan. "A COMPARATIVE ANALYSIS OF VARIOUS MACHINE LEARNING BASED SOCIAL MEDIA SENTIMENT ANALYSIS AND OPINION MINING [10] APPROACHES", Advances in Mathematics: Scientific Journal, 2020.
    10. W. A. Qader, M. M. Ameen and B. I. Ahmed, "An Overview of Bag of Words; Importance, Implementation, Applications, and Challenges," 2019 International Engineering Conference (IEC), 2019, pp. 200-204.

Recent Article