Open Access Journal

ISSN : 2394-2320 (Online)

International Journal of Engineering Research in Computer Science and Engineering (IJERCSE)

Monthly Journal for Computer Science and Engineering

Open Access Journal

International Journal of Engineering Research in Computer Science and Engineering (IJERCSE)

Monthly Journal for Computer Science and Engineering

ISSN : 2394-2320 (Online)

Design of Automatic Text Summarization Approach for Hindi Text Document Using Semantic Graph and Particle Swarm Optimization

Author : Vipul Dalal 1 Manisha Waze 2 Dr. Latesh Malik 3

Date of Publication :7th February 2017

Abstract: Automatic text summarization is the process of summarizing given document using intelligent algorithms. Many techniques have been suggested by researchers in past for summarization of English text. Not much work is found in the literature for summarization of Hindi text even though Hindi is an official language of India. In this paper, we propose a design for summarizing Hindi text based on semantic graph of the document using Particle Swarm Optimization (PSO) algorithms. The subject-object-verb (SOV) triples are extracted from the document. These triples are used to construct semantic graph of the document. A trained classifier using PSO algorithm generates semantic sub-graph which is then used to obtain document summary. The approach is under implementation phase and expected to give better results as compared to traditional summarizers.

Reference :

    1. Vipul Dalal, Dr. Latesh Malik, “A Survey of Extractive & Abstractive Text Summarization”, 6th International Conference on Emerging Trends in Engineering & Tecnology (ICETET), 2013
    2. M. S. Binwahlan, Salim, N., & Suanmali, L., "Swarm based features selection for text summarization," International Journal of Computer Science and Network Security IJCSNS, vol. 9, pp. 175-179, 2009b.
    3. M. S. Binwahlan, et al., "Swarm Based Text Summarization," in Computer Science and Information Technology – Spring Conference, 2009. IACSITSC '09. International Association of, 2009, pp. 145-150.
    4. Albaraa Abuobieda M. Ali, Naomie Salim, Rihab Eltayeb Ahmed, Mohammed Salem Binwahlan, Ladda Sunamali, Ahmed Hamza, “Pseudo Genetic And Probabilistic-Based Feature Selection Method For Extractive Single Document Summarization”, Journal of Theoretical and Applied Information Technology, 15th October 2011. Vol. 32 No.1, ISSN: 1992-8645, E-ISSN: 1817-3195.
    5. Alkesh Patel, Tanveer Siddiqui, U. S. Tiwary, “A language independent approach to multilingual text summarization”, Conference RIAO2007, Pittsburgh PA, U.S.A. May 30-June 1, 2007 - Copyright C.I.D. Paris, France
    6. Naresh Kumar Nagwani, Shrish Verma, “A Frequent Term and Semantic Similarity based Single Document Text Summarization Algorithm”, International Journal of Computer Applications (0975 – 8887) Volume 17– No.2, March 2011.
    7. Kamal Sarkar, “Bengali Text Summarization By Sentence Extraction
    8. Upendra Mishra, Chandra Prakash, “MAULIK: An Effective Stemmer for Hindi Language” International Journal on Computer Science and Engineering (IJCSE), ISSN : 0975-3397, Vol. 4 No. 05 May 2012
    9. Vishal Gupta, Gurpreet Singh Lehal, “Preprocessing Phase of Punjabi language Text Summarization”
    10. Jurij Leskovec, Natasa Milic-Frayling, Marko Grobelnik, “Extracting Summary Sentences Based on the Document Semantic Graph” Microsoft Research, Microsoft Corporation
    11. Regina Barzilay, Michael Elhadad. “Using Lexical Chains for Text Summarization”. In Proceedings of the Intelligent Scalable Text Summarization Workshop (ISTS'97). Madrid: ACL, 1997. 10-17.
    12. Kavita Ganesan, ChengXiang Zhai, Jiawei Han, “Opinosis: A Graph-Based Approach to Abstractive Summarization of Highly Redundant Opinions”.
    13. Eduard Hovy and Chin-Yew Lin. “Automated Text Summarization in SUMMARIST”. In I. Mani and M. Maybury (eds), Advances in Automated Text Summarization. MIT Press.
    14. Joel Larocca Neto, Alex A. Freitas Celso, A. A. Kaestner, “Automatic Text Summarization using a Machine Learning Approach”, Advances in Artificial Intelligence, Lecture Notes in Computer Science Volume 2507, 2002, pp 205-215
    15. Aysun Güran, Eren Bekar, Selim AkyokuÅŸ, “A Comparison of Feature and Semantic-Based Summarization Algorithms for Turkish”, INISTA 2010, International Symposium on Innovations in Intelligent Systems and Applicaitons, 21-24June 2010, Kayseri & Cappadocia,TURKEY
    16. Alkesh Patel, Tanveer Siddiqui, U. S. Tiwary, "A language independent approach to multilingual text summarization", Conference RIAO2007, Pittsburgh PA, U.S.A., (2007).
    17. Nguyen Quang Uy, Pham Tuan Anh, Truong Cong Doan, Nguyen Xuan Hoai, “A Study on the Use of Genetic Programming for Automatic Text Summarization”, 2012 Fourth International Conference on Knowledge and Systems Engineering
    18. Massih R. Amini, Nicolas Usunier, and Patrick Gallinari, “Automatic Text Summarization Based onWord-Clusters and Ranking Algorithms”, ECIR 2005, LNCS 3408, pp. 142–156, 2005.Springer-Verlag Berlin Heidelberg 2005
    19. Khosrow Kaikhah, “Automatic Text Summarization with Neural Networks”, SECOND IEEE International Conference On Intelligent Systems, June 2004.
    20. Kamal Sarkar, “Bengali Text Summarization By Sentence Extraction“.
    21. Albaraa Abuobieda M. Ali, Naomie Salim, Rihab Eltayeb Ahmed, Mohammed Salem Binwahlan, Ladda Suanmali, Ahmed Hamza, “Pseudo Genetic And Probabilistic-Based Feature Selection Method For Extractive Single Document Summarization”, Journal of Theoretical and Applied Information Technology 15th October 2011. Vol. 32 No.1.
    22. Reddy Siva, Natural Language Processing Tools. December. 2012 URL: http://sivareddy.in/downloads.

Recent Article