Open Access Journal

ISSN : 2394-2320 (Online)

International Journal of Engineering Research in Computer Science and Engineering (IJERCSE)

Monthly Journal for Computer Science and Engineering

Open Access Journal

International Journal of Engineering Research in Computer Science and Engineering (IJERCSE)

Monthly Journal for Computer Science and Engineering

ISSN : 2394-2320 (Online)

Part of Speech Tagging for Konkani Corpus

Author : Meghana Mahesh Pai Kane 1

Date of Publication :8th June 2017

Abstract: The wide spectrum of languages are been used for communication around the world , utilization of world wide web for searching information requires computational linguistics because majority of the search engines uses bag of words that causes problem in extracting of the information due to use of Multi words . This has made to think beyond the boundaries about what kinds of query a human can submit and also its interpretation in forms of its annotation could be used to obtain good result. The essential st ep in the Natural Language Processing resides in obtaining the grammatical information of the words used in the input as per it appearance in the text .POS taggers for several other Indian languages have been developed but assumption of unavailability of the POS tagger for the Konkani language aims at developing the same. Further POS tagging to do manually is much tougher job due to huge content of data. This paper aims at part of speech tagging for Konkani corpus.

Reference :

    1. Ed. T. Jaynes, “Information Theory”, dated 1957 http://homepages.inf.ed.ac.uk/lzhang10/maxent.html
    2. Abney, “Stochastic Attribute-Value Grammars”, dated 1997 http://citeseer.ist.psu.edu/490897.html
    3. Christopher D. Manning, Hinrich Schutze, “Foundations of statistical natural language processing”
    4. Experiences in Building the Konkani WordNet Using the Expansion Approach http://www.cfilt.iitb.ac.in/gwc2010/pdfs/54_Konkani_Wo rdNet__Walawalikar.pdf
    5. Daniel Jurafsky and James H.Martin, “Speech and Language Processing”Adam L. Berger, Stephen A. Della Pietra and Vincent J. Della Pietra, “A maximum entropy approach to natural language processing”
    6. Stochastic Algorithm http://citeseer.ist.psu.edu/rosenfeld94adaptive.html
    7. Morphological Analyzer http://Morphadorner.northwestern.edu/morphadorner/post agger/example
    8. “A Part Of Speech Tagger For Indian Languages” http://shiva.iiit.ac.in/SPSAL2007/iiit_tagset_guidelines.pd f
    9. Hindi POS Tagging and Chunking Itrc.ac.in/nlpai_contest06/papers/msrindia.pdf
    10. Sanskrit Tagger, a stochastic lexical and pos tagger for Sanskrit http://hal.inria.fr/inria-00203467/fr/
    11. A maximum entropy model for Part of Speech tagging www.Idc.upenn.edu/acI/W/W96/W96-0213.pdf
    12. Natural Language Processing cnlp.syr.edu/publications/03NLP.LIS.Encyclopedia.pdf
    13. BIS Annotation Standards With Reference to Konkani Language – Goa university
    14. Multiword Expressions Dataset for Indian Languages https://www.cse.iitb.ac.in/~pb/papers/lrec16-m w-resource.pdf

Recent Article