Open Access Journal

ISSN : 2394-2320 (Online)

International Journal of Engineering Research in Computer Science and Engineering (IJERCSE)

Monthly Journal for Computer Science and Engineering

Open Access Journal

International Journal of Engineering Research in Computer Science and Engineering (IJERCSE)

Monthly Journal for Computer Science and Engineering

ISSN : 2394-2320 (Online)

Information Extractor: A Unified Framework for Information Extraction from Multimodal Sources

Author : Hemant Suteri, Subodh Sharma, Vandana Choudhary, Namita Goyal

Date of Publication :4th April 2024

Abstract:In today’s digital world everything is either converted into or is in process to be converted into digital form in one way or the other, similar is the thing with various documents like books, newspapers and documents of vehicles etc. and you might have also realized that nowadays, sales of digital form of books be it Google scholar or audio format like kindle by amazon, you might have seen newspapers/magazine focused more on digital format nowadays because of various reasons of convenience. We have seen every document going digital and with all such things around we can confidently say that the future is going to be of digital format of documents and managing such documents needs various technologies, one such technology is Tesseract OCR by Google which is an open-source Platform. Tesseract OCR where OCR stands for optical character recognition where it uses Artificial Intelligence to search the text and identify the image from the document.

Reference :

Will Updated soon

Recent Article