IJRTI
International Journal for Research Trends and Innovation
International Peer Reviewed & Refereed Journals, Open Access Journal
ISSN Approved Journal No: 2456-3315 | Impact factor: 8.14 | ESTD Year: 2016
Scholarly open access journals, Peer-reviewed, and Refereed Journals, Impact factor 8.14 (Calculate by google scholar and Semantic Scholar | AI-Powered Research Tool) , Multidisciplinary, Monthly, Indexing in all major database & Metadata, Citation Generator, Digital Object Identifier(DOI)

Call For Paper

For Authors

Forms / Download

Published Issue Details

Editorial Board

Other IMP Links

Facts & Figure

Impact Factor : 8.14

Issue per Year : 12

Volume Published : 10

Issue Published : 114

Article Submitted : 18456

Article Published : 7827

Total Authors : 20673

Total Reviewer : 756

Total Countries : 142

Indexing Partner

Licence

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License
Published Paper Details
Paper Title: A Computer Vision Powered OCR Framework for Extracting Tabular Data from Scanned PDFs
Authors Name: Krishnika Thirunavukkarasu , Haritha Shankar
Download E-Certificate: Download
Author Reg. ID:
IJRTI_203110
Published Paper Id: IJRTI2504315
Published In: Volume 10 Issue 4, April-2025
DOI:
Abstract: As data-driven decision-making is becoming more prevalent, extracting tabular data from scanned documents and images is a big issue. In this paper, we present an automated table extraction pipeline that employs both OpenCV for image pre-processing and Tesseract OCR for text extraction. The system applies grayscale, binarization, and morphological processing to detect lines and isolate text, thereby facilitating correct tabular data extraction. The extracted data is later transformed into a significant table structure utilizing Python's pandas package and eventually saved as an Excel file. The method suggested is effective for those documents that possess clearly defined tabular structures and acts as a stepping stone for more complex document analysis systems. Index Terms— OpenCV, Tesseract OCR, Table Extraction, Python, Image Pr ocessing, Document Analysis. _______________________________________________________________________________________________
Keywords: Index Terms— OpenCV, Tesseract OCR, Table Extraction, Python, Image Pr ocessing, Document Analysis.
Cite Article: "A Computer Vision Powered OCR Framework for Extracting Tabular Data from Scanned PDFs", International Journal of Science & Engineering Development Research (www.ijrti.org), ISSN:2455-2631, Vol.10, Issue 4, page no.d126-d130, April-2025, Available :http://www.ijrti.org/papers/IJRTI2504315.pdf
Downloads: 000416
ISSN: 2456-3315 | IMPACT FACTOR: 8.14 Calculated By Google Scholar| ESTD YEAR: 2016
An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 8.14 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator
Publication Details: Published Paper ID: IJRTI2504315
Registration ID:203110
Published In: Volume 10 Issue 4, April-2025
DOI (Digital Object Identifier):
Page No: d126-d130
Country: Chennai, Tamil Nadu, India
Research Area: Information Technology 
Publisher : IJ Publication
Published Paper URL : https://www.ijrti.org/viewpaperforall?paper=IJRTI2504315
Published Paper PDF: https://www.ijrti.org/papers/IJRTI2504315
Share Article:

Click Here to Download This Article

Article Preview
Click Here to Download This Article

Major Indexing from www.ijrti.org
Google Scholar ResearcherID Thomson Reuters Mendeley : reference manager Academia.edu
arXiv.org : cornell university library Research Gate CiteSeerX DOAJ : Directory of Open Access Journals
DRJI Index Copernicus International Scribd DocStoc

ISSN Details

ISSN: 2456-3315
Impact Factor: 8.14 and ISSN APPROVED, Journal Starting Year (ESTD) : 2016

DOI (A digital object identifier)


Providing A digital object identifier by DOI.ONE
How to Get DOI?

Conference

Open Access License Policy

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License

Creative Commons License This material is Open Knowledge This material is Open Data This material is Open Content

Important Details

Join RMS/Earn 300

IJRTI

WhatsApp
Click Here

Indexing Partner