Our 1st Digital Newsletter

Introducing Our Latest

Introducing Our Latest

Introducing Our Latest

Introducing Our Latest

Podcast

Podcast

OCR Of Graphic

Comics Novels

Overview

This project leverages advanced image processing techniques to extract text from multiple PDF files and generate customizable tables with a cell structure. The intuitive interface allows users to effortlessly add, delete, or modify cells and their structures, tailored for seamless data organization. The designed border-less cell structure ensures smooth data transfer. Users can export cells to Excel or JSON format for further analysis. The aim is to offer an intuitive solution for efficient data management from PDFs.
Plus Sign

Challenges

Language Barrier:
Inconvenient Reading:
Text Extraction Difficulty:
Reactivity:

Solution

Multilingual Translation:
Optimized Mobile Reading:
Accurate OCR:

Development Process

Research

Planning

Designing

Development

Maintenance

Tools/Technologies

Team & Role

01

Versatile OCR Expertise:

Proficient usage of diverse OCR engines, such as Tesseract, Paddle OCR, and Easy OCR, to extract text from comics.

02

Deep Learning Segmentation Pipeline:

Development of a sophisticated deep learning pipeline for segmentation, accurately identifying text regions for subsequent OCR processing.
Shopping Basket