LinuxQuestions.org
Download your favorite Linux distribution at LQ ISO.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices


Reply
  Search this Thread
Old 12-05-2018, 09:04 AM   #1
qrange
Senior Member
 
Registered: Jul 2006
Location: Belgrade, Yugoslavia
Distribution: Debian stable/testing, amd64
Posts: 1,030

Rep: Reputation: 47
tesseract ocr v4.0


I'd like to create a new "language set" for recognizing hand written numbers.
(lstmtraining)

There's this tutorial:

https://github.com/tesseract-ocr/tes...Tesseract-4.00

but its not clear how to do it with my own samples?

thanks.
 
Old 12-05-2018, 03:36 PM   #2
business_kid
LQ Guru
 
Registered: Jan 2006
Location: Ireland
Distribution: Slackware, Pi OS & Android
Posts: 12,010

Rep: Reputation: 1416Reputation: 1416Reputation: 1416Reputation: 1416Reputation: 1416Reputation: 1416Reputation: 1416Reputation: 1416Reputation: 1416Reputation: 1416
Email them and ask them to clarify. I'm sure they would be glad of help, especially if you're working in Cyrillic. Abbyy is about the best at that, but they do seem to want money. They're a windows lot that made a linux version. At least sometimes it's free for 1 month trial.

Tesseract is building a new (for tesseract) OCR approach, and it's in beta. It has some catchup to do. Abbyy is far from perfect either. Fonts vary so much.
 
Old 12-06-2018, 01:23 AM   #3
qrange
Senior Member
 
Registered: Jul 2006
Location: Belgrade, Yugoslavia
Distribution: Debian stable/testing, amd64
Posts: 1,030

Original Poster
Rep: Reputation: 47
um, AFAIK, there are no "Cyrillic numbers".

Tesseract-OCR is open-source. The tutorial uses fonts, it seems incomplete.


edit:

it seems " JTessBoxEditor/VietOCR" should help creating boxes, etc.
will try that.

edit2:
found someone with similar problem:
https://github.com/tesseract-ocr/tesseract/issues/1536

Last edited by qrange; 12-06-2018 at 08:40 AM.
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
LXer: gImageReader (Tesseract OCR GUI) Gets Multipage Recognition Support LXer Syndicated Linux News 0 03-25-2011 06:12 PM
LXer: Extract Text From PDFs And Images With gImageReader, A Tesseract OCR GUI LXer Syndicated Linux News 0 01-04-2011 10:00 AM
LXer: Optical Character Recognition With Tesseract OCR On Ubuntu 7.04 LXer Syndicated Linux News 0 08-30-2007 07:30 PM
OCR & Tesseract...Anyone tried it ? 2GNUBY Linux - Desktop 0 10-10-2006 04:39 PM
LXer: Google's Tesseract OCR engine is a quantum leap forward LXer Syndicated Linux News 0 09-28-2006 02:54 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Software

All times are GMT -5. The time now is 12:39 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration