LinuxQuestions.org
Share your knowledge at the LQ Wiki.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices

Reply
 
Search this Thread
Old 11-22-2005, 06:49 PM   #1
moxieman99
Member
 
Registered: Feb 2004
Distribution: Dabble, but latest used are Fedora 13 and Ubuntu 10.4.1
Posts: 413

Rep: Reputation: 88
OCR woes with Kooka


Tried Google for answers and saw that others had the problems I have, but no solutions.

I have Kooka .44 installed as part of FC4 (2.6.11-1 kernal)

I am trying to do some OCR (optical character recognition) stuff with the scanner.

I installed GOCR (a/k/a JOCR) and OCRAD.

Three questions:

1. Kooka insists on saving the scanned piece as an image/picture (jpeg or some other formats) before I can try OCR on it. Is this how it should be/How to change it?

2. Kooka does not use GOCR, even though I set it to use GOCR. It immediately fires up OCRAD. I have the path to GOCR in the settings and GOCR as the app to use, and even removed OCRAD entirely. No luck. It still tries for OCRAD and gives me no option to use GOCR. What am I overlooking?

3. What specific file from my OCRAD install should I put in as the final part of the OCRAD path? I figured out which file to use in GOCR (which I can't get kooka to use), but can't figure out which OCRAD file to use.

I get no error message. A screen comes up for OCRAD and the little KDE wheels spin, but it goes for hours with nothing happening.

Any help appreciated.


Moxieman99
 
Old 11-25-2005, 08:03 AM   #2
aikempshall
Member
 
Registered: Nov 2003
Location: Bristol, Britain
Distribution: Slackware
Posts: 387

Rep: Reputation: 40
Possibly the file that you're trying to OCR is too noisey. Save the scanned file as a JPG format and try running OCRAD from the command line. I'm not at my machine right now but for noisey files I ran it through a filter to clean the file before putting it through OCRAD.
 
Old 11-25-2005, 12:22 PM   #3
moxieman99
Member
 
Registered: Feb 2004
Distribution: Dabble, but latest used are Fedora 13 and Ubuntu 10.4.1
Posts: 413

Original Poster
Rep: Reputation: 88
Thanks, I will try that (cleaning up the image first), but why no choice and no ability to use GOCR?

Moxieman99
 
Old 11-25-2005, 04:01 PM   #4
aikempshall
Member
 
Registered: Nov 2003
Location: Bristol, Britain
Distribution: Slackware
Posts: 387

Rep: Reputation: 40
I soometimes scan the financial papers such as the Financial Times which is printed on "pink" paper.

I scan in Gray mode at a resolution of 600 and save as a jpeg
Clean the images with the following command
jpegtopnm kscan_0001.jpeg | pamditherbw -threshold -value 0.50 | pamtopnm > kscan_0001.pbm
then in kooka ocr the kscan_0001.pbm image

both gocr and ocrad work on my system. ocrad gives by far the better result. For me ocrad has a 3% errror rate whereas gocr on the same document has a 30% error rate.

I would suggest that you scan your document and try ocrad and gocr at the command line.


Edit:

Looking at the code ksaneocr.cpp my inpression is that kooka will only ever use ocrad or, if available, Kadmos. This page suggests that there may be a change on the way http://cvs-digest.org/index.php?diff...evision=434828

Last edited by aikempshall; 11-26-2005 at 05:49 AM.
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Best OCR application? dgermann Linux - Software 13 10-14-2010 08:39 PM
Anyone using kooka + ocr for accented chars? J_Szucs Suse/Novell 12 09-08-2005 08:11 AM
ocr John Master Linux - Software 7 06-12-2005 06:56 PM
Ocr apffal Linux - Software 1 06-12-2005 06:01 AM
OCR initialization failed accessing OCR device: PROC-26 cheeku Linux - Software 0 09-19-2004 09:36 AM


All times are GMT -5. The time now is 10:36 PM.

Main Menu
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
identi.ca: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration