LinuxQuestions.org
Download your favorite Linux distribution at LQ ISO.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices

Reply
 
Search this Thread
Old 08-21-2005, 08:20 PM   #1
J_Szucs
Senior Member
 
Registered: Nov 2001
Location: Budapest, Hungary
Distribution: SuSE 6.4-11.3, Dsl linux, FreeBSD 4.3-6.2, Mandrake 8.2, Redhat, UHU, Debian Etch
Posts: 1,126

Rep: Reputation: 58
Kooka and UTF-8


I tried to use kooka in SuSE 9.1, but I failed, as its ocr function returns much garbage in the text because the ocr engine returns ISO 8859-2 text, while kooka shows that in UTF-8. (The conversion is out of question taking into account how often I must use ocr).

I know that both ocr engines (gocr and ocrad) could return UTF-8 text, too, but kooka calls them without the necessary command line parameter, so they output ISO 8859-2 text.

Is there a way to make kooka run those ocr engines with the correct command line parameter?

Last edited by J_Szucs; 08-21-2005 at 08:21 PM.
 
Old 08-23-2005, 07:30 AM   #2
aikempshall
Member
 
Registered: Nov 2003
Location: Bristol, Britain
Distribution: Slackware
Posts: 393

Rep: Reputation: 41
In the past I used omnipage under wine for ocr with reasonable results. It's always bugged me that this was the only non-linux software. So recently I've started using ocrad with kooka and the results are on a par with what I achieved from omnipage.

My configuration is straight out of the box.

I'm using kooka 0.44 and ocrad 0.12.

And am now windows free!
 
Old 08-23-2005, 11:06 AM   #3
J_Szucs
Senior Member
 
Registered: Nov 2001
Location: Budapest, Hungary
Distribution: SuSE 6.4-11.3, Dsl linux, FreeBSD 4.3-6.2, Mandrake 8.2, Redhat, UHU, Debian Etch
Posts: 1,126

Original Poster
Rep: Reputation: 58
Well, I believe that kooka actually works well on slackware, but I have a SuSE with default utf-8 charcoding.

Here, even when I enter some accented characters into the recognized text window of kooka, and copy & paste it into an other editor, I get nothing but garbage.

And I do not know how to configure kooka to use utf-8 by default; or how to convert kooka's ocr results to utf-8. (Doing an iso_8859-1 > utf-8 conversion produced garbage, too)

Last edited by J_Szucs; 08-23-2005 at 04:48 PM.
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Problem with Kooka dyrer Linux - Hardware 0 10-06-2005 02:27 PM
Scanner KOOKA auliahazza Linux - Newbie 2 05-28-2005 04:45 AM
How do I know how a file is encoded? UTF-8, UTF-16, etc.. ?? brynjarh Linux - General 1 12-03-2004 12:11 PM
[Enter] in text documents diffrent on Windows and Linux? UTF-8/UTF-16 problem or? brynjarh Linux - General 1 11-24-2004 06:20 AM
X11 / UTF-8 locale seems missing 'fr_FR.UTF-8' chrsitophermann Debian 11 07-17-2004 03:04 PM


All times are GMT -5. The time now is 05:15 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
identi.ca: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration