LinuxQuestions.org
Welcome to the most active Linux Forum on the web.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices


Reply
  Search this Thread
Old 08-21-2005, 07:20 PM   #1
J_Szucs
Senior Member
 
Registered: Nov 2001
Location: Budapest, Hungary
Distribution: SuSE 6.4-11.3, Dsl linux, FreeBSD 4.3-6.2, Mandrake 8.2, Redhat, UHU, Debian Etch
Posts: 1,126

Rep: Reputation: 58
Kooka and UTF-8


I tried to use kooka in SuSE 9.1, but I failed, as its ocr function returns much garbage in the text because the ocr engine returns ISO 8859-2 text, while kooka shows that in UTF-8. (The conversion is out of question taking into account how often I must use ocr).

I know that both ocr engines (gocr and ocrad) could return UTF-8 text, too, but kooka calls them without the necessary command line parameter, so they output ISO 8859-2 text.

Is there a way to make kooka run those ocr engines with the correct command line parameter?

Last edited by J_Szucs; 08-21-2005 at 07:21 PM.
 
Old 08-23-2005, 06:30 AM   #2
aikempshall
Member
 
Registered: Nov 2003
Location: Bristol, Britain
Distribution: Slackware
Posts: 900

Rep: Reputation: 153Reputation: 153
In the past I used omnipage under wine for ocr with reasonable results. It's always bugged me that this was the only non-linux software. So recently I've started using ocrad with kooka and the results are on a par with what I achieved from omnipage.

My configuration is straight out of the box.

I'm using kooka 0.44 and ocrad 0.12.

And am now windows free!
 
Old 08-23-2005, 10:06 AM   #3
J_Szucs
Senior Member
 
Registered: Nov 2001
Location: Budapest, Hungary
Distribution: SuSE 6.4-11.3, Dsl linux, FreeBSD 4.3-6.2, Mandrake 8.2, Redhat, UHU, Debian Etch
Posts: 1,126

Original Poster
Rep: Reputation: 58
Well, I believe that kooka actually works well on slackware, but I have a SuSE with default utf-8 charcoding.

Here, even when I enter some accented characters into the recognized text window of kooka, and copy & paste it into an other editor, I get nothing but garbage.

And I do not know how to configure kooka to use utf-8 by default; or how to convert kooka's ocr results to utf-8. (Doing an iso_8859-1 > utf-8 conversion produced garbage, too)

Last edited by J_Szucs; 08-23-2005 at 03:48 PM.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Problem with Kooka dyrer Linux - Hardware 0 10-06-2005 01:27 PM
Scanner KOOKA auliahazza Linux - Newbie 2 05-28-2005 03:45 AM
How do I know how a file is encoded? UTF-8, UTF-16, etc.. ?? brynjarh Linux - General 1 12-03-2004 11:11 AM
[Enter] in text documents diffrent on Windows and Linux? UTF-8/UTF-16 problem or? brynjarh Linux - General 1 11-24-2004 05:20 AM
X11 / UTF-8 locale seems missing 'fr_FR.UTF-8' chrsitophermann Debian 11 07-17-2004 02:04 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Software

All times are GMT -5. The time now is 08:08 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration