LinuxQuestions.org
Did you know LQ has a Linux Hardware Compatibility List?
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Hardware
User Name
Password
Linux - Hardware This forum is for Hardware issues.
Having trouble installing a piece of hardware? Want to know if that peripheral is compatible with Linux?

Notices



Reply
 
Search this Thread
Old 03-24-2010, 01:49 AM   #1
Ml9xLi6kvpVZ09r
LQ Newbie
 
Registered: Oct 2009
Posts: 6

Rep: Reputation: 1
HP scanner text recognition quality


Just installed Ubuntu K-K 9.10 4 months ago, still a newbie. Got my HP PSC 1410 all-in-one hooked up and the HP Device Manager installed OK and xsane. Printer works good and scanner handles pics great but I don't have any luck scanning a menu to doc format. The menu does have fancy scroll-work around the edges and main headings are a script font and Course names are underlined. All other items are clear font. I figured this all added to the problem of very unreadable scan results till I tried a page with all clear block font and it's still totally unreadable. I mean totally! I've searched till I'm out of ideas. There has to be a way to tweak the recognition of text.
 
Old 03-24-2010, 02:28 AM   #2
tommyttt
Member
 
Registered: Oct 2009
Location: Federal Way, WA
Distribution: openSUSE 11.4 x86_64, openSuSE 12.1, Fedora 15
Posts: 207

Rep: Reputation: 34
Quote:
Originally Posted by Ml9xLi6kvpVZ09r View Post
Just installed Ubuntu K-K 9.10 4 months ago, still a newbie. Got my HP PSC 1410 all-in-one hooked up and the HP Device Manager installed OK and xsane. Printer works good and scanner handles pics great but I don't have any luck scanning a menu to doc format. The menu does have fancy scroll-work around the edges and main headings are a script font and Course names are underlined. All other items are clear font. I figured this all added to the problem of very unreadable scan results till I tried a page with all clear block font and it's still totally unreadable. I mean totally! I've searched till I'm out of ideas. There has to be a way to tweak the recognition of text.
You can't scan directly to a text document, only to an image. You need to scan the page into an image format file, generally *.tiff works best. Then it has to be run through an OCR (Optical Character Recognition) program such as Tesseract. Your HP device probably came with a windblows program that accompliched all that but it won't work in linux unless you use virtualization of some flavor.

I don't use Ubuntu but think they should have Tesseract in their repository.
 
Old 03-24-2010, 04:25 AM   #3
remmilou
LQ Newbie
 
Registered: Mar 2010
Location: Amsterdam
Distribution: Debian
Posts: 9

Rep: Reputation: 3
Try gscan2pdf. One pass scanning and ocr to pdf or tiff. It's in the repositories.
It has options for choosing the ocr engine (and more). Tesseract works best (at least for me), so install that from the repo as well.
"sudo apt-get update && sudo apt-get install tesseract gscan2pdf" should do the trick. Or use synaptic.
It has an issue with saving PDF's. I solved that long ago, but don't know how anymore (Google is your friend). To tiff works flawless.
 
  


Reply

Tags
hp, recognition, scanner, text


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
usb scanner premissions issue when running sane-find-scanner vlsd Linux - Hardware 4 08-20-2009 03:20 PM
LXer: Get top-quality scans from your scanner with Lprof LXer Syndicated Linux News 0 11-16-2006 03:21 AM
scanner to text ilnli Linux - General 1 03-28-2005 03:19 PM
Poor text quality after upgrade cyclic Fedora 0 01-06-2005 11:09 AM


All times are GMT -5. The time now is 12:25 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
identi.ca: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration