LinuxQuestions.org
Help answer threads with 0 replies.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Distributions > Fedora
User Name
Password
Fedora This forum is for the discussion of the Fedora Project.

Notices


Reply
  Search this Thread
Old 04-26-2010, 01:01 PM   #1
mickeyboa
Senior Member
 
Registered: May 2004
Location: Indianapolis, Indiana
Distribution: FC-KDE, 32 and 64 bit
Posts: 1,544

Rep: Reputation: 55
Scan and Save As a Text file


Fedora12/Kde4

After a Sane Scan how do I Save a scan as a *.txt file ?

gocr does not work, I can save in any other format but a txt file.
 
Old 04-26-2010, 06:34 PM   #2
choogendyk
Senior Member
 
Registered: Aug 2007
Location: Massachusetts, USA
Distribution: Solaris 9 & 10, Mac OS X, Ubuntu Server
Posts: 1,194

Rep: Reputation: 105Reputation: 105
So, the magic search keyword here is OCR. You have scanned an image. You have a page of pixels. You need to process that through something that will do optical character recognition and give you the text in the image.

http://www.xs4all.nl/~ljm/SANE-faq.html
http://www.xs4all.nl/~ljm/SANE-faq.html#110 <--Check item 11.11 in the middle of that page. It talks about OCR packages that are available.
 
Old 04-26-2010, 08:35 PM   #3
choogendyk
Senior Member
 
Registered: Aug 2007
Location: Massachusetts, USA
Distribution: Solaris 9 & 10, Mac OS X, Ubuntu Server
Posts: 1,194

Rep: Reputation: 105Reputation: 105
hmm. Just checking something & realized that you did mention gocr. That's actually one that was mentioned in the link I posted above, but the person writing that said they didn't have much luck with gocr. They did mention another possibility, and said it needed training.

You could try the other or post some additional details about what you did in the way of installing and configuring gocr and what happened.
 
Old 04-28-2010, 05:56 PM   #4
mickeyboa
Senior Member
 
Registered: May 2004
Location: Indianapolis, Indiana
Distribution: FC-KDE, 32 and 64 bit
Posts: 1,544

Original Poster
Rep: Reputation: 55
I didn't do any Configuring to gocr, I just installed the gocr package.
 
Old 04-29-2010, 05:35 AM   #5
decrepit
Member
 
Registered: May 2006
Distribution: Ubuntu10.04, < fedora12
Posts: 347

Rep: Reputation: 34
I've used a couple of OCR programs a few years ago, and found them less than satisfactory, takes longer to correct the mistakes, than just type the thing out.
The quality and size of the image is very important!!!! If the image is too big or small, there can be problems. Contrast and clarity also have to be right. Light coloured ink on a dark background, or a low resolution/fuzzy pic aren't going to work.

And if there's any complicated formatting, forget it!
 
Old 04-29-2010, 07:19 AM   #6
choogendyk
Senior Member
 
Registered: Aug 2007
Location: Massachusetts, USA
Distribution: Solaris 9 & 10, Mac OS X, Ubuntu Server
Posts: 1,194

Rep: Reputation: 105Reputation: 105
That's unfortunate. I've used commercial OCR software on the Mac for many years and had good luck. Adobe Acrobat Pro has an OCR capability. You can have a scanned text image and use OCR to build text search capability and put it all in a pdf. Thus, you are seeing and reading the scanned image, but if you do a search, it uses the underlying text and then highlights the corresponding space in the scanned image. This worked well for a paper of mine that had a lot of mathematical equations. It looks good, because you are seeing the scanned images of the equations, and they wouldn't do well in OCR. You can't search on what doesn't work with OCR, but you can on the body text.

Anyway, I just checked and Adobe Acrobat Pro doesn't seem to be available for Linux.

There's gotta be something that works for Linux. http://www.claraocr.org/ has a lot of information. It says that ClaraOCR no longer is available, but the site is now dedicated to information about OCR and lists both free and commercial programs available for various platforms, including Linux. You could also check through http://www.thefreecountry.com/utilities/ocr.shtml, and there are a bunch of products listed at http://en.wikipedia.org/wiki/OCR_Software. Some of the better commercial ones can be quite expensive, but there seem to be a number open source ones to choose from.
 
Old 09-30-2010, 03:58 PM   #7
ilnli
Member
 
Registered: Jul 2004
Location: Pakistan
Distribution: Slackware 10.0, SUSE 9.1, RH 7, 7.3, 8, 9, FC2
Posts: 413

Rep: Reputation: 32
Just use online OCR conversion utility http://www.ocrconvert.com
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
gedit cannot save text file, type mailbox file ! snailchess Linux - Newbie 10 02-26-2011 04:57 AM
Save text to the beginning of file. c0m4r Linux - General 3 05-24-2009 11:53 AM
How to save text file with ASCII code?? Nejad General 7 01-04-2009 02:17 AM
wiring to a text file and being able to save and add to it mrobertson Programming 1 07-27-2005 02:27 PM
how to cat a text file and save it as a variable mrobertson Programming 37 07-05-2005 08:20 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Distributions > Fedora

All times are GMT -5. The time now is 05:11 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration