LinuxQuestions.org
Review your favorite Linux distribution.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices


Reply
  Search this Thread
Old 11-24-2016, 11:09 PM   #1
michael diemer
Member
 
Registered: Jul 2016
Location: Maine, USA
Distribution: Bodhi 6, Debian 11 LXDE
Posts: 158

Rep: Reputation: Disabled
Software To Scan Old Text Documents, Then Edit


I want to put all my old poems into my computer, and edit them. Is there a program to scan them in, and then edit them?
 
Old 11-24-2016, 11:33 PM   #2
ferrari
LQ Guru
 
Registered: Sep 2003
Location: Auckland, NZ
Distribution: openSUSE Leap
Posts: 5,805

Rep: Reputation: 1140Reputation: 1140Reputation: 1140Reputation: 1140Reputation: 1140Reputation: 1140Reputation: 1140Reputation: 1140Reputation: 1140
Scanning and capturing as an image is the easy part. I'll assume that you have a scanner and are familiar with the usual scanning applications. After that you'll need optical character recognition software to undertake this task of capturing the hand-written (or typed) text and YMMV. Anyway, I'll point you at the following pages to get started...

https://help.ubuntu.com/community/OCR
https://sourceforge.net/projects/lios/

Good luck with your project.
 
Old 11-25-2016, 12:59 AM   #3
michael diemer
Member
 
Registered: Jul 2016
Location: Maine, USA
Distribution: Bodhi 6, Debian 11 LXDE
Posts: 158

Original Poster
Rep: Reputation: Disabled
Thanks ferrari, I'll check those out.
 
Old 11-25-2016, 03:31 PM   #4
jefro
Moderator
 
Registered: Mar 2008
Posts: 21,982

Rep: Reputation: 3625Reputation: 3625Reputation: 3625Reputation: 3625Reputation: 3625Reputation: 3625Reputation: 3625Reputation: 3625Reputation: 3625Reputation: 3625Reputation: 3625
When you say poems, do you mean OCR readable or handwritten letters?

I think this is the one that did the best last time I saw a comparison. http://www.ocr4linux.com/en:start

I could never get much out of this. https://github.com/tesseract-ocr/tesseract/wiki


Ended up on one project taking the stuff to a high end business MFP where they had the ability to scan to file in various formats. I think the scan was almost perfect.

*(no scanner seems to be 100% unless you started with special OCR fonts)
 
Old 11-26-2016, 12:21 AM   #5
michael diemer
Member
 
Registered: Jul 2016
Location: Maine, USA
Distribution: Bodhi 6, Debian 11 LXDE
Posts: 158

Original Poster
Rep: Reputation: Disabled
jefro, the poems were typed on an ancient manual typewriter, with the expected fuzzy and irregular results. They are from the 1970's.
 
Old 11-26-2016, 11:30 AM   #6
DavidMcCann
LQ Veteran
 
Registered: Jul 2006
Location: London
Distribution: PCLinuxOS, Debian
Posts: 6,142

Rep: Reputation: 2314Reputation: 2314Reputation: 2314Reputation: 2314Reputation: 2314Reputation: 2314Reputation: 2314Reputation: 2314Reputation: 2314Reputation: 2314Reputation: 2314
I used Tesseract once on a long typescript and it went quite well. The fact that the letters are monospaced reduces some of the usual OCR problems, like "ll" > "U".
 
Old 11-26-2016, 05:18 PM   #7
jefro
Moderator
 
Registered: Mar 2008
Posts: 21,982

Rep: Reputation: 3625Reputation: 3625Reputation: 3625Reputation: 3625Reputation: 3625Reputation: 3625Reputation: 3625Reputation: 3625Reputation: 3625Reputation: 3625Reputation: 3625
https://finereaderonline.com/en-us 10 pages to start.


https://acrobat.adobe.com/us/en/acro...f-to-text.html Think I used this in wine before.

Might try some of the online ones if this is just a once in a while deal.

I have used Adobe's professional product and it was very good. Of course they usually try to force you into a new pdf format.

Not sure how trustworthy the online places are.

Last edited by jefro; 11-26-2016 at 05:20 PM.
 
Old 11-28-2016, 10:52 AM   #8
michael diemer
Member
 
Registered: Jul 2016
Location: Maine, USA
Distribution: Bodhi 6, Debian 11 LXDE
Posts: 158

Original Poster
Rep: Reputation: Disabled
I've been trying out some freeware on my Windows install. My Canon printer came with some Nuance software, plus I downloaded a couple other freebies. Also tried One Note, which has some OCR capability. But I'm considering buying something at this point, as the type is so poor that the free stuff is missing a lot of it.

There are some things that Windows is still better for.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Scan documents & make writeable EDDY1 Linux - Software 3 08-18-2011 04:23 PM
[SOLVED] Scan documents into OO dvjackson Linux - General 4 08-16-2011 09:43 AM
[SOLVED] copy and paste text among text documents in Linux ethereal1m Linux - Newbie 5 03-28-2010 03:14 AM
how can I view and edit "Documents to Go" documents in Linux? izquierdista Linux - Software 7 08-30-2007 07:58 AM
Want to scan documents to a database...how? glenn69 Linux - Newbie 2 01-06-2005 10:57 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Software

All times are GMT -5. The time now is 04:04 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration