LinuxQuestions.org
Review your favorite Linux distribution.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices


Reply
  Search this Thread
Old 03-18-2016, 11:48 AM   #1
business_kid
LQ Guru
 
Registered: Jan 2006
Location: Ireland
Distribution: Slackware & Android
Posts: 11,007

Rep: Reputation: 1250Reputation: 1250Reputation: 1250Reputation: 1250Reputation: 1250Reputation: 1250Reputation: 1250Reputation: 1250Reputation: 1250
Dumb PDF to epub or similar?


What's the current best way to recover text from a pdf featuring low res scans of a printed page?

These (typically large) books are difficult for me to read due to font sizing and clumsy pdf zooming especially on Android tablets.

For my sins, I had work making such books for a period. We would scan @ 400 DPI if the print was any way small, (300dpi otherwise), process under windows with Abbey Fine Reader, and Adobe Acrobat, and then put text under the images & shrink file size to under 300 Megs by reducing image resolution. The reason for the last stage was that windows gave memory issues with Adobe tools and large files.

I run slackware-14.1, and have Fedora locked up in a VM.
 
Old 03-18-2016, 02:33 PM   #2
NGIB
Member
 
Registered: Sep 2013
Location: Sumter SC, USA
Distribution: MX, Lubuntu
Posts: 449

Rep: Reputation: Disabled
The text on the page isn't text at all, the page is a graphic image. Years ago, I experimented with a scanner and OCR software but it was not very good, and fixing the errors was not worth the effort...
 
Old 03-18-2016, 04:01 PM   #3
toothandnail
Member
 
Registered: Apr 2007
Location: Oxfordshire, UK
Distribution: Arch, Sparky, Salix64
Posts: 115

Rep: Reputation: 22
Results vary, but calibre can convert .pdf to a number of ebook formats, including epub. Not sure what the results would be like with a low res pdf, but its worth a try...

Paul.
 
Old 03-19-2016, 11:26 PM   #4
sgosnell
Senior Member
 
Registered: Jan 2008
Location: Baja Oklahoma
Distribution: Debian
Posts: 1,054

Rep: Reputation: 281Reputation: 281Reputation: 281
Yes, I would start with calibre. If that didn't work, there are websites that do conversions. Doing large files can be iffy, though.
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
HTML, EPUB, PDF and automated publishing. tekra Linux - Software 2 07-08-2015 12:48 AM
PDF --> EPUB. Software? business_kid Linux - Software 6 02-07-2014 01:25 PM
Is there a way to export pdf files to mobi or epub without weird artifacts/characters linux_BSD Linux - Software 7 10-20-2012 07:01 PM
LXer: Listen to your books (ePUB or PDF) with Okular, KDE's PDF Reader LXer Syndicated Linux News 0 06-22-2012 07:30 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Software

All times are GMT -5. The time now is 06:07 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration