Is there a way to export pdf files to mobi or epub without weird artifacts/characters
Linux - SoftwareThis forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
Is there a way to export pdf files to mobi or epub without weird artifacts/characters
Hi,
Is there an alternative to calibre. Calibre does not seem export certain pdf files that well.
Simple pdf files are fine when exported to mobi/epub, but the more complex pdfs files don't render that well. Sometimes they have weird artifacts and characters. Is this normal when converting pdfs with calibre or are pdf files difficult to render no matter what application does it?
After doing more research. Adobe pdf files have features that are proprietary such as tables in pdf files and mobi/epub doesn't support tables. I read a work-a-round is to export the pdf file to epub and then use an application call sigil, a WYSIWYG ebook editor to make adjustments to the epub file.
Distribution: K/Ubuntu 18.04-14.04, Scientific Linux 6.3-6.4, Android-x86, Pretty much all distros at one point...
Posts: 1,802
Rep:
PDF files are my speciality... Just kidding,... But I do know a bit about them...
One possible way around the problem is to break up the PDFs with a tool like PDFsam, pdfedit, or some others that can be found, exporting problematic things like tables to images, and reassembling the PDF for conversion. It's time consuming and labor intensive for automatic conversion though, unless the real goal is getting the document into reader form for wider distribution, or for self-publishing.
Anything that is in the ISO 32000-1 standard for PDFs has been essentially open sourced (Adobe has granted a public patent license for free use). Of course, there are other features that extend PDFs from there that are not open. Most of what can be found in ISO 32000-1 has been incorporated, to some extent, into open source PDF editors. One suggestion is to make sure you're using the most up to date versions of the editing tools available (so not necessarily repo versions).
Working from the other end, I use Aldiko on Android. Imports/converts technical (reference manuals) pdfs fine. They are page-format just like the pdf rather than a truly flowable epub, but work just fine for me when read on a tablet in portrait.
Might be an option.
Kindlegen produce the same results as calibre. If a pdf has tables or a complex layout it is not formatted as the original pdf and the table of contents i.e chapters are replace by dots, cryptic symbols and incomplete sentences. See below
I tried this experiment to see if kindle can open native pdf files and it did. I drop the original unmodified pdf file into the kindle directory and then launch the kindle app. The only drawback is I have to zoom in to read it. But hey, it works!
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.