LinuxQuestions.org
Review your favorite Linux distribution.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices


Reply
  Search this Thread
Old 02-07-2019, 07:24 AM   #1
newbiesforever
Senior Member
 
Registered: Apr 2006
Location: Iowa
Distribution: Debian distro family
Posts: 2,377

Rep: Reputation: Disabled
Abiword converts PDF to Word easily if imperfectly; surprised Libreoffice won't


Of course I know Libreoffice Writer can convert its documents to PDF; you simply select Export to PDF. But I was hoping to do the opposite: I downloaded a PDF of a doctor's new-patient form before my appointment, and wanted to convert it to a Word document and edit it in Libreoffice. I researched this and found out that Abiword does the conversion easily. It's not perfect--the fonts and other formatting generally aren't there--but I can use it, and the provider and staff can read it and enter it into the computer. I'll settle for it because I don't like writing detailed answers on pre-made forms with a limited amount of space, such as I often face on medical history.

Great, I solved my issue; but could Libreoffice do it? If Abiword can, I guessed the superior Libreoffice can probably do it too. To my surprise, I found a seemingly "official" statement that no, it can't: https://ask.libreoffice.org/en/quest...o-a-word-file/ . Although that post is going on three years old. I imagine the Libreoffice designers simply don't want to incorporate whatever Abiword did, because the conversion doesn't meet their high standards: they would want their conversion to look exactly like the PDF, and Abiword's conversion is crude.

Last edited by newbiesforever; 02-07-2019 at 07:50 AM.
 
Old 02-07-2019, 07:41 AM   #2
wpeckham
LQ Guru
 
Registered: Apr 2010
Location: Continental USA
Distribution: Debian, Ubuntu, RedHat, DSL, Puppy, CentOS, Knoppix, Mint-DE, Sparky, VSIDO, tinycore, Q4OS,Manjaro
Posts: 5,640

Rep: Reputation: 2697Reputation: 2697Reputation: 2697Reputation: 2697Reputation: 2697Reputation: 2697Reputation: 2697Reputation: 2697Reputation: 2697Reputation: 2697Reputation: 2697
LO can convert a document CREATED in LO between document format and pdf. I have no problem converting a PDF using ABIWORD, then always handling it using LO forever after.
 
Old 02-07-2019, 07:45 AM   #3
sevendogsbsd
Senior Member
 
Registered: Sep 2017
Distribution: FreeBSD
Posts: 2,252

Rep: Reputation: 1011Reputation: 1011Reputation: 1011Reputation: 1011Reputation: 1011Reputation: 1011Reputation: 1011Reputation: 1011
Interesting - I have attempted to replace Libreoffice with Abiword and Gnumeric for a couple of years but every time I try Abiword, it is horrible: the UI is black and flickers and is unusable. This is on both Linux and FreeBSD.

Good to know it works for someone!
 
Old 02-07-2019, 07:45 AM   #4
Turbocapitalist
LQ Guru
 
Registered: Apr 2005
Distribution: Linux Mint, Devuan, OpenBSD
Posts: 7,313
Blog Entries: 3

Rep: Reputation: 3723Reputation: 3723Reputation: 3723Reputation: 3723Reputation: 3723Reputation: 3723Reputation: 3723Reputation: 3723Reputation: 3723Reputation: 3723Reputation: 3723
It also depends on what is in the PDF. The format PDF is a terminal stage format. Your document goes there while waiting either to go to the printer or the bit bucket. Trying to recover data from a PDF is a fool's errand.

tldr; Go get the original which was used to create the PDF and work with that.
 
Old 02-07-2019, 07:52 AM   #5
newbiesforever
Senior Member
 
Registered: Apr 2006
Location: Iowa
Distribution: Debian distro family
Posts: 2,377

Original Poster
Rep: Reputation: Disabled
Quote:
Originally Posted by wpeckham View Post
LO can convert a document CREATED in LO between document format and pdf. I have no problem converting a PDF using ABIWORD, then always handling it using LO forever after.
I don't particularly like ABiword either, and this is the first useful purpose I've had for it.
 
Old 02-07-2019, 08:52 AM   #6
TB0ne
LQ Guru
 
Registered: Jul 2003
Location: Birmingham, Alabama
Distribution: SuSE, RedHat, Slack,CentOS
Posts: 26,637

Rep: Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965
Quote:
Originally Posted by newbiesforever View Post
I don't particularly like ABiword either, and this is the first useful purpose I've had for it.
Converting a PDF back into 'text' is *NEVER* going to work 100%, unless you just have a basic text-document, single column. Any formatting (dual columns, etc.), is going to throw off whatever you convert.

Personally, if you can't get a hold of the source that the PDF I'd use the pdftotext utility from the command line, and make peace with the fact you're not going to get good results. When I've had to do such things and the PDF's contained images, I'd extract the images from the PDF's first, and then get the text. Copy/paste the text into LibreOffice Write, shove in the images, and go from there. There just isn't a good way to do this with PDF's.
 
2 members found this post helpful.
Old 02-07-2019, 07:22 PM   #7
wpeckham
LQ Guru
 
Registered: Apr 2010
Location: Continental USA
Distribution: Debian, Ubuntu, RedHat, DSL, Puppy, CentOS, Knoppix, Mint-DE, Sparky, VSIDO, tinycore, Q4OS,Manjaro
Posts: 5,640

Rep: Reputation: 2697Reputation: 2697Reputation: 2697Reputation: 2697Reputation: 2697Reputation: 2697Reputation: 2697Reputation: 2697Reputation: 2697Reputation: 2697Reputation: 2697
Quote:
Originally Posted by TB0ne View Post
Converting a PDF back into 'text' is *NEVER* going to work 100%, unless you just have a basic text-document, single column. Any formatting (dual columns, etc.), is going to throw off whatever you convert.

Personally, if you can't get a hold of the source that the PDF I'd use the pdftotext utility from the command line, and make peace with the fact you're not going to get good results. When I've had to do such things and the PDF's contained images, I'd extract the images from the PDF's first, and then get the text. Copy/paste the text into LibreOffice Write, shove in the images, and go from there. There just isn't a good way to do this with PDF's.
Good advice, but I take one exception: if you are talking about a LO PDF file, LO leaves adequate clues in the metadata to do a (Near)prefect conversion back to LO Writer. If the PDF was created by anything else, it will lack that kind of metadata. Somethign may be able to read and convert it, but it may not look as you think it should. Always best to have the source.
 
1 members found this post helpful.
Old 02-08-2019, 07:10 AM   #8
TB0ne
LQ Guru
 
Registered: Jul 2003
Location: Birmingham, Alabama
Distribution: SuSE, RedHat, Slack,CentOS
Posts: 26,637

Rep: Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965Reputation: 7965
Quote:
Originally Posted by wpeckham View Post
Good advice, but I take one exception: if you are talking about a LO PDF file, LO leaves adequate clues in the metadata to do a (Near)prefect conversion back to LO Writer. If the PDF was created by anything else, it will lack that kind of metadata. Somethign may be able to read and convert it, but it may not look as you think it should. Always best to have the source.
Quite correct, and great observation. The PDF's I had to work did NOT have that metadata, so I had to improvise.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
[SOLVED] Libreoffice - how can I convert many odt files to doc or pdf automatically? charlemagne-is-my-son Linux - Software 3 12-10-2012 11:32 AM
problem saving doc file as pdf in abiword ahurd Linux - Newbie 0 03-16-2009 03:24 PM
LXer: OggConvert makes Ogg converts (and converts to Oggs) LXer Syndicated Linux News 0 12-22-2007 06:00 AM
How to convert a PDF to DOC (word compatible) ? Xeratul Linux - General 9 02-06-2007 07:29 PM
trouble finding policy doc as pdf in doc mirrors stardotstar Debian 2 05-12-2005 10:56 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Software

All times are GMT -5. The time now is 02:59 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration