Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum. |
Notices |
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
Are you new to LinuxQuestions.org? Visit the following links:
Site Howto |
Site FAQ |
Sitemap |
Register Now
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
|
|
02-13-2009, 06:57 PM
|
#1
|
Senior Member
Registered: Feb 2002
Location: harvard, il
Distribution: Ubuntu 11.4,DD-WRT micro plus ssh,lfs-6.6,Fedora 15,Fedora 16
Posts: 3,233
|
pdf table to html table?
the problem is simple, i have a pdf document that is essentially a table of data that i need parsed into an html table, i know there is pdftohtml but that converts using <div> tags and i need it to be a table (<th>,<td> etc...) is there any way of doing that and if so how?
|
|
|
02-14-2009, 06:40 PM
|
#2
|
Senior Member
Registered: Jan 2002
Location: germany
Distribution: ubuntu, mint, suse
Posts: 1,591
Rep:
|
If you are familiar with a scripting language it is not a problem. In perl you could use PDF::API2 module among others to parse the pdf and transform it to html. In php there is fpdf.org.
Why do you need this table-style html?
|
|
|
02-14-2009, 06:43 PM
|
#3
|
Senior Member
Registered: Jul 2006
Location: Kolkata, India
Distribution: Debian 64-bit GNU/Linux, Kubuntu64, Fedora QA, Slackware,
Posts: 2,766
Rep:
|
Another way is pdf --> txt --->(Gnumeric) --> table in various formats
|
|
|
02-14-2009, 07:04 PM
|
#4
|
Senior Member
Registered: Feb 2002
Location: harvard, il
Distribution: Ubuntu 11.4,DD-WRT micro plus ssh,lfs-6.6,Fedora 15,Fedora 16
Posts: 3,233
Original Poster
|
Quote:
Originally Posted by j-ray
If you are familiar with a scripting language it is not a problem. In perl you could use PDF::API2 module among others to parse the pdf and transform it to html. In php there is fpdf.org.
Why do you need this table-style html?
|
i am using a site that keeps track of inventory and it outputs reports as a pdf, i'm looking for a way to convert the pdf table into a form that can be processed by a php script (such as an array)
|
|
|
02-14-2009, 07:21 PM
|
#5
|
Senior Member
Registered: Jan 2002
Location: germany
Distribution: ubuntu, mint, suse
Posts: 1,591
Rep:
|
the fpdf library only supports generation of pdfs not the parsing. Sorry, no idea in php. (I had thought it could both) If you can switch to perl you are done soon...
|
|
|
All times are GMT -5. The time now is 11:01 AM.
|
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.
|
Latest Threads
LQ News
|
|