LinuxQuestions.org
Register a domain and help support LQ
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Newbie
User Name
Password
Linux - Newbie This Linux forum is for members that are new to Linux.
Just starting out and have a question? If it is not in the man pages or the how-to's this is the place!

Notices


Reply
  Search this Thread
Old 04-13-2010, 06:05 AM   #1
dili
LQ Newbie
 
Registered: Apr 2010
Posts: 16

Rep: Reputation: 0
Question Problems with pdf files


hi
i have started using linux for less than 6 months.now i have come across a problem with pdf files in linux.i want to join different pages from different pdf files into single pdf file.i have come across softwares that do this but they perform this using page numbers from pdf files.but i need to do this based on keywords in different pages .for eg there 3 pdf files

india.pdf

contents:languages
.........
........
places
......
......
achievments
......
......


china.pdf

contents:languages
.........
........
places
......
......
achievments
......
......

americe.pdf

contents:languages
.........
........
places
......
......
achievments
......
......

now i have to create a pdf file langunage.pdf ,combining the topic languanges from three pdf files america.pdf,india.pdf,china.pdf
how can i do it??
whether there is any open source software for doing this?..(if i could get the source code of that software then it will be very helpful to me0

Last edited by dili; 04-13-2010 at 06:34 AM.
 
Old 04-13-2010, 11:20 AM   #2
amani
Senior Member
 
Registered: Jul 2006
Location: Kolkata, India
Distribution: Debian 64-bit GNU/Linux, Kubuntu64, Fedora QA, Slackware,
Posts: 2,766

Rep: Reputation: Disabled
You should write a shell script for that.

You can use pdftotext, grep, and pdftk...for example

or pdfedit
 
1 members found this post helpful.
Old 04-13-2010, 11:23 AM   #3
TB0ne
LQ Guru
 
Registered: Jul 2003
Location: Birmingham, Alabama
Distribution: SuSE, RedHat, Slack,CentOS
Posts: 17,948

Rep: Reputation: 3693Reputation: 3693Reputation: 3693Reputation: 3693Reputation: 3693Reputation: 3693Reputation: 3693Reputation: 3693Reputation: 3693Reputation: 3693Reputation: 3693
Quote:
Originally Posted by dili View Post
hi
i have started using linux for less than 6 months.now i have come across a problem with pdf files in linux.i want to join different pages from different pdf files into single pdf file.i have come across softwares that do this but they perform this using page numbers from pdf files.but i need to do this based on keywords in different pages .for eg there 3 pdf files

now i have to create a pdf file langunage.pdf ,combining the topic languanges from three pdf files america.pdf,india.pdf,china.pdf
how can i do it??
whether there is any open source software for doing this?..(if i could get the source code of that software then it will be very helpful to me0
What version/distro of Linux are you using?? What is the "softwares" that you are using?? Provide details, if you want anyone to help. Also, you only give a vague idea on what you want to do, but if it was me, I'd convert the PDF into a different format, run the collation on it, then put it back into PDF format.

Also, you can find the source code for pretty much ANY Linux program easily.
 
1 members found this post helpful.
Old 04-13-2010, 08:45 PM   #4
John VV
LQ Muse
 
Registered: Aug 2005
Location: A2 area Mi.
Posts: 16,825

Rep: Reputation: 2408Reputation: 2408Reputation: 2408Reputation: 2408Reputation: 2408Reputation: 2408Reputation: 2408Reputation: 2408Reputation: 2408Reputation: 2408Reputation: 2408
what TB0ne said .
But this can be done in OpenOffice

but depending on WHAT the pdf's are i might use gimp

if there are say, scanned docs ,and the pdf's are just "photos" of said docs
then gimp will work
 
1 members found this post helpful.
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
problem to convert pdf, from multiple tiff files unix.fresher Linux - Software 1 03-27-2010 10:23 AM
Problem printing pdf files (okular 0.8.4 KDE 4.2.4) on openSuse 11.1 Kapten Beard Linux - Newbie 3 02-04-2010 09:06 AM
Print problem - PDF files, Deskjet815c, Evince, Ubuntu 8.04 tandp7877 Linux - Desktop 6 06-03-2009 07:37 AM
How do I unpack pdf.pdf files corbis_demon Linux - General 5 10-29-2004 10:12 PM
IPtables problem with PDF files Dax_wells Linux - Security 9 09-29-2004 04:03 PM


All times are GMT -5. The time now is 02:45 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration