LinuxQuestions.org
Share your knowledge at the LQ Wiki.
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - General
User Name
Password
Linux - General This Linux forum is for general Linux questions and discussion.
If it is Linux Related and doesn't seem to fit in any other forum then this is the place.

Notices

Reply
 
Search this Thread
Old 11-30-2007, 07:15 AM   #1
frenchn00b
Senior Member
 
Registered: Jun 2007
Location: E.U., Mountains :-)
Distribution: Debian, Etch, the greatest
Posts: 2,546

Rep: Reputation: 51
how to get all the pdf files of a website recursive ?


how to get all the pdf files of a website recursive ?

Quote:
yes yes
and wget cannot get all the pdf files,

wget -r -l15 -A.pdf http://www.rasmusen.org/

and when I do :
wget -r -nd --convert-links -l15 -A.pdf http://www.rasmusen.org/
it does not convert any single link

Last edited by frenchn00b; 11-30-2007 at 08:57 AM.
 
Old 11-30-2007, 07:18 AM   #2
pixellany
LQ Veteran
 
Registered: Nov 2005
Location: Annapolis, MD
Distribution: Arch/XFCE
Posts: 17,802

Rep: Reputation: 728Reputation: 728Reputation: 728Reputation: 728Reputation: 728Reputation: 728Reputation: 728
Do you mean the html files? (eg my site has no pdf files)

Have you looked at wget?
 
Old 11-30-2007, 08:56 AM   #3
frenchn00b
Senior Member
 
Registered: Jun 2007
Location: E.U., Mountains :-)
Distribution: Debian, Etch, the greatest
Posts: 2,546

Original Poster
Rep: Reputation: 51
yes yes
and wget cannot get all the pdf files,

wget -r -l15 -A.pdf http://www.rasmusen.org/

and when I do :
wget -r -nd --convert-links -l15 -A.pdf http://www.rasmusen.org/
it does not convert any single link
 
Old 11-30-2007, 09:55 AM   #4
brianmcgee
Member
 
Registered: Jun 2007
Location: Munich, Germany
Distribution: RHEL, CentOS, Fedora, SLES (...)
Posts: 399

Rep: Reputation: 38
Have a look at httrack [1]. It is a website copier and you can specify which files to include respectively exclude.

[1] http://www.httrack.com/
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Printing PDF docs from the IRS Website HaroldWho Linux - Desktop 2 02-05-2007 01:56 PM
need website doc->pdf conversion cody19 Linux - Software 8 04-13-2006 12:41 PM
Recursive Copy of certain files Whiskerz Linux - Newbie 5 12-18-2005 07:06 PM
chmod recursive on files on dlublink Linux - Newbie 6 03-02-2005 08:45 AM
How do I unpack pdf.pdf files corbis_demon Linux - General 5 10-29-2004 09:12 PM


All times are GMT -5. The time now is 02:05 PM.

Main Menu
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
identi.ca: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration