LinuxQuestions.org
Register a domain and help support LQ
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Newbie
User Name
Password
Linux - Newbie This Linux forum is for members that are new to Linux.
Just starting out and have a question? If it is not in the man pages or the how-to's this is the place!

Notices


Reply
  Search this Thread
Old 07-06-2011, 10:47 AM   #1
Jennifer Corpus
LQ Newbie
 
Registered: Jul 2011
Posts: 3

Rep: Reputation: Disabled
Unhappy Wget command


Hi
What is the Wget command to perform the following

download only html from the url and save it in a directory

other file extentions like.doc,.xls etc should be excluded automatically
 
Old 07-06-2011, 10:52 AM   #2
szboardstretcher
Senior Member
 
Registered: Aug 2006
Location: Detroit, MI
Distribution: GNU/Linux systemd
Posts: 3,788
Blog Entries: 1

Rep: Reputation: 1340Reputation: 1340Reputation: 1340Reputation: 1340Reputation: 1340Reputation: 1340Reputation: 1340Reputation: 1340Reputation: 1340Reputation: 1340
What command are you trying now to get all of the site? If you provide that, I could modify it to help you out.
 
Old 07-06-2011, 10:57 AM   #3
Jennifer Corpus
LQ Newbie
 
Registered: Jul 2011
Posts: 3

Original Poster
Rep: Reputation: Disabled
wget -r -a.html http://linuxreviews.org/ -o e:\websites\linux

basically im trying to perform a website copier like htttrak using wget. But im not sure about the command formatt

Last edited by Jennifer Corpus; 07-06-2011 at 10:58 AM.
 
Old 07-06-2011, 11:02 AM   #4
szboardstretcher
Senior Member
 
Registered: Aug 2006
Location: Detroit, MI
Distribution: GNU/Linux systemd
Posts: 3,788
Blog Entries: 1

Rep: Reputation: 1340Reputation: 1340Reputation: 1340Reputation: 1340Reputation: 1340Reputation: 1340Reputation: 1340Reputation: 1340Reputation: 1340Reputation: 1340
Quote:
Originally Posted by Jennifer Corpus View Post
wget -r -a.html http://linuxreviews.org/ -o e:\websites\linux

basically im trying to perform a website copier like htttrak using wget. But im not sure about the command formatt
You have asked for two different things. The first thing you asked for was to be able to download ONLY the html files from a site, here is how you do that.

Code:
wget --recursive --accept=html http://linuxreviews.org
if you want to 100% mirror a website, as you requested in your second post, just use the -m or --mirror parameter. (look at the manual for wget for details)

Code:
wget --mirror http://linuxreviews.org

Last edited by szboardstretcher; 07-06-2011 at 11:05 AM.
 
Old 07-06-2011, 11:07 AM   #5
Jennifer Corpus
LQ Newbie
 
Registered: Jul 2011
Posts: 3

Original Poster
Rep: Reputation: Disabled
Thank you so much will try that
 
  


Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
regarding wget command shanthini Linux - General 6 02-21-2010 10:54 PM
Wget command help divyahm Linux - Software 12 09-04-2007 02:25 PM
wget command help munkeevegetable Linux - Newbie 8 11-03-2004 07:37 PM
wget command juanb Linux - General 5 12-11-2003 04:23 PM
wget command glock19 Linux - General 0 11-29-2001 03:20 PM


All times are GMT -5. The time now is 10:37 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Facebook: linuxquestions Google+: linuxquestions
Open Source Consulting | Domain Registration