LinuxQuestions.org
Visit Jeremy's Blog.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - General
User Name
Password
Linux - General This Linux forum is for general Linux questions and discussion.
If it is Linux Related and doesn't seem to fit in any other forum then this is the place.

Notices


Reply
  Search this Thread
Old 11-15-2002, 07:59 PM   #1
PinkJin
LQ Newbie
 
Registered: Nov 2002
Location: Manchester, UK
Distribution: Red Hat, Mandrake
Posts: 5

Rep: Reputation: 0
wget 1.8.2


I've been using wget -m to mirror some websites and version 1.8.1 worked exactly as I wanted. However I've now upgraded my Red Hat to 8.0 & Mandrake to 9.0, so I now have wget 1.8.2. Now my wget scripts are spanning external hosts, which didn't happen before. The documentation says that 1.8.2 is just a bug fix with no change for users and also, the man page says that hosts shoulg not be spanned by default. I don't have a .wgetrc file and I can't find anything suspicious in /etc/wgetrc. Any ideas?
 
Old 11-16-2002, 02:43 PM   #2
RijilV
Member
 
Registered: Sep 2002
Location: somewhere
Distribution: gentoo
Posts: 123

Rep: Reputation: 15
hrm, odd, a quick fix would be to use the -L option.... but -m should be working...

for whats it worth, I just used wget 1.8.2 to mirror a website with plenty of external links, and it worked fine....perhaps the sites you are trying to mirror are doing something that is tricking wget? could you give me an example of a site you tried to mirror with the -m option that grabbed content from elsewhere?

I did blackops.rubi-con.org just fine...
 
Old 11-17-2002, 04:14 AM   #3
PinkJin
LQ Newbie
 
Registered: Nov 2002
Location: Manchester, UK
Distribution: Red Hat, Mandrake
Posts: 5

Original Poster
Rep: Reputation: 0
Thanks for the reply.

I tried your rubi-con site which worked fine. One of the sites I use is www.manchester.com, which still doesn't. I just used 'wget -m www.manchester.com -o log.txt&' (I usually also include -w, but I wanted a quick result). After about 8 hours, it was still running and I had lots of directories representing external sites; they all contained just a few files.
 
Old 11-17-2002, 05:50 AM   #4
RijilV
Member
 
Registered: Sep 2002
Location: somewhere
Distribution: gentoo
Posts: 123

Rep: Reputation: 15
I going to guess then its something about the web-code on that manchester website, I took a look at it and it seemed pretty goofy, with tons of javascript in there. I bet wget is just getting confused regarding some of the external links.

unless that is, wget was caching that site correctly before, and now it isn't. in that case, I would probably submit a bug to the makers of wget
 
Old 11-24-2002, 04:20 PM   #5
PinkJin
LQ Newbie
 
Registered: Nov 2002
Location: Manchester, UK
Distribution: Red Hat, Mandrake
Posts: 5

Original Poster
Rep: Reputation: 0
I tried using -L option, but no change. Also I tried reinstalling 1.8.1 from the Mandrake 8.2 distro; that didn't make any difference either.

I've been looking on the wget list at sunsite.dk and there seem to be other people reporting similar things. It doesn't look like a problem with the wget binaries; maybe something changed elsewhere?

GTS
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
can't wget!! aru_04 Linux - General 5 08-13-2005 05:07 AM
wget noir911 Linux - Newbie 8 07-30-2005 08:57 AM
wget Harp00 Linux - Newbie 4 11-15-2004 07:27 PM
wget toastermaker Linux - Software 4 11-13-2004 10:59 AM
wget filex Linux - Security 4 09-08-2004 08:02 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - General

All times are GMT -5. The time now is 07:39 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration