Linux - GeneralThis Linux forum is for general Linux questions and discussion.
If it is Linux Related and doesn't seem to fit in any other forum then this is the place.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
Sorry for the cryptic title but I am not sure exactly what is going on here, as it may be apache or php or....
I was looking at my apache access logs and noticed entries such as:
/sitemap.bot/writing/music/unix/code/photos/july_august_2005/
As far as I can tell all these were created by Yahoo! Slurp indexing my site.
Now: /sitemap.bot is a static page that has a link to every page on my site.
"writing", "music", "unix" and all other elements of this 'path' are top level directories of my webroot. So I figure Slurp is a little confused, but no big deal, right? But then I notice, these requests are returning code 200 - ok
Whaaa?
This got me curious so I tried this url:
http://badcomputer.org/index.bot/foo/bar/baz/this/doesn't/exist/
Sure enough, it returns 200 and shows me my homepage (badcomputer.org/index.bot) but without a stylesheet images etc (presumably because the path is wrong).
So my question is, what is causing this, and how to I get requests such as these to return 404 as they should?
My apache is 2.0.58
My PHP is 5.1.4
If I have forgotten something you need to diagnose this please ask.
Thanks for consideration.
this is probably a security patch. Cause its good that any page you open returns a 200 request, since automatic scanners wont work against your domain to find any vulnerabilities. Suppose the scanner is searching for some php flaws like remote inclusion it will get all false positives. So the hacker will have a hard time figuring out which are the true vulns on your site.
Anyway, search for something you installed to secure apache. That should be the cause to this.
Cause its good that any page you open returns a 200 request, since automatic scanners wont work against your domain to find any vulnerabilities.
Not sure that any good would outweigh the fact that Yahoo! is indexing pages on my site that don't exist. I don't think that is good at all. Besides, it seems to fly in the face of the entire HTTP protocol.
Quote:
Anyway, search for something you installed to secure apache.
I've done nothing of the sort, in fact, I do not even have mod_security installed. I do however have hardened php. Could this be it then? I still want it disabled.
Actually, after a test I don't think you are right at all...as when I try: http://badcomputer.org/somepagethatdoesntexist.html
I get a 404 as I should. It just seems to do this when a directory path is grafted onto the en of the URL...
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.