GeneralThis forum is for non-technical general discussion which can include both Linux and non-Linux topics. Have fun!
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
I have been going through the website stats for my site using Analog and Webalizer in Cpanel and found a very interesting thing. I also have some questions regarding this. I would be glad if anybody could answer this.
The MSN bot has been sucking up a lot of bandwidth on my site. For example, on a single visit I found that it had used approximately about 10 MB (that is 10000+ Kilobytes) of the bandwidth. Although this may not sound like much, I'm not sure how much total MB of bandwidth the MSN bots use up in my site per month because it seems to be a frequent visitor on my site. Do you think this is normal?
Is it OK to completely ban this MSN bot from my site? The problem is that, it appears to be indexing my site for search at MSN, but so far my site doesn't even register on an MSN search page. Even when I type the full URL the MSN search doesn't find my site.
Has anybody else owning a website had this problem of MSN bots using up a lot of bandwidth? My host provides Apache 1.x/Linux webserver.
How do I submit my web page to be indexed in MSN Search?
MSNBot is not contributing directly to MSN Search at this time. Please visit the MSN Search submit a site page.
and here
Quote:
Why is MSNBot trying to access a robots.txt file that is not on my server?
The robots.txt file is used by webmasters to prevent web crawlers from downloading some or all of the information on their websites. For information on how to create a robots.txt file, see The Robot Exclusion Standard. If you want to prevent the "File not found" error messages from appearing in your server log, create an empty file named robots.txt.
also here
Quote:
How do I prevent MSNBot from crawling some or all of my website?
The robots.txt file is used to prevent web crawlers from accessing a web site. The format of the robots.txt file is specified in The Robot Exclusion Standard. MSNBot analyzes all instances where the User-Agent is specified as either "msnbot" or "*". Based on this, MSNBot crawls only the web pages that allow it to do so.
But in typical Microsoft fashion, they want us to "submit" our site manually for it to be included in their search, a euphemism for "advertise". But I'm going to ban that robot from my site anyway. They use up my paid-for bandwidth in this manner and my site does not even get listed in their search engine
The MSN bot was in the top 5 visitors list in my site stats in terms of visits and also in terms of KB. I can live without MSN search, but I will not pay for its visits to my site with my bandwidth which I want to conserve for the genuine visitors.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.