LinuxQuestions.org
Latest LQ Deal: Latest LQ Deals
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Server
User Name
Password
Linux - Server This forum is for the discussion of Linux Software used in a server related context.

Notices


Reply
  Search this Thread
Old 05-05-2015, 10:50 PM   #1
said76
Member
 
Registered: Aug 2011
Posts: 113

Rep: Reputation: Disabled
Get /robots.txt in my apache log


Hi,

I'm unable to access to my webmail site. Then, to find out why, I went to the apache log file and found this:

66.249.67.252 - - [06/May/2015:10:34:37 +1000] "GET /robots.txt HTTP/1.1" 200 26
66.249.67.240 - - [06/May/2015:11:11:46 +1000] "GET /robots.txt HTTP/1.1" 200 26

I got a feeling this is to do with googlebot. Could anyone share their thoughts on how to fix this.

My system runs on Ubuntu Server 12.04.5 32bit with apache version 2.4.12.

Thank you in advance
 
Old 05-06-2015, 02:06 AM   #2
bathory
LQ Guru
 
Registered: Jun 2004
Location: Piraeus
Distribution: Slackware
Posts: 13,165
Blog Entries: 1

Rep: Reputation: 2032Reputation: 2032Reputation: 2032Reputation: 2032Reputation: 2032Reputation: 2032Reputation: 2032Reputation: 2032Reputation: 2032Reputation: 2032Reputation: 2032
Quote:
Originally Posted by said76 View Post
Hi,

I'm unable to access to my webmail site. Then, to find out why, I went to the apache log file and found this:

66.249.67.252 - - [06/May/2015:10:34:37 +1000] "GET /robots.txt HTTP/1.1" 200 26
66.249.67.240 - - [06/May/2015:11:11:46 +1000] "GET /robots.txt HTTP/1.1" 200 26

I got a feeling this is to do with googlebot. Could anyone share their thoughts on how to fix this.

My system runs on Ubuntu Server 12.04.5 32bit with apache version 2.4.12.

Thank you in advance
This has nothing to do with your problem. It's legitimate traffic from the googlebot, trying to index your site.
From this
Quote:
A robots.txt file is a file at the root of your site that indicates those parts of your site you don’t want accessed by search engine crawlers. The file uses the Robots Exclusion Standard, which is a protocol with a small set of commands that can be used to indicate access to your site by section and by specific kinds of web crawlers (such as mobile crawlers vs desktop crawlers).
If you can't access your webmail URL, check the apache error_log for errors.

Regards
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Which Unix utilities should be allowed in robots.txt? Soderlund Linux - General 3 01-18-2014 07:04 AM
[SOLVED] robots.txt ignored on vsftpd hemite Linux - Server 3 05-11-2012 07:17 PM
I need to stop the viewing of robots.txt shongale Linux - Server 3 07-07-2010 01:29 PM
robots.txt paleogryph Linux - Software 1 11-11-2005 02:32 PM
configuring robots.txt jc materi Linux - Security 1 04-09-2005 10:37 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Server

All times are GMT -5. The time now is 03:00 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration