LinuxQuestions.org
Review your favorite Linux distribution.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices


Reply
  Search this Thread
Old 07-07-2010, 04:45 AM   #1
shongale
LQ Newbie
 
Registered: Jul 2010
Location: Eugene, Oregon USA
Distribution: CentOS 5.4
Posts: 5

Rep: Reputation: 0
Red face Stop viewing of robots.txt in browser


I need to stop the viewing of robots.txt on my website. I get the contents of the file displayed in my browser when I issue the command:
http://www.inwoon.net/robots.txt
Help me stop this as it displays all the directories I don't want them to go to. I greatly appreciate the help as I am right at the level of Linux knowledge to be dangerous, capable but still dangerous.
Thanks much;
 
Old 07-07-2010, 05:03 AM   #2
irmin
Member
 
Registered: Jan 2010
Location: the universe
Distribution: Slackware (modified), Slackware64 (modified), openSuSE (modified)
Posts: 342

Rep: Reputation: 62
Welcome on LQ,

robots.txt list directories or files that should not be indexed by webcrawler. However the rules are not mandatory for those programs. Consequently the robots.txt must be available to the public to give the crawlers a chance to read your rules. Furthermore robots.txt-rules do never apply to web browsers.

Thus if you want not show this file, make it unreadable for all clients or for certain user agents. If the directories contain secret contents, why are they accessible from the internet? Perhaps you should require the user to authenticate before accessing this directories.
 
Old 07-07-2010, 09:29 AM   #3
shongale
LQ Newbie
 
Registered: Jul 2010
Location: Eugene, Oregon USA
Distribution: CentOS 5.4
Posts: 5

Original Poster
Rep: Reputation: 0
Thank you for the response. Are you saying I should popup an authentication dialog before letting anyone view the contents of a directory?
Is this something I have to do in apache. I am hosting my own web server with my own static ip in house. I have total access to the configuration. If you could direct me to relevant documentation on how to set this idea up I would be forever in your debt.
Once again thank you for the answer!!
 
Old 07-07-2010, 11:33 AM   #4
repo
LQ 5k Club
 
Registered: May 2001
Location: Belgium
Distribution: Arch
Posts: 8,529

Rep: Reputation: 899Reputation: 899Reputation: 899Reputation: 899Reputation: 899Reputation: 899Reputation: 899
Take a look at
http://forums.ukwebmasterworld.com/p...ed-robots.html
 
Old 07-07-2010, 12:09 PM   #5
shongale
LQ Newbie
 
Registered: Jul 2010
Location: Eugene, Oregon USA
Distribution: CentOS 5.4
Posts: 5

Original Poster
Rep: Reputation: 0
Talking

@repo: Thanks that's an awesome lead. I will implement and report back.
That still leaves the problem of the directories being available to list in a browser.
ie. http://www.inwoon.net/EnergyLibrary/
While I want this one to be displayed all of the other I don't want them to look at.
Any suggestions will be greatly appreciated.
 
Old 07-07-2010, 12:24 PM   #6
repo
LQ 5k Club
 
Registered: May 2001
Location: Belgium
Distribution: Arch
Posts: 8,529

Rep: Reputation: 899Reputation: 899Reputation: 899Reputation: 899Reputation: 899Reputation: 899Reputation: 899
To disable directory listing in apache:
http://www.ducea.com/2006/06/26/apac...ctory-indexes/
http://httpd.apache.org/docs/1.3/misc/FAQ.html
 
  


Reply

Tags
robots



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
LXer: Robots.txt Tips For Deailing With Bots LXer Syndicated Linux News 0 01-13-2010 06:40 AM
robots.txt file ooops!...or bot ooops?!?! vous Linux - Software 1 05-15-2007 10:39 AM
robots.txt paleogryph Linux - Software 1 11-11-2005 02:32 PM
Stop the Google robots lothario Linux - Software 2 04-18-2005 05:47 AM
configuring robots.txt jc materi Linux - Security 1 04-09-2005 10:37 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Software

All times are GMT -5. The time now is 03:27 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration