[UPHPU] Default robots.txt file
Velda Christensen
velda at novapages.com
Tue Aug 7 17:11:27 MDT 2007
Jon Jensen wrote:
> It's a way for web sites to tell bots/spiders/crawlers how to behave
> (e.g. what they can and cannot view), though it is of course entirely
> up to the bot to comply.
>
> http://www.robotstxt.org/wc/norobots.html
We find that there are a ton of bots that do the exact opposite of what
you tell them to. I'd use .htaccess to deny all traffic in directories
and files that no one needs to access via http, to password protect
admin / private directories, and to just deny bots in directories you
simply don't want indexed.
Robots.txt files are great though for telling google which pictures it
can and can't use :-)
-V
More information about the UPHPU
mailing list