Message boards : Web interfaces : robots.txt / snapshot robot
Message board moderation
Author | Message |
---|---|
Send message Joined: 27 Jun 06 Posts: 305 |
The snapshot robot from snaps.com does not obey robots.txt Project admins who want to protect their pages from beeing accessed through this snapshot stuff can try this in .htaccess : deny from 38.98.19 It worked on my domain although I'm not sure wether the excluded IP range is sufficient. p.s.: I have to correct what I wrote ... show_user.php is just missing in robots.txt so it's not the fault of snaps.com - sorry for that |
Send message Joined: 19 Jan 07 Posts: 1179 |
The snapshot robot from snaps.com does not obey robots.txt That domain name doesn't exist. |
Send message Joined: 27 Jun 06 Posts: 305 |
http://www.snap.com Agent identifier (HTTP_USER_AGENT) are - "Snapbot/1.0 (Snap Shots, +http://www.snap.com)" - "Mozilla/5.0 (SnapPreviewBot) Gecko/20061206 Firefox/1.5.0.9" But it was my fault, the bot does obey robots.txt - I assumed that the BOINC sample robots.txt contained all database driven pages but it does not. |
Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document
under the terms of the GNU Free Documentation License,
Version 1.2 or any later version published by the Free Software Foundation.