The Robots Exclusion Protocol
The Robots Exculsion Protocol is a mechanism for telling robots and crawlers which files and directories they can access as they flow through your website.
When a well behaved robot starts crawling a website it checks for the robots.txt file, analyses its contents and cralws only through the directories and files which are allowed by the contents of the robots.txt file.
The syntax of the entries in the robots.txt file is of the type:
User-agent: <Name of user agent>
Dissallow: <Name of directory>
Wild cards are not allowed and each of the directories or files that you need to exclude for a specific robot should be denoted explicitly.
Advanced Robots.txt Generator allows you to select robots and directories in a WYSIWYG manner and has the most updated list of robots available. It will take seconds to create and upload your robots.txt file and maintaining your robots.txt files can be done at a glance.
Check the full features and download Advanced Robots.txt Generator!
|