The robots ejection protocol (REP), or robots.txt is a text data file webmasters produce to apprise robots (typically SE robots) how to crawling and index pages on their web site.
Robots.txt needs to be charged in the ranking directory of a World Wide Web host called for to be of value. Example: http://www.apnaypages.com/robots.txt
Robots.txt is most common name of a text data file that are uploaded to a web site root directory and affiliated in the hypertext markup language code of the web site. The robots.txt file is used to allow instruction manual about the web site to Web robots and spiders. Web authors can use robots.txt to keep on cooperating Web robots from acquiring at altogether or parts of a web site that you want to hold on case-by-case.
Translate the limits of robots.txt:
Earlier you build up your robots.txt, you had better acknowledge the chances of this URL block formula. Once in a while, you could prefer to consider other mechanicses to check your URLs are not find able on the web.
Robots.txt instructions are directives only:
The instructions in robots.txt files can’t impose bot behavior to your internet site; alternatively, these commands act directives to the bots getting at your internet site. While Googlebot and other honorable web bots obey the instructions in a robots.txt file, other bots could not. Therefore, if you prefer to keep info ensure from web bots, it’s best to use other block formulas, such password-protecting private files on your host.
Different bots interpret syntax differently
Altho honorable web bots follow the directives in a robots.txt file, each bot could construe the directives differently. You should acknowledge the right syntax for accosting a different web bots as some might not realise sure directions.
Your robots.txt directives can’t prevent acknowledgments to your URLs from other internet site:
While Google won’t crawling or index the article blocked up by robots.txt, we could still discover and index a prohibited URL from other places on the web. As a result, the URL address and, possibly, other in public usable info such anchor text in links to the site can still come along in Google search results. You are able to stop your URL from appearance in Google Search results all by applying your robots.txt in combining with other URL block formulas, such password-protecting the files on your host, or introducing indexing directing meta tags into your HTML.
Note: aggregating multiple crawl and indexing directives could cause a few directives to antagonist other directives. Learn how to configure these directives properly by reading the aggregating crawl with indexing / serving directives section of the Google Developers support.
How to use Robot.txt video tutorial in Urdu/Hindi Part one?