登录查看更多内容

A basic guide on Robots.txt with Best Practices

Shruti Panwar

SEO Analyst

发布日期: 2022年11月28日

+ 关注

#robots #seo #technicalseo

A complete guide on Robots.txt with Best Practices

A robots.txt is a file that is obeyed by most search engines including Google, Bing, and Yahoo to crawl or not crawl pages on the website. A basic file looks like this –

User-agent: *

Disallow:

Sitemap:?https://example.com/sitemap.xml

Importance of Robots.txt File

Block nonpublic pages- There are certain pages that you don’t want to index such as a staging site or login page because you don’t want random people to access it. This is a case where robots.txt play a crucial role by blocking such pages from the index.
Maximize crawl budget- You may have a crawl budget where you want the all-important page to be indexed then you can block the irrelevant pages in the robot.txt file. This way Google will crawl those pages that matter to you.
Prevent Indexing resources- Meta directives work well with some of the pages but they don’t work with documents like pdfs and images. This is where Robots.txt play a crucial part and you can always check the index status in the search console.

How does the robots.txt file work?

The search engine works by crawling and indexing from the syntax of the robots.txt file and discovering to follow and nofollow. From robots.txt, the crawler knows which page to be indexed and which not.

Where you should put Robots.txt file?

The robots.txt file should be placed in the root of your domain and make sure you write it as “robots.txt” as it is case sensitive otherwise it will not work.

Best Practices on Creating Robots.txt

Creating a Proper syntax – It is vital to write the proper robots.txt following the syntax for allowing bots to crawl or not crawl the specific pages. It can also be the different syntax for different bots. Make sure to allow and disallow syntax should be case sensitive. For an instance

User agent: Googlebot

Shubh Singhai 4 年前

Is robots.txt file really necessary for your website!!

Subrata Sarker 7 年前

Fun with robots.txt

Raybahadursinh champavat 7 年前

Disallow: /images

The above directive means to crawl everything by the spider’s name Google bot except images folder. Make sure to enter the right disallow directive for images as it is case sensitive and should not be Images instead of images. You can choose * for all bots and syntax be like this

User-agent: *

Disallow: /images

2. Common User agents – Here is the list of most common agents to match the most used search engines

Using wild cards/ regular expressions
Ex –
Disallow: /.php Disallow: /copyrighted-images/.jpg

In the above example use of * in the first line is used to match the file name and will be blocked but second line will not be blocked from the crawl.

Disallow: /*.php$

In the above example /index.php will be blocked but /index.php?p=1. Hence it is important to use the expressions very diligently otherwise many pages will get block on the site.

Common directives used – Most sites use the below-mentioned directives as it is easy and very readable.

Disallow: /wp-admin/

Allow: /wp-admin/admin-ajax.php

Crawl Delay – Make sure when using the crawl-delay directive. If you set a crawl delay of ten seconds, you only allow search engines to access 8,640 pages a day.

6. Using a sitemap is crucial to index all the webpages although you need to submit it to the search console for recommendations.

Source: https://eliteseozone.com/robots-txt-guide/

要查看或添加评论，请登录

Shruti Panwar的更多文章

The Role of Click Through Rate (CTR) in Google Ranking

2024年7月31日

The Role of Click Through Rate (CTR) in Google Ranking

Introduction In the realm of search engine optimization (SEO), many factors influence how well a website ranks on…
How to Optimize SEO Title Tags and make them search friendly?

2023年2月14日

How to Optimize SEO Title Tags and make them search friendly?

#titles #onpageseo #seo #seospecialist Title tag optimization is a key strategy when used with other on-page SEO…
Why is hiring a travel SEO specialist important for your site?

2022年12月27日

Why is hiring a travel SEO specialist important for your site?

#travelseo #travelseospecialist #seofreelancer What is Travel SEO? Travel SEO is a process of increasing the visibility…
Local SEO – A Complete Local SEO Checklist for your Business

2022年12月13日

Local SEO – A Complete Local SEO Checklist for your Business

#localseo #SEO #localseotips If you are the owner of a small business or working for local clients you need to have a…

2 条评论
Google Business Profile – How to Set & Optimize Google Listing

2022年11月17日

Google Business Profile – How to Set & Optimize Google Listing

#GoogleBusinessprofile #GMB #Googlelisting A well-optimized Google business profile will help your local store to get…
How to increase website organic Traffic – 8 Powerful & Proven Tips

2022年11月10日

How to increase website organic Traffic – 8 Powerful & Proven Tips

#increasewebsitetraffic #websitetraffic Are you struggling to get organic traffic to your website? Here are powerful…

1 条评论
How to Optimize Product Page to get a better Sale?

2022年11月7日

How to Optimize Product Page to get a better Sale?

#ecommerceseo #productsales Performing product page SEO is as crucial as other content page optimization. This is…
Google Spam Update 2022 – Everything you need to know

2022年11月3日

Google Spam Update 2022 – Everything you need to know

#googlealgorithm #spamupdate #Spamupdate2022 Google rolled out its spam update to prevent search engine result pages…
5 Common SEO Myths to Leave Behind in 2021

2022年10月7日

5 Common SEO Myths to Leave Behind in 2021

#SEOMyth #SEO22 #SEO SEO changes quickly and it is hard to keep up with the latest trends. In this post, check out all…
Key Benefits of hiring a Real Estate SEO Expert

2022年6月23日

Key Benefits of hiring a Real Estate SEO Expert

What is real estate SEO? Real estate SEO means optimizing the website and content for the keywords on search engines…

1 条评论

See all articles

A basic guide on Robots.txt with Best Practices

Shruti Panwar

SEO Analyst

Importance of Robots.txt File

How does the robots.txt file work?

Best Practices on Creating Robots.txt

领英推荐

Shruti Panwar的更多文章

社区洞察

其他会员也浏览了

Have fun with Robots.txt

Robots.txt File Case Study: How Third-Party Directive Changes Led To Leaking URLs And Lost SEO Traffic

How to create a Robots.txt file under 30 seconds?

What is Robots.txt

How well do you know the Robots.txt? Learn about it as well how to set it up plus PRO TIPS

The Value of Robots.txt: Bringing Your SEO to the Next Level

Some Common Mistakes About Robots.txt File & Fix the Issues

The Power of Robots.txt: Controlling What Search Engines See

How to Fix Blocked by robots.txt Errors?

Traffic Dropped to Zero? Check Your Robots.txt

Importance of Robots.txt File

How does the robots.txt file work?

Best Practices on Creating Robots.txt

领英推荐

Shruti Panwar的更多文章

The Role of Click Through Rate (CTR) in Google Ranking

How to Optimize SEO Title Tags and make them search friendly?

Why is hiring a travel SEO specialist important for your site?

Local SEO – A Complete Local SEO Checklist for your Business

Google Business Profile – How to Set & Optimize Google Listing

How to increase website organic Traffic – 8 Powerful & Proven Tips

How to Optimize Product Page to get a better Sale?

Google Spam Update 2022 – Everything you need to know

5 Common SEO Myths to Leave Behind in 2021

Key Benefits of hiring a Real Estate SEO Expert

社区洞察

其他会员也浏览了

Have fun with Robots.txt

Robots.txt File Case Study: How Third-Party Directive Changes Led To Leaking URLs And Lost SEO Traffic

How to create a Robots.txt file under 30 seconds?

What is Robots.txt

How well do you know the Robots.txt? Learn about it as well how to set it up plus PRO TIPS

The Value of Robots.txt: Bringing Your SEO to the Next Level

Some Common Mistakes About Robots.txt File & Fix the Issues

The Power of Robots.txt: Controlling What Search Engines See

How to Fix Blocked by robots.txt Errors?

Traffic Dropped to Zero? Check Your Robots.txt