How to Block Websites from Google Crawlers

x32x01 · Dec 3, 2023

Sometimes, you don’t want your entire website - or certain pages - to appear in Google or other search results. Whether your site is under development, being redesigned, or contains private data, you can easily control what gets indexed using a simple file called robots.txt or meta tags.

This guide answers the most common questions about how to close your website (or parts of it) from search engine crawlers efficiently and safely.

Why You Might Block a Site from Search Engines

Search engine crawlers automatically scan every page on the web

. However, you might want to restrict access for several reasons, such as:

Protecting admin or user areas
Hiding outdated promotions or event pages
Preventing scripts, banners, and heavy files from being indexed
Reducing server load and speeding up indexing

How to Block Your Entire Website

If your site is still in development or redesign, it’s a good idea to hide it from search engines completely. You can block all crawlers, a specific one, or allow only a single bot using the following examples

Block all bots:

Code:

User-agent: *
Disallow: /

Block only Google Images:

Code:

User-agent: Googlebot-Image
Disallow: /

Allow only Google, block others:

Code:

User-agent: *
Disallow: /
User-agent: Google
Allow: /

How to Block Individual Pages or Sections

For most small business sites, it’s rare to hide specific pages. But for larger or eCommerce websites, you may want to block service-related or non-public areas such as:

/admin – Admin panel
/login – Personal accounts
/cart – Shopping carts
/search – Site search results
/promotions – Outdated offers
/compare, /favorites, /captcha

Examples:

Block a single page:

Code:

User-agent: *
Disallow: /contact.html

Block an entire section:

Code:

User-agent: *
Disallow: /catalog/

Allow only one folder:

Code:

User-agent: *
Disallow: /
Allow: /catalog/

How to Block Files, Scripts & Parameters

You can also block indexing of specific files, scripts, or tracking parameters like UTM tags.

Examples:

Block file types (e.g., images):

Code:

User-agent: *
Disallow: /*.jpg

Block a folder:

Code:

User-agent: *
Disallow: /images/

Allow only one file in a folder:

Code:

User-agent: *
Disallow: /images/
Allow: /images/logo.jpg

Block scripts:

Code:

User-agent: *
Disallow: /plugins/*.js

Block UTM tags:

Code:

User-agent: *
Disallow: *utm=
Clean-Param: utm_source&utm_medium&utm_campaign

How to Use Meta Tags Instead of Robots.txt

If you prefer not to use robots.txt, you can add a meta tag directly inside your HTML <head> section.

Example 1 – Block all crawlers:

HTML:

<meta name="robots" content="noindex, nofollow">

Example 2 – Full restriction:

HTML:

<meta name="robots" content="none">

Meta tag options explained:

none - No indexing or following links
noindex - Block content indexing
nofollow - Block link crawling
index - Allow content indexing
follow - Allow link crawling

Tip: You can mix rules like noindex, follow to block content but still let search engines follow links.

Common Robots.txt Mistakes

Even small syntax errors can cause major indexing issues. Watch out for these:

Missing blank lines between rules

Uppercase letters in directives

Conflicting rules (logical errors)

Forgetting Disallow: directive

Always test your robots.txt using Google Robots Testing Tool or Yandex.Webmaster.

Quick Cheat Sheet

To block everything → use robots.txt with Disallow: /

To block individual sections → specify their paths

To block via HTML → use <meta name="robots">

To prevent crawler overload → close scripts, sessions, UTM tags

To allow only one bot → combine Allow and Disallow rules

To verify → use tools like Linkbox or Google Index Checker

Final Thoughts

Blocking your site or pages from search engines is simple - but must be done carefully. A single misplaced line in robots.txt could make your entire site disappear from Google

.

Always test before deploying, double-check syntax, and update your rules regularly. By mastering these controls, you’ll protect your data, improve crawl efficiency, and keep your website running smoothly

How to Block Websites from Google Crawlers

Why You Might Block a Site from Search Engines​

How to Block Your Entire Website​

How to Block Individual Pages or Sections​

How to Block Files, Scripts & Parameters​

How to Use Meta Tags Instead of Robots.txt​

Common Robots.txt Mistakes​

Quick Cheat Sheet​

Final Thoughts​

Why You Might Block a Site from Search Engines

How to Block Your Entire Website

How to Block Individual Pages or Sections

How to Block Files, Scripts & Parameters

How to Use Meta Tags Instead of Robots.txt

Common Robots.txt Mistakes

Quick Cheat Sheet

Final Thoughts