Question 1

Should I block CCBot?

Accepted Answer

It's not advisable if you're looking for AI visibility. Common Crawl feeds dozens of models at once: blocking CCBot is like erasing yourself from the encyclopedia almost every AI uses to learn.

Question 2

Does CCBot respect robots.txt?

Accepted Answer

Yes. A simple Disallow rule for the CCBot user-agent is enough. Common Crawl also publishes its official IP ranges and offers a voluntary opt-out registry, and warns that impostors posing as CCBot exist.

Question 3

How do I know if CCBot visits my site?

Accepted Answer

Search for "CCBot" in your server logs. Legitimate visits can be verified via reverse DNS: they resolve to domains like crawl.commoncrawl.org.

CCBot

How to allow it in your robots.txt

How to block it (not recommended)

Frequently asked questions

Should I block CCBot?

Does CCBot respect robots.txt?

How do I know if CCBot visits my site?

Related resources

The bots already read your site. Do you know what AI says about your business?