Question 1

Should I block img2dataset?

Accepted Answer

It depends on whether you want your images used to train AI models. Allowing it won't improve your chances of being mentioned by ChatGPT or similar tools — it only means your photos could feed into image-generation datasets. If you'd rather keep your images out of those datasets, blocking it is a reasonable call.

Question 2

Does img2dataset affect my AI visibility?

Accepted Answer

Not directly. This crawler feeds image datasets, not the language models behind conversational AI assistants. Letting it through won't make ChatGPT or Perplexity recommend your business more often when someone asks about what you offer.

Question 3

How do I know if img2dataset is crawling my site?

Accepted Answer

Check your server logs for entries containing "img2dataset" or the string "(compatible; img2dataset;". You can also add a disallow rule for the "img2dataset" token in your robots.txt file — the crawler partially respects it, though its official compliance with robots.txt is not explicitly documented.

img2dataset

How to allow it in your robots.txt

How to block it (not recommended)

Frequently asked questions

Should I block img2dataset?

Does img2dataset affect my AI visibility?

How do I know if img2dataset is crawling my site?

Related resources

Do you know if these bots already read your site and what they say about you? Run the free test.