Omgili Bot
Omgili Bot is the crawler operated by Webz.io, a company that sells web data to media monitoring tools like Hootsuite and Sprinklr, as well as to businesses building AI models. Content indexed by this bot may end up in training datasets that influence various AI systems. It is Webz.io's legacy crawler — the current token is «omgili/0.5», with «omgilibot» being the older alias — and it honours robots.txt directives if you choose to restrict its access.
- User-agent
omgilibot- Does it respect robots.txt?
- Yes
How to allow it in your robots.txt
User-agent: omgilibot
Allow: /How to block it (not recommended)
User-agent: omgilibot
Disallow: /Frequently asked questions
Should I block Omgili Bot?
If you want your business to have better visibility in AI-powered tools, blocking it may work against you. Webz.io sells its data to AI companies, and keeping your content out means missing that channel. Bear in mind that the link between this bot and any specific AI product like ChatGPT or Gemini is indirect, so the precise impact is hard to pin down.
How does Omgili Bot affect my AI visibility?
Webz.io sells its dataset to both media monitoring platforms and companies training language models. That means content crawled by Omgili Bot can end up in the training data that shapes various AI systems. There is no guaranteed direct pipeline to a single named model, but it does contribute to the broader pool of data those models learn from.
How can I tell if Omgili Bot is visiting my site?
Check your server access logs or CDN dashboard — for example, Cloudflare — for entries containing «omgilibot» or «omgili». The current bot version identifies itself as «omgili/0.5», while «omgilibot» is the legacy token, so it is worth searching for both to get the full picture.