Skip to main content

How to detect and analyze AI and LLMs bots hits using log monitoring

Learn how to enable and analyze the detection of AI and LLM bots in Oncrawl using log monitoring.

Updated yesterday

First, if you are interested in using live log monitoring, make sure you have activated the log monitoring feature for your project and followed the setup steps here.

Which AI/LLMs bots are supported on Oncrawl ?

With Log Monitoring, you can identify and analyze hits from OpenAI, Perplexity, Claude, and Mistral LLMs directly within your projects.

Detection needs to be activated by your dedicated (and amazing) Oncrawl CSM. Feel free to reach out to them to request activation or deactivation for specific bots based on your needs. You can also submit your request directly via the in-app chat, a member of our technical support team will be happy to assist you.

Currently, we support detection for the following bots :

Open AI :

  • GPTBot

  • OAI-SearchBot

  • ChatGPT-User

Perplexity

  • Perplexity‑User

  • PerplexityBot

Claude

  • ClaudeBot

  • Claude-User

  • Claude-SearchBot

Mistral

  • MistralAI-User

How does Oncrawl detect AI bot hits?

To identify these hits in your logs, we analyzes user-agent strings and public IP addresses used by each bot platform.

For this detection to work, make sure these elements (user-agent and IP address) are clearly present in your log files. Without them, Oncrawl won’t be able to surface LLM bot activity.

Below is a list of publicly documented user-agents and IP address ranges for the LLM bots we currently support.

OpenAI :

User agent

Full user-agent string

Published IP addresses

GPTBot

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko); compatible; GPTBot/1.1; +https://openai.com/gptbot

OAI-SearchBot

OAI-SearchBot/1.0; +https://openai.com/searchbot

ChatGPT-User

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko); compatible; ChatGPT-User/1.0; +https://openai.com/bot

Perplexity :

User agent

Full user-agent string

Published IP addresses

Perplexity-User

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; Perplexity-User/1.0; +https://perplexity.ai/perplexity-user)

PerplexityBot

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; PerplexityBot/1.0; +https://perplexity.ai/perplexitybot)

Claude :

User agent

Full user-agent string

Published IP addresses

ClaudeBot

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)

/

Claude-User

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; Claude-User/1.0; +Claude-User@anthropic.com)

/

Claude-SearchBot

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; Claude-SearchBot/1.0; +Claude-SearchBot@anthropic.com)

/

Mistral :

User agent

Full user-agent string

Published IP addresses

MistralAI-User

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; MistralAI-User/1.0; +https://docs.mistral.ai/robots)

⚠️ Keep in mind that enabling detection for multiple LLMs bots can significantly impact your average daily log lines quota. Make sure to take this into account before proceeding with the setup. ⚠️

Did this answer your question?