AI bots made 18,209 requests to a small blog in 9 days, outnumbering human visitors
A developer running a small self-hosting blog discovered that AI crawlers accounted for 18,209 of 348,667 total requests over nine days, surpassing human readership. The largest share came from ChatGPT-User, which performs live fetches on behalf of real users asking questions — not training data scraping — making blanket blocking counterproductive. The author distinguishes between training crawlers like GPTBot and ClaudeBot, which generally respect robots.txt, and live answer-engine fetchers like ChatGPT-User, which behave more like browsers and are not bound by it. A separate technical finding revealed that most AI crawlers do not execute JavaScript, meaning structured data or metadata injected client-side after page hydration remains invisible to them. The author warns that site owners relying on JS-rendered metadata may be misrepresented in AI-generated answers, and recommends ensuring all critical structured data is present in the initial server-side HTML.
This is an AI-generated summary. ShortSingh links to the original source for the complete article.

Discussion (0)
Log in to join the discussion and vote.
Log in