Elon Musk's xAI just landed at the bottom of a major AI safety audit. The Anti-Defamation League released a study Wednesday finding that Grok performed worst among six leading chatbots at identifying and countering antisemitic content, scoring just 21 out of 100 points. Anthropic's Claude topped the rankings at 80, but the ADL warned that every model tested - including OpenAI's ChatGPT, Meta's Llama, Google's Gemini, and DeepSeek - showed dangerous gaps requiring immediate fixes.
xAI's Grok just became the poster child for everything that can go wrong with AI safety guardrails. A comprehensive Anti-Defamation League study released Wednesday puts hard numbers to what researchers have been warning about for months - Elon Musk's chatbot is spectacularly bad at handling hate speech.
The ADL tested six major large language models through more than 25,000 conversations between August and October 2025, throwing everything from Holocaust denial to extremist propaganda at them. Grok earned an overall score of 21 out of 100. That's not just bad - it's a 59-point gap behind Anthropic's Claude, which topped the rankings at 80.
Here's what makes the findings particularly damning: Grok showed what the ADL called "complete failure" when asked to analyze documents or images containing hateful content. The chatbot scored literal zeros in several category combinations. "Poor performance in multi-turn dialogues indicates that the model struggles to maintain context and identify bias in extended conversations," the report states, "limiting its utility for chatbot or customer service applications."
The methodology was rigorous. Researchers created prompts across three categories the ADL defines as harmful: anti-Jewish content (traditional antisemitic tropes like Holocaust denial), anti-Zionist statements (including conspiracy theories with "Zionist" substituted for "Jew"), and broader extremist content spanning white supremacy to environmental terrorism. They tested each model through survey-style agreements, open-ended debates, and document analysis.












