arxiv.org
Cats Confuse Reasoning LLM: Query Agnostic Adversarial Triggers for Reasoning Models↗
The original "cat attack" paper, showing that out of distribution terms in the prompt massively degrade LLM performance including guardian LLMs used for defensive purposes, opening advesarial attack opportunities.
