Cats Confuse Reasoning LLM: Query Agnostic Adversarial Triggers for Reasoning Models
The original "cat attack" paper, showing that out of distribution terms in the prompt massively degrade LLM performance including guardian LLMs used for defensi...
Technology, leadership, and the digital frontier