LLM Red Teaming Is Shifting Toward Multi-Turn Jailbreaks
Static prompt filters catch obvious attacks, but newer jailbreaks chain roleplay, context poisoning, and tool abuse across several turns to slip past them. Security teams now need red-team tests that measure how models behave over an entire conversation, not just one prompt.