Surprise Attack

This technique involves crafting prompts or queries in a way that avoids directly mentioning specific terms or names that may trigger safety mechanisms or filters. By reframing the request or using indirect language, users can guide the model to provide the desired information or output without raising flags or causing the model to restrict its response. This method emphasizes subtlety and creativity in communication with the model to achieve the intended results.

Strategy: Persuasion and Manipulation

This strategy focuses on employing rhetorical techniques to influence the model's responses by framing prompts in a way that persuades or manipulates the output.

Category: Rhetoric

This category employs persuasive techniques and language to shape the model's responses, often using methods like reverse psychology or Socratic questioning.