0din logo

Re-storying

This technique involves continuing a narrative in a way that misaligns the original goal of a prompt, effectively repurposing the story to achieve a different outcome than initially intended.

Category: Fictionalizing

This category involves creating scenarios or narratives that leverage existing genres or contexts to manipulate the language model's responses.

Techniques

Note Description
Goal Hijacking This technique refers to the process where an attacker misaligns the original goal of a prompt to redirect the model's output towards a new, often unintended goal, such as printing a target phrase or generating specific content that deviates from the initial intent. It often involves crafting prompts that manipulate the model's understanding and response, effectively "hijacking" the conversation or task at hand.