question
: The question asked by the usermodel_purpose (optional)
: The intended purpose of the LLM
How to use it?
By default, we are using GPT 3.5 Turbo for evaluations. If you want to use a different model, check out this tutorial.
A higher jailbreak detection score reflects an attempt to jailbreak.