question
: The question asked by the userresponse
: The response given by the model
guideline
: The guideline to be followedguideline_name (optional)
: User-assigned name of the guideline to distinguish between multiple checksresopnse_schema (optional)
: Schema of the response in case it is of type JSON, XML, etc.
How to use it?
By default, we are using GPT 3.5 Turbo for evaluations. If you want to use a different model, check out this tutorial.
A higher guideline adherence score reflects that the generated response contains adheres to defined guideline.
How it works?
We evaluate custom guidelines by determining which of the following two cases apply for the given task data:- The given guideline is strictly adhered to.
- The given guideline is strictly violated.