Custom Guideline
Grades how well the LLM adheres to a provided guideline when giving a response.
Guideline adherence measures the extent to which the generated response follows a given guideline, rule, or protocol.
Given the complexity of LLMs, it is crucial to define certain guidelines, be it in terms of the structure of the output or the constraints on the content of the output or protocols on the decision-making capabilities of the LLMs.
Columns required:
question
: The question asked by the userresponse
: The response given by the model
Parameters:
guideline
: The guideline to be followedguideline_name (optional)
: User-assigned name of the guideline to distinguish between multiple checksresopnse_schema (optional)
: Schema of the response in case it is of type JSON, XML, etc.
How to use it?
Sample Response:
The generated reponse contains numeric information about the height of Burj Khalifa, which conflicts the defined guideline.
Resulting in a low guideline adherence score.
How it works?
We evaluate custom guidelines by determining which of the following two cases apply for the given task data:
- The given guideline is strictly adhered to.
- The given guideline is strictly violated.
Was this page helpful?