Response Quality Evals
Response Relevance
Measures how relevant the generated response was to the question specified.
Response relevance is the measure of how relevant the generated response is to the question asked. It is a measure of how well the response addresses the question asked and if it contains any additional information that is not relevant to the question asked.
Columns required:
question
: The question asked by the userresponse
: The response given by the model
How to use it?
By default, we are using GPT 3.5 Turbo for evaluations. If you want to use a different model, check out this tutorial.
Sample Response:
A higher response relevance score reflects that the generated response is relevant to the question asked.
The question asked was “The benefits of regular exercise”
The response does not answer the question being asked and contains irrelevant information about the benefits of regular health check-ups.
Resulting in a low response relevance score.
Was this page helpful?