Context Utilization
Measures how complete the generated response is for the question specified given the information provided in the context.
Context Utilization score measures if the generated response has sufficiently used the retrieved context to answer the question being asked.
This check is important since a bad utilization of context reduces the effectiveness to answer all aspects of the question asked.
Columns required:
question
: The question asked by the usercontext
: Information retrieved to answer the questionresponse
: The response given by the model
How to use it?
Sample Response:
The question can be divided in following 2 parts:
- Where is the Taj Mahal located?
- When was the Taj Mahal built?
The context contains information about both the questions.
But, the response only answers 1 question while providing no information on “When was the Taj Mahal built?”
Ultimately, resulting in a low context utilization score.
How it works?
We evaluate context utilization by determining which of the following three cases apply for the given task data:
- The generated response incorporates all the relevant information present in the context.
- The generated response incorporates some of the information present in the context, but misses some of the information in context which is relevant for answering the given question.
- The generated response doesn’t incorporate any information present in the context.
Was this page helpful?