How to use it?
By default, we are using GPT 3.5 Turbo. If you want to use a different model, check out this tutorial.
A higher query resolution score reflects that the LLM effectively addresses the user’s query.
How it works?
We evaluate query resolution by determining which of the following cases apply for the given task data:- The given responses effectively resolve the user’s query.
- The given responses partially resolve the user’s query.
- The given responses do not resolve the user’s query.