Evaluates the ability of the LLM to resolve the user’s query.
A good AI assistant should be able to effectively address the user’s query. Query resolution evaluates the ability of the AI assistant to resolve the user’s query effectively.
Sample Response:
The nurse in the conversation was not able to address the patient’s query, which was about chest pain, indicating a potential medical emergency.
Resulting in a low query resolution score.
We evaluate query resolution by determining which of the following cases apply for the given task data:
Evaluates the ability of the LLM to resolve the user’s query.
A good AI assistant should be able to effectively address the user’s query. Query resolution evaluates the ability of the AI assistant to resolve the user’s query effectively.
Sample Response:
The nurse in the conversation was not able to address the patient’s query, which was about chest pain, indicating a potential medical emergency.
Resulting in a low query resolution score.
We evaluate query resolution by determining which of the following cases apply for the given task data: