ChromaDB
ChromaDB is an open-source embedding database. While working on RAG based applications, you can use Chroma to retrieve information from your context documents.
How will this help?
Vector databases store data as high-dimensional vectors, enabling fast and efficient similarity search and retrieval of data based on their vector representations. You can use UpTrain along with vector databases such as ChromaDB for evaluations such as the relevance of the retrieved context ensuring a good retrieval quality.
How to integrate?
First, let’s import the necessary packages
Let’s define a dataset
Embedding the data in ChromaDB:
Using ChromaDB to find supporting evidence(context) for the questions in the dataset:
Evaluate the retrieval quality using UpTrain
Let’s look at the retrieval quality of the context documents
According to these evaluations:
- Example 1: The context clearly states alpha decay to be the least dangerous radioactive decay, making the context sufficient to answer the question. Hence, the context is highly relevant to the question asked.
- Example 2: The context contains information about the agents used in preparation of foods like cheese and yogurt, but does not specifically talks about their use to make yougurt and cheese. Thus, even though the context is related to the question it’s not sufficient to answer it.