RAG Evaluation

Explanation of RAG Evaluation Types

Retrieval-Augmented Generation (RAG) evaluation can be categorized into several types:

Relevance: Measures if the retrieved documents are useful.
Faithfulness (Precision): Ensures that the generated output is accurate.
Coherence: Evaluates if the text makes logical sense.
Semantic Similarity: Measures similarity between generated text and reference data using embeddings.

Selected Evaluation Type

For this project, we use Semantic Similarity. This approach measures how close the retrieved document is to the original query in meaning, ensuring high-quality retrieval.

Reasoning for This Choice

Semantic similarity is efficient and widely used in RAG applications.
It helps quantify how much retrieved information aligns with the original query.
It allows for automated and scalable evaluation using embeddings.

Running the Code

pip install faiss-cpu sentence-transformers numpy scikit-learn
python rag_evaluation.py

Logging Demonstration

The program logs:

The query
The retrieved document
The computed similarity score

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
README.md		README.md
rag_evaluation.py		rag_evaluation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG Evaluation

Explanation of RAG Evaluation Types

Selected Evaluation Type

Reasoning for This Choice

Running the Code

Logging Demonstration

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RAG Evaluation

Explanation of RAG Evaluation Types

Selected Evaluation Type

Reasoning for This Choice

Running the Code

Logging Demonstration

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages