rag-evaluation

Here are 15 public repositories matching this topic...

Giskard-AI / giskard

🐢 Open-Source Evaluation & Testing for AI & LLM systems

ai-security mlops fairness-ai responsible-ai ml-validation red-team-tools trustworthy-ai ml-testing llm ai-red-team ai-testing llmops llm-security llm-eval llm-evaluation rag-evaluation agent-evaluation

Updated Apr 25, 2025
Python

Marker-Inc-Korea / AutoRAG

Star

AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

python open-source qa benchmarking ops pipeline analysis optimization evaluation embeddings automl document-parser rag llm retrieval-augmented-generation llm-ops llm-evaluation rag-evaluation

Updated Apr 24, 2025
Python

vectara / open-rag-eval

Star

Open source RAG evaluation package

metrics evaluation-metrics rag vectara retrieval-augmented-generation rag-evaluation

Updated Apr 26, 2025
Python

LLAMATOR-Core / llamator

Star

Framework for testing vulnerabilities of large language models (LLM).

Updated Apr 22, 2025
Python

mts-ai / rurage

Star

information-retrieval question-answering rag llm-evaluation rag-evaluation

Updated Apr 14, 2025
Python

oztrkoguz / RAG-Framework-Evaluation

Star

This project aims to compare different Retrieval-Augmented Generation (RAG) frameworks in terms of speed and performance.

swarms autogen rag langchain llamaindex rag-evaluation crewai langchain-rag autogen-rag crewai-rag llamaindex-rag swarms-rag

Updated Jul 28, 2024
Python

ioannis-papadimitriou / rag-playground

Star

A framework for systematic evaluation of retrieval strategies and prompt engineering in RAG systems, featuring an interactive chat interface for document analysis.

chatbot qa-generation llm-inference retrieval-augmented-generation rag-evaluation

Updated Dec 18, 2024
Python

rostyslavshovak / RAG-Retrieval-Augmented-Generation

Star

RAG Chatbot for Financial Analysis

open-source pdf rag gradio-interface langchain qdrant-vector-database retrieval-augmented-generation rag-evaluation

Updated Mar 9, 2025
Python

shaadclt / EvalRAG

Sponsor

Star

A comprehensive evaluation toolkit for assessing Retrieval-Augmented Generation (RAG) outputs using linguistic, semantic, and fairness metrics

rag rag-evaluation

Updated Apr 19, 2025
Python

AnasAber / MLflow_with_RAG

Star

Using MLflow to deploy your RAG pipeline, using LLamaIndex, Langchain and Ollama/HuggingfaceLLMs/Groq

deployment cicd evaluation-metrics rag mlops mlflow mlflow-tracking-server mlflow-tracking mlflow-projects mlflow-ui mlops-template mlops-project llamaindex rag-evaluation rag-pipeline llamaindex-rag mlflow-deployement

Updated Jan 20, 2025
Python

BetterRAG: Powerful RAG evaluation toolkit for LLMs. Measure, analyze, and optimize how your AI processes text chunks with precision metrics. Perfect for RAG systems, document processing, and embedding quality assessment.

optimization evaluation embeddings evaluation-framework rag embeddings-extraction rag-evaluation rag-application rag-optimization chunking-optimization embeddings-optimization

Updated Mar 26, 2025
Python

Gian207 / RAG-lego-like-component

Star

Proposal for industry RAG evaluation: Generative Universal Evaluation of LLMs and Information retrieval

benchmark rag rag-evaluation

Updated Feb 1, 2025
Python

OranDanon / RAG-application

Star

RAG Chatbot over pre-defined set of articles about LangChain

rag generative-ai rag-evaluation rag-chatbot docling

Updated Apr 7, 2025
Python

OranDanon / Gen-AI-Assignment

Star

Home assignment featuring two AI projects: a Medical Q&A Bot for Israeli HMOs and a National Insurance Form Extractor. Built with Azure OpenAI to demonstrate practical GenAI implementation skills.

azure-ai generative-ai rag-evaluation rag-chatbot azure-document-intelligence-ocr

Updated Apr 8, 2025
Python

TajaKuzman / pandachat-rag-benchmark

Star

PandaChat-RAG benchmark for evaluation of RAG systems on a non-synthetic Slovenian test dataset.

retrieval embedding-models slovenian rag slovenian-language test-dataset llm retrieval-augmented-generation rag-evaluation slovenian-language-dataset

Updated Dec 15, 2024
Python

Improve this page

Add a description, image, and links to the rag-evaluation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the rag-evaluation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rag-evaluation

Here are 15 public repositories matching this topic...

Giskard-AI / giskard

Marker-Inc-Korea / AutoRAG

vectara / open-rag-eval

LLAMATOR-Core / llamator

mts-ai / rurage

oztrkoguz / RAG-Framework-Evaluation

ioannis-papadimitriou / rag-playground

rostyslavshovak / RAG-Retrieval-Augmented-Generation

shaadclt / EvalRAG

AnasAber / MLflow_with_RAG

Kaos599 / BetterRAG

Gian207 / RAG-lego-like-component

OranDanon / RAG-application

OranDanon / Gen-AI-Assignment

TajaKuzman / pandachat-rag-benchmark

Improve this page

Add this topic to your repo