Awesome-LLM-RAG

A curated list of awesome resources for Retrieval-Augmented Generation (RAG) in Large Language Models. Each resource is carefully selected and includes relevant information.

This repository is actively maintained and welcomes contributions from the community to keep the resource list up-to-date with the latest developments in RAG technology😄.

📒Content

📖 Papers
- Survey
- Framework
- Retrieval
- Generation
- Augmentation
- Applications
- Evaluation
📚 Tutorials & Blogs
🗄️ Vector Database
🛠️ Tools

📖 Papers

📖 Survey

Title	Paper	Date	Code	Recom
🔥[Survey] Retrieval-Augmented Generation for Large Language Models: A Survey	[pdf]	2023.12	⚠️	��️
[Survey] Retrieval-Augmented Generation for AI-Generated Content: A Survey	[pdf]	2024.02	[RAG-Survey]	⭐️
🔥[Survey] A Comprehensive Survey of Retrieval-Augmented Generation (RAG): Evolution, Current Landscape and Future Directions	[pdf]	2024.10	⚠️	⭐️⭐️
[Survey] Survey of Vector Database Management Systems	[pdf]	2023.10	⚠️	⭐️

📖 Framework

Title	Paper	Date	Code	Recom
Query Rewriting for Retrieval-Augmented Large Language Models	[pdf]	2023.05	⚠️	⭐️
Modular RAG: Transforming RAG Systems into LEGO-like Reconfigurable Frameworks	[pdf]	2024.07	⚠️	⭐️
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation	[pdf]	2024.08	[RAGFoundry]	⭐️
A Collaborative Multi-Agent Approach to Retrieval-Augmented Generation Across Diverse Data	[pdf]	2024.12	��️	⭐️
🔥Generate rather than Retrieve: Large Language Models are Strong Context Generators	[pdf]	2022.09	[GenRead]	⭐️⭐️

📖 Retrieval

Title	Paper	Date	Code	Recom
🔥TableGPT: Towards Unifying Tables, Nature Language and Commands into One GPT	[pdf]	2023.07	⚠️	⭐️⭐️
KnowledGPT: Enhancing Large Language Models with Retrieval and Storage Access on Knowledge Bases	[pdf]	2023.08	⚠️	⭐️
Knowledge Graph Prompting for Multi-Document Question Answering	[pdf]	2023.08	[KG-LLM-MDQA]	⭐️
G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering	[pdf]	2024.02	[G-Retriever]	⭐️
Chain-of-Verification Reduces Hallucination in Large Language Models	[pdf]	2023.09	��	⭐️
🔥Prompt-RAG: Pioneering Vector Embedding-Free Retrieval-Augmented Generation in Niche Domains, Exemplified by Korean Medicine	[pdf]	2024.01	⚠️	⭐️⭐️
Multi-Meta-RAG: Improving RAG for Multi-Hop Queries using Database Filtering with LLM-Extracted Metadata	[pdf]	2024.06	[Multi-Meta-RAG]	⭐️

📖 Generation

Title	Paper	Date	Code	Recom
🔥Open-source Large Language Models are Strong Zero-shot Query Likelihood Models for Document Ranking	[pdf]	2023.10	[llm-qlm]	⭐️⭐️
LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression	[pdf]	2023.10	⚠️	⭐️
Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!	[pdf]	2023.03	⚠️	⭐️

📖 Augamentation

Title	Paper	Date	Code	Recom
Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions	[pdf]	2022.12	[ircot]	⭐️
🔥Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy	[pdf]	2023.05	⚠️	⭐️⭐️
🔥Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection	[pdf]	2023.10	⚠️	⭐️⭐️

📖 Applications

Title	Paper	Date	Code	Domain	Recom
Improving language models by retrieving from trillions of tokens	[pdf]	2021.12	⚠️	QA	⭐️
DISC-LawLLM: Fine-tuning Large Language Models for Intelligent Legal Services	[pdf]	2023.09	[DISC-LawLLM]	Law	⭐️
🔥EVOR: Evolving Retrieval for Code Generation	[pdf]	2024.02	[arks]	Code	⭐️⭐️
🔥CodeS: Towards Building Open-source Language Models for Text-to-SQL	[pdf]	2024.02	⚠️	SQL	⭐️⭐️
🔥Retrieval-Augmented Text-to-Audio Generation	[pdf]	2023.09	⚠️	Audio	⭐️⭐️
An Empirical Comparison of Video Frame Sampling Methods for Multi-Modal RAG Retrieval	[pdf]	2024.07	⚠️	Video	⭐️
🔥Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners	[pdf]	2022.05	[VidIL]	Video	⭐️⭐️
Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension	[pdf]	2024.11	⚠️	Video	⭐️

📖 Evaluation

Title	Paper	Date	Code	Recom
Benchmarking Large Language Models in Retrieval-Augmented Generation	[pdf]	2023.09	⚠️	⭐️
🔥Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering	[pdf]	2024.11	⚠️	⭐️⭐️
RAGAS: Automated Evaluation of Retrieval Augmented Generation	[pdf]	2023.09	[ragas]	⭐️
CharacterBox: Evaluating the Role-Playing Capabilities of LLMs in Text-Based Virtual Worlds	[pdf]	2024.12	⚠️	⭐️
🔥RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework	[pdf]	2024.08	[RAGEval]	⭐️⭐️

📚 Tutorials & Blogs

What is RAG? - Pinecone's comprehensive introduction to RAG
RAG Architecture Guide - DataStax's detailed RAG architecture explanation
RAG Best Practices - Anyscale's comprehensive guide for RAG applications
Best Practices for LLM Evaluation of RAG Applications - Databricks' guide on RAG evaluation
OpenAI Cookbook RAG Examples - How to Combine GPT-4o Mini with RAG - Create a Clothing Matchmaker App
Optimizing RAG Applications - Secrets to Optimizing RAG LLM Apps for Better Performance, Accuracy and Lower Costs!

🗄️ Vector Database

Name	Description	Pros	Cons	Stars
ChromaDB	Open-source embedding database	• Lightweight deployment • Easy integration • Local & server modes	• Limited scalability • Not for large-scale production
FAISS	Library for efficient similarity search	• High performance • Memory efficient • GPU acceleration	• Steep learning curve • Storage layer needed
Milvus	Highly scalable vector database	• High scalability • Rich index types • Production-ready	• Complex deployment • Resource intensive
Weaviate	Vector database with generative search	• Built-in generative search • GraphQL interface • Modular design	• Complex configuration • High memory usage
Qdrant	Vector similarity search engine	• Powerful filtering • High performance • Easy deployment	• Smaller community • Newer features
Vespa	Real-time data processing engine	• Strong real-time processing • Feature-rich • High availability	• Complex setup • Steep learning curve
pgvector	PostgreSQL vector extension	• PostgreSQL integration • Transaction support • Familiar SQL interface	• Average performance • Limited scalability
LanceDB	Serverless vector database	• Developer friendly • Serverless architecture • Lightweight	• Basic features • Newer community
Pinecone	Managed vector database	• Zero maintenance • Good scalability • Enterprise support	• Paid service • Storage limitations • Variable latency	⚠️

🛠️ Tools

Name	Description	Pros	Cons
LangChain	Framework for developing LLM applications	• Rich ecosystem • Active community • Extensive integrations	• Complex architecture • Steep learning curve
LlamaIndex	Data framework for LLM applications	• Data-centric design • Easy to use • Good documentation	• Less flexible • Limited customization
Unstructured	Data preprocessing library	• Multiple file formats • Clean extraction • Easy integration	• Limited advanced features • Processing speed
txtai	All-in-one embeddings database	• Lightweight • Simple API • Built-in workflows	• Less enterprise features • Smaller community
Semantic Kernel	Microsoft's orchestration SDK	• Strong Microsoft integration • Memory management • Enterprise support	• Microsoft ecosystem focused • Less community plugins
Embedchain	RAG framework	• Easy to use • Data source adaptors • Quick prototyping	• Less production features • Limited customization
Ragatouille	RAG experimentation toolkit	• Research focused • Advanced RAG features • Flexible architecture	• Early stage • Less documentation

🤝 Contributing

Feel free to submit Pull Requests to help grow this list! Please ensure:

📚The resource is relevant to RAG
✅ Follow the existing format
✅ Check for duplicates before submitting

Let's make this resource list even better together! 🌟

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Awesome-LLM-RAG

📒Content

📖 Papers

📖 Survey

📖 Framework

📖 Retrieval

📖 Generation

📖 Augamentation

📖 Applications

📖 Evaluation

📚 Tutorials & Blogs

🗄️ Vector Database

🛠️ Tools

🤝 Contributing

License

About

Releases 1

Packages

yangchou19/Awesome-LLM-Rag

Folders and files

Latest commit

History

Repository files navigation

Awesome-LLM-RAG

📒Content

📖 Papers

📖 Survey

📖 Framework

📖 Retrieval

📖 Generation

📖 Augamentation

📖 Applications

📖 Evaluation

📚 Tutorials & Blogs

🗄️ Vector Database

🛠️ Tools

🤝 Contributing

License

About

Topics

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Packages