Skip to content

A curated list of awesome papers and resources for Retrieval-Augmented Generation (RAG) in Large Language Models(LLM).

Notifications You must be signed in to change notification settings

yangchou19/Awesome-LLM-Rag

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 

Repository files navigation

Awesome-LLM-RAG

Awesome Stars

A curated list of awesome resources for Retrieval-Augmented Generation (RAG) in Large Language Models. Each resource is carefully selected and includes relevant information.

This repository is actively maintained and welcomes contributions from the community to keep the resource list up-to-date with the latest developments in RAG technology😄.

📒Content

📖 Papers

📖 Survey

Title Paper Date Code Recom
🔥[Survey] Retrieval-Augmented Generation for Large Language Models: A Survey [pdf] 2023.12 ⚠️ ��️
[Survey] Retrieval-Augmented Generation for AI-Generated Content: A Survey [pdf] 2024.02 [RAG-Survey] ⭐️
🔥[Survey] A Comprehensive Survey of Retrieval-Augmented Generation (RAG): Evolution, Current Landscape and Future Directions [pdf] 2024.10 ⚠️ ⭐️⭐️
[Survey] Survey of Vector Database Management Systems [pdf] 2023.10 ⚠️ ⭐️

📖 Framework

Title Paper Date Code Recom
Query Rewriting for Retrieval-Augmented Large Language Models [pdf] 2023.05 ⚠️ ⭐️
Modular RAG: Transforming RAG Systems into LEGO-like Reconfigurable Frameworks [pdf] 2024.07 ⚠️ ⭐️
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation [pdf] 2024.08 [RAGFoundry] ⭐️
A Collaborative Multi-Agent Approach to Retrieval-Augmented Generation Across Diverse Data [pdf] 2024.12 ��️ ⭐️
🔥Generate rather than Retrieve: Large Language Models are Strong Context Generators [pdf] 2022.09 [GenRead] ⭐️⭐️

📖 Retrieval

Title Paper Date Code Recom
🔥TableGPT: Towards Unifying Tables, Nature Language and Commands into One GPT [pdf] 2023.07 ⚠️ ⭐️⭐️
KnowledGPT: Enhancing Large Language Models with Retrieval and Storage Access on Knowledge Bases [pdf] 2023.08 ⚠️ ⭐️
Knowledge Graph Prompting for Multi-Document Question Answering [pdf] 2023.08 [KG-LLM-MDQA] ⭐️
G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering [pdf] 2024.02 [G-Retriever] ⭐️
Chain-of-Verification Reduces Hallucination in Large Language Models [pdf] 2023.09 ����� ⭐️
🔥Prompt-RAG: Pioneering Vector Embedding-Free Retrieval-Augmented Generation in Niche Domains, Exemplified by Korean Medicine [pdf] 2024.01 ⚠️ ⭐️⭐️
Multi-Meta-RAG: Improving RAG for Multi-Hop Queries using Database Filtering with LLM-Extracted Metadata [pdf] 2024.06 [Multi-Meta-RAG] ⭐️

📖 Generation

Title Paper Date Code Recom
🔥Open-source Large Language Models are Strong Zero-shot Query Likelihood Models for Document Ranking [pdf] 2023.10 [llm-qlm] ⭐️⭐️
LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression [pdf] 2023.10 ⚠️ ⭐️
Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples! [pdf] 2023.03 ⚠️ ⭐️

📖 Augamentation

Title Paper Date Code Recom
Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions [pdf] 2022.12 [ircot] ⭐️
🔥Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy [pdf] 2023.05 ⚠️ ⭐️⭐️
🔥Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection [pdf] 2023.10 ⚠️ ⭐️⭐️

📖 Applications

Title Paper Date Code Domain Recom
Improving language models by retrieving from trillions of tokens [pdf] 2021.12 ⚠️ QA ⭐️
DISC-LawLLM: Fine-tuning Large Language Models for Intelligent Legal Services [pdf] 2023.09 [DISC-LawLLM] Law ⭐️
🔥EVOR: Evolving Retrieval for Code Generation [pdf] 2024.02 [arks] Code ⭐️⭐️
🔥CodeS: Towards Building Open-source Language Models for Text-to-SQL [pdf] 2024.02 ⚠️ SQL ⭐️⭐️
🔥Retrieval-Augmented Text-to-Audio Generation [pdf] 2023.09 ⚠️ Audio ⭐️⭐️
An Empirical Comparison of Video Frame Sampling Methods for Multi-Modal RAG Retrieval [pdf] 2024.07 ⚠️ Video ⭐️
🔥Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners [pdf] 2022.05 [VidIL] Video ⭐️⭐️
Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension [pdf] 2024.11 ⚠️ Video ⭐️

📖 Evaluation

Title Paper Date Code Recom
Benchmarking Large Language Models in Retrieval-Augmented Generation [pdf] 2023.09 ⚠️ ⭐️
🔥Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering [pdf] 2024.11 ⚠️ ⭐️⭐️
RAGAS: Automated Evaluation of Retrieval Augmented Generation [pdf] 2023.09 [ragas] ⭐️
CharacterBox: Evaluating the Role-Playing Capabilities of LLMs in Text-Based Virtual Worlds [pdf] 2024.12 ⚠️ ⭐️
🔥RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework [pdf] 2024.08 [RAGEval] ⭐️⭐️

📚 Tutorials & Blogs

🗄️ Vector Database

Name Description Pros Cons Stars
ChromaDB Open-source embedding database • Lightweight deployment
• Easy integration
• Local & server modes
• Limited scalability
• Not for large-scale production
FAISS Library for efficient similarity search • High performance
• Memory efficient
• GPU acceleration
• Steep learning curve
• Storage layer needed
Milvus Highly scalable vector database • High scalability
• Rich index types
• Production-ready
• Complex deployment
• Resource intensive
Weaviate Vector database with generative search • Built-in generative search
• GraphQL interface
• Modular design
• Complex configuration
• High memory usage
Qdrant Vector similarity search engine • Powerful filtering
• High performance
• Easy deployment
• Smaller community
• Newer features
Vespa Real-time data processing engine • Strong real-time processing
• Feature-rich
• High availability
• Complex setup
• Steep learning curve
pgvector PostgreSQL vector extension • PostgreSQL integration
• Transaction support
• Familiar SQL interface
• Average performance
• Limited scalability
LanceDB Serverless vector database • Developer friendly
• Serverless architecture
• Lightweight
• Basic features
• Newer community
Pinecone Managed vector database • Zero maintenance
• Good scalability
• Enterprise support
• Paid service
• Storage limitations
• Variable latency
⚠️

🛠️ Tools

Name Description Pros Cons Stars
LangChain Framework for developing LLM applications • Rich ecosystem
• Active community
• Extensive integrations
• Complex architecture
• Steep learning curve
LlamaIndex Data framework for LLM applications • Data-centric design
• Easy to use
• Good documentation
• Less flexible
• Limited customization
Unstructured Data preprocessing library • Multiple file formats
• Clean extraction
• Easy integration
• Limited advanced features
• Processing speed
txtai All-in-one embeddings database • Lightweight
• Simple API
• Built-in workflows
• Less enterprise features
• Smaller community
Semantic Kernel Microsoft's orchestration SDK • Strong Microsoft integration
• Memory management
• Enterprise support
• Microsoft ecosystem focused
• Less community plugins
Embedchain RAG framework • Easy to use
• Data source adaptors
• Quick prototyping
• Less production features
• Limited customization
Ragatouille RAG experimentation toolkit • Research focused
• Advanced RAG features
• Flexible architecture
• Early stage
• Less documentation

🤝 Contributing

Feel free to submit Pull Requests to help grow this list! Please ensure:

  1. 📚The resource is relevant to RAG
  2. ✅ Follow the existing format
  3. ✅ Check for duplicates before submitting

Let's make this resource list even better together! 🌟

License

CC0

About

A curated list of awesome papers and resources for Retrieval-Augmented Generation (RAG) in Large Language Models(LLM).

Topics

Resources

Stars

Watchers

Forks

Packages

No packages published