Knowledge-R1 🚀📚🧠

Knowledge-R1 is a framework designed to enhance the synergy between knowledge retrieval and reasoning capabilities. It addresses two fundamental challenges:

Mitigating Knowledge Deficiency in Reason Models 🧐: Large reasoning models often lack sufficient knowledge to make informed decisions.
Enhancing Reasoning in Adaptive Retrieval-Augmented Generation (RAG) Models 🔄📖: Traditional RAG models struggle with complex reasoning for improved query analysis, document analysis, and retrieval.

Approach 🤖➡️🎯

Knowledge-R1 introduces a novel agentic RAG reinforcement learning (RL) framework that enables multi-turn knowledge interaction. This approach:

Enhances the model's ability to integrate retrieved knowledge into its reasoning process. 🏆
Facilitates iterative refinement, allowing reasoning models to actively query and adapt retrieved knowledge. 🔄
Optimizes knowledge-reasoning synergy through reinforcement learning. 🎯

Method 🏗️📌

The core methodology of Knowledge-R1 involves:

Fast Agentic RAG Framework: Using batch inference to accelerate agentic RAG.
Multi-Turn Knowledge Interaction 🔄🔍: Enabling stepwise retrieval and reasoning to progressively improve the model’s understanding and decision-making.
Reinforcement Learning Optimization 🎯🔧: Employing reinforcement learning techniques to dynamically enhance the model's retrieval and reasoning alignment.

Achievements 🏅

✅ Successfully reproduced results on Qwen-1.5B-Instruct, demonstrating significant improvements in knowledge reasoning tasks.
⚡ Partial implementation on 7B-scale models, though currently facing Out-Of-Memory (OOM) challenges.
We are still working on it! 😓💾

Experimental Results 📊

Qwen2.5-1.5B-Instruct:

Qwen2.5-7B-Instruct:

We have observed that the response length has been continuously increasing. However, as the length increases, we have encountered OOM (Out of Memory) issues. Consequently, training at the 7B scale has not yet been completed. We will continue to optimize.

Failed Result:

Details 🛠️

Retriever: BM25s.
Retrieval Corpus: Wiki2018.
Dataset: 2wikimultihopqa
Hugging Face Dataset.

License 📜

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.github		.github
assets		assets
recipes		recipes
scripts		scripts
slurm		slurm
src/open_r1		src/open_r1
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
image-1.png		image-1.png
image.png		image.png
run_debug.sh		run_debug.sh
run_main.sh		run_main.sh
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Knowledge-R1 🚀📚🧠

Approach 🤖➡️🎯

Method 🏗️📌

Achievements 🏅

Experimental Results 📊

Qwen2.5-1.5B-Instruct:

Qwen2.5-7B-Instruct:

Details 🛠️

License 📜

About

Releases

Packages

Languages

License

hzy312/knowledge-r1

Folders and files

Latest commit

History

Repository files navigation

Knowledge-R1 🚀📚🧠

Approach 🤖➡️🎯

Method 🏗️📌

Achievements 🏅

Experimental Results 📊

Qwen2.5-1.5B-Instruct:

Qwen2.5-7B-Instruct:

Details 🛠️

License 📜

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages