Multi-Agent Orchestrator with RAG, Web Search, and More

Overview

This repository is an advanced implementation of AI agent techniques, focusing on:

Multi-Agent Orchestration for coordinating multiple agents in AI workflows.
Retrieval-Augmented Generation (RAG) framework to improve AI-generated responses.
AI Agent Techniques such as Planning (ReAct flow), Reflection, etc. for enhanced reasoning.

1. Multi-Agent Orchestrator

This project enhances LLM capabilities using multi-agent workflows, integrating:

ReAct for planning and execution.
Reflection for iterative learning.
Multi-Agent Coordination for complex problem-solving.

Workflow:

User input is classified to determine the appropriate agent.
The orchestrator selects the best agent based on historical context and agent capabilities.
The selected agent processes the input and generates a response.
The orchestrator updates conversation history and returns the response.

For further exploration:

2. Introduction to RAG

Large Language Models (LLMs) have limitations in handling private or recent data. The Retrieval-Augmented Generation (RAG) framework mitigates this by retrieving relevant external documents before generating responses.

Key Components of RAG:

Indexing: Splits documents into chunks, creates embeddings, and stores them in a vector database.
Retriever: Finds the most relevant documents based on the user query.
Augment: Combines retrieved documents with the query for context.
Generate: Uses the LLM to generate accurate responses.

3. Advanced RAG Techniques

This repository supports several advanced RAG techniques:

Technique	Tools	Description
Naive RAG	LlamaIndex, Qdrant, Google Gemini	Basic retrieval-based response generation.
Hybrid RAG	LlamaIndex, Qdrant, Google Gemini	Combines vector search with BM25 for better results.
Hyde RAG	LlamaIndex, Qdrant, Google Gemini	Uses hypothetical document embeddings to improve retrieval accuracy.
RAG Fusion	LlamaIndex, LangSmith, Qdrant, Google Gemini	Generates sub-queries, ranks results using Reciprocal Rank Fusion.
Contextual RAG	LlamaIndex, Qdrant, Google Gemini, Anthropic	Compresses retrieved documents to keep only the most relevant details.
Unstructured RAG	LlamaIndex, Qdrant, FAISS, Google Gemini, Unstructured	Handles text, tables, and images for diverse content retrieval.

4. Other AI Technologies

🤖 Supports Claude 3, GPT-4, Gemini. For optimal performance: Use the Gemini family of models.
🧠 Advanced AI planning and reasoning capabilities
🔍 Contextual keyword extraction for focused research
🌐 Seamless web browsing and information gathering
💻 Code writing in multiple programming languages
📊 Dynamic agent state tracking and visualization
💬 Natural language interaction via chat interface
📂 Project-based organization and management
🔌 Extensible architecture for adding new features and integrations

5. Running Backend Only as API

To run the backend separately, follow the instructions in the backend README.

6. Running the Project with Docker

Prerequisites

Prerequisites for Audio/Video Processing

To process audio/video files, FFmpeg is required:

For Ubuntu/Debian

sudo apt update
sudo apt install ffmpeg

For macOS (Homebrew)

brew install ffmpeg

For Windows

Download FFmpeg from FFmpeg official website.
Extract the files and add the bin folder to your system's PATH.
Restart your terminal and verify installation with:
```
ffmpeg -version
```

Steps

1. Clone the Project

git clone https://github.com/buithanhdam/maowrag-unlimited-ai-agent.git
cd maowrag-unlimited-ai-agent

2. Configure Environment Variables

cp ./frontend/.env.example ./frontend/.env
cp ./backend/.env.example ./backend/.env

and fill values:

# For backend .env
# API key
GOOGLE_API_KEY=
OPENAI_API_KEY=
ANTHROPIC_API_KEY=
TAVILY_API_KEY=

# URL
BACKEND_API_URL=http://localhost:8000
QDRANT_URL=http://localhost:6333

# Database connection
MYSQL_USER=
MYSQL_PASSWORD=
MYSQL_ROOT_PASSWORD=
MYSQL_HOST=
MYSQL_PORT=
MYSQL_DB=
MYSQL_ALLOW_EMPTY_PASSWORD=yes

# AWS S3 connection
AWS_ACCESS_KEY_ID=
AWS_SECRET_ACCESS_KEY=
AWS_REGION_NAME=
AWS_STORAGE_TYPE=
AWS_ENDPOINT_URL=

# For frontend .env
NEXT_PUBLIC_BACKEND_API_URL=http://localhost:8001

3. Build and Run the Project

docker-compose up --build

4. Set Up MySQL Database (if needed)

docker exec -it your-container-name bash
mysql -u root -p

Enter root password (configured in .env or docker-compose.yml).

Run SQL queries:

CREATE USER 'user'@'%' IDENTIFIED BY '1';
GRANT ALL PRIVILEGES ON maowrag.* TO 'user'@'%';
FLUSH PRIVILEGES;
CREATE DATABASE maowrag;

5. Access the Application

Frontend: http://localhost:3000
Backend: http://localhost:8000
Qdrant: Ports 6333, 6334
MySQL: Port 3306

6. Stop the Project

docker-compose down

7. Project Structure

📦 maowrag-unlimited-ai-agent
├── backend/       # Backend source code
│   ├── Dockerfile.backend
│   ├── requirements.txt
├── frontend/      # Frontend source code
│   ├── Dockerfile.frontend
│   ├── next.config.js
├── docker-compose.yml  # Docker Compose setup
├── Jenkinsfile    # CI/CD configuration

8. Contributing

Contributions are welcome! Please submit an issue or a pull request to improve this project.

9. License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
Jenkinsfile		Jenkinsfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Agent Orchestrator with RAG, Web Search, and More

Overview

Table of Contents

1. Multi-Agent Orchestrator

Workflow:

2. Introduction to RAG

Key Components of RAG:

3. Advanced RAG Techniques

4. Other AI Technologies

5. Running Backend Only as API

6. Running the Project with Docker

Prerequisites

Prerequisites for Audio/Video Processing

For Ubuntu/Debian

For macOS (Homebrew)

For Windows

Steps

1. Clone the Project

2. Configure Environment Variables

3. Build and Run the Project

4. Set Up MySQL Database (if needed)

5. Access the Application

6. Stop the Project

7. Project Structure

8. Contributing

9. License

10. References

About

Releases

Packages

Contributors 2

Languages

License

buithanhdam/maowrag-unlimited-ai-agent

Folders and files

Latest commit

History

Repository files navigation

Multi-Agent Orchestrator with RAG, Web Search, and More

Overview

Table of Contents

1. Multi-Agent Orchestrator

Workflow:

2. Introduction to RAG

Key Components of RAG:

3. Advanced RAG Techniques

4. Other AI Technologies

5. Running Backend Only as API

6. Running the Project with Docker

Prerequisites

Prerequisites for Audio/Video Processing

For Ubuntu/Debian

For macOS (Homebrew)

For Windows

Steps

1. Clone the Project

2. Configure Environment Variables

3. Build and Run the Project

4. Set Up MySQL Database (if needed)

5. Access the Application

6. Stop the Project

7. Project Structure

8. Contributing

9. License

10. References

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages