Web Research Agent

An intelligent AI agent that can research complex topics by browsing the web, extracting relevant information, recognizing entities, and generating structured reports. The agent leverages a modern web browser, Google search, and AI language models to provide comprehensive answers to research questions.

Features

Automated Web Research: Search the web and browse pages to find information
Entity Recognition: Automatically identify people, organizations, roles, and other entities
Adaptive Search: Refine searches based on previously discovered information
Information Synthesis: Combine information from multiple sources
Task Analysis: Automatically determine the best approach to research tasks
Structured Output: Organize findings into well-formatted reports
Code Generation: Write code when required for data processing tasks

Architecture

graph TD
    A[Main] --> B[WebResearchAgent]
    B --> C1[Memory]
    B --> C2[Planner]
    B --> C3[Comprehension]
    B --> C4[ToolRegistry]
    
    C2 -->|Creates| D[Plan]
    D -->|Contains| E[PlanSteps]
    
    C4 -->|Registers| F1[SearchTool]
    C4 -->|Registers| F2[BrowserTool]
    C4 -->|Registers| F3[CodeGeneratorTool]
    C4 -->|Registers| F4[PresentationTool]
    
    C3 -->|Provides| G1[Task Analysis]
    C3 -->|Extracts| G2[Entities]
    C3 -->|Generates| G3[Summaries]
    
    B -->|Executes| H[Tasks]
    H -->|Produces| I[Results]
    
    style B fill:#f9f,stroke:#333,stroke-width:2px
    style C1 fill:#bbf,stroke:#333
    style C2 fill:#bbf,stroke:#333
    style C3 fill:#bbf,stroke:#333
    style C4 fill:#bbf,stroke:#333
    style F1 fill:#bfb,stroke:#333
    style F2 fill:#bfb,stroke:#333
    style F3 fill:#bfb,stroke:#333
    style F4 fill:#bfb,stroke:#333

Installation

Prerequisites

Python 3.9 or higher
pip (Python package installer)

Setup

Clone the repository:

git clone https://github.com/yourusername/web_research_agent.git
cd web_research_agent

Create a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```

Configuration

The agent requires API keys to function properly:

Gemini API key: For LLM services
Serper API key: For Google search results

Setting up your API keys

Option 1: .env file (Recommended)

Create a .env file in the project root:

GEMINI_API_KEY=your_gemini_api_key
SERPER_API_KEY=your_serper_api_key

The agent will automatically load this file.

Option 2: Environment Variables

export GEMINI_API_KEY=your_gemini_api_key
export SERPER_API_KEY=your_serper_api_key

Option 3: Programmatically

from config.config_manager import init_config

config = init_config()
config.update('gemini_api_key', 'your_gemini_api_key')
config.update('serper_api_key', 'your_serper_api_key')

Additional Configuration Options

Config Key	Environment Variable	Description	Default
gemini_api_key	GEMINI_API_KEY	API key for Google's Gemini LLM	-
serper_api_key	SERPER_API_KEY	API key for Serper.dev search	-
log_level	LOG_LEVEL	Logging level	INFO
max_search_results	MAX_SEARCH_RESULTS	Maximum number of search results	5
memory_limit	MEMORY_LIMIT	Number of items to keep in memory	100
output_format	OUTPUT_FORMAT	Format for output (markdown, text, html)	markdown
timeout	REQUEST_TIMEOUT	Default timeout for web requests (seconds)	30

Usage

Basic Usage

Create a text file with your research tasks, one per line:

# tasks.txt
Find the name of the COO of the organization that mediated secret talks between US and Chinese AI companies in Geneva in 2023.
By what percentage did Volkswagen reduce their Scope 1 and Scope 2 greenhouse gas emissions in 2023 compared to 2021?

Run the agent:
```
python main.py tasks.txt
```
Results will be saved to the results/ directory as Markdown files.

Command Line Options

python main.py tasks.txt --output custom_output_dir

Option	Description	Default
task_file	Path to text file containing tasks	(required)
--output	Directory to store results	results/

Project Structure

agent/: Core agent components
- agent.py: Main agent class
- comprehension.py: Text understanding capabilities
- memory.py: Memory management
- planner.py: Plan creation and management
tools/: Tools used by the agent
- browser.py: Web browsing tool
- search.py: Web search tool
- code_generator.py: Code generation tool
- presentation_tool.py: Information formatting
- tool_registry.py: Tool registration system
utils/: Utility functions
- console_ui.py: Console interface
- formatters.py: Output formatting
- logger.py: Logging configuration
config/: Configuration management
main.py: Entry point

Advanced Usage

Entity Extraction

The agent can automatically identify and extract entities from content:

People: Names of individuals
Organizations: Companies, agencies, groups
Roles: Job titles and organizational positions
Locations: Physical places
Dates: Temporal references

This feature helps the agent refine searches and identify key information.

Custom Output Formats

You can customize the output format by setting the output_format configuration:

from config.config_manager import init_config

config = init_config()
config.update('output_format', 'html')  # Options: markdown, json, html

Troubleshooting

Common Issues

URL Access Errors: Some websites block automated access. Try using a different source.
API Rate Limiting: If you receive rate limit errors, space out your requests or use a premium API plan.
Memory Issues: For very large research tasks, you may need to increase your system's memory allocation.

Error Logs

Logs are stored in the logs/ directory for debugging.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request. Check out the CONTRIBUTING.md file for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 163 Commits
.github		.github
agent		agent
config		config
results		results
tools		tools
utils		utils
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
INSTALL.md		INSTALL.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
__init__.py		__init__.py
cli.py		cli.py
main.py		main.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py
tasks.txt		tasks.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web Research Agent

Features

Architecture

Installation

Prerequisites

Setup

Configuration

Setting up your API keys

Option 1: .env file (Recommended)

Option 2: Environment Variables

Option 3: Programmatically

Additional Configuration Options

Usage

Basic Usage

Command Line Options

Project Structure

Advanced Usage

Entity Extraction

Custom Output Formats

Troubleshooting

Common Issues

Error Logs

Contributing

About

Releases 11

Packages

Languages

License

ashioyajotham/web_research_agent

Folders and files

Latest commit

History

Repository files navigation

Web Research Agent

Features

Architecture

Installation

Prerequisites

Setup

Configuration

Setting up your API keys

Option 1: .env file (Recommended)

Option 2: Environment Variables

Option 3: Programmatically

Additional Configuration Options

Usage

Basic Usage

Command Line Options

Project Structure

Advanced Usage

Entity Extraction

Custom Output Formats

Troubleshooting

Common Issues

Error Logs

Contributing

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 11

Packages 0

Languages

Packages