CompUse

Computer User "agent" PydanticAI + PyAutoGUI + Puppeteer (MCP server)

Overview

We have combined Pydantic AI agent with MCP so that we register MCP tools as agent tools to a pyantic agent. at the same time we implement nice GUI tools for the agent to use. and pull puppetter MCP server to use its tools for browser use. Idea is to experiment with the right abstraction layer of tools to implement for a decent computer use agent.

Features

Desktop GUI automation with PyAutoGUI
Web browser automation with Puppeteer MCP
Voice (TODO) /text-based computer control
Screenshot-based interaction (may be need to figure out things like bounding box etc to localize buttons windows)
Cross-platform support (macOS, Windows, Linux) -- haven't tested on windows..

How It Works

CompUse creates a bridge between Pydantic AI and MCP by:

Starting a Puppeteer MCP server as a subprocess
Querying available tools from the MCP server
Dynamically generating Pydantic AI-compatible tool wrappers for each MCP tool
Registering both GUI tools and MCP tools with a single agent
Providing a unified interface for users to control their computer with natural language

Installation

pip install -r requirements.txt
npm install -g @modelcontextprotocol/server-puppeteer

Quick Start

Start the GUI agent (desktop control only):
```
python gui_agent_example.py
```
Start the combined agent (desktop + browser):
```
python agent.py
```

Requirements

Python 3.7+
Node.js 16+
OpenAI API key (set in .env file)

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
misc		misc
tests		tests
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
agent.py		agent.py
agent_manager.py		agent_manager.py
audio_cli.py		audio_cli.py
cli.py		cli.py
example_gui_deps.py		example_gui_deps.py
gui_agent_example.py		gui_agent_example.py
gui_tools.py		gui_tools.py
requirements-audio.txt		requirements-audio.txt
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CompUse

Overview

Features

How It Works

Installation

Quick Start

Requirements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

swairshah/CompUse

Folders and files

Latest commit

History

Repository files navigation

CompUse

Overview

Features

How It Works

Installation

Quick Start

Requirements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages