Skip to content
View domsteil's full-sized avatar
💭
building the stateset blockchain
💭
building the stateset blockchain

Block or report domsteil

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An MCP Server that's also an MCP Client. Useful for letting Claude develop and test MCPs without needing to reset the application.

TypeScript 107 7 Updated Mar 10, 2025

Verifiers for LLM Reinforcement Learning

Python 725 75 Updated Mar 23, 2025

Two conversational AI agents switching from English to sound-level protocol after confirming they are both AI agents

TypeScript 4,065 329 Updated Mar 12, 2025

Tiny data-over-sound library

C++ 6,266 349 Updated Mar 20, 2025

Exploring Applications of GRPO

Python 120 11 Updated Mar 28, 2025

A toolkit for developing and comparing reinforcement learning algorithms.

Python 35,689 8,660 Updated Oct 11, 2024

Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Python 255 15 Updated Feb 24, 2025

Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna

Python 39 5 Updated Feb 4, 2025

Reinforcement Learning Agents in Javascript (Dynamic Programming, Temporal Difference, Deep Q-Learning, Stochastic/Deterministic Policy Gradients)

HTML 1,359 343 Updated Feb 18, 2019
TypeScript 6 1 Updated Feb 2, 2025

an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM

Rust 10,990 794 Updated Mar 29, 2025

Make websites accessible for AI agents

Python 49,972 5,239 Updated Mar 29, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,404 1,443 Updated Mar 10, 2025

An open infrastructure to democratize and decentralize the development of superintelligence for humanity.

Rust 215 23 Updated Mar 26, 2025

A comprehensive repository of reasoning tasks for LLMs (and beyond)

JavaScript 426 48 Updated Sep 27, 2024

npm for design engineers: largest marketplace of shadcn/ui-based React Tailwind components, blocks and hooks

TypeScript 4,098 177 Updated Mar 28, 2025

Autonomous agents for everyone

TypeScript 15,273 4,973 Updated Mar 29, 2025

A react-based starter app for using the Live API over websockets with Gemini

TypeScript 1,935 464 Updated Mar 27, 2025

LLM code

TypeScript 750 161 Updated Nov 30, 2024
Python 84 23 Updated Feb 28, 2025

AG2 (formerly AutoGen): The Open-Source AgentOS. Join us at: https://discord.gg/pAbnFJrkgZ

Python 2,163 273 Updated Mar 29, 2025

Every AI Agent deserves a wallet.

Python 619 374 Updated Mar 28, 2025

A full-featured, hackable Next.js AI chatbot built by Vercel

TypeScript 14,349 3,739 Updated Mar 22, 2025

Business Data Benchmark (BDB) is a set of real-world questions to evaluate AI systems connected to business data.

22 Updated Dec 3, 2024

🙌 OpenHands: Code Less, Make More

Python 51,525 5,713 Updated Mar 29, 2025

A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API

TypeScript 8,409 1,433 Updated Mar 24, 2025

A testnet open-source Layer 2 from the future, co-designed with the developer tools stack.

Rust 277 76 Updated Mar 5, 2025

Official inference framework for 1-bit LLMs

C++ 12,850 907 Updated Feb 18, 2025

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 19,472 2,076 Updated Mar 11, 2025
Next
Showing results