Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents
deep-reinforcement-learning r1 multimodal o1 multimodal-large-language-models large-multimodal-models gui-agent grpo mllm-reasoning
-
Updated
Apr 26, 2025 - Python