Open-source multimodal AI agent stack designed to control computers and browsers via vision.

Library

GitHub Repository

agenticvisionframework