Browse

UI-TARS

Open-source multimodal AI agent stack for desktop and browser automation

About

UI-TARS is an open-source stack designed to automate GUI interactions on local and remote computers using vision-language models. It features a desktop application and CLI that interpret natural language instructions to perform complex tasks through visual recognition and MCP tool integration.

Details
Built with
Unknown
Creator
Listed
Added to Dropday 2h ago
Evidence
Strong

The repository is an active, highly-starred project (37k+ stars) from a verified organization (Bytedance) with a live product homepage at agent-tars.com.

Timeline
Teaser
Video
Playable
Product

Loading…

UI-TARS — Dropday.ai