UI-TARS Desktop is a native GUI Agent from Bytedance that runs locally. It can operate desktop apps, open files, browse websites, and automate multi-step tasks without sending data to the cloud. The repo is 100% open source under Apache 2.0 license.
UI-TARS Desktop is the desktop-focused piece of the broader TARS multimodal agent stack. It provides a native GUI Agent based on the UI-TARS model, with operators for local machines, remote machines, and the browser. If you are exploring desktop automation agents, you might also check out how to run Hermes Agent Desktop as a native Windows AI agent — another approach to local agent automation.

How It Works
UI-TARS Desktop exposes a set of operators that can interact with graphical elements on screen, simulate input, and combine vision with LLM reasoning to perform tasks that mimic human interaction. It ships with a CLI and a Web UI for control and debugging.
git clone https://github.com/bytedance/UI-TARS-desktop.git cd UI-TARS-desktop # follow the README to install dependencies and run the desktop app locally
| Feature | Notes |
|---|---|
| Local automation | Runs offline, no cloud dependency |
| GUI operators | Click, type, detect UI elements via vision models |
| Browser operators | Open pages, fill forms, download artifacts |
| Extensible | CLI, Web UI, and plugin-style operators |


Key Considerations
- Data isolation — Run in a sandboxed environment or dedicated VM since it requires broad desktop access
- Performance — Automation speed can be slower for trivial tasks versus native scripts
- Security — A local agent controlling your desktop is equivalent to giving someone programmatic access to your machine
For hardening agent environments, also see how to harden AI agent runtimes with NVIDIA NemoClaw — a complementary tool for securing agent deployments.
What the Community Says
“It’s like give your personal computer to hackers” — @a7tiony
“It takes a lot of time to automate that stuff which we do in seconds, it takes minutes like just sending a file or saving a file takes 1-2min” — @dr.manhattan000
Quick Start
git clone https://github.com/bytedance/UI-TARS-desktop.git ls UI-TARS-desktop # open the README and follow platform-specific install steps
Running a local agent that controls your desktop is powerful but requires trust and careful sandboxing. Treat it like physical access and lock it down accordingly.
Project link:
https://github.com/bytedance/UI-TARS-desktop
- Vibecode Terminal, Unified Agent Sandbox
Textarea.my by antonmedv is a tiny, client side text editor that embeds the note directly into the URL using deflate..
- Vibecode Terminal, Unified Agent Sandbox
NemoClaw is an early preview reference stack from NVIDIA that wraps an agent runtime with guided onboarding, a hardened blueprint,..
- Vibecode Terminal, Unified Agent Sandbox
MultipleWindow3dScene is a demo that synchronizes a single 3D world across infinite browser windows or screens in real time using..
- Vibecode Terminal, Unified Agent Sandbox
MapToPoster is a lightweight generator that renders minimalist city maps from any location you type, adds stylized text and coordinates,..
- Vibecode Terminal, Unified Agent Sandbox
Hermes Agent Desktop by RedWoodOG wraps the NousResearch Hermes Agent in a WinUI 3, .NET 10 desktop app. It gives..
- Vibecode Terminal, Unified Agent Sandbox
Hermes Agent is an open-source, terminal-first AI assistant from Nous Research that lives in your terminal or on a server…
