Cua is an open-source framework that enables AI agents to control full operating systems within high-performance, lightweight virtual containers. It delivers up to 97% native CPU speed on Apple Silicon and works with any language models.
When we started building Computer-Use AI agents we quickly realised that for them to be genuinely useful, they must interact with your system's native applications. But giving full access to your host device is risky - and not ideal. How can AI agents interact with operating systems, desktop applications, and browsers without jeopardizing security or sacrificing performance?
Today, we're thrilled to introduce Cua: an open-source framework that lets AI agents control full operating systems within secure, high-performance virtual containers.
Cua brings together three powerful components into one unified framework stack:
Lume - Our high-performance Virtualization layer leverages Apple's Virtualization.Framework to spin up macOS or Linux environments.
Computer-Use Interface - This layer lets any process or AI agent - no matter which framework it's built on (like AutoGen, LangGraph, etc.) - see the screen and interact with Lume VMs just like a human would. It supports actions such as clicking, typing, scrolling, and extracting the visual accessibility tree.
Computer-Use Agent - A general agent framework to run intelligent workflows within these VMs. It's compatible with popular LLM providers like OpenAI, Anthropic, UI-Tars, as well as open-source tools like Ollama, LM Studio, and OpenRouter.
AI agents built with Cua operates within a virtual environment to:
Navigate and interact with any application's interface
Read screen content and perform keyboard/mouse actions
Switch between applications and self-debug when needed
Operate in a secure sandbox with controlled file access
All of this happens inside a fully isolated virtual container, ensuring your host system, files, and sensitive data remain completely safe - while you continue using your device uninterrupted.
But we didn’t stop there. We’ve also launched a cloud platform for Cua, enabling developers to spin up and control these remote containers via API - no need to run anything locally. This makes it easy to scale and deploy secure, cloud-hosted agents that interact with real OS environments - perfect for testing, remote execution, or powering AI-first products.
From CAD software to Photoshop, from Tableau to legacy tools with no APIs, Cua opens up entirely new possibilities for human-AI collaboration.
We built Cua because we believe AI should enhance the tools we already use - not replace them. It should operate alongside us, securely, transparently, and on our terms.
The journey is just beginning, and we can’t wait to see what you’ll build!
@antonyo_demydov Hey Antonyo - just to clarify, when you say "custom GPT," do you mean setting up a custom prompt connected to different models? If so, you can absolutely do that! Let me know if you meant something else.
This could redefine cross-platform automation! Love the Apple Virtualization.Framework integration for macOS devs! How does Cua handle permissions? Will it need human's intervention for kernel-level actions?
@desmond_ren1 Thanks Desmond! For regular dev stuff (coding, testing, running apps), Cua handles permissions automatically within the VM - no need to worry! But for hardcore kernel-level actions or loading custom kexts, you'll need to step in manually. Cua intentionally doesn't automate these high-risk operations to protect your systems!
Cua is primarily designed for developers and more technical users who want to create AI agents that can control full operating systems in virtual environments. But we're also supporting non technical teams in building complete automation workflows. If you'd be interesting in a demo/chat about it more, here is my calendar: https://cal.com/alessandro-puppo-cfqtqe/cua-demo
@pritraveler Hey Yong, thanks for the awesome comment! 🙌
You nailed it - while other tools are still in the "few clicks" era, Cua is pushing toward true zero-click automation. The whole idea is to create these super secure, isolated environments where AI agents can just do their thing without constant human babysitting :)
@morgan_dean@kuzma_safonov Hi Kuzma - absolutely! You can definitely connect Cua to APIs and databases for more complex workflows. The whole system is designed with integration in mind.
Huge congrats on the launch—this is a game-changer for anyone working on agentic workflows or automation that needs real OS access. Love the focus on security and the flexibility for both local and cloud use.
Quick question: how are you thinking about scaling up concurrent agent sessions in the cloud? Is there any orchestration layer for managing lots of VMs/containers, or is that something you expect users to handle themselves? Would love to hear more about your roadmap for multi-agent or team scenarios!
Really excited to see where Cua goes next. 🚀
Report
Super interesting project. Can it use Apple Events on macOS when the app supports it instead of relying on clicks? I ask because Photoshop supports it, allowing every action in the app to be recorded and automated. It's a much more native and efficient way to automate an app.
Cua
When we started building Computer-Use AI agents we quickly realised that for them to be genuinely useful, they must interact with your system's native applications. But giving full access to your host device is risky - and not ideal. How can AI agents interact with operating systems, desktop applications, and browsers without jeopardizing security or sacrificing performance?
Today, we're thrilled to introduce Cua: an open-source framework that lets AI agents control full operating systems within secure, high-performance virtual containers.
Cua brings together three powerful components into one unified framework stack:
Lume - Our high-performance Virtualization layer leverages Apple's Virtualization.Framework to spin up macOS or Linux environments.
Computer-Use Interface - This layer lets any process or AI agent - no matter which framework it's built on (like AutoGen, LangGraph, etc.) - see the screen and interact with Lume VMs just like a human would. It supports actions such as clicking, typing, scrolling, and extracting the visual accessibility tree.
Computer-Use Agent - A general agent framework to run intelligent workflows within these VMs. It's compatible with popular LLM providers like OpenAI, Anthropic, UI-Tars, as well as open-source tools like Ollama, LM Studio, and OpenRouter.
AI agents built with Cua operates within a virtual environment to:
Navigate and interact with any application's interface
Read screen content and perform keyboard/mouse actions
Switch between applications and self-debug when needed
Operate in a secure sandbox with controlled file access
All of this happens inside a fully isolated virtual container, ensuring your host system, files, and sensitive data remain completely safe - while you continue using your device uninterrupted.
But we didn’t stop there. We’ve also launched a cloud platform for Cua, enabling developers to spin up and control these remote containers via API - no need to run anything locally. This makes it easy to scale and deploy secure, cloud-hosted agents that interact with real OS environments - perfect for testing, remote execution, or powering AI-first products.
From CAD software to Photoshop, from Tableau to legacy tools with no APIs, Cua opens up entirely new possibilities for human-AI collaboration.
We built Cua because we believe AI should enhance the tools we already use - not replace them. It should operate alongside us, securely, transparently, and on our terms.
The journey is just beginning, and we can’t wait to see what you’ll build!
Check out our website at cua.sh
GitHub repo at github.com/trycua/cua
@pupposandro can I run my own custom GPT agent inside Cua?
Cua
@antonyo_demydov Hey Antonyo - just to clarify, when you say "custom GPT," do you mean setting up a custom prompt connected to different models? If so, you can absolutely do that! Let me know if you meant something else.
Den
@pupposandro @antonyo_demydov I was able to pretty easily
Manna
This could redefine cross-platform automation! Love the Apple Virtualization.Framework integration for macOS devs! How does Cua handle permissions? Will it need human's intervention for kernel-level actions?
Cua
@desmond_ren1 Thanks Desmond! For regular dev stuff (coding, testing, running apps), Cua handles permissions automatically within the VM - no need to worry! But for hardcore kernel-level actions or loading custom kexts, you'll need to step in manually. Cua intentionally doesn't automate these high-risk operations to protect your systems!
Den
@desmond_ren1 @pupposandro Wow you have VM-level permissions? Crazy
FuseBase
congrats @francesco_bonacci1 @pupposandro @morgan_dean
I’m not super technical, but would Cua make sense for someone just experimenting with AI tools?
Cua
@francesco_bonacci1 @morgan_dean @kate_ramakaieva Hi Kate, thanks for the kind words!
Cua is primarily designed for developers and more technical users who want to create AI agents that can control full operating systems in virtual environments. But we're also supporting non technical teams in building complete automation workflows. If you'd be interesting in a demo/chat about it more, here is my calendar: https://cal.com/alessandro-puppo-cfqtqe/cua-demo
Prit
While others say 'Finish with a few clicks', Cua will make an environment that users even don't need a single click!
Congratulations :)
Cua
@pritraveler Hey Yong, thanks for the awesome comment! 🙌
You nailed it - while other tools are still in the "few clicks" era, Cua is pushing toward true zero-click automation. The whole idea is to create these super secure, isolated environments where AI agents can just do their thing without constant human babysitting :)
Den
@pritraveler max 1 click haha!
FuseBase
congrats! @morgan_dean can I connect Cua to APIs or databases for more complex workflows?
Cua
@morgan_dean @kuzma_safonov Hi Kuzma - absolutely! You can definitely connect Cua to APIs and databases for more complex workflows. The whole system is designed with integration in mind.
cubic
Huge congrats on the launch—this is a game-changer for anyone working on agentic workflows or automation that needs real OS access. Love the focus on security and the flexibility for both local and cloud use.
Quick question: how are you thinking about scaling up concurrent agent sessions in the cloud? Is there any orchestration layer for managing lots of VMs/containers, or is that something you expect users to handle themselves? Would love to hear more about your roadmap for multi-agent or team scenarios!
Really excited to see where Cua goes next. 🚀
Super interesting project. Can it use Apple Events on macOS when the app supports it instead of relying on clicks? I ask because Photoshop supports it, allowing every action in the app to be recorded and automated. It's a much more native and efficient way to automate an app.