open-cuak
<img src="./apps/web/public/assets/icons/open-cuak-logo.png" alt="Open CUAK Logo"> <p align="center"> <a href="https://aident.ai/cloud">Aident Cloud</a> · <a href="https://docs.aident.ai/docs/get-started">Host locally</a> · <a href="https://docs.aident.ai/">Documentation</a> </p> <p align="center"> <img src="https://img.shields.io/github/stars/Aident-AI/open-cuak"> <a href="https://discord.gg/SHT4etYuX2" target="_blank"> <img src="https://img.shields.io/discord/1129411337727000606?logo=discord&labelColor=%20%235462eb&logoColor=%20%23f5f5f5" alt="Chat on Discord"></a> <a href="https://twitter.com/intent/follow?screen_name=Aident_AI" target="_blank"> <img src="https://img.shields.io/twitter/follow/Aident_AI?logo=X" alt="Follow on X"></a> </p> <h1 align="center">🤖 - Free OpenAI Operator alternative - 👥</h1> Open CUA (Computer Use Agent) Kit, or Open-CUAK (pronounced "quack" 🦆🗣️), is THE platform for teaching, hiring and managing automation agents at scale — starting with browsers. ### 🎯 Why Open-CUAK? In the real world, for real businesses, working with real people, reliability is everything. When automation becomes reliable, it becomes scalable. And when it becomes scalable, it becomes profitable. That’s why Open-CUAK is designed to run and manage **thousands of** automation agents, ensuring each one is **reliable**. This project is still in its very early days, but our team is working very hard to make it a reality, soon. This is just the beginning of a new era in work, a new way to a world of **abundant productivity**. And when productivity becomes truly abundant, we want to make sure it is **equally distributed**. That's why we are making it open-sourced, today. _Read more in our [launch blog](https://aident.ai/blog/openai-operator-open-source-alternative)._ --- <p align="center">❤️ Built and open-sourced by <a href="https://aident.ai">Aident AI</a> team. ❤️</p> <p align="center"><a href="https://aident.ai"><img src="./apps/web/public/assets/icons/aident-logo-rounded-512.png" alt="Aident AI Logo" width="30" height="30" ></a></p> ## Quick Start ### 👉 Start Local Production Build 0. (optional) Make sure you have [`brew`](https://brew.sh/) for package management > works on Mac and Linux. For Windows, use WSL2 for now. ```bash /bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)" # (optional) on Linux, if `brew` command is not available in terminal, use this to register `brew` test -d ~/.linuxbrew && eval "$(~/.linuxbrew/bin/brew shellenv)" test -d /home/linuxbrew/.linuxbrew && eval "$(/home/linuxbrew/.linuxbrew/bin/brew shellenv)" echo "eval \"\$($(brew --prefix)/bin/brew shellenv)\"" >> ~/.bashrc # (optional) verify the successful installation of `brew` brew doctor ``` 1. Install Open-CUAK package ```bash brew install Aident-AI/homebrew-tap/open-cuak # or use this to update to the latest version brew update && brew upgrade Aident-AI/homebrew-tap/open-cuak ``` 2. Start Open-CUAK services > downloading images can take a while (Sorry! We will optimize this soon.) ``` open-cuak start ``` 3. Ta-da! It is now ready locally at [http://localhost:11970](http://localhost:11970). > Don't forget to go to the ⚙️ Configurations page to set your OpenAI or other major model API key to chat with Aiden! <p align="center"> </p> ## Demos 1. ### Agent Demo #1: Canva Use Agent An automation agent uses Canva to create a poster for President's Day. When it encounters issues, such as whether to use Pro templates (require a Canva paid plan), it proactively asks the user for additional instructions. https://github.com/user-attachments/assets/f283189b-bc90-4875-8bd5-75b2a6a4bf9f 2. ### Agent Demo #2: Expedia Flight Search An agent uses built-in remote-browser running locally to search for flight tickets on Expedia. https://github.com/user-attachments/assets/e2dd7276-fff7-4ed1-a042-3c1a6bbecef8 3. ### Feature Demo: Account Management Account Management lets agents use your account and tools on your behalf. https://github.com/user-attachments/assets/1de1069d-053f-408f-b07b-61b399bfcc1f <p align="center"> </p> <p align="center">Watch more on our <a href="https://www.youtube.com/@aident-ai">Youtube channel</a>, and subscribe to see more.</p> <p align="center"> </p> ## Core Features ✅ Run Operator-like automation workflows locally, ensuring full privacy ✅ Use vision-based automation with more flexibility and reliability, just like a human ✅ Turn any browser into an Operator-companion, with a browser extension ✅ Utilize a dedicated remote browser to mitigate risks associated, without sharing your own ✅ Use any vision-compatible model, whether frontier or open-source (Claude, Gemini, LLaVA, etc.) ✅ Bypass frustrating bot detection, unlocking more automation possibilities ✅ Cookie management for easy login, without the need for manual re-login 🔜 Teach agents new workflows reliably, with SOP-based training 🔜 Centralize all account access in one place, managing everything agents have access to ⏳ Monitor and manage a large number of tasks, with built-in observability tools ⏳ Deploy and scale hundreds of agents to execute real-world tasks, in parallel ⏳ Open source an RL-trained CUA model to run automations, for free <p align="center"> </p> ## Development Setup ### ⚡ Start Development Servers 1. Install dependencies. ```bash # on mac brew install node # on linux sudo apt install nodejs npm ``` ```bash npm install -g pnpm ``` ```bash # at repo root # on mac npm run init:mac # on linux npm run init:linux ``` 2. Start the dev servers. ```bash # at repo root npm run dev ``` 3. Now, it is ready locally at [http://localhost:3000](http://localhost:3000). ### ▶️ Build and Run Production Build from Local 1. Run local production build (with `docker`). ```bash # at repo root npm run docker:start ```