Home
Softono
i

ifuryst

Professional software vendor delivering innovative solutions on the Softono platform. Specialized in both open-source and proprietary software development.

Total Products
2

Software by ifuryst

open-codex-computer-use
Open Source

open-codex-computer-use

# open-computer-use [![English](https://img.shields.io/badge/English-Click-yellow)](./README.md) [![简体中文](https://img.shields.io/badge/简体中文-点击查看-orange)](./README.zh-CN.md) [![Release](https://img.shields.io/github/v/release/iFurySt/open-codex-computer-use)](https://github.com/iFurySt/open-codex-computer-use/releases) [![Ask DeepWiki](https://deepwiki.com/badge.svg)](https://deepwiki.com/iFurySt/open-codex-computer-use) <a href="https://llmapis.com?source=https%3A%2F%2Fgithub.com%2FiFurySt%2Fopen-codex-computer-use" target="_blank"><img src="https://llmapis.com/api/badge/iFurySt/open-codex-computer-use" alt="LLMAPIS" width="20" /></a> > [!TIP] > Interested in Browser Use? Check out [open-browser-use](https://github.com/iFurySt/open-codex-browser-use). --- `open-computer-use` is an open-source `Computer Use` service wrapped as `MCP`. Any AI agent or MCP client can use it to run Computer Use on macOS, Linux, and Windows. This project was inspired by OpenAI's [Codex Computer Use](https://openai.com/index/codex-for-almost-everything/). It showed that non-intrusive CUA can be built on top of Accessibility, so I decided to build an open-source version. I started this repo with my [harness template](https://github.com/iFurySt/harness-template), a template for quickly spinning up AI-first projects. It has been one of our most useful workflows lately, especially for nearly 100% AI-generated projects. I also wrote [a post](https://www.ifuryst.com/blog/2026/speedrunning-the-ai-era/) about the methodology behind it. ## Demos ### Codex App and Codex CLI [![Open Computer Use custom demo cover](./docs/generated/readme-assets/open-computer-use-demo-cover.png)](https://youtu.be/2s6aVpGiwaQ) <sub><em>`open-computer-use` used as Computer Use in Codex App and Codex CLI, matching the official experience.</em></sub> ### Gemini CLI https://github.com/user-attachments/assets/eacb3b15-f939-46c7-b3b3-6f876977a58d <sub><em>Gemini CLI connects to `open-computer-use` through MCP and runs full Computer Use actions.</em></sub> ### Linux https://github.com/user-attachments/assets/e036b1c8-2200-4896-abd4-19225915cf66 <sub><em>`open-computer-use` running on Linux.</em></sub> ## Quick Start ```bash npm i -g open-computer-use ``` **On macOS, run it once and grant `Accessibility` and `Screen Recording`. Windows and Linux do not need this step.** ```bash open-computer-use ``` Before using it, install it into your agent: ```bash # Install into Codex by writing to ~/.codex/config.toml open-computer-use install-codex-mcp ``` Or add it to your own client manually: ```json { "mcpServers": { "open-computer-use": { "command": "open-computer-use", "args": ["mcp"] } } } ``` ### Skill Install the skill directly: ```bash # Install for Codex npx skills add iFurySt/open-codex-computer-use -g -a codex --skill open-computer-use -y npx skills ls -g -a codex | rg 'open-computer-use' ``` Install for Claude Code: ```bash npx skills add iFurySt/open-codex-computer-use -g -a claude-code --skill open-computer-use -y ``` Update an existing global install, including the Codex install created above: ```bash npx skills update open-computer-use -g -y ``` You can also manually download and install the [`open-computer-use` skill](./skills/open-computer-use). ## More Besides the MCP JSON config above, you can also use the built-in commands: ```bash # Install into Codex by writing to ~/.codex/config.toml open-computer-use install-codex-mcp # Install as a Codex plugin, mainly for Codex App open-computer-use install-codex-plugin # Install into Claude Code by writing to ~/.claude.json open-computer-use install-claude-mcp # Install into Gemini CLI for the current project by writing to ./.gemini/settings.json open-computer-use install-gemini-mcp # Install into Gemini CLI user config instead open-computer-use install-gemini-mcp --scope user # Install into opencode by writing to ~/.config/opencode/opencode.json (or the active config file) open-computer-use install-opencode-mcp # Call a single Computer Use tool and print the MCP-style JSON result open-computer-use call list_apps open-computer-use call get_app_state --args '{"app":"TextEdit"}' # Run a sequence in one process so element_index state can be reused # Sequence runs sleep 1s between successful operations by default open-computer-use call --calls '[{"tool":"get_app_state","args":{"app":"TextEdit"}},{"tool":"press_key","args":{"app":"TextEdit","key":"Return"}}]' open-computer-use call --calls-file examples/textedit-overlay-seq.json --sleep 0.5 # Check permissions; onboarding only opens when something is missing open-computer-use doctor # Run local validation from a source checkout make smoke OPEN_COMPUTER_USE_STRESS_LOOPS=20 make stress make agent-smoke make agent-smoke SCENARIO=fixture-full node ./scripts/run-agent-smoke-tests.mjs --agents=claude,codex --command=open-computer-use node ./scripts/run-agent-smoke-tests.mjs --scenario=fixture --agents=claude,codex --command=open-computer-use node ./scripts/run-agent-smoke-tests.mjs --scenario=fixture-full --agents=claude,codex --command=open-computer-use OPEN_COMPUTER_USE_HERMES_PROVIDER=anthropic OPEN_COMPUTER_USE_HERMES_MODEL=claude-opus-4-20250514 make agent-smoke AGENTS=hermes SCENARIO=fixture-full node ./scripts/run-agent-smoke-tests.mjs --agents=hermes --hermes-provider=anthropic --hermes-model=claude-opus-4-20250514 node ./scripts/run-agent-smoke-tests.mjs --scenario=fixture --agents=hermes --hermes-provider=anthropic --hermes-model=claude-opus-4-20250514 node ./scripts/run-agent-smoke-tests.mjs --scenario=fixture-full --agents=hermes --hermes-provider=anthropic --hermes-model=claude-opus-4-20250514 --hermes-max-turns=12 # Show help open-computer-use -h ``` ## Cursor Motion Cursor Motion is an open-source cursor motion system for macOS, based on public information shared by members of the Software.Inc team. You can download the app from the [Releases page](https://github.com/iFurySt/open-codex-computer-use/releases). [![Cursor Motion custom demo cover](./docs/generated/readme-assets/cursor-motion-demo-cover.png)](https://youtu.be/KRUq5GUHv1Q) ## Star History <a href="https://www.star-history.com/?repos=iFurySt%2Fopen-codex-computer-use&type=date&legend=top-left"> <picture> <source media="(prefers-color-scheme: dark)" srcset="https://api.star-history.com/chart?repos=ifuryst/open-codex-computer-use&type=date&theme=dark&legend=top-left" /> <source media="(prefers-color-scheme: light)" srcset="https://api.star-history.com/chart?repos=ifuryst/open-codex-computer-use&type=date&legend=top-left" /> <img alt="Star History Chart for open-computer-use" src="https://api.star-history.com/chart?repos=ifuryst/open-codex-computer-use&type=date&legend=top-left" /> </picture> </a> ## License [MIT](./LICENSE)

AI Agents RPA Code Editors & IDEs
1K Github Stars
open-browser-use
Open Source

open-browser-use

# Open Browser Use [![English](https://img.shields.io/badge/English-Click-yellow)](./README.md) [![简体中文](https://img.shields.io/badge/简体中文-点击查看-orange)](./README.zh-CN.md) [![Release](https://img.shields.io/github/v/release/iFurySt/open-browser-use)](https://github.com/iFurySt/open-browser-use/releases) [![npm SDK](https://img.shields.io/npm/v/open-browser-use-sdk?label=npm%20SDK)](https://www.npmjs.com/package/open-browser-use-sdk) [![PyPI SDK](https://img.shields.io/pypi/v/open-browser-use-sdk?label=PyPI%20SDK)](https://pypi.org/project/open-browser-use-sdk/) [![Go SDK](https://pkg.go.dev/badge/github.com/ifuryst/open-browser-use/packages/open-browser-use-go.svg)](https://pkg.go.dev/github.com/ifuryst/open-browser-use/packages/open-browser-use-go) [![Ask DeepWiki](https://deepwiki.com/badge.svg)](https://deepwiki.com/iFurySt/open-browser-use) --- > [!TIP] > Interested in Computer Use? Check out [open-computer-use](https://github.com/iFurySt/open-codex-computer-use). `open-browser-use` is a browser automation layer that stays neutral across agent runtimes. It is also an open-source alternative to the Chrome Browser Use capability recently shipped in Codex.app. For the story behind it, see the [Browser Use Deep Dive](https://www.ifuryst.com/en/blog/2026/open-browser-use/). Under the hood, it pairs a browser extension with the `open-browser-use` CLI. You can integrate it through the JavaScript SDK, Python SDK, Go SDK, or the CLI. https://github.com/user-attachments/assets/bcfba878-f6a8-44b9-b84b-29c7e0285687 ## Quick Start ```bash npm i -g open-browser-use open-browser-use setup ``` ### Install the CLI ```bash # npm (macOS, Linux, Windows) npm i -g open-browser-use # Homebrew (macOS, Linux) brew tap iFurySt/open-browser-use && brew install open-browser-use # Upgrade npm update -g open-browser-use ``` ### Set Up Chrome Register the native host for the extension. The setup command also opens the Chrome Web Store page so you can install or enable the matching Chrome extension, then restart Chrome if needed. ```bash open-browser-use setup ``` If the Chrome Web Store item is temporarily unavailable, use `open-browser-use setup beta` to prepare a keyed release ZIP for manual installation from `chrome://extensions/`. ### Use It #### SDK ```bash # JavaScript / TypeScript npm install open-browser-use-sdk # Python pip install open-browser-use-sdk # Go go get github.com/ifuryst/open-browser-use/packages/open-browser-use-go ``` The SDK package name is `open-browser-use-sdk` on both npm and PyPI. Python code imports it as `open_browser_use`; Go code imports the SDK package as `obu`. #### Skill Install the skill directly: ```bash # Install for Codex npx skills add iFurySt/open-browser-use -g -a codex --skill open-browser-use -y npx skills ls -g -a codex | rg 'open-browser-use' codex exec --skip-git-repo-check "Use open-browser-use to check today’s Hacker News and summarize the most worth-reading posts." # Install for Claude Code npx skills add iFurySt/open-browser-use -g -a claude-code --skill open-browser-use -y ``` Update an existing global install, including the Codex install created above: ```bash npx skills update open-browser-use -g -y # `upgrade` is an alias for `update` npx skills upgrade open-browser-use -g -y ``` You can also manually download and install the [`open-browser-use` skill](./skills/open-browser-use), then start using it from your agent. Downloadable `.skill` and `.zip` packages are available in [GitHub Releases](https://github.com/iFurySt/open-browser-use/releases). #### MCP Install the MCP server into all supported global agent configs: ```bash npx add-mcp "obu mcp" --name open_browser_use --all -g -y npx add-mcp list -g ``` You can also configure an agent runtime with local MCP stdio support manually: ```toml [mcp_servers.open_browser_use] command = "obu" args = ["mcp"] ``` The server exposes browser tools for tab listing, opening, claiming, navigation, CDP, action plans, and cleanup. ## License [MIT](./LICENSE)

AI Agents Browser Automation
140 Github Stars