AIMAXXING
Windows-first, local-model-friendly AgentOS for personal AI workers
简体中文文档 | English Docs | Development Standards

What AIMAXXING Is
AIMAXXING is a local-first autonomous agent runtime built around one public front agent and many narrow worker agents.
The product direction is Windows native first. WSL/Linux paths are used for development, compatibility checks, and bridge-based testing, but they are not the default product deployment story.
The current architecture optimizes for:
- one clear user-facing agent:
aimaxxingis the public conversation entry point - specialist workers: worker agents equip narrow tools or skills instead of dumping every tool into the front prompt
- local model ownership: local GGUF/ONNX/Windows ML style model flows are treated as first-class citizens
- governed memory: short-term continuity, long-term facts, RAG, and sensory writeback are separated but coordinated
- safe execution: tools, skills, browsers, shell commands, and external runtimes run behind explicit security and policy boundaries
- observable runtime behavior: traces, witness records, task state, resource pressure, and failures are meant to be diagnosable
Current Runtime Shape
User / Panel / Gateway
|
v
aimaxxing front agent
|
| delegate / shared board / memory / RAG / voice coordination
v
specialist worker agents
|
| explicitly equipped builtin tools or installed skills
v
files, browser, knowledge base, models, media, OS runtimes
The front agent should stay small and focused. It routes, remembers, retrieves, coordinates, and explains. Specialized workers execute the concrete tools.
Applications
| App | Role |
|---|---|
| apps/gateway | Local API gateway, OpenAI-compatible surface, chat/runtime coordination, and panel-facing service layer. |
| apps/panel | Desktop control panel for model configuration, agent setup, tool/skill equipment, chat, logs, and runtime operations. |
Crate Map
The crate list below is based on the current crates/*/README.md files. Each
crate owns a narrow part of the system; detailed behavior should live inside
the crate README, not in this root document.
Agent Brain And Runtime Policy
| Crate | Responsibility |
|---|---|
| brain | The cognitive runtime: foreground conversation, worker delegation, memory policy, context assembly, and agent execution flow. |
| hardness | Task-pressure classification, Reflexion gating, failure classification, and tool-first recovery decisions. |
| routing | Shared routing contracts, capability route hints, coordinator modes, and route helper logic extracted from brain/tool paths. |
| loop-guard | Repeat-loop protection for agent/tool execution without becoming the main reasoning policy. |
| intervention | Explicit user interruption and intervention policy, such as stop/wait style turn handling. |
| runtime-policy-core | Runtime-neutral policy data shapes for context windows, retry strategy, tool approval, and executor configuration. |
Memory, Knowledge, And Compression
| Crate | Responsibility |
|---|---|
| memory-core | Canonical memory-system architecture and shared memory data contracts. |
| memory-api | Backend-neutral memory service trait used by brain, tools, kernel, and durable stores. |
| engram | Durable local memory and retrieval substrate with redb storage, lexical search, vector search, lifecycle governance, and telemetry. |
| knowledge | Semantic orchestration over knowledge: fact promotion, knowledge graph logic, and retrieval refinement. |
| compression | Shared compression/output-shaping primitives for tool output, command output, RAG snippets, browser evidence, and future runtime compression work. |
Models, Sensory, And Resource Control
| Crate | Responsibility |
|---|---|
| inference | Local/cloud model execution substrate, inference factory, GGUF/Candle/ONNX-style backend work, VLM/OCR/audio support, and KV-cache machinery. |
| sensory | Sensory bus for STT, TTS, vision, OCR, audio, video, and model-backed perception plugins. |
| orchestrator | Resource arbitration for VRAM/RAM pressure, local-model workload gating, and hardware-aware runtime safety. |
| provider-core | Shared provider data contracts without binding to a full provider execution implementation. |
| providers | LLM provider adapters, provider classification, streaming parsing, and cloud/local provider bridge behavior. |
Tools, Skills, And Execution
| Crate | Responsibility |
|---|---|
| builtin-tools | Native Rust tool library for worker agents: files, browser, knowledge, media, desktop, vault, image, voice, and runtime surfaces. |
| runtimes | Zero-dependency skill execution runtimes: QuickJS, Bun/Node, Python provisioning, WASM, shell, and portable toolchain support. |
| security | Execution security: shell firewall, secret guard, OS sandboxing, WASM policy guard, audit, and secure process cleanup. |
| mcp | Model Context Protocol integration for external tool servers and data sources. |
| auth | Local secret vault, OS keyring integration, OAuth token handling, and API guard support. |
State, Scheduling, Telemetry, And Infrastructure
| Crate | Responsibility |
|---|---|
| kernel | AgentOS core boot/service layer, service factory, and global runtime wiring. |
| state | Durable task, snapshot, artifact, checkpoint, and state persistence on local storage. |
| scheduler | Scheduled job contracts and cron-like execution for delayed/repeated work and maintenance tasks. |
| telemetry | Runtime metrics, run traces, witness summaries, scorecards, and local observability. |
| infra | Operational backbone: gateway helpers, observation traits, prompt/cache utilities, and non-cognitive runtime support. |
Communication And Protocols
| Crate | Responsibility |
|---|---|
| comm | Internal A2A communication substrate for front-agent to specialist-worker coordination. |
| connectors | External messaging bridges such as Telegram, Discord, Slack, Feishu/DingTalk, and email. |
| protocol-core | Stable shared protocol types without runtime orchestration. |
Design Commitments
Windows Native Is The Product Path
AIMAXXING may use WSL bridges during development to test Windows-native model and browser behavior from a Linux workspace. The product target remains native Windows execution, with Windows model/runtime configuration exposed through the panel instead of hidden scripts.
Model Selection Belongs To Users
Language models are equipped on agents. Global small models and sensory models are configured through panel-owned model bindings. Runtime code should resolve models from configuration and factory detection, not from hardcoded model names.
Memory And RAG Are Related But Not The Same
Personal memory is governed fact and continuity state. RAG/knowledge is durable retrieval over imported or captured documents. Both share storage/retrieval infrastructure, but user intent and routing must keep them separate.
Tools Are Equipped, Not Assumed Global
Installed skills and most builtin tools are not automatically available to the front agent. A worker agent equips a narrow tool or skill, and the front agent delegates when needed. This keeps prompts smaller and failures easier to debug.
Real Agent Tests Matter
For agent behavior, panel/gateway chat tests are preferred over mock-only tests. Unit tests still matter, but routing, memory, multimodal, browser, and tool orchestration must be validated through real runtime paths when they affect user behavior.
Documentation Entry Points
- Documentation Hub, Chinese
- Documentation Hub, English
- Development Standards
- Memory Core
- Builtin Tools
- Engram
- Compression Migration Plan
Development Notes
Before changing architecture or shared behavior, read docs/DEVELOPMENT_STANDARDS_AGENTOS.md.
Useful checks:
cargo check --workspace
git diff --check
For narrower work, prefer crate-specific checks, for example:
cargo check -p aimaxxing-brain
cargo check -p aimaxxing-memory-core -p aimaxxing-memory-api
cargo check -p aimaxxing-builtin-tools
Built for local ownership, practical autonomy, and a Windows-first personal AI runtime that ordinary users can actually live with.