About Decepticon

Autonomous Hacking Agent for Red Team

p

Published by

purpleailab

Visit View Profile

README.md

View on GitHub

Decepticon — Autonomous Red Team Agent

"Another AI hacker? Let us guess — it runs nmap and writes a report."

Install

Prerequisites: Docker and Docker Compose v2. Supported on macOS (Apple Silicon + Intel), Linux (amd64 + arm64), and Windows (amd64 + arm64) — native via PowerShell or via WSL2 (Ubuntu / Kali).

macOS / Linux / WSL2

curl -fsSL https://decepticon.red/install | bash
decepticon onboard   # Interactive setup wizard (provider, API key, model profile)
decepticon           # Start the core stack and drop into the terminal CLI

The default start brings up the core management plane (LiteLLM, PostgreSQL, Neo4j, Skillogy, LangGraph, sandbox) and launches the terminal CLI. Specialist workloads (BloodHound CE, Sliver C2, Ghidra MCP, …) and the web dashboard come up on demand — the orchestrator spawns specialists via ops_start("ad") etc., and you bring up the dashboard from inside the CLI with /web (see Web Dashboard).

Windows (PowerShell, native)

irm https://decepticon.red/install.ps1 | iex
decepticon onboard
decepticon

→ Quick start · Full setup walkthrough

Use as a library (pip)

Building on top of the agents — a SaaS, a research integration, or a custom orchestrator? Install the SDK from PyPI:

pip install decepticon              # core SDK
pip install "decepticon[neo4j]"     # + the knowledge-graph attack-chain tools

decepticon is a client SDK: it ships the agent factories, middleware, tools, and skills, and routes LLM calls and sandbox execution to runtime services over HTTP (DECEPTICON_LLM__PROXY_URL, SAAS_SANDBOX_URL). Running agents still needs those services — use the Docker stack above, or point the URLs at your own equivalents. See Decepticon as a library for the factory override surface, declarative PluginBundle plugins, and the safety gate.

💖 Support Decepticon

We're building Decepticon toward an Offensive Vaccine for the AI-driven threat landscape. If you believe in autonomous red teaming as a path to stronger defense, consider supporting the project.

Benchmark

Decepticon — XBOW pass rate 102/104 (98.08%)

Benchmark	Difficulty	Pass Rate
XBOW validation-benchmarks	Easy (Level 1)	45 / 45 (100 %)
XBOW validation-benchmarks	Medium (Level 2)	50 / 51 (98.0 %)
XBOW validation-benchmarks	Hard (Level 3)	7 / 8 (87.5 %)
XBOW validation-benchmarks	All levels	102 / 104 (98.08 %)

What is Decepticon?

The "AI + hacking" space is full of demos that run nmap and print a report. That's not what this is.

Decepticon is a professional autonomous Red Team agent. It executes realistic attack chains — reconnaissance, exploitation, privilege escalation, lateral movement, C2 — the way a real adversary would, not the way a scanner does.

But more importantly: it operates under the discipline that separates red teamers from script kiddies. Before a single packet leaves the wire, Decepticon generates a complete engagement package — RoE, ConOps, Deconfliction Plan, and OPPLAN with MITRE ATT&CK mapping — and every action runs inside those defined rules.

→ Engagement workflow deep dive

Why Decepticon?

Real kill chains, not checkbox scans. Decepticon reads an OPPLAN and pursues objectives through whatever path opens up — pivoting, adapting, chaining techniques.

Interactive shells, actually. Real offensive tools are interactive (msfconsole, sliver-client, evil-winrm). Decepticon runs every command inside persistent tmux sessions with automatic prompt detection — so when a tool drops into an interactive prompt, the agent sends follow-up commands without workarounds.

Hardened sandbox isolation. All commands run inside a Kali Linux sandbox on a dedicated operational network (sandbox-net), separate from the management plane (decepticon-net). LangGraph drives the sandbox via the Docker socket. → Architecture

Offense serves defense. The planned Offensive Vaccine loop will turn findings into defense improvements through an attack → defend → verify cycle.

Architecture

Two-network design. The always-on management plane (LiteLLM, PostgreSQL, Skillogy, LangGraph) and the always-on sandbox plane stay up across the whole engagement; everything else is dynamic-spawn — the Web dashboard comes up on /web from the CLI, and specialist workloads (BloodHound CE, Sliver C2, Ghidra MCP, …) come up only when the orchestrator calls ops_start(...) (see ADR-0006). Networks: management on decepticon-net; sandbox + C2 server + targets on sandbox-net. Neo4j is dual-homed so the agent (on management) can persist findings written from inside the sandbox.

→ Architecture deep dive · Knowledge graph

Agents

16 specialist agents organized by kill chain phase, with a fresh context window per objective — no accumulated noise.

Orchestration · Reconnaissance · Exploitation · Post-Exploitation · Vulnerability Research · Domain Specialists (AD, Cloud, Smart Contracts, Reversing, Analyst).

→ Full agent roster and middleware stack

Models & Providers

Tier-based, credentials-aware fallback chain. You declare which credentials you have in priority order; Decepticon builds the primary→fallback chain at every tier from there.

Profile	Tier per agent	Use case
eco (default)	Per-agent (HIGH for orchestrator/exploiter/patcher/analyst, MID for execution, LOW for recon/soundwave)	Production
max	Every agent on HIGH	High-value targets
test	Every agent on LOW	Development / CI

Tier-mapped providers: Anthropic, OpenAI, Google Gemini, MiniMax, DeepSeek, xAI, Mistral, OpenRouter, Nvidia NIM, Ollama (local). Subscription OAuth: Claude Max/Pro/Team, ChatGPT Pro/Plus/Team, Gemini Advanced, Copilot Pro, SuperGrok, Perplexity Pro.

Configure via decepticon onboard. → Full model reference & fallback examples

Documentation

Topic	Doc
Installation and first engagement	Getting Started
Complete setup, OAuth, providers, dashboard	Setup Guide
All CLI commands and keyboard shortcuts	CLI Reference
All `make` targets	Makefile Reference
Agent roster and middleware	Agents
Model profiles and fallback chain	Models
Skill system and format spec	Skills
Web dashboard features and setup	Web Dashboard
System architecture and network isolation	Architecture
Neo4j knowledge graph	Knowledge Graph
End-to-end engagement workflow	Engagement Workflow
Offensive Vaccine loop	Offensive Vaccine
Contributing to Decepticon	Contributing

Contributing

git clone https://github.com/PurpleAILAB/Decepticon.git
cd Decepticon
make dogfood  # Full OSS UX (launcher → onboard → CLI) on local code
make dev      # Backend hot-reload (compose watch) — daily dev loop

→ Contributing guide

Community

Join the Discord — ask questions, share engagement logs, discuss techniques.

Disclaimer

Do not use this project on any system or network without explicit written authorization from the system owner. Unauthorized access to computer systems is illegal. You are solely responsible for your actions. The authors and contributors assume no liability for misuse.

License

Apache-2.0

Decepticon