📖 Full doc index → · 🌐 flow-next.dev · 👥 Teams guide · 💬 Discord
Flow-Next is an AI agent orchestration plugin. Twenty-six agent-native skills for the full lifecycle: idea → spec → tasks → review → ship → maintain. Bundled task tracking, dependency graphs, re-anchoring before every task, multi-model reviews, decay-aware project memory, GitHub PR creation and resolution, agent-readiness audits. Everything lives in your repo — no external services, no global config. Uninstall: delete .flow/.
- Spec-first. Every unit of work belongs to a spec
fn-N. Tasksfn-N.Minherit context. - Fresh-context workers. Each task runs in its own subagent. No token bleed between tasks.
- Cross-model reviews. A different model (RepoPrompt / Codex / Copilot) gates every implementation.
- R-IDs frozen at handover. Acceptance criteria numbered once, never renumbered.
First-class on Claude Code, OpenAI Codex (CLI + Desktop), and Factory Droid. Also runs on xAI Grok Build and Cursor (local plugin), plus OpenCode via the community port.
🆕 v1.0+ —
flowctl epic→flowctl spec. The 1.0 release renames the canonical primitive across the entire flow-next surface. All 0.x scripts and CLAUDE.md examples keep working — the legacy CLI is preserved as a deprecation alias layer through all of 1.x. See the CHANGELOG for the migration path (interactive via/flow-next:setupor deterministic viaflowctl migrate-rename --yes, both transactional with rollback).
| Claude Code | OpenAI Codex | Factory Droid |
/plugin marketplace add \
https://github.com/gmickel/flow-next
/plugin install flow-next
/reload-plugins
/flow-next:setup |
git clone https://github.com/gmickel/flow-next.git
cd flow-next
./scripts/install-codex.sh flow-next
# then: /flow-next:setup |
droid plugin marketplace add \
https://github.com/gmickel/flow-next
# /plugins → install flow-next |
Why a script for Codex? Codex's plugin protocol only registers skills from plugin.json — not custom .toml agents or hooks. install-codex.sh merges all 21 agents + hooks into ~/.codex/config.toml. Idempotent — safe to re-run. Full platform matrix + community ports in docs/platforms.md.
Grok Build (xAI)? If flow-next is already installed in Claude Code, Grok Build picks it up automatically — grok inspect shows the skills + hook loaded, zero extra setup. The /flow-next:* commands run when typed and the multi-agent flows work (a full /flow-next:plan fanned out all seven scout subagents end-to-end, verified). Grok's slash autocomplete + grok inspect just under-list flow-next's commands/agents — cosmetic, they work when invoked. (Don't grok plugin install the repo — it's a marketplace, not a single plugin.) See docs/platforms.md.
/flow-next:capture # 1. Synthesize conversation → .flow/specs/<id>.md
/flow-next:plan <spec-id> # 2. Break the spec into dependency-ordered tasks
/flow-next:work <spec-id> # 3. Execute tasks in fresh-context worker subagents
/flow-next:make-pr <spec-id> # 4. Render a cognitive-aid PR body (9 input streams)
/flow-next:resolve-pr <PR#> # 5. Fetch review threads → triage → resolveThat's the inner loop. Branch in (/flow-next:prospect for ranked candidates, /flow-next:interview for structured discovery), branch out (/flow-next:ralph-init for autonomous overnight runs, /flow-next:audit for memory garbage collection).
flowchart LR
Idea([💡 Idea]) --> P[/flow-next:prospect/]
Idea --> C[/flow-next:capture/]
P --> C
P -.->|direct via promote| L[/flow-next:plan/]
C --> L
C --> I[/flow-next:interview/]
I --> L
L --> W[/flow-next:work/]
W --> R[/flow-next:impl-review/]
R -->|SHIP| Q[/flow-next:qa/]
R -->|NEEDS_WORK| W
Q -->|YES| Done([🚀 Ship])
Q -->|NO| W
Done -.maintenance.-> A[/flow-next:audit/]
A -.-> M[(.flow/memory/)]
/flow-next:qais an opt-in live-app QA stage (after work / around make-pr) — it drives the deployed app like a real user and only runs when there's a live deploy + a driver; with neither it surfaces the limitation rather than blocking.
The loop is spec-driven. Each step below maps to one skill; click through to flow-next.dev for the full page.
Either synthesize an existing conversation into a structured spec (source-tagged, mandatory read-back), or — when starting from scratch — generate ranked candidate ideas grounded in the repo. Both land in .flow/specs/<id>.md.
/flow-next:capture # from a conversation
/flow-next:prospect <focus-hint> # from a focus hint (concept, path, constraint, volume)→ flow-next.dev/skills/capture · flow-next.dev/skills/prospect
Deep Q&A pass over a spec or task: lead-with-recommendation, confidence tiers, codebase-first investigation. Use to flesh out an ambiguous spec before breaking it down. --scope=business|technical|both symmetrically narrows the pass.
/flow-next:interview <spec-id>→ flow-next.dev/skills/interview
Research the codebase, then write the spec + tasks together. Tasks fn-N.M declare blockers, inherit context from the parent spec, and stay dependency-ordered. This skill does not write code — only the plan.
/flow-next:plan <spec-id> # or <free-form text>Execute tasks systematically: each runs in a fresh-context worker subagent, re-anchors against the spec before starting, then implements + commits + records evidence. Cross-model review gates (impl-review, plan-review) wrap the loop.
/flow-next:work <spec-id> # or <task-id>Render a PR body from nine flow-next input streams (spec R-IDs, per-task evidence, memory hits, glossary changes, strategy alignment, deferred review findings, the diff itself). Optional mermaid diagrams on module-boundary changes. Pushes via gh.
/flow-next:make-pr <spec-id> # auto-detects from current branch→ flow-next.dev/skills/make-pr
Fetch unresolved threads + top-level comments + review-submission bodies, cluster them, dispatch per-thread resolver agents (parallel on Claude Code, serial elsewhere), validate, commit, then reply + resolve via GraphQL.
/flow-next:resolve-pr <PR#>→ flow-next.dev/skills/resolve-pr
Going autonomous? /flow-next:ralph-init scaffolds a repo-local Ralph harness under scripts/ralph/. Ralph loops the same steps overnight with fresh context per iteration, multi-model review gates, and auto-block on stuck tasks. → flow-next.dev/ralph
| Problem | Solution |
|---|---|
| Context drift | Re-anchoring before every task — re-reads specs + git state |
| Context window limits | Fresh context per task — worker subagent starts clean |
| Single-model blind spots | Cross-model reviews — RepoPrompt, Codex, or Copilot as second opinion |
| Forgotten requirements | Dependency graphs — tasks declare blockers, run in order |
| "It worked on my machine" | Evidence recording — commits, tests, PRs tracked per task |
| Infinite retry loops | Auto-block stuck tasks — fails after N attempts, moves on |
| Duplicate implementations | Pre-implementation search — worker checks for similar code before writing new |
| Hallucinated specs from "I think we discussed…" | Source-tagged capture — every acceptance criterion marked [user] / [paraphrase] / [inferred], mandatory read-back loop |
| Stale project memory polluting future work | /flow-next:audit + categorized memory schema — agent reviews each entry, flags stale (never deletes) |
| GitHub PR review threads piling up | /flow-next:resolve-pr — fetch → triage → dispatch resolver agents → reply → resolve via GraphQL |
| Command | What it does |
|---|---|
/flow-next:strategy |
Write STRATEGY.md — target problem, approach, users, metrics, active tracks |
/flow-next:prospect |
Generate ranked candidate ideas grounded in the repo, upstream of capture/interview/plan |
/flow-next:capture |
Synthesize conversation context into a spec (source-tagged, mandatory read-back) |
/flow-next:interview |
Deep spec refinement with lead-with-recommendation + confidence tiers + codebase-first investigation; --scope=business|technical|both |
/flow-next:plan |
Research codebase, create spec + dependency-ordered tasks |
/flow-next:work |
Execute tasks with re-anchoring + worker subagents + review gates. Opt-in: offload implementation to a local codex exec with delegate:codex (or work.delegate=codex config) — OFF by default, consent-gated, host keeps all judgment (config keys) |
/flow-next:impl-review |
Cross-model implementation review (RepoPrompt, Codex, or Copilot) |
/flow-next:plan-review |
Cross-model plan review |
/flow-next:spec-completion-review |
Spec-completion review gate — verify combined implementation matches the spec (renamed from /flow-next:epic-review in 1.0.0; soft-removal target 2.0.0) |
/flow-next:qa |
Live-app real-user QA — derives scenarios from the spec (AC / R-IDs / boundaries), drives the running app via flow-next-drive, files P0/P1/P2 findings with evidence, ends with a YES/NO ship verdict receipt. Forbidden from marking PASS by reading source. Opt-in — needs a live deploy + a driver |
/flow-next:make-pr |
Render a cognitive-aid PR body (9 input streams) and open via gh |
/flow-next:resolve-pr |
Resolve GitHub PR review threads (fetch → triage → fix → reply → resolve via GraphQL) |
/flow-next:audit |
Agent-native review of .flow/memory/ entries against current code (Keep / Update / Consolidate / Replace / Delete) |
/flow-next:memory-migrate |
Lift legacy flat memory files into the categorized schema |
/flow-next:prime |
8-pillar agent-readiness assessment with parallel scouts; remediation via consent prompts |
/flow-next:ralph-init |
Scaffold autonomous loop (scripts/ralph/) |
/flow-next:sync |
Plan-sync — update downstream task specs after implementation drift inside flow-next |
/flow-next:tracker-sync |
Tracker bridge (distinct from /flow-next:sync) — project a spec to a Linear/GitHub issue and reconcile body/status/comments two-way; projection, not coordination (docs) |
/flow-next:map |
Optional — wrap openclaw/clawpatch's clawpatch map for a semantic feature index (.clawpatch/features/*.json); scouts read it when present, fall back to grep/glob when absent. Requires Node 22+ + pnpm add -g clawpatch |
Full command reference (every flag, every default) in docs/flowctl.md.
Run overnight. Fresh context per iteration + multi-model review gates + auto-block stuck tasks.
/flow-next:ralph-init # One-time setup
scripts/ralph/ralph.sh # Run from terminal📖 Ralph deep dive · Ralph TUI (bun add -g @gmickel/flow-next-tui)
The repo holds the offline-resilient reference. flow-next.dev holds the narrative, browseable guide. Pick by audience.
| Looking for… | Repo file | Website |
|---|---|---|
| 5-minute pitch + install | README.md (this page) |
flow-next.dev |
| Adopting in a team, handover objects, Spec-as-PR, adoption ladder | docs/teams.md |
Teams guide |
Full flowctl CLI reference — every command, every flag |
docs/flowctl.md |
— |
| Ralph autonomous mode internals — hooks, receipts, DCG | docs/ralph.md |
— |
Live-app QA — /flow-next:qa, spec-derived scenarios, P0/P1/P2 findings, qa_verdict receipt |
skills/flow-next-qa/SKILL.md |
— |
.flow/ directory layout, spec-first task model, ID format |
docs/architecture.md |
— |
| Spec template — R-ID rules, confidence anchors, receipt schema | docs/spec-template.md · canonical scaffold at templates/spec.md |
— |
| Memory schema — bug / knowledge tracks, frontmatter, audit lifecycle | docs/memory-schema.md |
— |
Tracker-sync bridge — projection model, hybrid id, transport ladder, /flow-next:tracker-sync vs /flow-next:sync |
docs/tracker-sync.md |
— |
Project glossary — GLOSSARY.md shape, R17 forbidden-vocabulary guard |
docs/glossary.md · GLOSSARY.md |
— |
Project strategy — STRATEGY.md shape, downstream skill grounding |
docs/strategy.md · STRATEGY.md |
— |
| Cross-platform install matrix + Codex / Droid / OpenCode notes | docs/platforms.md |
— |
scripts/sync-codex.sh pipeline, plain-text transform, validation guards |
docs/sync-codex.md |
— |
| Troubleshooting — stuck tasks, Ralph debug, receipt validation, uninstall | docs/troubleshooting.md |
— |
Adding a new /flow-next:<name> skill |
agent_docs/adding-skills.md |
— |
| Cutting a release | agent_docs/releasing.md |
— |
| Local plugin dev + smoke tests + Ralph e2e | agent_docs/local-dev.md |
— |
| Repo strategic intent + active tracks | STRATEGY.md |
— |
| Canonical vocabulary | GLOSSARY.md |
— |
| Visual overview, diagrams, methodology | — | mickel.tech/apps/flow-next · flow-next.dev |
Doc index with one-line descriptions: plugins/flow-next/docs/README.md.
- Python 3.8+ — bundled
flowctlCLI is pure-stdlib. jqandgh— required for the review subsystem and PR plumbing.bun(optional) — only needed for the Ralph TUI.
| Platform | Status |
|---|---|
| Claude Code | First-class (canonical surface) |
| OpenAI Codex (CLI + Desktop) | First-class (mirror at plugins/flow-next/codex/, regenerated by scripts/sync-codex.sh) |
| Factory Droid | First-class (regex-OR matchers handle Execute ↔ Bash) |
| Grok Build (xAI) | Works via Claude Code compatibility — skills, /flow-next:* commands, hooks, and multi-agent flows (verified). UI under-lists commands/agents (cosmetic); Ralph TBD — details |
| Cursor | Works via a local plugin (./scripts/install-cursor.sh, or install-cursor.ps1 on Windows) — skills, commands, and multi-agent flows (verified). No plugin card + autocomplete under-list (cosmetic); Ralph unsupported (hook-schema mismatch) — details |
| OpenCode | Community port: flow-next-opencode |
Detailed install + cross-platform patterns in docs/platforms.md.
| Project | Platform |
|---|---|
| flow-next-opencode | OpenCode |
| FlowFactory | Factory.ai Droid |
| Ralph TUI | Cross-platform TUI for Ralph runs |
GNO — local hybrid search for your notes, docs, and code. Long-term memory over your files via MCP.
bun install -g @gmickel/gno && gno mcp install --target claude-code
MIT — see LICENSE.
Made by Gordon Mickel · @gmickel · gordon@mickel.tech