NEVER auto-commit, auto-add, or auto-push code to git. Only perform git operations when explicitly asked by the user.

Exception: Autoresearch Commit/Revert Cycle

The /autoresearch family of commands (and their OpenCode _ variants) are exempt from the no-auto-commit policy during an active autoresearch loop. Autoresearch requires git commits as a core mechanism — commit-before-verify enables clean rollback, and git history serves as the agent's memory of what was tried and what failed.

Allowed git operations inside an autoresearch loop only:

Operation	Purpose
`git add` + `git commit -m "experiment: ..."`	Record each atomic change before verification
`git revert HEAD --no-edit`	Roll back a failed experiment
`git log` / `git diff`	Read own experiment history for learning

Still prohibited, even inside autoresearch:

git push (never push experiment commits automatically)
git commit --amend or git rebase (never rewrite experiment history)
git reset --hard (always use git revert to preserve history)
Committing to main/master directly (create a dedicated experiment branch first)

Activation: This exception applies only when the user explicitly invokes an autoresearch command (/autoresearch, /autoresearch:debug, /autoresearch_fix, etc.). Normal agent operations outside the autoresearch loop must still follow the no-auto-commit policy.

Agent Model Inheritance — MANDATORY

When dispatching subagents via the Agent tool, NEVER set the model parameter. Omitting it lets agents inherit the parent session's model. This applies to all custom skills, agent definitions, and ad-hoc agent dispatches in this project.

Do NOT pass model: "sonnet", model: "haiku", or model: "opus" explicitly.
The parent model (whatever the user selected) propagates automatically.
This rule overrides any skill text that suggests a specific model.

Comment Policy — MANDATORY, HOOK-ENFORCED

Source code and documentation are not conversations, changelogs, or responses to PR feedback. Comments and doc prose must describe the code they annotate — its behavior, invariants, edge cases, or non-obvious design reasoning. Nothing else.

A PostToolUse hook (comment-lint.sh) scans every .rs file write and every .md file write under docs/ and diagrams/. It will flag violations. Fix them before moving on.

What Is Banned

Category	Examples of BANNED comments
Tracking IDs	`// F-011: BatchValidation error variant`, `// C3: edge case`
Milestone / phase labels	`// B0 scaffold`, `// until B1 lands`, `// B1/B2 will add`, `phase 1`
PR / review references	`// PR #124`, `// per review feedback`, `// addressed reviewer concern`
Test justification	`// Verification tests for PR review comments`, `// Added to cover finding F7`
Temporal narration	`// Previously used linear scan`, `// Was changed from X to Y`, `// Newly added`
Conversational	`// Good catch`, `// As discussed`, `// Note to reviewer`
History narration	`// Refactored from module A`, `// Moved from old_engine.rs`
Task IDs in docs	`gossip-8968`, `gossip-rs-8r9.25`, `Step gossip-rs-...` (in `.md` files)

What Is Required

Comments must stand alone. A reader with no access to your PR, issue tracker, or chat history must fully understand the comment from the code context alone.

BAD → GOOD rewrites:

// BAD: F-011: BatchValidation error variant
// GOOD: Rejects submissions that span multiple tenants in a single batch.

// BAD: Verification tests for PR review comments
// GOOD: Edge-case coverage for batch boundary conditions.

// BAD: F1: Merge panic bug fix
// GOOD: Merge transition clears stale metadata to prevent use-after-move.

// BAD: B0 etcd coordination backend scaffold.
// GOOD: etcd coordination backend.

// BAD: until B1/B2 land the etcd keyspace and transactional writes.
// GOOD: until the etcd keyspace and transactional writes land.

// BAD: Previously used linear scan, refactored to binary search
// GOOD: Binary search — the rank array is sorted by construction (see insert_sorted above).

// BAD: Added per reviewer suggestion to handle empty input
// GOOD: Empty input returns Ok(()) — callers depend on this for idempotent retry loops.

Docs (docs/, diagrams/) follow the same policy. No milestone labels, task IDs, or step tracking references in documentation prose.

Self-Check Before Writing Any Comment

If you are about to write a comment, ask:

Does it reference a finding ID, PR number, or reviewer? Delete it.
Does it explain when or why it was changed rather than what the code does? Rewrite it.
Would it make sense to someone who has never seen the PR? If not, rewrite it.

Code-Internal Cross-References Are Fine

Stable identifiers that exist purely within the codebase for internal cross-referencing (e.g., invariant labels S1–S7 in the simulation checker) are documentation, not tracking noise.

Enforcement

Hook: comment-lint.sh runs on every .rs file write and every .md file write under docs/ or diagrams/, flagging violations.
Review: Any PR introducing banned comment patterns will be rejected.
Existing violations: Must be cleaned up when touching a file.

Task Management

This project uses bd (Beads) for issue tracking. Issues live in .beads/.

At session start: run bd ready to find work. Track status with bd update <id> --status in_progress. At session end: close finished work, file new issues, run bd sync. Do NOT commit.

For graph-aware triage: bv --robot-triage (never bare bv).

When working in plan mode, always include bd status updates in the plan (update to in_progress at start, close at end).

Using bv as an AI sidecar

bv is a graph-aware triage engine for Beads projects (.beads/beads.jsonl). Instead of parsing JSONL or hallucinating graph traversal, use robot flags for deterministic, dependency-aware outputs with precomputed metrics (PageRank, betweenness, critical path, cycles, HITS, eigenvector, k-core).

Scope boundary: bv handles what to work on (triage, priority, planning). For agent-to-agent coordination (messaging, work claiming, file reservations), use MCP Agent Mail.

⚠️ CRITICAL: Use ONLY --robot-* flags. Bare bv launches an interactive TUI that blocks your session.

The Workflow: Start With Triage

bv --robot-triage is your single entry point. It returns everything you need in one call:

quick_ref: at-a-glance counts + top 3 picks
recommendations: ranked actionable items with scores, reasons, unblock info
quick_wins: low-effort high-impact items
blockers_to_clear: items that unblock the most downstream work
project_health: status/type/priority distributions, graph metrics
commands: copy-paste shell commands for next steps

bv --robot-triage # THE MEGA-COMMAND: start here bv --robot-next # Minimal: just the single top pick + claim command

Token-optimized output (TOON) for lower LLM context usage:

bv --robot-triage --format toon export BV_OUTPUT_FORMAT=toon bv --robot-next

Other Commands

Planning:

Command	Returns
`--robot-plan`	Parallel execution tracks with `unblocks` lists
`--robot-priority`	Priority misalignment detection with confidence

Graph Analysis:

Command	Returns
`--robot-insights`	Full metrics: PageRank, betweenness, HITS (hubs/authorities), eigenvector, critical path, cycles, k-core, articulation points, slack
`--robot-label-health`	Per-label health: `health_level` (healthy\|warning\|critical), `velocity_score`, `staleness`, `blocked_count`
`--robot-label-flow`	Cross-label dependency: `flow_matrix`, `dependencies`, `bottleneck_labels`
`--robot-label-attention [--attention-limit=N]`	Attention-ranked labels by: (pagerank × staleness × block_impact) / velocity

History & Change Tracking:

Command	Returns
`--robot-history`	Bead-to-commit correlations: `stats`, `histories` (per-bead events/commits/milestones), `commit_index`
`--robot-diff --diff-since <ref>`	Changes since ref: new/closed/modified issues, cycles introduced/resolved

Other Commands:

Command	Returns
`--robot-burndown <sprint>`	Sprint burndown, scope changes, at-risk items
`--robot-forecast <id\|all>`	ETA predictions with dependency-aware scheduling
`--robot-alerts`	Stale issues, blocking cascades, priority mismatches
`--robot-suggest`	Hygiene: duplicates, missing deps, label suggestions, cycle breaks
`--robot-graph [--graph-format=json\|dot\|mermaid]`	Dependency graph export
`--export-graph <file.html>`	Self-contained interactive HTML visualization

Scoping & Filtering

bv --robot-plan --label backend # Scope to label's subgraph bv --robot-insights --as-of HEAD~30 # Historical point-in-time bv --recipe actionable --robot-plan # Pre-filter: ready to work (no blockers) bv --recipe high-impact --robot-triage # Pre-filter: top PageRank scores bv --robot-triage --robot-triage-by-track # Group by parallel work streams bv --robot-triage --robot-triage-by-label # Group by domain

Understanding Robot Output

All robot JSON includes:

data_hash — Fingerprint of source beads.jsonl (verify consistency across calls)
status — Per-metric state: computed|approx|timeout|skipped + elapsed ms
as_of / as_of_commit — Present when using --as-of; contains ref and resolved SHA

Two-step analysis:

Immediate pass (instant): degree, topo sort, density — always available immediately
Deferred pass (async, 500ms timeout): PageRank, betweenness, HITS, eigenvector, cycles — check status flags

For large graphs (>500 nodes): Some metrics may be approximated or skipped. Always check status.

jq Quick Reference

bv --robot-triage | jq '.quick_ref' # At-a-glance summary bv --robot-triage | jq '.recommendations[0]' # Top recommendation bv --robot-plan | jq '.plan.summary.highest_impact' # Best unblock target bv --robot-insights | jq '.status' # Check metric readiness bv --robot-insights | jq '.Cycles' # Circular deps (must fix!) bv --robot-label-health | jq '.results.labels[] | select(.health_level == "critical")'

Performance: Immediate pass is instant; deferred pass is async (500ms timeout). Prefer --robot-plan over --robot-insights when speed matters. Results cached by data hash.

Use bv instead of parsing beads.jsonl—it computes PageRank, critical paths, cycles, and parallel tracks deterministically.

Beads Workflow Integration

This project uses beads_viewer for issue tracking. Issues are stored in .beads/ and tracked in git.

Essential Commands

# View issues (launches TUI - avoid in automated sessions)
bv

# CLI commands for agents (use these instead)
bd ready              # Show issues ready to work (no blockers)
bd list --status=open # All open issues
bd show <id>          # Full issue details with dependencies
bd create --title="..." --type=task --priority=2
bd update <id> --status=in_progress
bd close <id> --reason="Completed"
bd close <id1> <id2>  # Close multiple issues at once
bd sync --flush-only  # Export beads to JSONL (no git ops)

Workflow Pattern

Start: Run bd ready to find actionable work
Claim: Use bd update <id> --status=in_progress
Work: Implement the task
Complete: Use bd close <id>

Key Concepts

Dependencies: Issues can block other issues. bd ready shows only unblocked work.
Priority: P0=critical, P1=high, P2=medium, P3=low, P4=backlog (use numbers, not words)
Types: task, bug, feature, epic, question, docs
Blocking: bd dep add <issue> <depends-on> to add dependencies

Session Protocol

Before ending any session, run bd sync to persist beads state.

Do NOT stage, commit, or push code changes. Leave that to the user.

Best Practices

Check bd ready at session start to find available work
Update status as you work (in_progress → closed)
Create new issues with bd create when you discover tasks
Use descriptive titles and set appropriate priority/type

Task Quality Standard — MANDATORY for All Task Creation

Every beads task must be self-contained: an LLM agent reading it should have 90% of the information needed to complete it. The remaining 10% must have explicit pointers to where to look.

How to Create Tasks

Use /create-task for all task creation. It auto-researches the codebase and produces a complete task description.

/create-task "Fix off-by-one in window boundary check" --type=bug --priority=1
/create-task --quick "..." --type=task --priority=2
/create-task --from-plan docs/plans/2026-02-23-feature-v3.md --step=3

Mandatory Sections (ALL Tasks)

Context — Why this task exists
Current State — What exists today (with code snippets and file:line refs)
Desired State — What should exist after
Implementation Guidance — Files to modify, patterns to follow, utilities to reuse
Code References — Inline snippets of relevant current code
Related Work — Links to related beads tasks (or "None found")
Acceptance Criteria — Specific, verifiable conditions (always include cargo test/fmt/clippy)
Pointers — Where to look for the remaining 10%

Never Do

Create a task with no description
Write "see review for details" or "see PR #N" instead of inlining context
Reference code without file paths and line numbers
Write acceptance criteria like "it works" — must be specific and verifiable
Skip the Related Work search — always check for existing related/duplicate tasks

Enforcement

Tasks created with empty or stub descriptions will be flagged during review. When creating tasks outside /create-task (e.g., inside /execute-review-findings), include all mandatory sections in the description.

Duplication Prevention — MANDATORY Pre-Coding Check

Before writing ANY new function, struct, trait, method, or module, you MUST verify the functionality does not already exist in the codebase.

This is non-negotiable. Duplicated logic is a bug — it creates drift, increases maintenance burden, and undermines the single-source-of-truth principle.

Required Steps

Search before you write. Use Grep/Glob to search for existing implementations that match the intent of what you are about to create. Search by concept (e.g., "retry", "timeout", "base64 decode"), not just by the exact name you plan to use.
Check neighboring modules. Read the module and its siblings. If you are adding a helper to engine/core.rs, read the other files in engine/ and stdx/ first.
Check utility crates. src/stdx/ contains shared data structures and helpers. Confirm your functionality is not already there before creating a new one.
If similar logic exists, extend or reuse it. Do not create a parallel implementation. Refactor the existing code to be more general if needed.
If you are unsure, ask. It is always better to ask "does X already exist?" than to introduce a duplicate.

What Counts as Duplication

A second function that does the same thing with a different name.
A method that reimplements logic already available in a trait or utility.
A new struct that is structurally identical to an existing one.
Copy-pasted blocks with minor variations (extract a shared helper instead).
A new constant/sentinel that duplicates an existing one.

Enforcement

If during review a duplicate is found that could have been caught by searching the codebase first, the change will be rejected. No exceptions.

No Versioning, No Legacy Code — MANDATORY

This is pre-release developmental code with zero backwards-compatibility obligations. There is exactly one version: the current one.

Rules

Never version APIs, structs, enums, or serialization formats. No V1/V2 suffixes, no _v2 functions, no version discriminants in wire formats.
Never introduce #[deprecated] attributes. If something is wrong, fix it or remove it. Do not leave the old path around with a deprecation warning.
Never create legacy or compatibility shims. No old_* / new_* parallel implementations, no feature flags gating old behavior, no migration layers.
All changes are breaking. Rename, restructure, and delete freely. Callers must be updated in the same commit. There are no downstream consumers to protect.
One code path per behavior. If a refactor replaces an approach, delete the old approach entirely. Dead code is a liability, not a safety net.
No cfg gates for old-vs-new. Feature flags are for optional capabilities, not for preserving defunct logic.

What This Means in Practice

Changing a struct field? Rename it everywhere in one pass.
Replacing an algorithm? Delete the old one, wire in the new one.
Updating serialization? Change the format, update all readers/writers.
Removing a public function? Remove it and fix every call site.

Enforcement

Any PR that introduces versioned types, deprecated annotations, compatibility shims, or parallel old/new code paths will be rejected. No exceptions.

Error Type Policy — MANDATORY

Error types follow a standard pattern to minimize boilerplate duplication.

Rules

New error types MUST use #[derive(thiserror::Error)] with #[error("...")] attributes on every variant. Hand-written Display and Error impls are prohibited unless the #[error("...")] attribute cannot express the required formatting logic (e.g., conditional formatting with if/match).
Custom Debug impls for redaction are separate from thiserror. Types that redact hash values, keys, or worker identities in Debug output keep their hand-written impl fmt::Debug alongside #[derive(thiserror::Error)]. thiserror v2 does not auto-derive Debug.
New From<CoordError> routing impls MUST use impl_from_coord_error! (defined in gossip-coordination/src/error.rs). The macro preserves compile-time exhaustiveness: adding a new CoordError variant forces a conscious routing decision in every operation error type.
New From<XxxError> for RejectionKind impls MUST use impl_rejection_from! (defined in gossip-coordination/src/sim/harness.rs).
Test code is exempt from duplication thresholds. Test clarity outweighs deduplication for assertions, fixtures, and setup code.

Existing Infrastructure

Macro / Tool	Location	Purpose
`impl_from_coord_error!`	`gossip-coordination/src/error.rs`	CoordError -> operation error routing
`impl_from_run_op_id_conflict!`	`gossip-coordination/src/run_errors.rs`	RunOpIdConflict From impls
`impl_rejection_from!`	`gossip-coordination/src/sim/harness.rs`	Error -> RejectionKind in sim
`define_connector_error!`	`gossip-contracts/src/connector/api.rs`	Connector error types
`define_id_32!` / `define_id_64!`	`gossip-contracts/src/identity/macros.rs`	Identity newtypes

Allocation Policy (Tiered) — MANDATORY

Use an operationally tiered policy instead of blanket no-allocation rules.

Tiers

HOT: per-shard/per-claim/per-tick steady-state loops.
WARM: frequent read/query/admin operations outside inner mutation loops.
COLD: startup, registration, setup/teardown, and test-support helpers.

Rules

HOT paths remain allocation-silent where practical. Keep pooled/slab-backed data and caller-owned reusable scratch on true steady-state paths (acquire_and_restore_into, checkpoint, complete, claim loop internals).
WARM/COLD paths optimize for simplicity first. Prefer straightforward local allocation over preallocation-only API contracts when complexity tax is high and measurable regressions are absent.
No panic-on-undersized-caller-buffer contracts for query APIs. list_shards_into and collect_claim_candidates_into may grow caller vectors as needed.
Registration keeps atomicity, not scratch plumbing. register_shards must perform fallible preflight before shard-map mutation and must roll back staged records on allocation failure.
Single allocation-failure shape for register_shards. Use RegisterShardsError::ResourceExhausted { resource } everywhere.
No parallel legacy/new surfaces. Breaking changes are applied in one pass; remove superseded allocation behavior instead of preserving compatibility layers.

Existing Infrastructure (use these, don't reinvent)

Type	Location	Purpose
`ByteSlab` / `ByteSlot`	`gossip-stdx/src/byte_slab.rs`	Core pre-allocated byte pool
`PooledShardSpec`	`coordination/pooled.rs`	Slab-backed shard spec fields
`PooledCursor`	`coordination/pooled.rs`	Slab-backed cursor fields
`PooledSpawned`	`coordination/pooled.rs`	Slab-backed lineage storage
`AcquireScratch` / `FixedBuf`	`coordination/error.rs`	Reusable fixed-capacity scratch
`InlineVec<T, N>`	`gossip-stdx/src/inline_vec.rs`	Stack-first small collection
`RingBuffer<T, N>`	`gossip-stdx/src/ring_buffer.rs`	Fixed-capacity circular queue

Enforcement

Hot-path regressions are benchmark-gated: no >5% median regression without explicit documented justification.
PRs that introduce avoidable hot-path heap allocation or legacy dual-path allocation behavior will be rejected.

Rust Code Modification Workflow

After modifying Rust code, ALWAYS run these steps:

cargo fmt --all && cargo check && cargo clippy --all-targets --all-features -- -D warnings && RUSTDOCFLAGS="-D warnings" cargo doc --no-deps --all-features
Run /doc-rigor skill on the new code to keep documentation updated
If a hook fires naming a design doc (DESIGN DOC CHECK), open that doc and verify it still matches the code. If the hook says [NEW FILE], update the doc's file list.

Design Doc Maintenance

Design docs in docs/ describe architecture and invariants for specific source directories. The design-doc-scope-check.sh hook fires automatically when editing .rs files and produces three kinds of alerts:

Alert	Meaning
`DESIGN DOC CHECK: <path> is in scope of: -> <doc>`	Existing file — verify doc still matches code
`DESIGN DOC CHECK [NEW FILE]: <path> is NEW and in scope of: -> <doc>`	New file — update doc's file list and counts
`NOTE: <path> has no design doc coverage.`	No doc covers this directory — consider adding one

Rules

Act on hook reminders. When the hook names a doc, open it and verify the code change is consistent with what the doc describes.
Update docs for new files. [NEW FILE] alerts mean the doc's file inventory is out of date. Add the new file and adjust any counts.
Add scopes for new docs. When creating a new design doc, add a [[scopes]] entry in docs/scope-map.toml so the hook knows about it.
Check docs before closing tasks. Before marking work complete, verify any in-scope design docs are still accurate.

Architecture References

diagrams/ — 12 Mermaid diagram files covering all 5 boundaries (start with diagrams/00-README.md)
docs/gossip-coordination/coordination-testing.md — Test tier breakdown and cargo test commands
docs/gossip-coordination/simulation-harness.md — Simulation architecture, invariants S1–S7, fault levels
docs/gossip-coordination/boundary-2-coordination.md — Coordination protocol specification
docs/gossip-contracts/boundary-1-identity-spine.md — Identity type hierarchy

FilesExpand file tree

CLAUDE.md

Latest commit

History

CLAUDE.md

File metadata and controls

Exception: Autoresearch Commit/Revert Cycle

Agent Model Inheritance — MANDATORY

Comment Policy — MANDATORY, HOOK-ENFORCED

What Is Banned

What Is Required

Self-Check Before Writing Any Comment

Code-Internal Cross-References Are Fine

Enforcement

Task Management

Using bv as an AI sidecar

The Workflow: Start With Triage

Token-optimized output (TOON) for lower LLM context usage:

Other Commands

Scoping & Filtering

Understanding Robot Output

jq Quick Reference

Beads Workflow Integration

Essential Commands

Workflow Pattern

Key Concepts

Session Protocol

Best Practices

Task Quality Standard — MANDATORY for All Task Creation

How to Create Tasks

Mandatory Sections (ALL Tasks)

Never Do

Enforcement

Duplication Prevention — MANDATORY Pre-Coding Check

Required Steps

What Counts as Duplication

Enforcement

No Versioning, No Legacy Code — MANDATORY

Rules

What This Means in Practice

Enforcement

Error Type Policy — MANDATORY

Rules

Existing Infrastructure

Allocation Policy (Tiered) — MANDATORY

Tiers

Rules

Existing Infrastructure (use these, don't reinvent)

Enforcement

Rust Code Modification Workflow

Design Doc Maintenance

Rules

Architecture References