Pillar 4: War-Rooms

War-rooms are OSTwin’s core execution primitive. Each war-room is a self-contained directory where a team of agents collaborates on a single epic. Everything an agent needs — its brief, channel, artifacts, progress state — lives inside the room directory.

Directory Layout

Every war-room follows a strict directory structure:

.agents/war-rooms/room-042/
  config.json          # Goal contract: epic ref, roles, lifecycle
  brief.md             # What the team must accomplish
  channel.jsonl        # Append-only message log
  progress.json        # Current completion percentage
  status.txt           # Current lifecycle state
  lifecycle.json       # State machine definition
  mcp.json             # Room-specific MCP config (optional)
  artifacts/           # Deliverables produced by agents
    report.md
    implementation.py
  skills/              # Room-local skill overrides (optional)
  roles/               # Room-local role overrides (optional)

Goal Contract: config.json

The config.json file is the war-room’s contract. It defines what the room exists to accomplish and how agents should operate within it.

{
  "room_id": "room-042",
  "plan_id": "plan-001",
  "epic_ref": "EPIC-007",
  "epic_title": "Implement user authentication flow",
  "roles": ["engineer", "qa", "architect"],
  "skill_refs": ["implement-epic", "write-tests", "security-review"],
  "model_override": null,
  "no_mcp": false,
  "lifecycle": "standard",
  "max_retries": 3,
  "timeout_seconds": 900,
  "dependencies": ["EPIC-003", "EPIC-005"],
  "created_at": "2025-01-15T10:00:00Z"
}

Message Channel

The channel.jsonl file is an append-only log of structured messages. Every inter-agent communication is recorded here.

{"id":"msg-001","ts":"2025-01-15T10:01:00Z","from":"manager","to":"engineer","type":"task","ref":"TASK-001","body":"Implement login endpoint per brief.md"}
{"id":"msg-002","ts":"2025-01-15T10:15:00Z","from":"engineer","to":"qa","type":"done","ref":"TASK-001","body":"Login endpoint implemented. See artifacts/auth.py. All tests pass."}
{"id":"msg-003","ts":"2025-01-15T10:20:00Z","from":"qa","to":"engineer","type":"pass","ref":"TASK-001","body":"Code review passed. 94% coverage. No security issues found."}

Message Types

Type	Direction	Purpose
`task`	manager -> agent	Assign work with requirements
`done`	agent -> qa/manager	Report task completion
`review`	qa -> engineer	Detailed review feedback
`pass`	qa -> manager	Task meets acceptance criteria
`fail`	qa -> engineer	Task needs fixes, with specifics
`fix`	engineer -> qa	Resubmission after fixing issues
`error`	any -> manager	Unrecoverable error, needs escalation
`signoff`	role -> manager	Role confirms epic completion

Channel Scripts

The channel is managed through dedicated MCP tools and PowerShell scripts:

Script	Purpose
`channel_post_message`	Append a message with file locking
`channel_read_messages`	Read with optional filters (role, type, ref)
`channel_get_latest`	Get the most recent message of a given type

All writes use fcntl.LOCK_EX (exclusive file locking) to prevent corruption from concurrent agent writes.

Multi-Agent Collaboration Flow

A typical war-room execution follows this pattern:

Manager                Engineer              QA
  │                       │                   │
  ├─── task ─────────────>│                   │
  │                       ├── implements ──>  │
  │                       ├─── done ─────────>│
  │                       │                   ├── reviews
  │                       │<──── fail ────────┤
  │                       ├── fixes ────────> │
  │                       │                   ├── reviews
  │                       │<──── pass ────────┤
  │<───── signoff ────────┤                   │
  │<──────────────────── signoff ─────────────┤
  ├── updates status to "passed"              │

5 Isolation Guarantees

War-rooms provide five levels of isolation:

Guarantee	Mechanism	What It Prevents
Filesystem isolation	Each room has its own directory tree	Agents in room-042 cannot read room-043’s files
Channel isolation	Separate `channel.jsonl` per room	Messages don’t leak between epics
Memory isolation	Filtered views via `exclude_room`	Agents query cross-room memory without seeing their own
MCP isolation	Per-room `mcp.json` configuration	Tools are scoped to the room’s needs
Model isolation	Per-room `model_override`	Different rooms can use different LLM models

Room Creation Flow

War-rooms are created by the manager agent during plan execution:

DAG resolution — manager reads the DAG to find the next executable epics
Room scaffolding — New-WarRoom.ps1 creates the directory with config, brief, and lifecycle
Role assignment — manager selects roles from the registry based on epic requirements
Skill linking — relevant skills are union-merged from role, room, and plan levels
Agent invocation — Invoke-Agent.ps1 launches each role in sequence

New-WarRoom -PlanId "plan-001" -EpicRef "EPIC-007" `
  -Roles @("engineer","qa") -Brief $briefContent

Room Teardown

When a war-room reaches a terminal state (passed or failed-final):

Final status is written to status.txt
All artifacts are preserved in the artifacts/ directory
Memory entries published during execution remain in the shared ledger
The room directory is retained for audit and debugging purposes

Concurrency: 50 Rooms, Wave-Based

OSTwin supports up to 50 concurrent war-rooms organized in waves:

The DAG determines which epics can execute in parallel (no unsatisfied dependencies)
Each wave is a set of rooms that can run simultaneously
When all rooms in a wave complete, the next wave begins
The manager monitors all active rooms via progress polling

Wave-based execution maximizes parallelism while respecting epic dependencies.

Monitoring: Dashboard SSE

The FastAPI dashboard provides real-time war-room monitoring via Server-Sent Events:

Room status — current lifecycle state for all active rooms
Progress — completion percentage updated by agents
Channel activity — live message feed from all rooms
Error alerts — immediate notification of agent failures

The Next.js frontend subscribes to the SSE endpoint and renders a live dashboard view of all war-rooms in the current plan.

Key Source Files

File	Purpose
`engine/New-WarRoom.ps1`	Room creation and scaffolding
`engine/Invoke-Agent.ps1`	Agent invocation within a room
`mcp_servers/warroom/`	Channel and status MCP server
`.agents/war-rooms/`	All war-room directories
`dashboard/api/rooms.py`	Room status API endpoint