Current Paper · Updated 2026-04-16

Project Phoenix

Open-core standards for grounded domain systems.

Back to Overview

Current Position

Project Phoenix is best understood as an open-core framework for grounded domain systems rather than as a single interface, a single benchmark story, or a single agent architecture.

Its public core consists of principles, validation standards, grounded architecture patterns, and white papers that explain how deterministic substrates, supervision, provenance, and bounded agent use can be combined into useful systems.

The portfolio now spans 17 primary papers across two tracks: Phoenix Operating Discipline (1.1–1.7, 1.11–1.17) and RVH/ML Evaluation (1.8–1.10). All papers have full drafts; empirical results incorporated in 1.9, 1.10, and the current measurement-integrity and operator-shell line.

What Is Public

Principles

Engineering rules for grounded, inspectable systems.

Standards

Validation, variation, and operating-discipline patterns.

Architecture

Generic grounded-domain and bounded-agent patterns.

White Papers

Public argument, historical support, and current interpretation across 17 primary papers plus companion format sources.

Featured Current Papers

The full inventory remains in canonical portfolio order. The papers below are featured because they define the current measurement-integrity and OpenClaw x Project Phoenix line rather than because the full portfolio has been reordered around them.

Paper 1.16

The Model Did Not Fail The Protocol, The Terminal Did shows that thinking-mode protocol scores can be invalidated by capture-path artifacts and that corrected clean capture materially changes the local-model ranking.

Paper 1.17

The Operator Shell Pattern argues that OpenClaw closes a real outer-layer gap around Project Phoenix by adding access, compression, incident discipline, and operator visibility without moving the authority boundary.

Paper Tracks

The portfolio divides into two tracks, with the operating discipline track further grouped into primary and detail papers.

#TitleTrack
Primary Papers — Operating Discipline
1.1Project Phoenix — Open-Core StandardsFramework
1.2Offline Grounded Domain AgentGrounding
1.3Ski Chalet Harness BoundaryGrounding
1.4Fab Simulation & RVHGrounding
1.5LocalLLMTSP — Solver-Backed OrchestrationOrchestration
1.6Where Orchestration Beats Raw Model PowerOrchestration
1.7Agentic Coding Failure PatternsOperations
RVH / ML Evaluation
1.8Rough Volatility — Cross-Domain Benchmark PrincipleRVH
1.9Rough Volatility — ML Evaluation DomainRVH
1.10Grounded Agent Failure Is Structurally DeterminedBoundary
Details — Local Model & Boundary
1.11Local Model Role SuitabilityLocal Model
1.12ShowcaseAgent Routing And CompressionLocal Model
1.13TourAgent Local Model ScreenLocal Model
1.14True Ski Chalet Boundary ResultBoundary
1.15When The Organized Stack LosesBoundary
1.16The Model Did Not Fail The Protocol, The Terminal DidMeasurement
1.17The Operator Shell PatternOperator Layer

Special Case, Not Whole Project

The offline-grounded-agent work (Paper 1.2) is one of the strongest current Project Phoenix results, but it is a special case within a broader framework. The grounded-agent result is better understood as a special case of Project Phoenix — a particularly important recent one, and the clearest current answer to the local-usefulness question.

The larger claim is that useful systems require deterministic grounding, explicit validation, clear trust boundaries, and disciplined operating practice.

Key Results

Reading Order

Why It Matters

Project Phoenix argues that reliability is a systems problem, not just an intelligence problem. The emphasis is therefore on grounded domains, deterministic substrates, validation, and operational discipline rather than on prompt optimism or maximal autonomy.

Without this framing, the current strong grounded-agent result can overshadow the broader framework, and older broad framework papers can overstate stale implementation details. The useful middle position: keep the broad Project Phoenix frame; keep the grounded-agent result visible; do not collapse one into the other.