Frozen and published (as of 2026-05-01).
This is a published paper, promoted from active draft following the capture of the 2026-04-30 PPR evidence packet and the 2026-05-01 DocDrop same-task frontier comparison ledger.
PPR evidence packet: docs/PRIVACY_IS_WORTH_PAYING_FOR_PACKET_2026-04-30.md
DocDrop same-task ledger: docs/domain_runs/PRIVACY-DOCDROP-001/comparison_report.md
It is the third frozen companion line in the local-LLM operator-judgment cluster:
GEMMA_4_IS_SMARTER_GEMMA_3_IS_SAFER.md — handoff-discipline doctrine; model-behavior conflationSLOW_IS_NOT_SMART.md — size / quality conflationPLEASE_IS_SAND_OFF_A_BEACH.md — token-cost attribution; ritual-vs-structural waste conflationThe lines share evidence (DocDrop, PPR Lane 2, the protocol matrix, the suppression-regression probe) but make distinct claims.
The narrow form:
The broader form, and why this paper exists:
The honest deployment case is "privacy is worth paying for," not "privacy is free."
The DocDrop privacy lane is currently the cleanest production-shape privacy deployment in Project Phoenix. Its design is correct: local inference on localhost, no API egress, sensitive documents on operator-controlled media, strict 5-field JSON contract, schema validation at the receive boundary.
The privacy argument for that design is bulletproof and survives any accounting.
The trouble is that the privacy argument and a "no cost" argument get bundled in operator framing — both for DocDrop specifically and for local-LLM offloading more generally. Bundling weakens the privacy argument because the "free" half does not survive scrutiny. Separating the two strengthens the privacy half.
This paper exists to keep the two arguments separate, and to make the cost ledger explicit so the privacy-vs-cost trade is a deliberate operator choice, not a hidden one.
DocDrop processes sensitive meeting documents on USB media without data egress. All inference is local Ollama on 127.0.0.1:11434. The deterministic boundary is preserved end-to-end. Pipeline run and verified against real documents. The privacy benefit is unambiguous and not in dispute.
A frontier model with native function-calling can typically extract the DocDrop 5-field schema from a single document in a few hundred input + output tokens, often on first pass.
The same task on gemma4:26b /no_think with format: "json" and a strict receive-side validator costs more tokens for several structural reasons (see next section), with the multiplier varying by document and retry rate.
The frontier path costs cents per document at the API boundary. The local path costs zero at the API boundary. That is the only place the ledgers are even close to symmetric.
Operators choosing local for DocDrop-class work usually have at least one of these in mind:
Reasons 1, 2, and 4 are clean and survive any cost analysis. Reason 3 is the one that gets confused with "free" — and is the one this paper line is about.
The local lane is not free at the resource level. The cost lives in several places that an API-boundary view misses.
/no_think) and monitored (the suppression-regression probe Project Phoenix built); even when suppressed, the suppression contract has to be tested on a scheduleSLOW_IS_NOT_SMART.mdLocal lane is the right deployment when privacy or availability constraints are real and load-bearing:
In any of these cases, the resource cost is the cost of doing it right. The claim is not "do not deploy local." The claim is "deploy local because of privacy, not because of free."
When the privacy argument does not apply — work that is not sensitive, data the operator is allowed to send to a frontier API, batch sizes where API spend would be trivial — the cost case has to stand on its own merits.
In those cases, "use local because it is free" often does not survive a real volume × cost-per-call vs. capex + ops-time accounting. The honest framing is that the operator is paying with hardware, electricity, and wall-time instead of with API tokens. That can still be the right choice, but it is not free.
The paper does not need to moralize about this. The corrective is just making the ledger visible.
The PPR Lane 2 packet measured a bounded tool-dispatch task on local gemma4:26b versus a frontier-native function-calling lane. The headline is not subtle: the local lane paid a **2.0x retry penalty** and carried much more prompt-maintenance overhead to preserve strict machine-facing behavior.
That does not make the local lane wrong. It makes the deployment reason matter. If the task is privacy-bounded, offline, or contractually restricted, the extra cost may be justified. If the task is ordinary non-sensitive dispatch, the local lane has to win on some other ledger.
DocDrop is the stronger privacy exhibit because the real production lane is privacy-bounded by design. Sensitive documents remain local. The frontier comparison therefore uses synthetic .docx fixtures only, while the local lane uses the same synthetic fixtures for apples-to-apples measurement.
The 2026-05-01 ledger compared:
| Lane | Model | Success | Input tok | Output tok | Total tok | Wall (s) |
|---|---|---|---|---|---|---|
| local | gemma4:26b | 1 / 3 | 1502 | 2553 | 4055 | 25.083 |
| frontier | gemini-2.5-flash | 3 / 3 | 1460 | 757 | 2217 | 10.224 |
The local lane used:
schema_fail, parse_fail)The frontier lane succeeded on all three documents. That is exactly why the privacy argument must stand on privacy, not on an inherited claim that local is cheaper or simpler.
The accounting does not stop at tokens. The local lane also requires:
/no_think must keep doing what the lane depends on)Project Phoenix treats those obligations as engineering work, not as hidden externalities. That is the point: privacy can justify the local lane, but it does not erase the cost of operating it.
This paper does not argue against local LLM deployment. It argues against the wrong justification for local LLM deployment.
The correct argument for DocDrop is:
The incorrect argument is:
That chain fails under measurement.
Privacy is worth paying for.
That is stronger, more honest, and more durable than saying privacy is free.