The Number They Threw Away

Three layers of this ground look unrelated. A medical test that is “99% accurate” and is right about half the time. A p-value that almost nobody — including the people who report it — can read correctly. A fleet of bombers that came home covered in bullet holes, and an Air Force that wanted to armor exactly the wrong places.

They are the same mistake. Each one quotes a conditional probability and reads it backwards — takes P(B | A), the chance of the evidence given the cause, and treats it as P(A | B), the chance of the cause given the evidence. Those are different numbers. The lawyer's version even has a name — the prosecutor's fallacy: “the chance of this DNA match by coincidence is one in a million, so the chance he's innocent is one in a million.” The first clause is P(match | innocent); the verdict needs P(innocent | match); the swap is silent and the floor falls out.

And in every case the reversal is made possible by the same thing — and that thing is exactly what the naive reading discards. It is the base rate: how common the cause was before any evidence arrived. Drag it below and watch the two readings come apart.

The two readings of one arrow — set the world, then read the arrow both ways

The arrow, forward

P(B | A)

99%

what gets quoted

The arrow, backward

P(A | B)

50%

what was actually asked

everyone who tests positive →

cause present & flagged cause absent & flagged (false alarms)

base rate P(A)1%

how common the cause is before any evidence

P(B | A) — hit rate99%

the evidence shows up when the cause is present

P(B | not A) — false-alarm rate1%

the evidence shows up even when the cause is absent

Read it. The forward arrow and the backward arrow are different numbers, and the gap is set by the base rate you just dragged. Pull the base rate toward zero — the cause is rare — and the backward reading collapses while the forward reading never moves. That is the whole trap: the quoted number, however impressive, says almost nothing about the question asked, until you put the base rate back in.

The hinge — why the base rate is not a footnote

Here is the fact under the instrument, and you can confirm it by dragging: hold the forward number fixed and the backward number can still be anything. A test that fires on 99% of the sick and only 1% of the healthy — leave those dials alone — gives a “chance you're sick” that runs from under 0.1% to over 99.9% as the base rate sweeps from rare to common. The forward number pins nothing about the backward one. The base rate isn't a correction to the answer; in a real sense it is the answer.

The bridge between the two readings is a single identity — Bayes' theorem — and the base rate P(A) is the term sitting right in the middle of it:

P(A | B) = P(B | A) · P(A) / P(B) To turn the arrow around you must multiply by the base rate. Throw it away and the turn is impossible — which is exactly what each of the three readings below does.

The same number, thrown away three times

Switch the instrument between the three lenses above and the sliders relabel, but the engine never changes. What changes is the name of the number that got discarded.

Three layers · one error · one missing number

Lens 1 · the test

The Positive Test That's Probably Wrong

Thrown away: the prevalence. A 99%-accurate test for a 1-in-100 disease leaves a positive result a coin flip — exactly 50%. Real doctors, asked the 1-in-1000 version, said 95%; the answer is under 2%.

Lens 2 · the p-value

The Null World

Thrown away: the prior over hypotheses — and here it's worse, because P(H₀ | data) isn't even defined without it. A p-value of 0.05 is consistent with the null still being true at least 29% of the time.

Lens 3 · the bombers

The Planes That Didn't Come Back

Thrown away: the exposure baseline — and the planes that never returned to be counted. Armor where the survivors are unhit. The most-damaged region on the survivors can be the least lethal one.

The p-value lens is the sharpest, so it's worth being exact about it. The other two throw the base rate away and get a wrong number; the p-value throws away something it never had. A p-value is honestly P(data this extreme | the null is true) — a forward arrow. The question everyone actually wants answered is P(the null is true | this data) — the backward arrow. But that backward arrow cannot be computed at all without a prior over the hypotheses, which classical testing refuses to supply. The Null World shows the floor: across every alternative, with no prior favoritism, a result at p = 0.05 still leaves the null true at least 28.9% of the time (the Sellke–Berger–Bayarri bound). The famous 0.05 and the ≥0.29 people read it as are not close. The distance between them is the missing prior.

Why this is one piece, not three

The Wasteland already holds these three layers separately, each true and checked on its own. What none of them says — because each is busy with its own subject, its medicine or its statistics or its aircraft — is that they are the same shape. The reason the bombers and the blood test rhyme isn't a metaphor; it is literally P(B|A) ↦ P(A|B) performed wrong, and the literal cure is to reinstate the same term, P(A), in the same identity. That's what a portal is for: to supply the load-bearing thing that's true of the whole and stated in none of the parts. Once you've seen it, the prosecutor's fallacy, the false-positive paradox, base-rate neglect, and survivorship bias stop being four entries in a list of biases and become one thing wearing four coats.

This place already holds a broader portal next door — The Condition You Weren't Told — which walks the wide version of this idea: any deleted baseline, across probability, economics, and physics (a host's protocol, a base rate, a system of governance, a question of scale). This one is the deliberate zoom. It takes a single rung of that spine and makes it the whole subject — not just that a baseline was dropped, but that the dropping is always the same arithmetic move, a conditional read in reverse, cured by the same identity — and it carries one case the broader portal does not: Wald's bombers. Read that one for the breadth; this one for the single mechanism, worked both directions at once.

It is also why the rule of this place — never lie about anything real, and show the check — is not a constraint bolted on after. The whole error above is what happens when a true forward statement (“the test is 99% accurate”) is allowed to imply a false backward one. Keeping the base rate in view is, exactly, the discipline of not letting a true thing imply a false thing. The moat and the math turn out to be the same move.

Show the check — every number on this page is recomputed offline

The forward reading P(B|A) fixed at 99/1, the backward reading P(A|B) sweeps from <0.1% to >99.9% as the base rate moves — the forward number constrains nothing.
The live engine matches a brute-force count over a 1,000,000-person cohort (no formula) to 6 decimals.
Lens 1: 99%-accurate test, 1% prevalence → P(disease | +) = exactly 50%. Casscells 1978 (1/1000, Se 100%, FPR 5%) → 1.96%. Eddy 1982 → 7.8%.
Lens 2: two-sided exact binomial p for 60 heads in 100 flips = 0.0569; Sellke–Berger–Bayarri minimum P(H₀ | data) at p=.05 with equal priors = 28.93% (≥29%).
Lens 3: survivor holes qᵢ ∝ aᵢ(1−vᵢ); a region with the most survivor damage (fuselage) is the least lethal; keeping the exposure baseline aᵢ recovers the true lethality order.

15/15 checks pass · node research/the-number-they-threw-away/verify.mjs · sources: Casscells, Schoenberger & Grayboys (1978); Eddy (1982); Sellke, Bayarri & Berger (2001); Wald (1943, Statistical Research Group). The three member layers carry their own verifiers.

The other portals in this place — the form, reused

Portal · physics

Before You Looked

Three experiments refute one classical assumption — counterfactual definiteness — that none of them names.

Portal · pattern

Any Loop You Can Draw

The nontransitivity spine: three games where “better than” goes in a circle, with McGarvey's theorem under all of them.

Portal · number

The Common Measure

Three faces of one algorithm — anthyphairesis under √2, the fifth, and φ.