Artificial Wasteland · Pattern

The Average That Never Arrives

A coin game with an infinite expected value — that no one in three centuries has been willing to pay much to play. The paradox isn't in the game. It's in the word average: this one game has three of them, and they disagree by everything between $2 and infinity.

Here is the whole game. A fair coin is tossed until it first comes up tails. The pot starts at $2. Every heads doubles it. The first tails ends the game and you take whatever is in the pot. So a tails on the first toss pays $2; heads-then-tails pays $4; two heads then tails, $8; and a first tails on toss k pays exactly 2k — with probability 2−k.

Now the question Nicolas Bernoulli asked in a letter in 1713, and his cousin Daniel answered in print from the St. Petersburg academy in 1738:1 what is a fair price to play once? Multiply each prize by its chance and add:

The expected value
E = ½·$2 + ¼·$4 + ⅛·$8 + … = $1 + $1 + $1 + … =
Each doubling of the prize is cancelled exactly by the halving of its chance, so every term contributes the same $1 — and the sum never stops. By the one rule every gambler is taught — pay less than the expected value — you should hand over your house, your city, everything, for one play. Nobody does. That refusal is the paradox.

01 — PLAY ITPlay the game, and watch the average refuse to settle

Flip by hand, or let it run thousands of times. Keep your eye on the average won per game — the one number that should, by every ordinary law of averages, settle down. It doesn't.

The game · pot doubles on heads, tails pays out
$2
Pot
$2
Press Toss to begin.
0
games
avg / game
log₂(games)
$0
biggest pot
your running average / game log₂(games) — the reference climb

Play a few by hand and it looks tame: most games pay $2 or $4, you feel like you're losing. Then run it a hundred thousand times. The average per game climbs — past 10, past 15 — and it climbs alongside the pale-blue curve log₂(games), always a couple of dollars above it, never levelling off. Reset and run it again and the same thing happens at a different height, because somewhere in there a single game paid a colossal pot and dragged the whole average up with it. There is no number the average is heading toward. That is not a bug in the simulation. It is a theorem — and we'll name it below.

02 — THREE AVERAGESOne game, three answers, all exact

"The average" is not one thing. Here are the three that matter, drawn straight from the exact distribution — no sampling, no noise.

The exact payout distribution · bar height = probability
$2
median
$4
geometric mean
arithmetic mean

The median — $2. Half of all games end on the very first toss, so half of all games pay exactly $2 and no more. The typical game — the one in the middle — is the cheapest possible outcome. (87.5% of games pay $8 or less; 96.9% pay $32 or less.) By the standard of the median player, this game is nearly worthless.

The geometric mean — $4. Instead of averaging the prizes, average their logarithms — the right thing to do when quantities multiply, or when what you feel is proportional change, not absolute dollars. Then E[log₂ payout] = Σ k·2−k = 2 exactly, so the geometric mean is 22 = $4. This is precisely Daniel Bernoulli's 1738 escape: he proposed that we value not money but its utility, which grows like a logarithm, and a logarithmic valuer prices this game at a small, finite figure.1 Gabriel Cramér had reached almost the same place in 1728 with a square-root valuation (≈$2.9).2

The arithmetic mean — ∞. The one every textbook computes, and the one no one obeys. It is infinite not because any prize is infinite — every prize is a finite power of two — but because the tail of rare, enormous jackpots is just heavy enough that its contribution never converges. Which average is "right"? The paradox's lasting lesson is that the question has no answer in the abstract: it depends entirely on whether you play once, play with borrowed money, or play a great many times — and on who is holding the bank.

03 — THE FINITE HOUSEWho can actually pay infinity?

No real casino has infinite reserves. If the house can pay at most W dollars, the pot is capped there — and the fair price collapses from infinity to something you could carry in a wallet. Drag the bankroll and watch.

The house's bankroll → the fair price of the capped game
bankroll W$1,048,576
$21 fair price to play once

A house holding $1,048,576 can afford at most 20 doublings — so the fair price is $21.

a millionaire's bank ($1.05M = 2²⁰)$21
a billionaire's bank ($1.07B = 2³⁰)$31
$1.1 trillion (= 2⁴⁰)$41
every dollar on Earth (~$450 trillion)$49
a dollar per atom in the universe (~10⁸⁰)$266

With the payout capped at 2L — where L = ⌊log₂ W⌋ is how many doublings the bank can survive — each affordable round still contributes exactly $1, and the truncated tail adds one more, so the fair price is exactly L + 1. The consequence is the quiet dissolver of the paradox: the value grows like the logarithm of the bankroll. To make the game worth one more dollar, the house must double its entire fortune. A banker as rich as the whole world economy makes this "infinitely valuable" game worth about the price of a paperback.3

04 — THE AVERAGE THAT NEVER ARRIVESWhy the running total refused to settle

Back to instrument 01 — the average that kept climbing. Ordinary randomness obeys the law of large numbers: average enough coin flips, dice, or heights and the running average homes in on a fixed value. The St. Petersburg game breaks that law, because its true mean is infinite — there is no fixed value to home in on.

In 1945 William Feller found what happens instead.4 Add up the winnings Sn from n games. Then

Feller's weak law (1945)
Sn / (n · log₂ n) → 1  in probability, as n → ∞.
Equivalently, the average per game Sn/n grows like log₂ n — exactly the pale-blue curve the simulation tracked. The right yardstick isn't n, as it is for every well-behaved average; it's n·log₂ n, and the extra log₂ n is the signature of the heavy tail.

So a "fair fee" exists only for a fixed session length: to play n games, pay about log₂ n per game. Play a thousand and roughly $10 each is fair; play a million and it's about $20; and as n grows the fair fee grows with it, without bound. The single-play fair price the Bernoullis hunted for doesn't exist because the fair price depends on how long you intend to play, and keeps rising.

And Feller's is only a weak law — convergence "in probability," which tolerates rare, wild excursions. There is no strong law here at all: the ratio Sn/(n·log₂ n) does not settle down for an individual endless player. Its lim sup is infinite almost surely — every so often, forever, a jackpot arrives large enough to yank the lifetime average back up (Chow & Robbins, 1961).5 That is the precise sense in which the average never arrives. It is always still climbing, and always still being ambushed by a prize you hadn't yet seen.

A modern reading pushes on the phrase "if you play long enough." Ole Peters argued in 2011 that a single person living through repeated plays experiences the time-average growth of their wealth, not the ensemble average across parallel universes — and that time-average, computed without any utility function, lands on the same finite value Bernoulli got from logarithms, tying the paradox to the Kelly betting criterion of 1956.6 It's an elegant lens, and a contested one — decision theorists still reach first for Bernoulli and Menger, and Peters' framing has published rebuttals.7 We flag it as a live debate, not a settled verdict — because the one thing this page will not do is tell you which average to believe.

✓ Show the check

Every number here is recomputed live in this page and, independently, proven offline in research/st-petersburg-paradox/verify.mjs31 / 31 checks green. The exact facts (mean = 1+1+1+…, median = $2, E[log₂]=2 so geometric mean = $4, capped fair price = ⌊log₂ W⌋+1) are settled by arithmetic; Feller's asymptotic is checked by a seeded simulation against its structure — a running average that keeps climbing, tracks log₂ n, and whose mean-across-lifetimes stays stubbornly above the typical one. The 2k payout convention is used throughout; the 2k−1 variant halves the median (to $1.50) and the geometric mean (to $2), but the arithmetic mean is infinite either way.

Sources & notes.

1. Nicolas Bernoulli posed the problem to Pierre Rémond de Montmort (letter of 9 Sept. 1713). Daniel Bernoulli, "Specimen theoriae novae de mensura sortis," Commentarii Academiae Scientiarum Imperialis Petropolitanae, Tomus V (the volume "for the years 1730 and 1731," actually published 1738), pp. 175–192 — the paper the paradox is named for, after the St. Petersburg academy that printed it. English: L. Sommer trans., "Exposition of a New Theory on the Measurement of Risk," Econometrica 22 (1954), 23–36.

2. Gabriel Cramér, letter to Nicolas Bernoulli, 21 May 1728, proposing decreasing marginal utility (a square-root valuation gives ≈$2.9). Karl Menger later (1934) showed only a bounded utility escapes every inflated variant of the game.

3. Fair price of the game capped at bankroll W is ⌊log₂ W⌋+1, derived and checked in the verifier; the figures match the standard worked examples (a millionaire banker ≈ $20). "Every dollar on Earth" uses total global wealth ≈ $450 trillion.

4. W. Feller, "Note on the Law of Large Numbers and 'Fair' Games," Annals of Mathematical Statistics 16 (1945), 301–304; and An Introduction to Probability Theory and Its Applications, Vol. 1, ch. X. The normalizing sequence is n·log₂ n (base-2 log, tied to the doubling prizes).

5. Y. S. Chow & H. Robbins, "On Sums of Independent Random Variables with Infinite Moments and 'Fair' Games," Proc. Natl. Acad. Sci. USA 47 (1961), 330–335 — the failure of the strong law (the lifetime average is almost surely unbounded).

6. O. Peters, "The time resolution of the St Petersburg paradox," Phil. Trans. R. Soc. A 369 (2011), 4913–4931 (preprint arXiv:1011.4404). J. L. Kelly Jr., "A New Interpretation of Information Rate," Bell System Technical Journal 35 (1956), 917–926.

7. E.g. J. R. Varma, "Time Resolution of the St. Petersburg Paradox: A Rebuttal" (2013). The ergodicity-economics framing is a modern minority view; expected-utility (Bernoulli/Menger) remains the textbook resolution.

← the wasteland