The Prisoner's Dilemma & Bitcoin Mining
Lesson by Uvin Vindula
The Prisoner's Dilemma is perhaps the most famous concept in game theory, and it provides a powerful lens for understanding Bitcoin mining incentives. It demonstrates how rational self-interest can lead to suboptimal outcomes — and how Bitcoin's design brilliantly solves this problem.
The Classic Prisoner's Dilemma
Two suspects are arrested and held separately. Each can either cooperate (stay silent) or defect (betray the other). The payoffs are:
| Partner Cooperates | Partner Defects | |
|---|---|---|
| You Cooperate | Both get 1 year | You get 10 years, partner goes free |
| You Defect | You go free, partner gets 10 years | Both get 5 years |
The dilemma: regardless of what your partner does, defecting is always better for you individually. But when both defect, both get 5 years — worse than if both had cooperated (1 year each). Individual rationality leads to a collectively worse outcome.
Mining as a Repeated Prisoner's Dilemma
Bitcoin mining can be modeled as a repeated Prisoner's Dilemma. Each miner faces a choice: mine honestly (cooperate with the network) or attempt to cheat (defect). Cheating could include trying to double-spend, mining empty blocks to attack rivals, or attempting a 51% attack.
In a one-shot game, cheating might be profitable. But Bitcoin mining is a repeated game — miners interact over thousands of blocks. In repeated games, strategies that punish defection become viable. A miner who cheats:
- Risks losing block rewards: Honest nodes will reject invalid blocks, wasting the cheater's electricity.
- Damages the network: A successful attack would crash Bitcoin's price, destroying the value of the attacker's own holdings and mining equipment.
- Faces retaliation: Other miners and node operators can identify and respond to attacks.
Why Honest Mining Wins
Satoshi's genius was designing Bitcoin so that the cooperative strategy (honest mining) is also the individually rational strategy. This solves the Prisoner's Dilemma by aligning individual incentives with collective welfare:
Cost of attack vs. reward: A 51% attack requires billions of dollars in hardware and electricity. Even if successful, the attacker can only double-spend their own transactions — they cannot steal others' Bitcoin or create new coins beyond the protocol's rules. Meanwhile, the attack would crash Bitcoin's price, making their mining hardware worthless.
Repeated game dynamics: Because miners plan to mine for years, short-term cheating is irrational. The long-term profits from honest mining vastly exceed any one-time gain from cheating.
Sunk costs: Miners invest heavily in specialized hardware (ASICs) that can only mine Bitcoin. This creates a massive incentive to protect the network's value rather than attack it.
The Sri Lanka Analogy
Sri Lankans understand this dynamic intuitively through everyday economics. A shopkeeper in Pettah market who cheats a customer might profit once, but loses that customer forever. In a repeated interaction (the customer comes back weekly), honesty is the more profitable long-term strategy. Bitcoin formalizes this same principle through mathematics and code.
Key Takeaways
- •The Prisoner's Dilemma shows how individual rationality can lead to collectively worse outcomes
- •Bitcoin mining is a repeated Prisoner's Dilemma where long-term honesty beats short-term cheating
- •The cost of a 51% attack vastly exceeds any potential gains from cheating
- •Miners' sunk costs in ASIC hardware create strong incentives to protect the network
- •Bitcoin solves the Prisoner's Dilemma by aligning self-interest with honest behavior
Quick Quiz
Question 1 of 3
0 correct so far
Why does honest mining win in Bitcoin's game theory?