Classify incentive problems into stag hunts, and prisoners dilemmas

game-theory — 4 minutes read — 07 Apr, 2025
🪴 Plant

In the limited time I’ve spent learning about game theory, these are some of the core ideas I’ve absorbed:

(a) to look at every encounter as a game with upsides and downsides due to various actions.

(b) to truly win the game, one minimises the downsides, and increases the upsides as much as possible.

This is the crux of what I have understood, and have been highly influential in acting as a heuristic when it comes to decision making.

And the first thought-experiment which everyone encounters when they hear “game theory” is the prisoner’s dilemma. As a recap, this is a situation where, two suspects are arrested and interrogated separately:

  • If both stay silent (co-operate), they get 1 year each.
  • If one betrays (defects) while the other stays silent, the betrayer goes free and silent one gets 10 years.
  • If both betray, they get 5 years each.
Cooperate (Silent)Defect (Betray)
Cooperate1y / 1y10y / 0y
Defect0y / 10y5y / 5y

And if it’s a single shot prisoner’s dilemma (i.e it happens only once), then the best strategy would be to defect. This is assuming that (temptation to defect >> mutual cooperation >> mutual defection >> worst possible outcome).

Compare this in contrast with another scenario, that of a stag hunt: In this example, two hunters can:

  • Cooperate to hunt a stag (big reward), but it requires both.
  • Hunt hare alone (small reward), but guaranteed.
Hunt StagHunt Hare
Hunt Stag4 / 40 / 3
Hunt Hare3 / 03 / 3

So the best payoff would happen if both the hunters co-ordinate.

FeaturePrisoner’s DilemmaStag Hunt
Type of gameDilemma of incentivesDilemma of coordination
RiskGetting exploited by a defectorBeing abandoned in cooperation
Dominant strategyDefectionNone (depends on expectations)
Equilibrium outcome(Defect, Defect)Either (Stag, Stag) or (Hare, Hare)
Cooperation motiveMust overcome self-interestMust overcome uncertainty/trust

Let’s take an example of two tech startups planning to create a joint standard for “blockchain” ethics.

If we look at this in terms of prisoner’s dilemma:

Each firm can cheat and ignore the standard regulatory framework set by the other. If one cheats (ignores the standard to move faster), and the other cooperates (develops together), the cheater wins market, and the cooperator loses. Both can cooperate, but there is a continued incentive to defect (and that remains).

In the same way, we can also look at it in terms of the stag hunt:

Both firms can either be aligned on a unified blockchain safety protocol (which might lead to a big win), or do their own thing (which is less risky, with smaller gain).

The stag hunt framing applies if both parties benefit most from mutual cooperation, but suffer only minor losses if they act alone. The prisoner’s dilemma framing applies if defection yields a significant individual advantage, and mutual defection is net worse than cooperation.

So which is true? is it a stag’s hunt or a prisoner’s dilemma?

Reality could be far more complex, and in this case, it could probably alternate between stag’s hunt and prisoner’s dilemma.

And this could be based on, (a) number of players, more actors —> more defective incentives. (b) if one company has more compute/data, it can afford to defect.

Liked this post? Get email for new ones:

Here are some other posts you might enjoy

Minto principle for writing memos

Initially popularised by McKinsey consultants to draft internal reports, this became quite widespread as a global standard for business writing. Start with the main statement, draw some conclusions and provide some data/facts and figures for further understanding....

writing
How I build greenfield apps with AI-assisted coding

Building apps with AI-assisted coding can be quite tricky if you start with a blank empty space. Previously I used to prompt the LLMs like a rookie by saying "fix this, add this, build this", and so on. And this is usually frowned upon in the developer circles, and it seems to be quite an irresponsible way to do AI-assisted programming. But "vibe coding" has so much more to offer to this world, in terms of speed and velocity, and it's important to not loose sight of the larger goal: to build the right things, and build things right. It's indeed a weird trajectory that programming has taken recently, and if this works out, why not embrace it? Any app is only as good as our ability to carefully prompt them. This could make or break the vibe-coded app. I first came across [Harper Reed's blog](https://harper.blog/2025/02/16/my-llm-codegen-workflow-atm/) talking about his own LLM-aided coding workflow, I felt like sharing something similar based on what I've learnt. Harper goes through a lot more LLM assistants, but my advice here is specific to Cursor IDE:...

ai-coding
Quality Ideas Trump Execution

‘’What might seem to be merely the initial step, deciding what to work on, is in a sense the key to the whole game’’ — Paul Graham A good start when it comes to product building is half the work done....

rough-notes
Digital Products built like physical artisanal tools

Tobias Van Schneider in his blog talks about a new way to think about building modern software— The advancements of our modern world mean there’s generally more of everything. The streaming age has led to a proliferation of low-quality content, churned out to satisfy the binge-watching masses. The ever-scrolling audiences and their short attention spans require news sources to up the ante, publishing throwaway articles (increasingly written by robots) like hotcakes. Industrialization and the off-loading of production to developing countries means clothes are mass-produced, designed to be thrown away and replaced one season to the next. ...

design
Importance of Why

When it comes to task management, a fundamental principle stands tall: explain the reason before the details of what and how. By letting others know the why behind a task, it facilitates faster completion....

task-management
Insights are not just a salad of facts

What is an insight? An insight for Elon was: "The most entertaining outcome was the most likely'. His tweet suggests that he believes in taking risks and embracing the unknown, rather than playing it safe....

design
Brew your ideas lazily

Leonardo da Vinci's masterpiece, the Mona Lisa achieved through the painstaking application of countless gossamer-thin layers of oil paint over the course of many years, many months. The _sfumato_ technique which Da Vinci popularised, involved applying more than 40 layers of paint, each only 10 to 50 micrometers thick, using fingers to blend the colors and create the depth of illusion. The creation process was "perpetually unfinished" — He began the portrait around 1503, but didn't complete it till his death in 1519. And it would be foolish to dismiss him as a 'master procrastinator' — _What if the attribute of delayed procrastination in itself had some merit?_ I'm beginning to suspect that this was the hidden reason behind the genius of Mona Lisa. The art of brewing ideas lazily. In 2009, a review of three dozen studies conducted by researchers of Lancaster University concluded that setting aside a problem was helpful in improving the performance of divergent thinking tasks....

ideas
Methods are lifejackets not straight jackets

Design methods are life jackets. Not straight jackets. Structures, whether they're processes, frameworks, or plans, are excellent tools to navigate complicated problems. They bring efficiency, reduce ambiguity, and offer defaults. ...

methodology
Use code only if no code fails

> UPDATE: **The landscape right now looks so different with the recent evolution of "vibe coding". I don't touch no-code tools such as Bubble, Softr etc for any of my prototyping needs for eg. I just shoot directly from the hip. For reference, read my essay on this topic** — [[Vibe coding]], [[Idea in the shower, testing before breakfast]]. Use code only if no code fails. It is that simple. I can assume that there might be counters, attacks and pushpacks to this heavy statement. Bear with me on this. Before we address the house on fire, let me take you on a quick detour....

code
We have been scammed by the Gaussian distribution club

Taleb insists that we've been scammed by the Gaussian distribution club. The gaussian distribution has become so ubiquitous in our daily jargons, oru day-to-day decisions even. ...

mathematics
Public gardens, secret routes

When you land on a site, you are treated with various hyperlinks. Based on the UX choices you make, you might end up preferring one over the other. You might get the job done, and head back to your earlier Chrome tab. Nothing out of the ordinary. It's an internet search as usual. But what about those pages which have no visible links on the site you just searched. It’s hidden in plain sight, but still useful in certain contexts. ...

digital-garden
Stitching React and Rails together

In this tutorial, I will take you through my current process of deploying a Rails 8 app with some careful design choices: 1. React: Best design engineering stack for those itching to add some front end flair. React is also very LLM-friendly since it's trained on vast amounts of WWW data, making it a popular choice for building front end using text prompts. I replaced the 'V' in the 'MVC' using the [inertia-on-rails library](https://evilmartians.com/chronicles/inertiajs-in-rails-a-new-era-of-effortless-integration) to use React instead of Hotwire (I was earlier skeptical about going the React route, and was learning Hotwire initially for the frontend. [This podcast by Vercel's former lead design engineer, Mariana Castilho convinced me otherwise](https://open.spotify.com/episode/2M7D5umsCGpBeo2fbLnw8P))...

prototypes
Balancing work, time and focus

Have you head of the Eudaimonia machine? Imagine a one-story, narrow structure, a straightforward rectangle divided into five rooms, in succession. There's no quick escape route here. This design insists that as you move through, you're plunging deeper into the world of intense productivity....

productivity
Thorough reference checks

I've been hiring people (and conducting more thorough reference checks) more recently now, and I've learned something important: most reference checks are useless. They're like those mandatory training videos you have to watch at big companies. Everyone goes through the motions, but nobody really learns anything. But it doesn't have to be this way. Reference checks can be incredibly valuable if you do them right. Here's how I do it....

leadership
Obsessing over personal websites

Intended Audience—For those of us who have attempted to make a personal website of their own and have guilt-tripped over making multiple updates every year I’ve been obsessed with my personal website. It’s not even about the views and impressions which I’m receiving. I have one subscriber on my mailing list from my website, and compared to internet writer standards, I am virtually non existent. ...

software
A Primer to Service Design Blueprints

I’ve found this exercise very valuable when it comes to mapping both the frontstage and backstage of any product/service. It’s a terrible tool to communicate what the product/service looks like. If you squint your eye and look at it, you might see an array of sticky notes and flowcharts lined up. However, it’s the best way to understand the front stage, the backstage, or even the backstage behind that backstage....

design
Making Nielsen's heuristics more digestible

Jakob Nielsen's heuristics are probably the most-used usability heuristics for the design of interfaces. If you go to their website, you might encounter these dry prescriptive statements as shown in the image....

design
How does AI affect UI?

Intended Audience — For conversational UI designers in healthcare industry curious about various UI affordances/design patterns in vogue right now Our online conversations have been increasingly life-like, but yet life-less at the same time. The UI of apps have become more conversational and chat-like in nature. Not just apps, even websites have their own chat-like interfaces on the side. And all of them face the peril of being infested by bot-like avatars....

design
How do you know what you believe is true?

My attempt towards discovering epistemology, as a philosophy discipline initially started with a benign question — *How do I know with certainty what I believe is true?* In this attempt, I had to meander around various schools of thought that discussed this topic in depth — from dogmatism, skepticism, empiricism, relativism and finally, critical rationalism. I wanted to know the truth about truth, and yet the path to that was not very straightforward....

rough-notes
Everything is a prioritisation problem

When it comes to building a product, everything is a prioritisation problem. > We might be building things right, but are we building the right thing?...

product-management