The Monty Hall Problem

A Counterintuitive Probability Brain Teaser

Author

Röx

Published

December 29, 2025

The Monty Hall Problem is a famous probability puzzle based on a game show scenario. It is named after Monty Hall, the original host of the television game show “Let’s Make a Deal.” The problem is known for its counterintuitive solution, which often surprises people.

The problem was first posed by Steve Selvin in a letter to the journal “The American Statistician” in 1975. It gained widespread attention after being featured in Marilyn vos Savant’s “Ask Marilyn” column in Parade magazine in 1990.

Marilyn’s response sparked a significant public debate, as many readers found the solution counterintuitive and disagreed with her explanation. The Monty Hall Problem has since become a classic example in probability theory and is often used to illustrate the importance of considering all available information when making decisions.

Savant’s solution has been hotly contested and debated, with many people finding it counterintuitive. But why is it so?

The basic problem statement

The original problem is as follows:1. You are a contestant on a game show. 2. There are three doors: behind one door is a car (the prize you want), and behind the other two doors are goats (which you don’t want). 3. You pick one door (say, Door 1). 4. The host, who knows what’s behind all the doors, opens another door (say, Door 3) that has a goat behind it. 5. The host then gives you the option to switch your choice to the remaining unopened door (Door 2) or stick with your original choice (Door 1).

What should you do to maximize your chances of winning the car?

The immediate intuition might suggest that since we now have two doors to choose from, the odds are 50/50, and it doesn’t matter whether we switch or stay. However, this is not the case.

The correct solution is that you should always switch doors. By switching, you have a 2/3 chance of winning the car, while if you stick with your original choice, you only have a 1/3 chance. This is the solution given by Marilyn vos Savant. However, this solution is dependent on the exact problem statement given. If the problem statement is changed, in particular the host’s behavior, the solution may also change.

The disagreements and debates around the Monty Hall Problem often stem from misunderstandings of the problem’s conditions and the role of the host’s actions. Many people assume that the host’s choice to open a door is random, but in reality, the host always opens a door with a goat behind it, which provides additional information that affects the probabilities.

It’s all about what information is known

The fundamentals of probabilities and conditional problems are directly related to what information is known and by whom.

In the beginning, we have three doors, and the probability of the car being behind any door is 1/3. When you pick a door, you have a 1/3 chance of having picked the car and a 2/3 chance of having picked a goat. We do not know where the car is, and do not have any other information at this time to help us make a better choice. The probabilities are unformly distributed between all options.

However, after we make our initial choice, the host’s actions are not random. Therein lies the key to understanding the problem. The host knows where the car is and will always open a door that has a goat behind it. This action provides us with new information that changes the probabilities. We can reinforce this understanding by considering the following: 1. The host will never open the door you initially picked. 2. The host knows where the car is and will always open a door with a goat behind it.

Out of the two doors we did not choose, the host’s choice between those two doors provides valuable information to our problem which is not random, and thus allows us to update our probabilities for the remaining two unopened doors.

However, had the host opened a door at random, even possibly the one we initially picked, the probabilities would remain 50/50 between the two remaining doors, as no new information would have been provided that allows us to updated our probabilities from the initial uniform probabilities.

Consider also an innocent bystander who knows nothing about what has happened, which door the contenstant chose and just sees two doors left unopened. For this bystander, the probabilities are indeed 50/50 between both doors - as they have no additional information - but for you as the contenstant, the probabilities are 2/3 in favour of switching from your initial choice.

The solution

So, here is the full solution.

We have gone over the probabilities at the initial choice of door by the contestant, the one chosen has probability \(p=1/3\) of having the car, the other two doors have a combined probability of \(1-p=2/3\) of having the car. The probabilities add up to 1 since there is only one car and one door must be openend - i.e. something will happen, either a car is revealed or not.

Although there is 2/3 probability of the car being behind either of the other two doors, we don’t know which one out of those two to pick. We need more information to make a better choice.

However, the host’s knowledge and actions will help us with this problem. Since the host will only choose a door out of the two unchosen doors, and will never open the door with the car - should it be behind either of them - the other door he did not choose becomes more valuable to us, since the full 2/3 probability of the car being beind either of the two unchosen doors now tranfers fully to the door not chosen by the host. We now don’t have to choose between the other two doors, the host has eliminated one for us free of charge. This is called variable change - i.e the actions by the host changes the variables of our problem.

There is a famous classroom scene in the movie “21” from 2008 where this is explained quite well (see below). There is also a great numberphile video explaining the problem and solution in detail.

Here is the scene from 21:

Here is the numberphile video:

The scene from 21 is also of interest from the other topic discussed, namely Newton’s method - or Newton-Raphson method - for finding roots of functions and the possible issues with that method. This is a topic for another time though.

Different host behavior

It is important to note that the solution to the Monty Hall Problem is dependent on the host’s behavior. If the host’s actions were different, the probabilities and optimal strategy could change. For example, if the host were to open a door at random, without knowledge of what is behind the doors, the probabilities would indeed be 50/50 between the two remaining doors after one is opened, and switching or staying would not matter. In other words, such behaviour by the host would not provide any additional information to the contestant.

We can look at various different host behaviors and see how they affect the solution:

Host always opens a door with a goat (standard Monty Hall Problem): The optimal strategy is to always switch, giving a 2/3 chance of winning the car.
Host opens a door at random: The optimal strategy is to stick with the original choice, as the probabilities are 50/50 between the two remaining doors.
Host only opens a door if the contestant initially picked the car: In this case, the optimal strategy is to always stick with the original choice, as switching would lead to a guaranteed loss.
Host opens a door only if the contestant initially picked a goat: Here, the optimal strategy is to always switch, as switching would lead to a guaranteed win.

These variations illustrate how the host’s behavior can significantly impact the probabilities and optimal strategies in the Monty Hall Problem.

Reinforcement through simulation

To further reinforce the understanding of the Monty Hall Problem, we can simulate the game multiple times to see how often switching versus staying leads to winning the car. Below is a simple Python simulation that demonstrates this. The contestant and host choices are randomly (pseud-randomly) generated to mimic the game scenario. The simulation is run for a large number of trials to get statistically significant results, providing a point estimate of the probability of choosing the right door \(p\) vs. switching \(1-p\).

This code is essentially simulating a Bernoulli process where each trial results in either a win or a loss based on the chosen strategy (switching or staying). By running the simulation multiple times, we can estimate the sample probabilities of winning for each strategy, taking advantage of the law of large numbers for the sample probabilities convering towards the true probabilities.

Run the following code cell by clicking “RUN CODE” in the upper right corner. Adjust the code as you please to experiment with different numbers of trials or strategies.

/Röx