09/10/26 Quick links about privacy and technology
09/10/25 Bayes' theorem
09/10/22 The return of Monty Hall
09/10/18 Monty Hall revisited
09/10/17 The Monty Hall problem
09/10/10 Doctor Who: Silence in the library / Forest of the dead
09/10/04 Quick links about privacy and technology
Privacy. Security. The FTC and blogger accountability. Everything else.
Privacy.
Gartner: Forget privacy, it is just an illusion
Burton Group: Gartner gets privacy dead wrong
Freedom to tinker: Privacy as a social problem, not a technology problem
PolicyBeta: Surveillance for profit and th incredible shrinking expectation of privacy
Ars technica: Anonymized genetic research data still carries privacy risks
The technology liberation front: Privacy polls v. real-world trade-offs
Tech and law: Profiling - draft recommendation on the protection of individuals with regard to automatic processing of personal data in the framework of profiling
Bottom-up: Privacy norms should come before privacy laws
Wired GeekDad: Is online privacy a generational issue?
The Register: Privay policy tool failed because of browser rejection
Visualisation & other methods of expression
Security.
DarkReading: Bankers gone bad - financial crisis making the threat worse
DarkReading: Passwords posted online by phishers
DarkReading: Databases' most serious vulnerability - authorized users
DarkReading: Factoring malware into your web application design
David Lacey's security blog: A new kind of security
The FTC and blogger accountability.
Ars technica: More transparency coming to blog reviews under new FTC rules
FastCompany: FTC responds to blogger fears - "That $11,000 fine is not true"
The technology liberation front: FTC to regulate blogger claims (I was not paid to say this)
The technology liberation front: What I don't get about the FTC's new blogger guidelines
Eric Goldman: Do the FTC's new endorsement/testimonial rules violate 47 USC 230?
MediaPost: Legal expert questions FTC's new blogger rules
Everything else.
Wired Threat level: Probe targets archives' handling of data on 70 million vets
Wired Threat level: Judge revives question of retail liability in Hannaford breach case
Wired Threat level: Credit card skimming survey - What's your magstripe worth
More math. Boys, girls and Bayes, oh my.
Since a vague disclaimer is nobody's friend: I have no formal training in mathematics, and all I know (or think I know) about Bayes' theorem is based on stuff that I found on the internet.
To learn about Bayes' theorem, this might be a good starting point:
Wikipedia: Bayes' theorem
And here's a Bayesian analysis of the Monty Hall problem (which was what got me interested in Bayes' theorem in the first place):
Wikipedia: Monty Hall problem - Bayesian analysis
More about Bayes' theorem:
Stanford Encyclopaedia of Philosophy: Bayes' theorem
This problem is used on several Wikipedia pages as an example of the application of Bayes' theorem:
"Suppose there is a school having 60% boys and 40% girls as students. The female students wear trousers or skirts in equal numbers; the boys all wear trousers. An observer sees a (random) student from a distance; all the observer can see is that this student is wearing trousers. What is the probability this student is a girl?"
Wikipedia: Bayes' theorem - Simple example
I know next to nothing about mathematics, and I found this problem very useful in understanding (at least some of) the reasoning behind Bayes' theorem.
Bayes' theorem gives us the following formula for problems like these:
P(A|B) = P(B|A) P(A) / P(B)
In our case, A
means the child is a girl, B
means that the child wears trousers and P(A|B)
is the probability of A given B which is, in this case, the probability of a trouser-wearing child being a girl.
A visual representation of the variables:
A (child is a girl) | not A (child is a boy) | |
B (child wears trousers) | ||
not B (child wears skirt) |
A visual representation of P(A|B)
, the answer we're looking for:
A (child is a girl) | not A (child is a boy) | |
B (child wears trousers) | P(A|B) |
P(B|A)
is the probability of B given A or, in our case, the probability of any girl wearing trousers.
A (child is a girl) | |
B (child wears trousers) | P(B|A) |
not B (child wears skirt) |
This is what we know:
"The female students wear trousers or skirts in equal numbers; [...]"
This means that P(B|A)
is 0,5.
P(A)
is the probability of A or, in our case, the probability of any student being a girl.
A (child is a girl) | not A (child is a boy) | |
B (child wears trousers) | P(A) | |
not B (child wears skirt) |
This is what we know:
"[...] a school having 60% boys and 40% girls as students."
This means that P(A)
is 0,4.
Multiplying P(B|A)
by P(A)
gives us P(A and B)
, the probability that A and B are both true or, in our case, the probability of any of the students being a trouser-wearing girl.
A (child is a girl) | not A (child is a boy) | |
B (child wears trousers) | P(A and B) | |
not B (child wears skirt) |
P(B|A) P(A)
is 0,5 * 0,4 = 0,2.
P(B)
is the probability of B or, in our case, the probability of any student at the school wearing trousers.
A (child is a girl) | not A (child is a boy) | |
B (child wears trousers) | P(B) | |
not B (child wears skirt) |
This is what we know:
"[...] a school having 60% boys and 40% girls as students. The female students wear trousers or skirts in equal numbers; the boys all wear trousers.
Since P(B)
isn't given, we'll have to determine it based on what we do know:
P(B|A)
, the probability of any girl wearing trousers;P(A)
, the probability of any student being a girl;P(B|not A)
, the probability of any boy wearing trousers;P(not A)
, the probability of any student being a boy.We can determine P(B)
using the following formula:
P(B) = P(B|A) P(A) + P(B|not A) P(not A)
P(B)
is 0,5 * 0,4 + 1 * 0,6 = 0,2 + 0,6 = 0,8.
Dividing P(B|A) P(A)
by P(B)
gives us P(A|B)
, the probability of A given B or, in our case, the probability of any child in trousers being a girl. And here's our solution:
A (child is a girl) | not A (child is a boy) | |
B (child wears trousers) | P(A|B) |
P(A|B)
is 0,5 * 0,4 / 0,8 = 0,25.
We've got answers, but what is the question?
The Monty Hall problem in its best-known form:
"Suppose you're on a game show, and you're given the choice of three doors: Behind one door is a car; behind the others, goats. You pick a door, say No. 1, and the host, who knows what's behind the doors, opens another door, say No. 3, which has a goat. He then says to you, "Do you want to pick door No. 2?" Is it to your advantage to switch your choice?"
Whitaker, Craig F. (1990). [Letter]. "Ask Marilyn" column, Parade Magazine p. 16 (9 September 1990), as quoted on the Wikipedia page about the Monty Hall problem.
Wikipedia: Monty Hall problem
The problem as it is worded gives us information about four different things:
The wording of the problem raises a number of questions:
In order to answer the question, we'll need to assume that a number of things will be true whenever the game is played. Based on the information that we have, none of the following assumptions seems too far-fetched:
Based on these assumptions, we can chart everything that might happen in the game as follows:
Car | You | Host | Probability | Assumptions | Result of switching | Assumptions |
1 | 1 | 1 | 0 | 3B and / or 3C | n/a | n/a |
2 | 1/18 | 1A and 2A and 3A and 3D | you lose | 4A and 4B | ||
3 | 1/18 | 1A and 2A and 3A and 3D | you lose | 4A and 4B | ||
2 | 1 | 0 | 3C | n/a | n/a | |
2 | 0 | 3B | n/a | n/a | ||
3 | 1/9 | 1A and 2A and 3A | you win | 4A and 4B | ||
3 | 1 | 0 | 3C | n/a | n/a | |
2 | 1/9 | 1A and 2A and 3A | you win | 4A and 4B | ||
3 | 0 | 3B | n/a | n/a | ||
2 | 1 | 1 | 0 | 3B | n/a | n/a |
2 | 0 | 3C | n/a | n/a | ||
3 | 1/9 | 1A and 2A and 3A | you win | 4A and 4B | ||
2 | 1 | 1/18 | 1A and 2A and 3A and 3D | you lose | 4A and 4B | |
2 | 0 | 3B and / or 3C | n/a | n/a | ||
3 | 1/18 | 1A and 2A and 3A and 3D | you lose | 4A and 4B | ||
3 | 1 | 1/9 | 1A and 2A and 3A | you win | 4A and 4B | |
2 | 0 | 3C | n/a | n/a | ||
3 | 0 | 3B | n/a | n/a | ||
3 | 1 | 1 | 0 | 3B | n/a | n/a |
2 | 1/9 | 1A and 2A and 3A | you win | 4A and 4B | ||
3 | 0 | 3C | n/a | n/a | ||
2 | 1 | 1/9 | 1A and 2A and 3A | you win | 4A and 4B | |
2 | 0 | 3B | n/a | n/a | ||
3 | 0 | 3C | n/a | n/a | ||
3 | 1 | 1/18 | 1A and 2A and 3A and 3D | you lose | 4A and 4B | |
2 | 1/18 | 1A and 2A and 3A and 3D | you lose | 4A and 4B | ||
3 | 0 | 3B and / or 3C | n/a | n/a |
The numbers in the Car column represent the location of the car. The numbers in the You column represent the door picked by you. The numbers in the Host column represent the door opened by the host. The Probability column gives the probability for each combination of car location, door picked by you and door opened by the host. The leftmost Assumptions column lists the assumptions on which the probability is based. The Results of switching column lists, well, the expected results of switching, and the second Assumptions column lists the underlying assumptions.
When we add up all the odds in the table, we find a 6/18 (or 1/3) probability of losing by switching, and a 6/9 (or 2/3) probability of winning by switching.
Actually, we don't need the entire table to figure that out. All we need to know is the probability of a player picking the door with the car on their first guess. The 1/3 probability is based on the following assumptions:
If we prefer not to assume anything (at least not more than strictly necessary) we could also say that if P is the probability of a player picking the car on their first guess, then the probability of any player losing when they switch is P and the probability of any player winning when they switch is 1-P.
For switching to be a winning strategy, we also need to assume that the following applies:
Now, it's important to keep in mind that what we have here is the overall probability - out of 6,000 players who play the game and switch, we may expect 4,000 to win and 2,000 to lose (assuming that P=1/3).
Going back to the original problem:
"You pick a door, say No. 1, and the host, who knows what's behind the doors, opens another door, say No. 3, which has a goat. He then says to you, "Do you want to pick door No. 2?" Is it to your advantage to switch your choice?"
The problem isn't asking for an overall probability of 6,000 players winning the game by switching. The problem is asking about what you should do, as you're on stage with Monty Hall, looking at two closed doors and one goat.
Let's take a look at the situation described in the original problem: you've picked door 1, the host has opened door 3 and he asks you whether you want to switch to door 2.
Car | You | Host | Probability | Assumptions | Result of switching | Assumptions |
1 | 1 | 1 | 0 | 3B and / or 3C | n/a | n/a |
2 | 1/18 | 1A and 2A and 3A and 3D | you lose | 4A and 4B | ||
3 | 1/18 | 1A and 2A and 3A and 3D | you lose | 4A and 4B | ||
2 | 1 | 0 | 3C | n/a | n/a | |
2 | 0 | 3B | n/a | n/a | ||
3 | 1/9 | 1A and 2A and 3A | you win | 4A and 4B | ||
3 | 1 | 0 | 3C | n/a | n/a | |
2 | 1/9 | 1A and 2A and 3A | you win | 4A and 4B | ||
3 | 0 | 3B | n/a | n/a | ||
2 | 1 | 1 | 0 | 3B | n/a | n/a |
2 | 0 | 3C | n/a | n/a | ||
3 | 1/9 | 1A and 2A and 3A | you win | 4A and 4B | ||
2 | 1 | 1/18 | 1A and 2A and 3A and 3D | you lose | 4A and 4B | |
2 | 0 | 3B and / or 3C | n/a | n/a | ||
3 | 1/18 | 1A and 2A and 3A and 3D | you lose | 4A and 4B | ||
3 | 1 | 1/9 | 1A and 2A and 3A | you win | 4A and 4B | |
2 | 0 | 3C | n/a | n/a | ||
3 | 0 | 3B | n/a | n/a | ||
3 | 1 | 1 | 0 | 3B | n/a | n/a |
2 | 1/9 | 1A and 2A and 3A | you win | 4A and 4B | ||
3 | 0 | 3C | n/a | n/a | ||
2 | 1 | 1/9 | 1A and 2A and 3A | you win | 4A and 4B | |
2 | 0 | 3B | n/a | n/a | ||
3 | 0 | 3C | n/a | n/a | ||
3 | 1 | 1/18 | 1A and 2A and 3A and 3D | you lose | 4A and 4B | |
2 | 1/18 | 1A and 2A and 3A and 3D | you lose | 4A and 4B | ||
3 | 0 | 3B and / or 3C | n/a | n/a |
Once more, when we look at the odds in the table we find a 1/3 probability that you will lose by switching, and a 2/3 probability that you will win by switching.
In fact, we're talking about two distinct scenarios with a non-zero probability: if the car is behind door 1 you will lose by switching, and if the car is behind door 2 you will win by switching. The 1/18 probability of the car being behind door 1 is based on the fact that the host has a choice between opening door 2 and opening door 3, and on our assumption:
Assumption 3D is irrelevant to the unconditional approach to solving the problem, but it is very much relevant to the conditional approach. If the host likes door 3 and will open it whenever possible, the probability that you will win by switching could be as low as 1/2. If, on the other hand, the hosts dislikes door 3 and will only open it when he has no other choice, the probability of winning by switching could be as high as 1.
Though I usually don't do this, I updated yesterday's post to correct a number of errors.
Get the all new and improved version here:
09/10/17
I have a cold, couldn't sleep, and ended up browsing the Wikipedia pages about the Monty Hall problem. The discussion on the Talk pages is longer and more heated than you might expect, and I think I actually learned something.
"Look. Here are three doors. Behind one of them is a car, and behind the other two are goats. Pick one. If it's the one with the car, you win. Otherwise, you lose..."
"So, you've picked your door. Are you sure? Good. Now, I'm going to open one of the other doors. Oh, look, it's a goat..."
"I'm going to give you a choice. You can either stay with the door that you originally picked, or you can switch. What will it be?"
Let's say the following is true:
In that case, you should switch.
Let's look at the possibilities:
Your choice | Host's choice | Probability | You switch? | Result |
car | 1st goat | 1/6 | yes | you lose |
no | you win | |||
2nd goat | 1/6 | yes | you lose | |
no | you win | |||
1st goat | 2nd goat | 1/3 | yes | you win |
no | you lose | |||
2nd goat | 1st goat | 1/3 | yes | you win |
no | you lose |
Now, look at what would happen if you would always decide to switch:
Your choice | Host's choice | You switch? | Result | Probability |
car | 1st goat | yes | you lose | 1/6 |
2nd goat | yes | you lose | 1/6 | |
1st goat | 2nd goat | yes | you win | 1/3 |
2nd goat | 1st goat | yes | you win | 1/3 |
If you switch, you have a 2/3 chance of winning. If you don't switch, you have a 1/3 chance of winning.
Let's look at this in a different way. The host opened one door, which has a goat behind it. Behind one of the remaining closed doors is a car and behind the other one is a goat. Switching will either take you from the door with the car to the door with the goat, or from the door with the goat to the door with the car. When you initially picked your door, you had a 1/3 chance of picking the door with the car, and a 2/3 chance of picking a door with a goat. Therefore, there is a 1/3 chance that switching will take you from the door with the car to the door with the goat, and a 2/3 chance that it will take you from the door with the goat to the door with the car.
Now, let's say the following is true:
In that case, it doesn't matter what you do.
Once again, let's look at the possibilities:
Your choice | Host's choice | Probability | You switch? | Result |
car | 1st goat | 1/6 | yes | you lose |
no | you win | |||
2nd goat | 1/6 | yes | you lose | |
no | you win | |||
1st goat | car | 1/6 | n/a | you lose |
1st goat | 2nd goat | 1/6 | yes | you win |
no | you lose | |||
2nd goat | car | 1/6 | n/a | you lose |
2nd goat | 1st goat | 1/6 | yes | you win |
no | you lose |
There's a 1/3 chance that you won't even be offered the option to switch, since the host will have opened the door that has the car behind it. Again, let's look at what would happen if you would always decide to switch whenever you are given the option:
Your choice | Host's choice | You switch? | Result | Probability |
car | 1st goat | yes | you lose | 1/4 |
2nd goat | yes | you lose | 1/4 | |
1st goat | 2nd goat | yes | you win | 1/4 |
2nd goat | 1st goat | yes | you win | 1/4 |
If you do get the option of switching, it doesn't matter what you do: you'll have a 1/2 chance of winning either way.
Let's say the following is true:
In this scenario you will lose if you switch and win if you don't.
Consider the following scenario:
Since the host is going to be displaying specific behaviour in this scenario, let's look at his choices first:
Your choice | Car | Host's choice | Probability |
1st door | 1st door | 2nd door | 1/9 |
2nd door | 3rd door | 1/9 | |
3rd door | 2nd door | 1/9 | |
2nd door | 1st door | 3rd door | 1/9 |
2nd door | 1st door | 1/9 | |
3rd door | 1st door | 1/9 | |
3rd door | 1st door | 2nd door | 1/9 |
2nd door | 1st door | 1/9 | |
3rd door | 1st door | 1/9 |
Let's juggle that around a bit, so it becomes more obvious what the host's choices tell you about the whereabouts of the car:
Your choice | Host's choice | Car | Probability |
1st door | 2nd door | 1st door | 1/9 |
3rd door | 1/9 | ||
1st door | 3rd door | 2nd door | 1/9 |
2nd door | 1st door | 2nd door | 1/9 |
3rd door | 1/9 | ||
2nd door | 3rd door | 1st door | 1/9 |
3rd door | 1st door | 2nd door | 1/9 |
3rd door | 1/9 | ||
3rd door | 2nd door | 1st door | 1/9 |
In three cases, your knowledge of the host's preferences will tell you where the car is. In each of these three cases you'll need to switch in order to win.
Let's look at what happens when you play the game, knowing what you do about the host's preferences:
Your choice | Host's choice | Car | Probability | You switch? | Result |
1st door | 2nd door | 1st door | 1/9 | yes | you lose |
no | you win | ||||
3rd door | 1/9 | yes | you win | ||
no | you lose | ||||
1st door | 3rd door | 2nd door | 1/9 | of course! | you win |
2nd door | 1st door | 2nd door | 1/9 | yes | you lose |
no | you win | ||||
3rd door | 1/9 | yes | you win | ||
no | you lose | ||||
2nd door | 3rd door | 1st door | 1/9 | of course! | you win |
3rd door | 1st door | 2nd door | 1/9 | yes | you win |
no | you lose | ||||
3rd door | 1/9 | yes | you lose | ||
no | you win | ||||
3rd door | 2nd door | 1st door | 1/9 | of course! | you win |
There's a 1/3 chance that the host's actions will tell you where the car is, and switching will make you win. In all other cases it doesn't matter what you do.
You can optimise your chances of winning, even if you know nothing about the host's preferences, by always switching:
Your choice | Host's choice | Car | You switch? | Result | Probability |
1st door | 2nd door | 1st door | yes | you lose | 1/9 |
3rd door | yes | you win | 1/9 | ||
3rd door | 2nd door | yes | you win | 1/9 | |
2nd door | 1st door | 2nd door | yes | you lose | 1/9 |
3rd door | yes | you win | 1/9 | ||
3rd door | 1st door | yes | you win | 1/9 | |
3rd door | 1st door | 2nd door | yes | you win | 1/9 |
3rd door | yes | you lose | 1/9 | ||
2nd door | 1st door | yes | you win | 1/9 |
Once again, switching gives you a 2/3 chance of winning.
Switching often gives you the best chance of winning.
But consider the following scenario:
Or the following scenario, as discussed earlier:
In both scenarios, you'll always lose if you switch. Also, if the host doesn't open a door before giving you the option to switch, the benefits of switching disappear (though switching may still be an effective strategy in specific situations).
Stay out of the shadows...
Read the review here:
Silence in the library / Forest of the dead (2008)
The FCC and net neutrality. Not the usual suspects - Sears, Netflix and privacy. Consumer attitudes on privacy and behavioral marketing. Lawyers and the magical powers of metatags. Everything else. And now for something completely different.
The FCC and net neutrality.
OpenInternet.gov: Preserving a free and open internet - a platform for innovation, opportunity and prosperity
The technology liberation front: Genachowski - Internet principles aren't rules
The technology liberation front: Government thinks it can "preserve" internet
WSJ: Neutering the 'Net
Ars Technica: Editorial - "Network neutrality" or "network neutering"?
WaPo: The FCC's heavy hand
The technology liberation front
Manifest density: another day, another bad WaPo op-ed
Bottom-up: the semi-vibrant internet
Not the usual suspects - Sears, Netflix and privacy.
WalletPop: Sears gets mere wrist slap for alledgedly spying on customers
Schneier on Security: Sears spies on its customers
Freedom to tinker: Netflix's impending (but still avoidable) multi-million dollar privacy blunder
MediaPost: Netflix contest seen as posing privacy risk
Consumer attitudes on privacy and behavioral marketing.
Joseph Turow, Jennifer King, Chris Jay Hoofnagle, Amy Bleakley and Michael Hennessy: Americans reject tailored advertising and three activities that enable it (SSRN)
NYT: Two-thirds of Americans object to online tracking
Concurring opinions: Consumer attitudes on privacy and behavioral marketing
Lawyers and the magical powers of metatags.
Technology & Marketing Law Blog: Ninth Circuit groaner about metatags - Art Attacs v. MGA
Techdirt: Is it too much to expect judges in tech related cases to understand tech?
Everything else.
Office of the Privacy Commissioner of Canada: There are other social networks with privacy concerns
Wired threat level: Telco spy immunity up for grabs
Wired threat level: Lawsuit - copyright filtering technology infringes
Wired threat level: Linden lab targeted in Second Life sex-code lawsuit
Freedom to tinker: Breaking Vanish - a story of security research in action
MediaPost: Facebook to wind down Beacon to resolve privacy lawsuit
Ars technica: Not-so-anonymous speech: how to get yourself unmasked online
Wired Epicenter: DoJ asks court to nix Google Book Search settlement
Ars Technica: DRM's tentacles snare British HDTV
DarkReading: New free web service confirms theft of your identity
Slaw: Lawyer Twitter practices - 29 do's and don'ts
And now for something completely different.
Webdesigner depot: 30 artistic and creative résumés
On this page Transitional HTML 4.01 and CSS 1 are used. If you're seeing this text you either have CSS switched off in your browser, or you're using a browser that can't handle CSS. If you're using an older browser version, you might want to consider upgrading.