Answer: more than I thought.
P(x|y) = eww, gross
I finished my first midterm of the semester this week. There was a decent section that revolved around calculating P(x|y)….in other words the probability of x if we know that y has already happened. In simple terms, if you have a probability of something happening (x), and something else that’s related to x happens (y), the probability of x happening changes.
This is not an overly complex concept when it’s spelled out mathematically, but in real life it can be hard for people to remember that improbable events are often far more probable if you consider what’s already happened.
I was reminded of this when I was reading Dear Prudence (Slate.com’s advice column) last week and came across this letter from a man (born by artificial insemination from an unknown sperm donor) who decided to seek out his father, only to discover that it was the same man who had donated to his wife’s mother. Oops. And gross. Seriously, gross.
What struck me as more though, was a response that was printed from another reader:
I know you/we cannot know, but color me skeptical that this letter is legit. The odds of such a “match” have to be very small. I can’t help but wonder if this letter is a fiction pushing a political agenda.
This seems true of course…I mean, it’s a sort of Casablanca moment right? Of all the gin joints in all the towns in all the world….
But let’s think about this couple for a second:
- They’d be the same age. I don’t know about sperm banks in particular, but I do know that most stored bodily fluids are only considered “good” for a few years…so if two people were to result from one donor, they would likely be around the same age. Most people tend to marry someone within a few years plus or minus their own age.
- They’d be from the same place. Sperm donors are unlikely to trit trot around the country donating to various centers….they’re more likely to stay in the city they actually live in. I have no particular experience with this, but I’d imagine that people looking for a sperm bank stick to their same town as well….which means these two children would like be raised in the same city, making their chances of meeting go up and the culture they were raised in more similar.
- Their mothers had a lot in common. From the letter, the couple is at least over the age of 30. From what I gather, it was less common for people to use sperm banks prior to 1980. According to this article, in 1987, only 5000 single women asked for donor sperm. At the time the letter writers mother and mother-in-law were looking, it was likely even fewer. This means the two shared a very unique background, and had mothers who were both counter-cultural enough to go forward with this. This would give them quite a bit in common.
- They’d likely have at least some similar hobbies, interests, and personality traits. I think most people would agree that at least some of our hobbies/interests/personality traits/taste in friends/what have you are more nature than nurture. I would thus think it extremely likely that two people sharing the same father would have at least one major hobby or interest in common, making it much more likely that their social circles would cross and that they’d have something to talk about when it did. The more you believe genetics influence who you eventually are, the better the chances they’d meet.
- People (might) like people who look like them. I actually had some trouble finding a good paper that proves this, but it’s a theory. Interestingly, the only scholarly article cited in support of this on the Wikipedia page actually turned out to say they found no evidence of this. This is why Wikipedia =/= research.
- Their parents would likely have supported the union. In-law issues are tough, but I’d imagine if you were a lesbian mother in the 80s and found out your adult son was going to marry the daughter of another lesbian mother from the 80s, you’d be pretty psyched. Also, they’d very likely have been in a similar socio-economic class, as both their moms were well off enough to pursue this avenue.
Who talks more…men or women?
There is nothing I love more than a clever phrase to describe a phenomena that bothers me. Last night I found such a phrase in a Jezebel article about gender differences in number of words spoken per day.
Weekend Moment of Zen 2-23-13
Friday Fun Links 2-22-13
Now that Valentine’s Day is over, I thought you might want to know how to break up with someone, data style.
Remember the Sims? Here’s what happens when they go wrong.
This is a little surreal looking, but these fMRIs of fetal brains are really interesting.
Going out to Utah reminded me that my first science crush was paleontology. Here’s a size comparison chart of people and dinosaurs. Expect more dinosaur mentions in the weeks to come.
Who do you believe, me or your brain?
A few years ago, right before the 2008 election, a friend of mine put the following up on Facebook:
Wednesday Brain Teaser 2-20-13
I am thinking of a number between 1 and 1300. It meets 3 criteria:
- It is a perfect cube.
- It is less than 500.
- It is a perfect square.
Include in your answer how much googling it took you.
Fox news: the channel you love to hate
At least according to this chart of people’s most and least trusted TV news sources:
Data, elections, and how to check your facts
I’ve been meaning to post something on David Brooks (Brooks’s? Brooks’?) column from a few weeks ago on the “Philosophy of Data”. A couple readers sent this to me (thanks all!) and I thought it was pretty interesting. He questions how the rise of big data is going to change things, and raises a few pertinent questions:
Over the next year, I’m hoping to get a better grip on some of the questions raised by the data revolution: In what situations should we rely on intuitive pattern recognition and in which situations should we ignore intuition and follow the data? What kinds of events are predictable using statistical analysis and what sorts of events are not?
I think those questions are relevant, and I was thinking about them when this cartoon popped up in my newsfeed on Facebook a few days ago:
- Figure out what problem you’re trying to solve
- Get a large relevant data set
- Analyze it until you get any numbers you can think of that might be helpful
- Find several rational people who are deeply embedded in the problem area
- Ask them what they think of the data, get the gut reaction
- Explain to them where you got the data, see if their reaction is the same
- Ask them if anyone they know would disagree with this data, and if so why
- Ask them if this helps them know how to proceed
- Ask them if there’s any other data that might be useful for this problem
- Go find that, repeat 5-9.
Interesting conference data of the week
So tonight’s the last night I’m at this conference (American Society for Bone Marrow Transplant if you’re curious), so I figured it would be an appropriate moment to share some interesting data issues that came up in the sessions I went to.
The most interesting one came from a group out of Johns Hopkins, in their talk about their combined inpatient/outpatient program. About 20 years ago now, they started to transition their transplant patients from one long inpatient stay to a shorter stay with a sort of intensive outpatient clinic follow up. This worked really well, cut costs, helped patients feel more autonomous, etc. What was interesting is that as they followed up on patients and how they did, they found that patients treated under this model did better on every single quality of life metric* except one: feelings about appearance.
Since there is no reason to believe they actually looked any different, the only conclusion they could reach is that the more “normal” people the cancer patients saw, the more acutely aware they were of how they looked. When they were in the hospital, they were surrounded by other patients, but on the outside they were exposed to more healthy people.
I thought that was an interesting example of how much quality of life measures can depend on what your environment is and who you’re being exposed to. We like to act like hapiness or contentment were definitive values that are totally internally generated, but they’re not. People compare themselves to others. We just can’t help it.
*The hard health measures (recovery, blood counts, etc) were the same with either method.



