27 May 2019

Methods to the Madness

Because brackets are hard and chalk is for the faint of heart...

March Madness has come and gone and it was an exciting (albeit fairly chalk-ish) tournament! To get a little skin in the game, many people join bracket leagues where they pick a winner for each of the sixty-three match-ups in the bracket. A correct pick is worth 10 points in the round of 64, 20 in the round of 32, 40 in the Sweet Sixteen, 80 in the Elite Eight, 160 points in the Final Four, and 320 points in the championship. At the end of the tournament, the bracket with the most points wins. This year would have been my first victory, were it not for the non-existent three-pointer foul by Samir Doughty (thanks for nothing, Wahoos), so I decided to look into the different methodologies of bracket selection to hopefully improve my odds for next year. There are some pretty strange techniques out there (my personal favorite is which mascot would win in a fight), but the most logical one I could come up with uses the projections from our friends over at FiveThirtyEight:

Let’s say you have no previous experience with college basketball, but you still want to play in order to have something to talk about with coworkers/old high school buddies/random strangers. To get around your lack of experience, you decide to flip a coin to pick who wins each matchup, i.e. heads means the better seed wins, tails means the worse seed wins. However, to make sure you don’t pick a 16-seed going all the way to the Final Four, you decide to use the latest innovation in sports-related coinage: a self-biasing coin whose heads/tails probabilities shift based on the two teams in each matchup and FiveThirtyEight’s Elo ratings system. For simplicity’s sake, let’s ignore FiveThirtyEight’s travel and injury adjustments (partly because they seem to be shrouded in secrecy) as well as the four initial play-in games (because those games are pointless). Let’s also assume we only know the initial team ratings. So as an example, in the first round Gonzaga-FDU matchup (Go Zags), Gonzaga has a rating of 95.02 and Fairleigh Dickinson has a rating of 70.04. This would suggest that the probability of the coin coming up heads (and Gonzaga winning) would be

\[P_{H} = \frac{1}{1 + 10^{30.464*(70.04 - 95.02)/400}} = 98.8\%\]

and the probability of the coin coming up tails (and FDU winning) would be

\[P_{T} = \frac{1}{1 + 10^{30.464*(95.02 - 70.04)/400}} = 1.2\%.\]

For both the men’s and women’s tournaments, how much better does the specially designed self-biasing coin do compared to a plain old unbiased quarter? What about compared to “chalk” where the better seed always wins? To assess how this method would perform, we can apply it to the past three tournaments using a Monte Carlo simulation. Using the code located here, we can pick 10,000 brackets for each year and see what the distributions of point values look like for each year compared to chalk or an unbiased coin flip, distributions like the ones below!

March Madness Results - Chalk vs. Biased Coin

Season Gender Chalk Simulation Avg. Simulation St. Dev.
Unbiased Coin 314.9 134.7
2017 Men 820 680.4 221.7
2017 Women 940 930.4 176.5
2018 Men 810 662.7 261.4
2018 Women 1190 924.7 174.4
2019 Men 920 802.8 248.9
2019 Women 1470 1218.3 252.6

As you may have already guessed, the unbiased coin flip (blue traces) does not do well at all, producing far fewer points than our technically advanced currency regardless of the season. Even when we compare the biased coin results (solid lines) to chalk results (dashed lines), we can see that, on average, the sophisticated FiveThirtyEight method comes close but also falls short of a chalk bracket. However, where the benefit of this method can be seen is in the width of these distributions. If we normalize the three biased coin distributions relative to the chalk point values for the respective years and average them together, this produces the graph below. Again, for both the men’s and women’s tournaments, the average ratio is below one, but 24.7% of the men’s brackets and 21.8% of the women’s brackets earn more points than a chalk bracket. Picking chalk may give you more points on average, but if you want to win in a bracket league of ten people, you have to take risks.

The real test would be to compare this method’s results to the actual brackets that fanatical sports enthusiasts submit, but I’ll leave that for next year’s analysis. Hopefully this method helps all of us in next year’s basketball-related spring psychosis!

The python code and simulations used to generate the values and figures above are available on my GitHub page.