Home Data Clustering for Fitting Parameters of a Markov Chain Model of Multi-Game Playoff Series
Article
Licensed
Unlicensed Requires Authentication

Data Clustering for Fitting Parameters of a Markov Chain Model of Multi-Game Playoff Series

  • Christopher M Rump
Published/Copyright: January 10, 2008

We propose a Markov chain model of a best-of-7 game playoff series that involves game-to-game dependence on the current status of the series. To create a relatively parsimonious model, we seek to group transition probabilities of the Markov chain into clusters of similar game-winning frequency. To do so, we formulate a binary optimization problem to minimize several measures of cluster dissimilarity. We apply these techniques on Major League Baseball (MLB) data and test the goodness of fit to historical playoff outcomes. These state-dependent Markov models improve significantly on probability models based solely on home-away game dependence. It turns out that a better two-parameter model ignores where the games are played and instead focuses simply on, for each possible series status, whether or not the team with home-field advantage in the series has been the historical favorite - the more likely winner - in the next game of the series.

Published Online: 2008-1-10

©2011 Walter de Gruyter GmbH & Co. KG, Berlin/Boston

Downloaded on 28.9.2025 from https://www.degruyterbrill.com/document/doi/10.2202/1559-0410.1087/html?lang=en
Scroll to top button