Statistics in Baseball: Are pinch runners effective? Part 2 of 3

For the introduction (part 1), see my previous post. In this entry I'm going to estimate the increase in the probability of scoring provided by a pinch runner (PR). My next post (part 3) will attempt to answer the ultimate question: is this increase in probability enough to justify weakening the offense for future innings.

After some R coding to count the necessary information, I fit a multiple logistic regression model with:

I.run: binary response, the indicator of whether the run scored,
I.PR: an indicator of whether there was a pinch runner,
lead: the score difference between the two teams (either -1, 0, or 1). Maybe somewhat surprisingly this makes a difference,
outs: the number of outs when the runner first got on base (this is categorical because there's no reason outs would be related linearly to the log odds of scoring),
I.2nd: an indicator of whether the runner was on 2nd base (the alternative is the reference level, 1st base),
player: identity of the original runner (for parsimony I'm lumping all pinch runners together as being fast guys),

while limiting the data as described in the introduction.

I'm making the assumption that there are only three outcomes stemming from possibly pinch running: the run won't score either way, it will score either way, or the PR would score when the original player would've been stranded. I'm ignoring the possible effects - that is, on subsequent runs in this inning that are less likely because of the extra out - of the slow runner being more susceptible to getting thrown out on the bases (by a force play or otherwise). I imagine the frequency of being thrown out when the PR wouldn't have been is low compared to the frequency of times this slow runner just gets held up by the 3rd base coach when the PR wouldn't have been. Regardless, it might get too complicated figuring out how much this extra out affects the distribution of run scoring in the inning. However if I fail to find that pinch running is beneficial (in part 3), I may have to revisit this assumption. I can think of two different ways to weaken it:

instead of the binary response, the response could be a count of the number of runs scored in the inning subsequent to this runner reaching base. I balked at that for this entry because in an entry last year, I had trouble getting R to do multinomial regression with covariance matrices provided - but that data set was bigger, so it may be worth a shot in my next post,
instead of separating steps 1 and 2, just fit a binary logistic regression with win/loss as the response. This gets at the question directly, but I'm worried that the response will be pretty noisy.

I considered the past 15 seasons (1995-2009), where a total of 720 different players satisfied the following two conditions: (i) they were both pinch run for and allowed to run for themselves in the 8th inning or later of a tie or one-run game, and (ii) they scored or had their PR score. The second condition removes about half a percent of the population left after the first condition, and I think it creates a minor selection bias - non-PR situations are being deleted more than PR situations (because there are many more non-PR situations), and the proportion of deletions corresponding to successes is probably not exactly the same as the proportion left in the sample. But without the second condition it would be impossible to estimate a player effect, so I required it anyway.

There are a number of interactions that could potentially enter the logistic model, but only two interactions were close to significant (for more on the I.PR:player interaction see the end of this post); they are shown below. I used the default treatment contrasts in R for I.PR, lead, outs, and I.2nd, but sum contrasts for the players, so the intercept corresponds to the log odds of scoring for the average player, running for himself, after a lead-off single when trailing by one run. The parameter estimates and their p-values are listed below; the residual deviance is 32836 on 27742 degrees of freedom, making the overdispersion parameter estimate less than 1.09, so I didn't bother allowing for overdispersion. It would not have changed any of the conclusions anyway. I've included David Ortiz's estimate below because he will come up again in a moment.

intercept	-0.430	2e-16 ***
I.PR	0.086	0.161
lead0	0.104	0.004 **
lead1	0.216	2.48e-07 ***
outs1	-0.661	2e-16 ***
outs2	-1.476	2e-16 ***
I.2nd	0.774	2e-16 ***
outs1:I.2nd	-0.207	0.011 *
outs2:I.2nd	-0.305	0.002 **
I.PR:lead0	0.161	0.046 *
I.PR:lead1	0.146	0.209
Ortiz	0.674	0.010 *

The coefficient estimates of interest are the ones concerning I.PR. The main effect for I.PR says that exp(0.086) is the multiplicative change in the odds of scoring created by pinch running when lead=-1, and similarly adding the main effect to each interaction term allows estimation of the multiplicative change in the odds when lead=0 or 1. I expect the more meaningful number in part 3 will be the additive change in probability - this will be different depending on the situation. The following table compares the probability of scoring a run with no PR to the probability of scoring a run with a PR in all 18 situations. This is done for an average player. A Taylor series expansion was used to estimate the variance of the difference of the probability estimates, and normality was assumed to calculate the p-values for the one-sided tests of those differences.

situation	P(run) no PR	P(run) PR	p-value
lead=-1,base=1,outs=0	0.394	0.415	0.081
lead=-1,base=1,outs=1	0.251	0.268	0.082
lead=-1,base=1,outs=2	0.129	0.139	0.084
lead=-1,base=2,outs=0	0.585	0.606	0.080
lead=-1,base=2,outs=1	0.372	0.392	0.082
lead=-1,base=2,outs=2	0.192	0.206	0.084
lead=0,base=1,outs=0	0.419	0.480	0.000
lead=0,base=1,outs=1	0.271	0.323	0.000
lead=0,base=1,outs=2	0.142	0.174	0.000
lead=0,base=2,outs=0	0.610	0.667	0.000
lead=0,base=2,outs=1	0.396	0.457	0.000
lead=0,base=2,outs=2	0.209	0.252	0.000
lead=1,base=1,outs=0	0.447	0.504	0.011
lead=1,base=1,outs=1	0.294	0.344	0.013
lead=1,base=1,outs=2	0.156	0.189	0.016
lead=1,base=2,outs=0	0.636	0.688	0.009
lead=1,base=2,outs=1	0.423	0.481	0.012
lead=1,base=2,outs=2	0.228	0.271	0.015

The following table is the same as the previous one, except it's for David Ortiz instead of the average player. He often hit prior to Manny Ramirez or Kevin Youkilis (good hitters).

situation	P(run) no PR	P(run) PR	p-value
lead=-1,base=1,outs=0	0.561	0.582	0.080
lead=-1,base=1,outs=1	0.397	0.418	0.082
lead=-1,base=1,outs=2	0.226	0.241	0.086
lead=-1,base=2,outs=0	0.735	0.751	0.082
lead=-1,base=2,outs=1	0.537	0.559	0.080
lead=-1,base=2,outs=2	0.318	0.337	0.083
lead=0,base=1,outs=0	0.586	0.644	0.000
lead=0,base=1,outs=1	0.422	0.483	0.000
lead=0,base=1,outs=2	0.244	0.293	0.000
lead=0,base=2,outs=0	0.754	0.797	0.000
lead=0,base=2,outs=1	0.563	0.623	0.000
lead=0,base=2,outs=2	0.341	0.398	0.000
lead=1,base=1,outs=0	0.613	0.666	0.010
lead=1,base=1,outs=1	0.450	0.507	0.011
lead=1,base=1,outs=2	0.266	0.313	0.016
lead=1,base=2,outs=0	0.774	0.812	0.012
lead=1,base=2,outs=1	0.590	0.645	0.010
lead=1,base=2,outs=2	0.366	0.422	0.013

The Ortiz coefficient's standard error was 0.263, so these probability estimates may be larger than the true probabilities: and surely they are because even Manny Ramirez cannot hit about .400, which is what the two-out numbers above are suggesting, with RISP in the long term.

We can see from the low p-values in both tables that pinch running helps increase the chance of scoring, at least when the leading or tied. The PR effect is actually much bigger than I imagined it would be. Other than the large probability increases from the 1st table to the 2nd, there are a couple of interesting things. First, as the I.PR*lead interaction suggested, the PR effect is much smaller when the team is trailing; the raw probability of scoring is also smaller when trailing. I think the latter is easily explained by the fact that the opposing team's best pitchers tend to be their set-up man and closer, and these are the pitchers you face when trailing by one run in the 8th or 9th inning. Another less likely explanation might be that if you are losing by one run, you are not hitting as well on the day, and hence less likely to score. This gets into the "hot hand effect" though, and so I think the likely explanation is the opposing pitching.

I don't think the smaller PR effect in games where the team is trailing is as intuitive, but it probably also has a lot to do with facing the opposing team's best pitching: the speed of your base runner is not important if the subsequent hitters are not getting hits. I actually fit a poisson regression with expected runs scored subsequent to the runner reaching base as the response, and interestingly the expected runs scored with the PR when trailing by one were lower than the expected runs with no PR. The effect was extremely small and not statistically significant, but with the size of my sample I wouldn't expect to see that. I wonder if it has to do with a different strategy being employed when trailing - the PR tries to steal 2nd base, or the manager tries to bunt him over to 2nd base. Either of these moves certainly decrease the expected runs scored in the inning (although possibly increasing the chance of scoring at least one run).

By not allowing for an interaction between I.PR and player, which would measure how fast the original base runner is, I've basically assumed that David Ortiz has the same speed as the average player (at least, average among those who have ever pinch run for - which is almost everybody at one time or another). But the variation in each of these interaction estimates would be even bigger than the variation we're now seeing in the player estimates, so I didn't bother with this here; I will have to include this is my final analysis in part 3 though, perhaps averaging several players together to more precisely estimate an overall effect of pinch running for good hitters. Anyway, because Ortiz is unusually slow, you can probably safely take away a little bit from Ortiz's probabilities and add a little bit to his PR's.

This wraps up step 1. I've had to make some simplifications and assumptions to fit this logistic model (some of which I might have to try to relax in my next post), but at least the data have allowed me to show that pinch running significantly increases the chance of scoring.

The information used here was obtained free of charge from and is copyrighted by Retrosheet. Interested parties may contact Retrosheet at "www.retrosheet.org".

Statistics in Baseball

Saturday, July 3, 2010

Are pinch runners effective? Part 2 of 3

No comments:

Post a Comment