Fun with Numbers: Strikeouts vs. Ball-in-Play Outs
I was having a discussion on another Reds' board, and came across one of my favorite types of Reds' fans; the "put-the-ball-in-play-and-let's-manufacture-some-runs" guy. He, of course, hated Chris Dickerson. I like Chris Dickerson. He told me that strikeouts were terrible, and productive outs were awesome. I said that strikeouts are bad, productive outs are slightly less bad, and GIDP's are really bad, and that most of the gains that result from productive outs are erased by grounding into double plays. He said, "Nuh uh." I said, "Yes huh."
It actually turned into a really good discussion, with one poster making a great point after I launched into my usual tirade about strikeout rate correlating with Walks, Power, and ever so slightly, run production. The point was that we need to separate strikeouts from players who strike out. There's an inherent selection bias in the correlation rates I cited, as players who strike out have to have those secondary skills in order to make the Big Leagues in the first place.
So, this is my attempt to isolate the effects of strikeouts, and compare them to Ball-in-Play outs. I haven't seen anything like this done by anybody else, so I hope I'm showing you guys something new here. It's long, and it's got a lot of math, all of which I did myself. You've been warned.
First, we need to determine a run value for every type of out in every baserunner situation and every out situation. The most important tool in making this determination is the 2008 Run Expectancy Matrix, as compiled by Baseball Prospectus. Most of you are familiar with this, but for those who haven't seen it, a quick explanation. On the left side is the baserunner situation. 1, 2 or 3 indicates a runner on that base, 0 indicates the base is empty. So, 103 means runners on first and 3rd. On top is the Out situation. The numbers in the middle indicate the average number of runs scored by all teams in all innings (except walk-off 9th innings) throughout MLB in 2008.
So, with this information, we'll determine Run Values for Productive Outs by subtracting the "after" run expectancy from the "before" run expectancy, adding runs as applicable when there's a productive out with a runner on 3rd. For example, for a Sacrifice Fly with runners on 1st and 3rd and 0 out, the Run Value would be 1.77 - (1 + 0.53) = -0.24, where 1.77 is the run expectancy for runners on 1st and 3rd with 0 out, 0.53 is the RE for a runner on first with 1 out, and 1 is added because a runner scored. I'll make the assumption that every Productive Out happens with the same frequency regardless of the Out situation, and thus can take a simple average of the 0 out and 1 out Run Value for a Productive Out in each Baserunner Situation. Here's what I came up with:
100= -0.205
020= -0.26
003= -0.08
120= -0.22
103= -0.02
023= -0.13
123= -0.12
I made the assumption that in 1st and 3rd situations, the man on 1st advances half the time; in 2nd and 3rd situations, the man on 2nd advances half the time; and in bases loaded situations, 1/3 of the time, nobody advances; 1/3 of the time, only the man on 2nd advances; and 1/3 of the time, both men advance. It's probably not perfect, but it's probably close.
This jives pretty well with intuition. The best time for a productive out is with a man on 3rd, since at least you get a run home. However, when the bases are stacked behind him, you get some diminishing returns, as you hurt your chance for a big inning.
Now, we do the same for Strikeouts. This is considerably easier, as we don't have to consider "before" and "after" Baserunner Situations. Here's the results I got:
100= -0.34
020= -0.41
003= -0.58
120= -0.54
103= -0.65
023= -0.71
123= -0.76
So, yeah, productive outs are better than strikeouts. That shouldn't shock anybody. However, there's a 3rd type of out that can also result from putting the ball in play; the dreaded GIDP. The only real assumption I made regarding other baserunners was that any runner in front of the DP advanced 1 base when there were 0 out. I gave the same credit for a DP with a runner on 3rd that I did above. The run values for GIDP:
100= -0.63
120= -1.05
103= -0.91
123= -1.28
So, now that we have Run Values for every type of Out in every Baserunner Situation, we can calculate the weighted average, multiplying Run Value by the Percentage of Plate Appearances in that situation. When we're done, we'll divide that by the aggregate of %PA to determine a Total Run Value. Being very familiar with them, I utilized Adam Dunn's splits for the %PA's for each Baserunner Situation (come on, you knew I wouldn't go an entire post without bringing him up). Here's the Total Run Value for each type of out:
TRV (PO) = -0.19
TRV (K) = -0.46
TRV (DP) = -0.81
So, now, we need to determine the ratio of Productive Outs to GIDP's needed to break even with Strikeouts. Get ready for some Algebra:
-0.81 + -0.19*x = -0.46*(x+1)
x=1.30
We need 1.3 Productive Outs for every GIDP in order to break even. So, how do MLB teams do in that regard?
ESPN actually kept Productive Out stats for a while as Buster Olney tried to convince us that they were some sort of hidden indicator of success. ESPN hid that data once it became apparent that teams with large numbers of productive outs had bad offenses. I found Productive Out data for 2008 buried deep on ESPN's site, though. The average MLB team had 187 Productive Outs in 2008.
GIDP's were considerably easier to find. The average team had 129 of those.
187 / 129 = 1.45
So, the average MLB team does gain a little bit by making ball-in-play outs instead of strikeouts. Let's see how many runs they gain over the course of a season with their Ball-in-Play Outs.
187*(-0.19) + 129*(-0.81) = -140 Runs
(187+129) * (-0.46) = -145 Runs
So, over the course of an entire season, the average MLB team gains 5 runs as a result of their Ball-in-Play outs. The question I leave to you, dear reader, is; Are the benefits of late-count hitting (walks, high opponents' pitch counts, seeing more pitches) enough to offset those 5 runs a team gains by trading strikeouts for Ball-in-Play outs?
2 recs |
28 comments
|
Comments
Good stuff, blee
I haven’t double checked your math yet, but the logic seems reasonable. Probably should do some checks on whether your assumptions about runner advancement are kosher to say anything definitive though. I could see a small variation in those assumptions changing the 5-run differential somewhat one way or another.
"It seems like we're not hitting because we're not getting hits." - Dusty being Dusty
by Slyde on Mar 25, 2009 9:10 PM EDT reply actions 0 recs
Thanks for the promotion!
The 2 areas where I’d appreciate a double check are my assumptions about baserunners moving up, and the percentage of plate appearances for each baserunner situation, as I used Dunn’s splits instead of an overall average. I figured Dunn was as good a guy to use as any, since he’s moved all around the lineup, and he’s played on good offenses and bad offenses.
If you have any questions about anything I did (my weighted average calcs for the Total Run Value are probably not intuitive, though I’m reasonably certain they’re correct), email me.
"Karma - there it was. The meaning of life, straight from Carson Daly's lips to my morphine-laced ears." -Earl Hickey
by BLee2525 on Mar 25, 2009 9:27 PM EDT up reply actions 0 recs
base situations
http://www.baseball-reference.com/pi/bsplit.cgi?team=TOT&year=2008&lg=ML#situa-bases
"It seems like we're not hitting because we're not getting hits." - Dusty being Dusty
by Slyde on Mar 25, 2009 10:13 PM EDT up reply actions 0 recs
Where the pictures at?
I need to see a fat, shirtless guy on a tricycle or some-such to understand all this math. I’m a visual learner, y’know?
"I usually shoot it pretty straight."- a boating enthusiast
by Colin Auscapee on Mar 25, 2009 9:13 PM EDT reply actions 0 recs
He asked for shirtless.
Just sayin’.
"Karma - there it was. The meaning of life, straight from Carson Daly's lips to my morphine-laced ears." -Earl Hickey
by BLee2525 on Mar 25, 2009 9:29 PM EDT up reply actions 0 recs
Bases loaded:

It's all fun and games until someone gets herpes. - Fox 4 News
by PeteyHendrix on Mar 25, 2009 9:38 PM EDT up reply actions 0 recs
Here's the new math:
80 combined years of Whoppers + $1000 = Mack Daddy Mobility!

It's all fun and games until someone gets herpes. - Fox 4 News
by PeteyHendrix on Mar 26, 2009 12:30 PM EDT up reply actions 0 recs
Perhaps.
But how do they steer?
It's all fun and games until someone gets herpes. - Fox 4 News
by PeteyHendrix on Mar 26, 2009 6:13 PM EDT up reply actions 0 recs

My millions are unconventional!
by Charlie Scrabbles on Mar 25, 2009 10:16 PM EDT reply actions 1 recs
my reaction too
this is why I will not be a GM unless I get to bring RR with me
...because there's already someone posing as Jacob Brumfield
by Cy Schourek on Mar 26, 2009 2:06 PM EDT up reply actions 0 recs
great stuff BLee
and yes, i would be one to hastily jump to the conclusion that making the pitcher work and throw 6 pitches for a K is enough to offset the gains made by a first-pitch sac bunt.
My millions are unconventional!
by Charlie Scrabbles on Mar 25, 2009 10:40 PM EDT reply actions 0 recs
Nice work.
I would be willing to bet that there are a lot of people reading this who are like Norman the Android at the end of this sequence.
And Adam Dunn's spirit, ranging for revenge / With Ate by his side come hot from hell, / Shall in these confines with a monarch's voice / Cry 'Havoc,' and let slip the dogs of war...
by Paul Householder on Mar 25, 2009 10:42 PM EDT reply actions 0 recs
And...
It fulfills a personal dream of mine, that every team would have nine Adam Dunns batting for it.
It would drive sMarty absolutely nuts.
And Adam Dunn's spirit, ranging for revenge / With Ate by his side come hot from hell, / Shall in these confines with a monarch's voice / Cry 'Havoc,' and let slip the dogs of war...
by Paul Householder on Mar 25, 2009 10:50 PM EDT up reply actions 0 recs
Do you think their cycles would synch?
and how would they know which Valtrex bottle was theirs?
"I usually shoot it pretty straight."- a boating enthusiast
by Colin Auscapee on Mar 25, 2009 11:07 PM EDT up reply actions 0 recs
Math question
THT just ran a GIDP article this week. There’s either something I’m not getting or the two of you disagree about the negative value from a bases-loaded GIDP. You list -1.28, which if I understand correctly is your average between a 0-out and 1-out GIDP. But the article thinks that a bases-loaded GIDP is even worse than that:
For example, with the bases loaded and one out, the batting team on average will score around 1.6 runs before the inning is over. A double play, of course, ends the inning without any runs having scored, so the cost of the double play is 1.6 runs. Even more costly is a double play that occurs with the sacks full and no outs. In that case, the run expectancy is 2.35 runs. Let’s say the DP goes 5-2-3, leaving runners on second and third with two outs (and no runs scored), a situation with run expectancy of .61 runs. So, the cost of the double play is the difference between those two numbers, or 1.74 runs.
The article’s run expectancy number for 0-outs, bases-loaded could be a typo – it lists 2.35 instead of the 2.305 in BP’s matrix. But that shouldn’t account for this large of a difference, right? Maybe it has something to do with runner advancement assumptions? I don’t know. Great post, btw.
by ken on Mar 25, 2009 11:07 PM EDT reply actions 0 recs
Bases Loaded GIDP
I’ll have to read the article at work, but based on your excerpt, the difference in our Run Values is pretty easy. I assumed a 6-4-3 DP, while THT (probably more correctly) assumed 6-2-3. THT went Home to First, I went 2nd to First. So, my ending Runner State was a man on 3rd, 2 out, 1 run home. THT’s was men on 2nd and 3rd, 2 out, 0 runs home.
"Karma - there it was. The meaning of life, straight from Carson Daly's lips to my morphine-laced ears." -Earl Hickey
by BLee2525 on Mar 26, 2009 6:07 AM EDT up reply actions 0 recs
I'm on the road today
but I wanted to throw in that the difference may be because I think BPro’s RE matrix is only based on a single season. I’m guessing the THT article used multi-years, probably the ones at tangotiger’s site.
"It seems like we're not hitting because we're not getting hits." - Dusty being Dusty
by Slyde on Mar 26, 2009 6:07 AM EDT up reply actions 0 recs
rec'd
Great job, Blee.
"Sometimes I listen for Griffey’s infectious laugh or Dunn’s humor and wit. But they’re gone." - Dusty
by BK on Mar 26, 2009 12:48 AM EDT reply actions 0 recs
Awesome Post
Even though all that Math made my head hurt, you did great job. Baseball Prospectus has a similar article that looks at the correlation between K’s and runs production here (I’m pretty sure it’s a premium article), for those who are interested.
by D-Rock on Mar 26, 2009 10:00 AM EDT reply actions 0 recs
Nicely done.
Your conclusion, though, leaves out important considerations.
The question I leave to you, dear reader, is; Are the benefits of late-count hitting (walks, high opponents’ pitch counts, seeing more pitches) enough to offset those 5 runs a team gains by trading strikeouts for Ball-in-Play outs?
Because:
- You’re asking Edwin Encarnacion to do something he isn’t very good at. If he tries to bunt, or hit behind the runner, or whatever to be “productive”, he’s much more likely than average to fail and the result will be something less (maybe much less) than the numbers you present.*
- You’re completely ignoring the other outcomes. As you state earlier, guys who strike out alot are much better than average at other things (walking, hitting for power). Willie Taveras** is going to strike out much less often than Adam Dunn, but no one in their right mind would rather have him at bat in any situation.
Something else I noticed:
It actually turned into a really good discussion, with one poster making a great point after I launched into my usual tirade about strikeout rate correlating with Walks, Power, and ever so slightly, run production. The point was that we need to separate strikeouts from players who strike out. There’s an inherent selection bias in the correlation rates I cited, as players who strike out have to have those secondary skills in order to make the Big Leagues in the first place.
That’s true, but the opposite is also true: If a player had zero power and doesn’t walk, he better have a good BABIP or else he’s not going to make the big leagues, either. We sometimes think that if you’re not Pujols or Bonds or Williams, you have to “make up” for your deficiencies in other areas; the truth is, those guys are freaks and everyone else is just a good ballplayer.
I think it’s significant that strikeouts are only slightly correlated to run scoring; what it looks like to me is that there’s more than one way to score runs (Ks, BBs and power vs. putting the ball in play).
*What’s funny is that if he hits a sac fly to the warning track, that’s considered a “productive out”, when that wasn’t what he was trying to do at all.
**Willie Taveras must be traded now. His career PA/K? 6.66.
Often wrong, never uncertain.
by sidnancy on Mar 26, 2009 11:07 AM EDT reply actions 0 recs
This is a great way of putting it:
The point was that we need to separate strikeouts from players who strike out.
Beyond the Boxscore // Calling BJ Upton lazy is lazy.
by Sky Kalkman on Mar 26, 2009 12:11 PM EDT reply actions 0 recs
Also, I'm surprised the guy you were arguing with didn't give up after "Yes, huh." Always a clincher.
Beyond the Boxscore // Calling BJ Upton lazy is lazy.
by Sky Kalkman on Mar 26, 2009 12:12 PM EDT up reply actions 0 recs
I was going to Re: to sidnancy, but it may make more sense here
I think that there are folks who hit for power, walk a lot, but K a lot because if someone like Taveras k’ed a lot, he would never make it out of the minors. From a strictly traditionalist’s view, you “live with” the Ks because you get the power. Lots of crappy players strike out a lot. But someone with less power needs to hit with a higer average to get attention, otherwise he’s chaff.
This is, of course, leaving OBP, let alone super-metrics, out of it. But I’ll be honest, I’m not a numbers guy, so this is how I see how a traditional baseball guy would see it. Am I making sense?
...because there's already someone posing as Jacob Brumfield
by Cy Schourek on Mar 26, 2009 2:10 PM EDT up reply actions 0 recs
Another good view.
That reminds me of the conclusion that size doesn’t matter for NFL linemen because small linemen rate just as well as large ones in the NFL. But, well, you just don’t make it to the NFL in the first place if you’re on the small side unless you have mad skills. The non-size skills for small lineman are better than those other skills for big lineman.
In baseball, you’re right, you don’t make it to the big leagues as a high strikeout guy unless you can do other stuff. That other stuff can include hitting things very far, or playing a tough defensive position.
It’s a bit trickier, because some people don’t think that all the other stuff (like walks and defense) make up for the strikeouts, but yeah, I like it.
Beyond the Boxscore // Calling BJ Upton lazy is lazy.
by Sky Kalkman on Mar 26, 2009 3:36 PM EDT up reply actions 0 recs
Wowza...
That’s some really intricate stuff… Nice!
I’ll have to come back and look at it more when I’m not on Spring Break and can actually think again…
People Don't Kill People. Burning Couches Kill People.
by crolfer on Mar 26, 2009 6:08 PM EDT reply actions 0 recs


























