Driveline Mechanics: An SB Nation Community

Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Around SBN: Has Kentucky Improved Since the Non-Conference Season?

Revisiting the Myth of the RBI Guy, Part One

1993_world_series_game_6_joe_carter_television_graphic_mediumIf that's one of my worst mistakes -- to sign somebody who has 20 home runs and 97 RBIs -- I'll take it.

-Royals' General Manager Dayton Moore, on Jose Guillen's 2008 performance.

Everyone knows that RBI are, if not worthless, way down the list of ways to judge a players offensive performance, right? There's tons of stuff written on it, but here is a really good, non-technical piece written by America's Greatest Living Sportswriter, Joe Posnanski.

Nonetheless, it is hard not to be impressed by RBI numbers, to see a player as an "RBI man" who produces runs. To get that man runs across the plate, he has to have some skills, right? Sure, he's getting more opportunities because he's in the middle of the order, but a truly terrible hitter wouldn't have driven in all those runs, right? Let's take a look-see.

Star-divide

In Baseball Prospectus' Baseball Between the Numbers, Jonah Keri takes an illuminating angle on this issue. He takes all the individual 100 RBI seasons since 1972 (I'm not sure why -- perhaps the introduction of the designated hitter rule?) and divides the RBI by the player's VORP (BP's Value Over Replacement Player stat) for that season. The higher a number of RBI/VORP a player has, the more overrated his value is by his RBI. The results are predictable -- famous "clutch hitter" Joe Carter having the highest number of most overrated 100 RBI seasons since 1972. It's a good piece and approach to the "RBI issue."

However, there are significant problems with VORP, both in its approach to positional adjustments and its incorrect weighting of walks, home runs, and doubles. Keri's approach is straightforward and illuminating, but can be improved by using a better measure of offensive value.

What would that measure be? Readers of this space arely likely anticipating wOBA as my answer, while while it's on the right track, we're looking for a counting stat -- runs created above some baseline. Fortuneatley, just like there's a relationship between batting average and total hits using at-bats, there's a relationship between wOBA and runs created above average (wOBA is basically  linear weights expressed as a rate stat). At FanGraphs, they call this "wRAA, here," I call it BRAA since I'm not sure if my results are exactly the same, and I use different park adjustments. The "1.15" is there because it is what forces wOBA onto an OBP scale, and to convert to runs above average, that needes to be undone (the custom lineare weights script used here and at FanGraphs uses a customized "wOBAScale" for each season).

Thanks to previous work by Tom Tango, Colin Wyers, and terpsfans' park factors , I was able to calculate BRAA for all seasons back to 1955 (Thanks, guys!).

To get my lists of "most overrated" and "most underrated" seasons by RBI, I simply followed Keri's lead and divided the players RBI by his BRAA (Keri used VORP, of course). The more RBI per BRAA a player has, the more his RBI overstate his offensive contribution. This is a "junk stat," of course, but while there might be more  sophisticated ways of doing it, I like it for its straightforward simplicity. The problem, of course, is that many players on the list are below average, which means I'll end up with a negative number for some bad players with lots of RBIs -- we want a positive number for them. So I had to add something proportionally do each player so that they would all end up "positive." I could have picked any number to add, but I decided instead to go with pseudo-"replacement level" (pseudo because there are no replacement level hitters or fielders, only replacement level players) by adding in 22.5 runs for every 700 PAs each player had that season. This also parallels Keri's piece, which works not from average, but from replacement level (defined differently, of course, and with positional adjustments with which I disagree). Still, there was one particularly "great" season that came up negative -- Keri ran into this problem with RBI/VORP as well, so I adopted his solution -- to simply add 10 to the number I got. So, the final version of the junk stat is RBI/(BRAA+(PA*(22.5/700))+10) = RBI per BRAR (Batting Runs Above "Replacement").

Here are the results. There are separate sheets for the Most Overrated RBI Guys 1973-, Most Underrated, and 2008's 90+ RBI Guys sorted by Overrated to Underrated. The columns are mostly obvious: wOBA is each players' unadjusted wOBA for that season using the custom weights, BRAA is his park-adjusted linear weights runs created above what a league-average hitter would have created in the same number of plate apperaances, and RBIperBRAR is RBIs per BRAA scaled as discussed in the previous paragraph. Again, the higher the RBIperBRAR number is, the more a players RBI numbers (generally) overstate his offensive ability.

There is much to "admire" here, particularly some wonderful performances from the Most Overrated list. Let's extract a few for discussion:

Player Year Team PA wOBA BRAA RBI RBI/BRAR
Tony Batista 2003 BAL 670 .283 -22.6 99 11.07
Joe Carter 1997 TOR 668 .297 -18.8 102 8.05
Pedro Feliz 2006 SFN 643 .299 -17.9 98 7.68
Tony Batista 2004 MON 646 .305 -15.0 110 6.98
Vinny Castilla 1999 COL 674 .344 -14.0 102 5.77
Joe Carter 1990 SDN 697 .299 -11.5 115 5.50
Jeff Francoeur 2006 ATL 686 .308 -12.4 103 5.24
Sammy Sosa 1997 CHN 694 .329 -3.8 119 4.17
Sammy Sosa 2007 TEX 454 .332 -1.5 92 3.98
Paul O'Neill 2000 NYA 628 .329 -4.1 100 3.83
Tino Martinez 2000 NYA 632 .325 -6.5 91 3.82

 

The first few are simply the "top"seasons, then I picked some that looked interesting. Starting at the bottom: I probably don't hate Paul O'Neill enough to be a real non-Yankees fan, but man, do he and Tino's ever illustrate  how much batting order effects RBI numbers: both were clearly below average, but O'Neill hit right behind Derek Jeter (.416 OBP in 2000) and Martinez behind Bernie Williams (.391 OBP).

I included Sammy Sosa on this chart not because I want to pick on him or anything, but I just found it interesting that he had two seasons this high up on the Overrated chart, and they pretty well bookend his incredible run from 1998-. Jeff Francoeur probably doesn't care about being on the list, since, unlike RBI (and OBP),  they don't put RBI per BRAR or wOBA on the scoreboard. Still, it's sort of sad (in a "ha-ha" way) to think that Frenchy's got a long way to go to even return to his .308 wOBA of 2006.

Vinny Castilla, well, there's a shocker. There's no need to point out that Pedro Feliz is a bad hitter. In fact, he's about an average player once you factor in his excellent defense. I just thought it was curious that so many third basemen were at the top of this list.

Finally, we come to one classic example and my persoinal favorite.

The "classic" overrated RBI guy is Joe Carter. Bill James tackled how overrated Carter was due to his reptuation for "clutch-ness" in the New Historical Baseball Abstract, and his take on it is in many ways unsurpassed. Still, even James may have overrated Carter as a player, as he doesn't even make Rally's Top 300 Position Players since 1955 by WAR list. Not that he didn't have some good seasons... but we aren't dealing with those. Here's a guy who drove in 102 runs hitting in the middle for the order for Toronto in 1997 with a .297 wOBA. You don't like wOBA? You think a .284 OBP might be okay? How about this -- he drove in 102 runs in 1997 and 115 in 1990 in  while not even slugging .400 either season.

While Carter makes the point about the "value" of a "guy who just knows how to drive in runs" the best because of his fame, my personal favorite is the guy I had to do the +10 thing to get on the list: Tony Batista. I loved the goofy batting stance, the willful all-or nothing approach, the nickname given to him by Batgirl. It is a fitting symbol of MLBl's treatment of Montreal's baseball fans* that in the Expos' last season there, their cleanup hitter had a .305 wOBA with a .241/.272/.455. I guess it was better than fellow Blue Jay alumn Joe Carter's 1997.

* And don't give me that "Montreal just wasn't into baseball" nonsense. They  had decent attendence through 1994 and were in first when the strike happened. Ater that, Jeffrey Loria and Bud Selig happened.

 

But Batista's 2004 greatness was nothing compared to what he had already achieved for the Orioles in 2003. This is the season for which I did the +10 modification, and the reason I lowered the bar from 100 to 90 RBI: 99 RBI in 670 PA,  .283 wOBA for -22.6 bRAA and a .235/.270/.393 line. Let's put that .283 wOBA in perspective: Adam Everett has a career wOBA of .290. If that isn't enough to ween a person of RBIs as a measure of offensive value, I don't know what is. Or maybe the Tigers should be hitting Everett cleanup this season...

Perhaps you still think that some players just have a "knack" for situational hitting, i.e., with runners in base or in scoring position, such that a high RBI/bRAA ration speaks in favor of their hitting abilities. I guess you'll have serious issues with the guys on the mmost underrated by RBI list, then. Look who appears 6 times in the top 10. 'Nuff said.

In conclusion, let's look at 2008's players with more than 90 RBI. Look at the bottom, first. Grady Sizemore, Lance Berkman... Albert Pujols. OK, obviously, Sizemore hits leadoff, and Berkman and Pujols had good years even for them, so they might be expected to "out-wOBA" their  RBI numbers. Still, any ranking that has them at the bottom has to make you think that being at the bottom is a good thing, right?

And who is that the top? The worst mistake Dayton Moore will take: Jose Guillen.

Still not convinced? I'll revisit this next week using a more contextual approach to situational hitting to see if there's anything there.

 

Thanks to Sky Kalkman for his stylistic and methodological suggestions on this piece, and to terpsfan for his park factors and instructions on how to implement them. Neither of them, of course, is responsible for my screw-ups.

 

Update, 5/18/2009, 11:40 p.m. EST: At the Book Blog, Tom Tango suggests that it would be better to just use RBI per absolute runs created rather than above average/replacement. I had considered doing so when working on this piece originally, but decided not to for various reasons.

However, Tango’s point is a good one, so I’ve quickly caculated Batting Runs Created. It’s the same formula as above average, but adds in the league average R/PA for each plate appearance:

(((wOBA-lgWOBA)/wOBAScale)/PA)+(lgR/PAPA)

The spreadsheet has been modified accordingly. The old ones are still there and are useful, I think, in that they reflect a similar methdology to that of the Jonah Keri piece that inspired this post. Those that are sorted by RBI/BRC are labeled with "BRC." I didn’t bother to park adjust those numbers, which causes some anomalies, but what I’m going for are the basic groups of players who are "better" or "worse" than their RBI numbers indicate. The players are basically the same. Batista's 2003 is the most RBI-riffic season on both lists. Joe Carter holds the #2 spot in BRC version, but this time its his 1990 season. For 2008, Jose Guillen has "dropped" to the fourth spot and is replaced by... wait for it... Mike Jacobs.

Thanks to Tom Tango and terpsfan for their suggestions, and there will be more on this topic next week (and hopefully in the comments this week!).

0 recs  |  Comment 17 comments |

Story-email Email Printer Print

Comments

Display:

Update

At the Book Blog, Tom Tango suggests that it would be better to just use RBI per absolute runs created rather than above average/replacement. I had considered doing so when working on this piece originally, but decided not to for various reasons.

However, Tango’s point is a good one, so I’ve quickly caculated Batting Runs Created. It’s the same formula as above average, but adds in the league average R/PA for each plate appearance:

(((wOBA-lgWOBA)/wOBAScale)PA)+(lgR/PAPA)

The spreadsheets have been modified accordingly. The old ones are still there and are useful, I think, in that they reflect s similar methdology to that of the Jonah Keri piece that inspired this post. Those that are sorted by RBI/BRC are labeled with “BRC.” I didn’t bother to park adjust those numbers, which causes some anomalies, but what I’m going for are the basic groups of players who are “better” or “worse” than their RBI numbers indicate. Those groups are basically the same.

I'm not a sabermetrician, but I do play one at Driveline Mechanics.

by devil_fingers on May 18, 2009 8:38 PM PDT reply actions   0 recs

Jonah Keri probably picked 1972 as the cutoff...

…because at the time the book was published, I believe that’s as far back as you could go and get a compete season of Retrosheet data.

by cwyers on May 18, 2009 10:22 PM PDT reply actions   0 recs

huh... interesting

i thought it might be the introduction of the DH in 1973

I'm not a sabermetrician, but I do play one at Driveline Mechanics.

by devil_fingers on May 18, 2009 10:55 PM PDT up reply actions   0 recs

that would make sense too

St. Louis Cardinals... defying win expectancy since 2008

by vivaelpujols on May 19, 2009 2:41 AM PDT up reply actions   0 recs

Mind you I am not trying to undermine your research

but none of those names surprise me. Just watching and knowing the game I could tell you those players were greatly over valued Basically you don’t see true Hall of Fame players on that list. Most or not all of those players were considered to be better players then they were in lineups that provided them ample opportunities thus the higher RBI totals. Put a better hitter in those situations and you might have produced an extra 20-30 per year. You can say what you will about Manny Ramirez but I am sure his numbers reflect him to be an above average producer.

Baseball is God's sport! All Truth Goes Through Three Stages 1.It is ridiculed 2.It is violently opposed 3.Finally, it is accepted as self-evident. kinesiologist

by E5 on May 18, 2009 10:23 PM PDT reply actions   0 recs

right

I wasn’t so much trying to “undermine” the good guys, as to bring up some guys who had horrendous seasons and still h ad a lot of RBI, like Batista’s 2003 and 2004 campaigns.

I'm not a sabermetrician, but I do play one at Driveline Mechanics.

by devil_fingers on May 18, 2009 10:56 PM PDT up reply actions   0 recs

Though

I think all Phillies fan instantly tuned out of this post at the sight of that picture.

by RollingWave on May 19, 2009 2:59 AM PDT reply actions   0 recs

Re: 1972

It was indeed a data availability issue.

Good piece btw.

by Jonah Keri on May 19, 2009 5:34 AM PDT reply actions   0 recs

Thank you very much!

and thanks for stopping by

I hope it was clear from the above that this piece was intended as a “follow-up” on yours. I finally read BBTN this Christmas right after Kyle asked me to write for Driveline, and it was one of the first ideas that came to me… Then it took me a while to figure out how to do park adjustments to my satisfaction (terpsfan101 really came through).

I'm not a sabermetrician, but I do play one at Driveline Mechanics.

by devil_fingers on May 19, 2009 7:03 AM PDT up reply actions   0 recs

great picture btw

ultimate proof that batter vs. pitcher graphics on tv are the worst things ever. i want to know if they talked about how carter clearly has trouble with williams…

"Sometimes Joe (morgan) doesn't like facts to get in the way of his opinions."- billy beane
"That was a great pick...if this was 2002" Me, to guy who selected Barry Zito in a fantasy draft
www.27ClubPeak.blogspot.com

by harendaman365 on May 19, 2009 10:22 AM PDT reply actions   0 recs

and ive always wondered about that article in BBTN

what have been the best non 100 RBi seasons ever?

"Sometimes Joe (morgan) doesn't like facts to get in the way of his opinions."- billy beane
"That was a great pick...if this was 2002" Me, to guy who selected Barry Zito in a fantasy draft
www.27ClubPeak.blogspot.com

by harendaman365 on May 19, 2009 10:23 AM PDT reply actions   0 recs

hmmm... good question. I might to a post about that later this week, although it wont' be "ever"

that query would take a long time to run

I'm not a sabermetrician, but I do play one at Driveline Mechanics.

by devil_fingers on May 19, 2009 11:33 AM PDT up reply actions   0 recs

Well just eyeballing the top 300 on baseball projection

Rickey Henderson had a couple of 10+ WAR seasons with only ~70 RBIs. Joe Morgan was worth 11.8 WAR in 1975, despite only driving in 94 runs.

St. Louis Cardinals... defying win expectancy since 2008

by vivaelpujols on May 19, 2009 1:24 PM PDT up reply actions   0 recs

yeah, one way to do it would just be to get Rally's csv (which I'm considering anyway)

but I could also do it easily here when I take the time — just using lwts

I'm not a sabermetrician, but I do play one at Driveline Mechanics.

by devil_fingers on May 19, 2009 1:28 PM PDT up reply actions   0 recs

not that there's a problem with what you suggest

but I just mean in terms of raw offensive value

I'm not a sabermetrician, but I do play one at Driveline Mechanics.

by devil_fingers on May 19, 2009 1:28 PM PDT up reply actions   0 recs

Oh

I see what you’re trying to do.

St. Louis Cardinals... defying win expectancy since 2008

by vivaelpujols on May 19, 2009 5:56 PM PDT up reply actions   0 recs

Comments For This Post Are Closed


User Tools

Welcome to Driveline Mechanics!
Start posting on Driveline Mechanics »

Join SB Nation and dive into communities focused on all your favorite teams.

FanPosts

Community blog posts and discussion.

Recent FanPosts

Small
Pitching Mechanics Retrospective
Small
Why the Blue Jays are a pitching mechanics train wreck.
Small
Brandon League's unusual arm action
Small
Summary: Aroldis Chapman's mechanics.
Small
The Blue Jays: A trainwreck of pitching mechanics.
Small
Re: Analysis of Justin Duchscherer's pitching mechanics.
Website_pic_small
Complete Hitting
Website_pic_small
Complete Hitting
Website_pic_small
CCC's to success in Baseball: Commitment to becoming a Complete Player = Confidence in your abilities.
Website_pic_small
The Eight Essential Pieces to a Smooth Swing

+ New FanPost All FanPosts >


Managers

Me_-_baseball_small Kyle Boddy

Editors

Photo_29_small hazel

Newavatar_small devil_fingers

1753738656_110919ebe9_o_small vivaelpujols