Platoon Split Projection Project 2009
Everyone knows that that left-handed hitters generally hit better against right-handed pitchers, and right-handed hittesr hit better against left-handed pitchers. Still, players actual career/last season splits are so varied that we sometimes latch onto those -- "hey, this righty actually hits RHP better than LHP! He's no platoon candidate," or, "this lefty-hitting rookie is so bad against LHP, he's a career platoon guy." As you might expect, much of this is generated from small sample sizes. But we also have the correct intuition that, although the general rules about platoon splits hold true, each player has their own particular split in terms of the "direction" and size of the split. With that in mind, I have taken a group of players and applying a simple method roughly following a suggestion from The Book in order to project what their "true" splits can be expected to be going forward. At the end of the season, we'll check in and see how the method did.
I initially planned on doing this in a very "precise" manner with weighted averages for each player, dividing by leagues, etc., but my first forays were such a pain, and the potential marginal "reward" so small, that I decided to go with something more simple, and in a sense, that's even more interesting, since we are dealing with regression to the mean. Here's the "clue" from The Book (page 163) that I'll be following:
A right-handed hitter needs around 2,000 plate appearances against left-handed pitchers before his measured platoon spltis can be reliable (in other words, using the measured platoon split is more accurate than assuming has an average split). From a practical standpoint, right-handers are best assumed to have average platoon skills, unless one is willing to make the calculations needed to accurately estimate a player's platoon skill. For lefties, the number is about 1,000, which means that only veteran starters have reliable platoon splits.
What I've done is simple enough. Taking a couple ideas of my own and some reader suggestions, I got a group of players with varying sorts of splits, and simply took their career splits from baseball-reference.com. I did this on Monday, so I'm assuming that no one will think that one week of 2009 stats is going to "contaminate" this oh-so-precise study. Next, I took a weighted average of the average split from the AL from 2005-2008 from the league pages. Why just the AL? Well, I first did all of MLB, but the large number of pitcher PAs from the NL makes the numbers really weird... If I knew a way to get "double splits" with position and handedness out of B-R.co or even my BDB database, I would have done that, but for now, well, we're going to have to use the AL splits for everyone... sorry, NL folks... The AL pitchers do have some PAs, but it's not that significant.
The stats I am going to use for this study, for each player, are on-base percentage (OBP) and weighted On-Base Average (wOBA --the original version, for ease and consistency of calculation). I will be projecting the percentage of each player's split, as 1) this isn't a column about projection in general, and 2) as noted in The Book, better hitters generally have larger splits.
Here are the weighted (10-9-8-7) averages I calculated for the 2005-2008 AL, expressed as percentage of the split relative to average OBP/wOBA.:
| RH OBP Split | 6.8% |
| LH OBP Split | 7.2% |
| RH wOBA Split | 5.2% |
| LF wOBA Split | 7.9% |
As I said before, the subjects for this study were suggested mostly by readers, with a couple of suggestions of my own. Alex Gordon is a young left-handed hitter with a typically large split -- it will be interesting to see how that works itself out as he (homerism alert) destroys the AL Central, especially Cleveland, over the next few years. Miguel Olivo hits like a catcher who doesn't hit very well, but he has mashed lefties so far in his career. Will that continue? Joe Mauer is pretty good, I guess. He was suggested because he had a reverse split in 2008, although that isn't typical of his career. Ichiro! has a "reverse" split for his career, although that hasn't persisted as much, lately. Ryan "Out of Nowhere" Ludwick and Evan Longoria are righty-hitters with very big reverse splits, as is Matt Holliday although his is not as large. Akinori Iwamura and Ian Stewart are lefties with reverse splits -- in Stewart's case, it is really big (although in a small sample), and this was also the case in the minors (I've only used major-league numbers here). Here's the data with an explanation of the columns following.
| Player | Bats | PA | OBPspl | OBP% | xOBP% | wOBAspl | wOBA% | xwOBA % | Rel. |
| Alex Gordon | L | 352 | .057 | 17.2% | 9.8% | .053 | 15.7% | 9.9% | .260 |
| Joe Mauer | L | 762 | .056 | 14.0% | 10.2% | .062 | 16.1% | 11.4% | .432 |
| Ichiro Suzuki | L | 1677 | -.014 | -3.7% | 0.4% | -.016 | -4.3% | 0.2% | .810 |
| Ryan Ludwick | R | 505 | -.044 | -12.7% | 2.9% | -.038 | -10.1% | 2.7% | .295 |
| Ian Kinsler | R | 424 | .005 | 1.4% | 5.9% | .012 | 3.2% | 5.4% | .380 |
| Matt Holliday | R | 670 | .002 | 0.5% | 5.2% | -.024 | -5.9% | 2.9% | .537 |
| Akinori Iwamura | L | 402 | -.014 | -3.9% | 4.0% | -.011 | -3.3% | 4.7% | .471 |
| Miguel Olivo | R | 620 | .057 | 20.7% | 10.1% | .090 | 29.7% | 11.5% | .446 |
| Ian Stewart | L | 72 | -.075 | -21.9% | 5.2% | -.112 | -32.3% | 5.2% | .225 |
| Evan Longoria | R | 146 | -.047 | -13.4% | 5.4% | -0.39 | -10.0% | 4.8% | .163 |
Table Key:
- PA is career plate apperances vs. LHP only.
- OBPspl and wOBAspl are the career OBP/wOBA split of the player. Negative numbers indicate a reverse split.
- OBP% and wOBA% are what the career split as a percentage. Negative numbers indicate a reverse split.
- xOBP% and xWOBA% are the expected "true" split percentage. Negative numbers indicate a reverse split.
- Rel. is a sort of like a Marcels reliability score, in that it is based on the number of PAs vs. LHP the player has as a percentage of his total Pas vs. LHP + PAs of lgAvg. They should be taken as purely "relative" to one another in this context however, not as anything like a "percentage chance this projection is right." I don't know how helpful they are. The reliability scores of lefties and righties should only be compared to one another, as well, since 1) the are regressed a different amount, and 2) there is a greater variance of platoon skills amoung left-handed hitters than right-handed hitters. I thought about doubling all the right-handed Rel. scores, but I'll let people figure it out. I thought about leaving them off entirely; I hope they don't distract from the main point of the piece.
There you have it. Those are some rough projections for the "true" platoon splits of these players based on available major league data. Note that most of them are "curious" cases, for what it's worth. Also note that all of these players coming very close or being way off of their projections doesn't either "prove" or "disprove" the theory -- 10 players over one season is a ridiculously small sample size. I'm just interested to see how it turns out. I didn't get as specific as I'd originally wanted, but it's a start. If most of them end up closer to the projected split rather than their career split, I'll be pleased.
Now I'll have to remember to check back in on these guys at the end of the season...
Update April 16: changed some labels to make things a bit less confusing.
1 recs |
6 comments
|
Comments
Great work
I fully agree that with the vast majority of players, we should expect platoon splits to regress to the average platoon split over a large sample of data. But I think there are some players who maintain consistent extreme platoon splits over many years. For instance, Miguel Olivo has 2232 career PA’s over 8+ seasons. He’s consistently had extreme platoon splits. Should we really expect him to start regressing to the mean? As I mentioned elsewhere, we should expect the vast majority of pitchers to have HR/FB rates around 11% over a large sample of data. But some few pitchers consistently have lower rates. Mariano Rivera, for instance has a career HR/FB of 5.6% and has never had a season over 7.5%.
Now the question is what is the threshold? How much data do you need before you know that Player X is an exception to the general rule. I have no answer.
The immoderate moderator
by NYRoyal on Apr 16, 2009 10:16 AM PDT reply actions 0 recs
The Book says
the threhold is ~2,000 PAs vs. LHP for righties, 1,000 vs. LHP for Lefties, and 600 PAs for switch hitters, give the relative variance of platoon skill
Bringing you more-or-less replacement level analysis and commentary to Driveline Mechanics and elsewhere since sometime in 2008.
by devil_fingers on Apr 16, 2009 11:27 AM PDT up reply actions 0 recs
So if Olivo does regress to the mean
Is he in for a TPJ-like season? Or will his righty numbers go up since they are so bad and sort of even it out? You say 1000 PAs for lefty-lefty, I assume that holds for righty-righty? If that’s so, Olivo has his “true” righty split, but could still regress on the lefties. I was scared watching him swing the bat this season already, but now I am terrified.
by AxDxMx on Apr 16, 2009 1:14 PM PDT reply actions 0 recs
no, it's 2000 vs. LHPs for RHPs
and keep in mind that I’m this just estimates the players platoon skill/split, not how they’ll do. In general, with a guy with a career split as large like Olivo’s, you’d expect him to be worse against LHPs and better against RHPs.
On other words, to “apply” this (and remember that a regression method like this will work better for larger oups of hitters on average than for one or two hitters in general — hence the reliability scores), you’d first project (or take someone else’s projection) for how Olivo will hit, then apply the estimated split to his projected numbers. That’s crude, but let me run you through a simplistic version of this. And I’m way out on a limb here, so please don’t quote me on this.
Marcel projects Olivo to have an OBP of .285 this year (believe it or not, this is by far the highest projection for his OBP I’ve seen). I’m guess you’d want to estimate what percentage of LHPs this is based on, but I’ll just go with 30% to make my point (I bet it’s higher for Olivo) -projection systems (wisely, for a number of reasons), generally don’t look at this stuff much, as far as I know. I’ll just go with the 10% figure (it’s 10.1%, but I’m lazy) .So then take 30% of 10% - or 3%. So against RHP, you can one woudl say he’d OBP 3% less than that .285 figure, or about .277 (coincidently, closer to what most projection systems have him at). Against LHP, he’s be 7% higher, for a Bondsian .305 OBP.
That’s really rough and quick (twss), but I hope it makes a bit of sense — that’s why I want to stay away from giving a specific projectoin for the “amount” of the split.
Feel free to keep asking, and I’ll try to clarify.
Bringing you more-or-less replacement level analysis and commentary to Driveline Mechanics and elsewhere since sometime in 2008.
by devil_fingers on Apr 16, 2009 2:29 PM PDT up reply actions 0 recs
sorry for the cross-out
Bringing you more-or-less replacement level analysis and commentary to Driveline Mechanics and elsewhere since sometime in 2008.
by devil_fingers on Apr 16, 2009 2:29 PM PDT up reply actions 0 recs
I get it now
I was reading the expected columns and noticing Olivo’s were lower than the actual by a significant amount while others weren’t so big. I misread that into a collapse at the plate, when all it meant was the “true” splits are estimated to not be as wide as they have been.
The application on the Marcel numbers cleared it up.
by AxDxMx on Apr 16, 2009 5:49 PM PDT up reply actions 0 recs














