Walk Like a Sabermetrician: November 2011

Tuesday, November 15, 2011

IBA Ballot: MVP

My position on pitchers as MVP candidates is pretty simple: I think they absolutely should be considered. However, that doesn’t mean it’s a common occurrence for me to conclude that a pitcher was the MVP of his league. In general, I think that given modern workloads, it is much more likely for a batter to be the MVP than a pitcher. Additionally, when I conclude that a pitcher and a position player are indistinguishable in terms of value, I will usually hedge my bets and go with the batter. A corollary to this is that I’d like the pitcher’s peripheral statistics to indicate that he is equally or more valuable than his batting rivals, not just his actual runs allowed. This is a higher hurdle to clear, since the best pitchers in terms of runs allowed are more likely than not to have outpitched their peripherals.

The end result of this thinking is that somewhere around 2-4 pitchers are sprinkled through my MVP ballot, but rarely is one listed at #1. I’ve been formally writing up my ballots for this blog since 2006, which gives me ten league-seasons with which to quantify my thought process:

As you can see, on average I list three pitchers on my ballot, with the leading pitcher placed fourth. Obviously I’m biased, but I think this is a very fair treatment of pitchers.

All of this bloviating and laughably in-depth analysis of my own previous ballots is necessary because, for the first time since I’ve been doing this, there is a popular movement to vote a starting pitcher as MVP. I want to make it clear that, and I think I have, that if I don’t feel that Justin Verlander was the AL MVP, it’s not because of some bias against pitchers, but simply that I felt other player(s) were more valuable in 2011.

Verlander has gained traction as a candidate for two reasons. One, he pitched for a playoff team, and heavens knows that mainstream types will bend over backwards to try to give the MVP to a player whose contributions were “actually valuable”, or whatever argument they’d like to use to dismiss players whose teammates just weren’t that good. It also helps that of the AL playoff teams, Detroit was something of a surprise (they were certainly the most surprising to me, although the voters would probably give that nod to Tampa Bay), and they made a strong surge in August and September to run away with their division. That’s a good narrative.

Secondly, Verlander’s W-L record is very impressive (24-5), and we all know that the mainstream still is easily distracted by a shiny W-L record. And oh yeah, third, he pitched very well by any measure.

That last point, though, is where I’m not as enthusiastic about Verlander. The mainstream view is that Verlander was obviously the AL’s best pitcher in 2011--my view is that he was a solid #1, but Jered Weaver can’t just be laughed off. Verlander’s season is not historic by any means when viewed through the lens of RAR--for last five seasons, the AL pitching RAR leaders totals have been:

72, 84, 95, 76, 84

Verlander’s 84 is very good, but the average of the previous four AL leaders was 82. It’s a fairly typical league-leading type of performance, a very solid Cy Young-type season, but not one for the ages either.

However, I have Jose Bautista at 82 RAR/63 RAA, I don’t see any compelling reason to penalize him for his defense or baserunning (UZR doesn’t think much of him, but Dewan’s DRS and Wyers’ FRAA don’t share that evaluation), and I don’t care that his team finished in fourth place. Verlander does not look nearly as good when evaluated by dRA, and so when there’s reasonable doubt that the pitcher was more valuable than the position player, I side with the position player.

I also have placed Verlander’s teammate Miguel Cabrera ahead of him, albeit with much less conviction. Cabrera’s offensive value is essentially indistinguishable from Bautista’s--I estimate that Cabrera created 137 runs in 376 outs while Bautista created 134 runs in 363 outs (9.3 to 9.4 RG, 71 to 70 HRAA). However, Cabrera played first base and there’s reason to believe he’s a below-average fielder, putting Bautista ahead. Compared to Verlander, though, I think the case can be made that he was a little more valuable.

Among the other position player candidates to fill out the ballots, Jacoby Ellsbury ranks first in RAR, plus fielding and baserunning would seem to work in his favor. Adrian Gonzalez was right behind his teammate in RAR, and has a good fielding reputation and a decent showing in fielding metrics.

The other three spots all go to second baseman. I suppose one can argue that the positional adjustments I use are too kind to second basemen, but I just happen to think there is a collection of very talented second basemen in the AL at this time. Dustin Pedroia was just behind Ellsbury and Gonzalez in RAR. Curtis Granderson (56 RAR) and Mike Napoli (56) rank ahead of the trio of Robinson Cano (53), Ben Zobrist (52), and Ian Kinsler (50), but Granderson’s fielding raises at least a little concern. Napoli’s RAR gives him a full catcher position adjustment, but he actually played nearly as many games between first base and DH (53) as he did as a catcher (61). While his 8.5 RG ranked third in the AL behind Bautista and Cabrera, he also logged just 427 PA.

Among the three remaining second basemen, the offensive differences are small enough to throw a bone to Kinsler’s well-regarded fielding (at least by the various metrics)and baserunning, while keeping in mind that Zobrist like Napoli also played a fair amount at less demanding positions. Evan Longoria will probably get a lot more love from others, but he ranks 16th on my RAR list and would require more fielding credit than I am comfortable with (or a repudiation of the position adjustment for 3B relative to 2B) to make the ballot:

1) RF Jose Bautista, TOR
2) 1B Miguel Cabrera, DET
3) SP Justin Verlander, DET
4) SP Jered Weaver, LAA
5) CF Jacoby Ellsbury, BOS
6) 1B Adrian Gonzalez, BOS
7) 2B Dustin Pedroia, BOS
8) SP James Shields, TB
9) SP CC Sabathia, NYA
10) 2B Ian Kinsler, TEX

In the National League, there is no need for philosophical reflection about the value of a pitcher versus a position player, or any need for intricate comparisons of multiple players. There is only one question that needs to be answered: Can you make a case against Matt Kemp?

Kemp led NL hitters in RAR by 12, and was in a tied Ryan Bruan for the league lead with a 8.5 RG. His fielding is probably not great, but since no one else was particular close in RAR, you’d have to think he was pretty bad and that Ryan Braun or Prince Fielder or Jose Reyes was really good in the field to close the gap. I don’t see any reason to believe that, so Kemp is my runaway choice as NL MVP.

Filling out the rest of the ballot, Ryan Braun is a very strong candidate for #2. The three pitchers (Halladay, Kershaw, and Lee) that were very close for the Cy Young are all strong mid-ballot choices. Prince Fielder was very good, but inferior to his teammate at the plate and he’s not a strong candidate for fielding and baserunning credit. Jose Reyes and Joey Votto are also in the mix.

As you can see, I’m having trouble finding much to say about the NL ballot. My RAR list actually makes it pretty straightforward; obviously small differences are not meaningful, but I don’t see a lot of compelling reasons to step in and make changes. The only player who drops far below his RAR is Lance Berkman, who obviously is not much of a fielder at this point and who I would be loathe to argue was more valuable than teammate Pujols. And that leaves him without a spot:

1) CF Matt Kemp, LA
2) LF Ryan Braun, MIL
3) SP Roy Halladay, PHI
4) 1B Prince Fielder, MIL
5) SS Jose Reyes, NYN
6) 1B Joey Votto, CIN
7) SP Clayton Kershaw, LA
8) SP Cliff Lee, PHI
9) 1B Albert Pujols, STL
10) SS Troy Tulowitzki, COL

Wednesday, November 09, 2011

IBA Ballot: Cy Young

In retroactively evaluating starting pitchers, I start with their actual runs allowed (crudely adjusted for bequeathed runners to produce what I call RRA). I consider peripherals, primarily what I call eRA (basically a component RA) and dRA (a DIPS RA). However, I do not start with either of those, and if there is a substantial difference in RRA, I usually don’t override it lightly. I’m not sure that this stance makes much of a difference in this year’s Cy Young vote, at least at the top of the ballot--the top guys fare well however you slice it, but it does put me at odds with anyone following what could be called the Fangraphs school of pitcher evaluation.

Everyone has handed the AL Cy Young to Justin Verlander, but consider this:

I don’t think that, looking at these categories, you can come to any sort of clear conclusion about who was the better pitcher. The first guy pitched sixteen more innings, but he allowed .15 more runs per game, so when you compare them to a baseline, they are just about even. The first pitcher had a better eRA, which is a positive, but the second pitcher didn’t grossly outpitch his peripherals. All things outside of this chart being equal, I’d give the edge to the first pitcher, but I would hardly consider it a landslide.

As you probably know, the first pitcher is Justin Verlander; the second pitcher is Jered Weaver. Weaver also trailed Verlander in dRA (3.44 to 3.75), which I purposely omitted in order to make a point, and obviously Verlander has the win-loss angle going for him in the mainstream. I have no qualms about putting Verlander first on my ballot, but Weaver ensured that he didn’t run away from the rest of the AL field.

James Shields has a fairly large lead for the third spot on the RAR list at 77, with Sababthia fourth at 66 and three pitchers tightly clustered just below (Romero 62, Haren 62, Beckett 61). Shields and Romero both benefitted from low BABIPs (.259 and .247). Shields’ Rays teammates did lead the majors in DER by a wide margin; it wasn’t just him who was getting great defensive support. Still, as discussed above, given Shields’ sizeable RAR lead over the others, I’m more comfortable giving him the nod. It is enough to drop Romero out of the running for the last spot on the ballot, which comes down to Haren and Beckett.

Haren worked 45 more innings, but Beckett’s RRA was .51 runs lower and his eRA was .42 runs lower. However, Haren’s dRA was .24 runs lower, and since the peripherals are a split decision, I’m more comfortable going with the guy who worked a lot more. Fried chicken was not a factor in this decision:

1) Justin Verlander, DET
2) Jered Weaver, LAA
3) James Shields, TB
4) CC Sabathia, NYA
5) Dan Haren, LAA

The NL Cy Young is very close. Consider these two pitchers:

It would be tough to get much closer than that, wouldn’t it? While it appears that Clayton Kershaw will win the award and that Roy Halladay is the consensus #2, the top line on that table is Kershaw and the second line is Cliff Lee. Lee’s season is nearly indistinguishable from Kershaw’s in the categories that drive my decision. Halladay’s same categories line: 234, 2.48, 2.70, 2.71, 43, 73.

This race is close enough that I decided to take a look at each pitcher’s performance on a game-by-game basis, using the relatively crude gW% I discussed in this post. However, looking at each game on its own does little more than verify that these pitchers were all very close: Lee leads the way at .685, but Halladay at .680 and Kershaw at .679 are right behind.

We could consider strength of schedule. On the team level, and considering just the opponent’s overall quality rather than isolating opposing offense as would be more useful for comparing pitchers, my crude team rankings indicate that PHI and LA played nearly the same caliber of opposition--PHI has a 95 SOS and LA a 94. Baseball Prospectus’ data on quality of opposing hitter reveals that Halladay’s average opponent hit .260/.330/.413, Kershaw’s .263/.327/.416, and Lee’s .266/.332/.423. Respectively, those lines translate to approximate runs/game of 4.69, 4.67, and 4.83. But over 233 innings, even the difference between the high (Lee, 4.83) and the low (Kershaw, 4.67) is just 4 runs, and those figures probably shouldn’t be applied without any sort of regression.

From the game-by-game analysis, I can also compute the pitcher’s personal park factor weighted by innings pitched in each park rather than assuming that each pitcher logged a 50/50 home/road innings split. My standard park factor for PHI is 101 versus 97 for LA. Halladay and Lee’s personal park factors are both 101, while Kershaw’s is 96, making any sort of deviation from just using the team PFs an exercise in futility.

I put next to zero stock in win-loss record, but Kershaw’s is 21-5 mark is obviously more impressive than Halladay’s 19-6 and Lee’s 17-8 when compared to their team’s winning percentage. The pitcher’s run support (from ESPN.com) were 5.89, 5.52, and 4.95 respectively, which helps explain why Lee’s record lags behind the other two, but does next to nothing to help us sort out how effective they all were.

In the end, I give the nod to Halladay--he led the three pitchers in all three run averages, and he does have a 5 RAR lead. That doesn't prove he was better, but I have no reason to override it. I think that a reasonable person could easily conclude that Kershaw or Lee deserved the award as well.

For the other two spots on the ballot, the RAR list highly recommends Ian Kennedy (61 RAR) and Cole Hamels (59), as Tim Lincecum is sixth on the list a full eight runs behind Hamels. Hamels has slightly better peripherals than Kennedy, and while batted ball metrics are of questionable value, he does much better in those categories than Kennedy. In this case it confirms my default position (Hamels > Kennedy), and so I’ll give in and fill out my ballot as follows:

1) Roy Halladay, PHI
2) Clayton Kershaw, LA
3) Cliff Lee, PHI
4) Cole Hamels, PHI
5) Ian Kennedy, ARI

Walk Like a Sabermetrician

Tuesday, November 15, 2011

IBA Ballot: MVP

Wednesday, November 09, 2011

IBA Ballot: Cy Young

Me, Elsewhere

Analysis Links

Reference Links

Blog Archive

OSU Baseball

End of Season Statistics

Win Shares Walkthrough

NL 1876-1881 Series

Labels

About Me