<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>nerdbaseball.com &#187; pitching</title>
	<atom:link href="http://www.nerdbaseball.com/tag/pitching/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.nerdbaseball.com</link>
	<description></description>
	<lastBuildDate>Tue, 24 Aug 2010 20:16:12 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>Walking the Walk Part II: Pitching and Defense</title>
		<link>http://www.nerdbaseball.com/2009/02/walking-the-walk-part-ii-pitching-and-defense/</link>
		<comments>http://www.nerdbaseball.com/2009/02/walking-the-walk-part-ii-pitching-and-defense/#comments</comments>
		<pubDate>Wed, 25 Feb 2009 14:00:30 +0000</pubDate>
		<dc:creator>Prof. Nerdtron 3000</dc:creator>
				<category><![CDATA[Nerdtron's computer]]></category>
		<category><![CDATA[defense]]></category>
		<category><![CDATA[Diamondbacks]]></category>
		<category><![CDATA[ERA is a bad stat and should die]]></category>
		<category><![CDATA[Errors are meaningless]]></category>
		<category><![CDATA[phillies]]></category>
		<category><![CDATA[pitching]]></category>
		<category><![CDATA[Rangers]]></category>
		<category><![CDATA[regression]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://www.nerdbaseball.com/?p=232</guid>
		<description><![CDATA[A baseball team has two jobs.  One is to score runs, and the other is prevent runs.  If we look at how teams score and prevent runs, then we can measure player&#8217;s values directly in the runs they contribute.  Nerdbaseball has a model for run scoring, now we need a model for run prevention. Regression [...]]]></description>
			<content:encoded><![CDATA[<p>A baseball team has two jobs.  One is to score runs, and the other is prevent runs.  If we look at how teams score and prevent runs, then we can measure player&#8217;s values directly in the runs they contribute.  Nerdbaseball has a model for run scoring, now we need a model for run prevention.</p>
<p><span id="more-232"></span><em>Regression in a nutshell</em></p>
<p>To look at run prevention, I went to the same regression approach as before.  I like this because it lets us use what Doc Nerd calls parsimonious stats.  They are stats easily counted and hopefully found in the box score.  The other nice thing about regression is that the output is pretty straightforward.  It&#8217;s the y = mx + b thing that comes up in high school.  In a regression, y is your predicted outcome, and x is your predictor.  There can be multiple predictors, leading to a formula like y = m1x1 + m2x2 + m3x3 + b.  The coefficients (m1, m2, m3, etc.) tell you how to weight each of the contributing factors.</p>
<p><em>The boring part</em></p>
<p>Looking at pitching and defense ended up opening up a lot of interesting paths for studying.  These paths are arranged in layers.  Just as the bottom tier of the standings is the least interesting, looking at the foundation is a lot like looking at the Nationals.  What I found was a lot of common sense: a team that wants to prevent runs needs to (A) not give up hits, (B) not give up walks, and (C) keep the ball in the yard (Nerdstradamus: this time we used best subsets to get the order).  This simple formula will explain over 90% of the variance in allowing runs, with a high degree of confidence.  That 90% is about as good as we are with offense, and is generally considered very high.</p>
<p>Here&#8217;s the raw output:</p>
<p></p>
<table class="wptable rowstyle-alt" id="wptable-7"  cellspacing="1">
	<thead>
	<tr>
		<th class="sortable" style="width:100px" align="left">Event</th>
		<th class="sortable" style="width:30px" align="center">Coeffecient</th>
	</tr>
	</thead>
	<tr>
		<td style="width:100px" align="left">(Intercept)</td>
		<td style="width:30px" align="center">-503.58</td>
	</tr>
	<tr class="alt">
		<td style="width:100px" align="left">Hits (no HR)</td>
		<td style="width:30px" align="center">0.66</td>
	</tr>
	<tr>
		<td style="width:100px" align="left">BB</td>
		<td style="width:30px" align="center">0.46</td>
	</tr>
	<tr class="alt">
		<td style="width:100px" align="left">HR</td>
		<td style="width:30px" align="center">0.90</td>
	</tr>
</table><p>
</p>
<p>Yes, that model looks like it&#8217;s missing some things to me too, but I want to hold that thought and first look at who over- and under- achieved last year.  The one major over-achiever was the Phillies.  That&#8217;ll be good news for the Mets fans that hang out around here.  The underachievers were the Rangers and the Diamondbacks.  The Rangers gave up 967 runs last year.  That is worst in MLB, and truly horrible.  The model predicts them at 913 runs, which would have been worst in MLB and truly horrible.  The model sees the Diamondbacks as a very good pitching and defense team; it&#8217;s too bad their offense is so unconvincing.</p>
<p><em>The three true outcomes?</em></p>
<p>When I look at this model, I&#8217;m surprised by the parts that aren&#8217;t there.  Namely, strikeouts.  Strikeouts show back up in the second level of the analysis, when I look specifically at what prevents hits.  But for now, teams that allowed fewer runs didn&#8217;t necessarily strike more players out; strikeouts are not a <em>direct</em> predictor of a teams prowess in preventing runs.</p>
<p>Second, errors aren&#8217;t there at all.  Team errors do not predict allowed runs at all (nor do they predict hits allowed, but that&#8217;s for another day).  This takes me to one of my favorite soapboxes.  Earned runs are not a useful statistic.  Any run is a product of the interaction between the pitcher and the defense.  The errors behind a pitcher do not give any meaningful information in how the pitcher fared.  There may be good ways to measure defense, but errors are not among them.  The case will get stronger when I look at hits.  The bottom line is that a pitcher should be uniformly punished for any run that is given up.  That penalty need not be as harsh as currently assumed for an earned run, but the pitcher bears a degree of responsibility for every run that crosses the plate.  A more enlightened approach is slightly more forgiving to pitchers for any run allowed, but indiscriminate on how that run came to pass.  Fantasy baseballers, take note.</p>
<p>Pitchers have direct control over the amount of HR they surrounder and the amount of free passes they give up.  The question now is how much control do pitchers have over the amount of hits that they give up?  That&#8217;s for next time.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.nerdbaseball.com/2009/02/walking-the-walk-part-ii-pitching-and-defense/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>
