<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Dear Science &#187; Stats</title>
	<atom:link href="http://dearscience.org/category/disciplines/stats/feed/" rel="self" type="application/rss+xml" />
	<link>http://dearscience.org</link>
	<description>Seattle's Only Scientist</description>
	<lastBuildDate>Tue, 29 Mar 2011 01:07:59 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Witness the Magic of Regression Analysis&#8230;</title>
		<link>http://dearscience.org/2008/11/05/witness-the-magic-of-regression-analysis/</link>
		<comments>http://dearscience.org/2008/11/05/witness-the-magic-of-regression-analysis/#comments</comments>
		<pubDate>Thu, 06 Nov 2008 00:07:55 +0000</pubDate>
		<dc:creator>Jonathan Golob</dc:creator>
				<category><![CDATA[Stats]]></category>

		<guid isPermaLink="false">http://dearscience.org/?p=570</guid>
		<description><![CDATA[&#8230; and some damn good statistics. FiveThirtyEight&#8217;s election-eve prediction, of 349 electoral votes for Obama: Reality this afternoon, of a projected 349 electoral votes for Obama: I might start caring about baseball, just to further appreciate the awesomeness of Nate Silver.]]></description>
			<content:encoded><![CDATA[<p>&#8230; and some damn good statistics.</p>
<p>FiveThirtyEight&#8217;s election-eve prediction, of 349 electoral votes for Obama:</p>
<p><center><a href="http://fivethirtyeight.com"><img src="http://dearscience.org/wp-content/uploads/2008/11/1105_bigmap.png" alt="" title="1105_bigmap" width="340" height="254" class="alignnone size-full wp-image-571" /></a></center></p>
<p>Reality this afternoon, of a projected 349 electoral votes for Obama:</p>
<p><center><a href="http://dearscience.org/wp-content/uploads/2008/11/evmap20081105.png"><img src="http://dearscience.org/wp-content/uploads/2008/11/evmap20081105-254x154.png" alt="" title="evmap20081105" width="254" height="154" class="alignnone size-medium wp-image-572" /></a></center></p>
<p>I might start caring about baseball, just to further appreciate the awesomeness of Nate Silver. </p>
]]></content:encoded>
			<wfw:commentRss>http://dearscience.org/2008/11/05/witness-the-magic-of-regression-analysis/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How to Read a Histogram</title>
		<link>http://dearscience.org/2008/10/20/how-to-read-a-histogram/</link>
		<comments>http://dearscience.org/2008/10/20/how-to-read-a-histogram/#comments</comments>
		<pubDate>Tue, 21 Oct 2008 00:36:43 +0000</pubDate>
		<dc:creator>Jonathan Golob</dc:creator>
				<category><![CDATA[Stats]]></category>

		<guid isPermaLink="false">http://dearscience.org/?p=555</guid>
		<description><![CDATA[Nate Silver, the wonky head of the mathematically rigorous election projection site FiveThirtyEight.com, has a computer model that uses all of the available polling, weighted for accuracy, demographics and the rest, to run through ten thousand possible elections every day. Each one of these simulated elections pops out an electoral vote total for Obama. What&#8217;s [...]]]></description>
			<content:encoded><![CDATA[<p>Nate Silver, the wonky head of the mathematically rigorous election projection site <a href="http://FiveThirtyEight.com">FiveThirtyEight.com</a>, has a computer model that uses all of the available polling, weighted for accuracy, demographics and the rest, to run through ten thousand possible elections every day. Each one of these simulated elections pops out an electoral vote total for Obama. </p>
<p>What&#8217;s the best way to display all this data? A histogram.<br />
Here&#8217;s Nate&#8217;s:<br />
<center><a href="http://fivethirtyeight.com"><img src="http://dearscience.org/wp-content/uploads/2008/10/1019_evdist.png" alt="" title="1019_evdist" width="354" height="333" class="alignnone size-full wp-image-556" /></a></center></p>
<p>Along the bottom, on the horizontal, are the possible electoral vote counts for Obama. </p>
<p>For each one, from zero to five hundred thirty eight, on the vertical are the number of times this Obama electoral vote count happened during his ten thousand simulations. The tallest peaks are the most likely outcomes during the simulation. The low tails are things that are possible, but not very likely.</p>
<p>Many of the closer followers of FiveThirtyEight.com, like the Stranger&#8217;s own Anthony Hecht, tend to focus more on the big Obama victory pie chart. Over the past few days, Obama&#8217;s number has drifted down a bit, from a peak around 96% to the low nineties today. </p>
<p>Look at the histogram for today:<br />
<center><img src="http://dearscience.org/wp-content/uploads/2008/10/1020_evdist.png" alt="" title="1020_evdist" width="354" height="333" class="alignnone size-full wp-image-558" /></center></p>
<p>McCain is all tail, no peak. The peaks are still strongly skewed to an Obama blowout. </p>
<p>The histogram tells you, in much more detail than an number or a pie chart, the chances of the different outcomes in crisp (and this case comforting) detail. </p>
<p>I love histograms. </p>
]]></content:encoded>
			<wfw:commentRss>http://dearscience.org/2008/10/20/how-to-read-a-histogram/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How to Read a Poll</title>
		<link>http://dearscience.org/2008/06/18/how-to-read-a-poll/</link>
		<comments>http://dearscience.org/2008/06/18/how-to-read-a-poll/#comments</comments>
		<pubDate>Thu, 19 Jun 2008 00:12:39 +0000</pubDate>
		<dc:creator>Jonathan Golob</dc:creator>
				<category><![CDATA[2008]]></category>
		<category><![CDATA[Featured Articles]]></category>
		<category><![CDATA[Stats]]></category>

		<guid isPermaLink="false">http://dearscience.org/?p=110</guid>
		<description><![CDATA[As we approach November, I anticipate a tidal wave of blog posts on polls. Reading the polling data improperly is hazardous to your health. The disconnect between the polling and the 2004 election results nearly resulted in my death. Avoid my mistakes. 1. Remember that polls are always of a population that may or may [...]]]></description>
			<content:encoded><![CDATA[<p>As we approach November, I anticipate a tidal wave of blog posts on polls. Reading the polling data improperly is hazardous to your health. The disconnect between the polling and the 2004 election results nearly resulted in my death. Avoid my mistakes.</p>
<p>1. Remember that polls are always of a <a href="http://en.wikipedia.org/wiki/Statistical_population">population</a> that may or may not resemble who actually goes to the polls. Only pay attention to polls that randomly select respondents. Consider how the poll selects the respondents.</p>
<p>For example, almost all polls used in the presidential race are based off random telephone surveys of landline telephones. I only have a cell phone. Therefore, I am not sampled in the statistical population surveyed.</p>
<p>Thus, even if the poll is perfect, it might not reflect the reality at the polls in the fall, as the populations might not match.</p>
<p>2. A poll only shows a statistically meaningful difference between two candidates if the difference between them is more than twice the margin of error. Most political polls in the United States are designed to have a margin of error of +/- 3%. Therefore, the difference between the candidates must be greater than 6% to be anything other than a tie.</p>
<p>A margin of error of 3% tells us that the true percentage in the population has a 95% chance of being somewhere between three percent above or below the number reported by the survey.</p>
<p>For example, the Rasmussen June 9 2008 poll of Michigan voters has Obama at 45%, McCain at 42%. Statistically, they are tied, as the actual percentage of the population for Obama ranges from 42% to 48%, McCain 39% to 45%. The ranges overlap, and therefore we cannot say that one is leading over the other.</p>
<p>Another fun thing to consider. 95% confidence means that for one in twenty polls, the true population percentage will not be in this range.</p>
<p>The practical meaning of all this? Beware selectively looking at the poll results! If you are selective enough, you can only see the error you want to see. Net result? Suicidal thoughts in November.</p>
<p>3. Often the real trends are smaller than the error ranges of the surveys. We can employ two math tricks to make things better.</p>
<p>First, we can aggregate many surveys together and get an average of percentages. Provided the surveys are independent of one another&#8211;that the results of one survey don&#8217;t affect another&#8211;this makes the error distribution closer to normal by the <a href="http://en.wikipedia.org/wiki/Central_limit_theorem">central limit theorem</a>.</p>
<p>The second trick is to use <a href="http://en.wikipedia.org/wiki/Moving_average">moving averages</a> as a mathematically safe way to sort out random ups-and-downs in the poll numbers from the real longer term changes in the sampled population.</p>
<p>Think of how much your weight changes each day, by when you&#8217;ve last gone to the bathroom, how much water you&#8217;ve drank and so on. The change on a day-by-day basis is far larger than what you&#8217;ll typically gain or lose in a week. So, if you measure your weight each day, and then average together the last seven days, you end up smoothing out all the variance. Left behind is the actual change on a week-long basis. We can use the same math on the polls.</p>
<p>Quite a few websites are around that basically do all of this for us, limiting themselves to polls with some statistical rigor, base their analysis on the confidence intervals, and aggregate multiple polls together in a moving average. None are perfect, but I&#8217;ve taken a shine to <a href="http://www.electoral-vote.com/">electoral-vote.com</a> for it&#8217;s non-commercial goodness and openness. I think the site is too aggressive in calling states&#8211;Michigan is listed as barely Obama, I think it should be a toss-up&#8211;but overall it&#8217;s a decent place to start.</p>
]]></content:encoded>
			<wfw:commentRss>http://dearscience.org/2008/06/18/how-to-read-a-poll/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

