Expected Win Percentage Topic

Also known as the Pythagorean Expectation, a formula concocted by Bill James, to estimate how many games a team should win based on the number of runs the team scores versus the number of runs the team allows over the course of a season.

How good is this formula, really?

Recently, I had way too much time on my hands for a couple of days, so I compiled some stats from one of my worlds for the total number of runs scored and total number of runs allowed by each franchise over the history of the world.  Probably not very meaningful, but I was looking for a way to pass some time.  After I compiled this information, I then decided to calculate the expected win percentage for each franchise over the history of the world and compare it to actual wins.  Was looking to see who the over-achievers were, and who the under-achievers were.

The results were kind of surprising.

Most surprising was the fact that the one team that had (a) won the most games over the history of the world, (b) scored the most runs, and (c) allowed the fewest runs, came in as the biggest under-achiever of all 32 franchises, clocking in at (-67) wins below expectations.

In fact, of the five winningest franchises in world history, four of them came in among the five biggest under-achievers.  The fifth one came in as a slight over-achiever, at +9 wins above expectations.

So this makes be wonder: is my approach flawed in my looking at too large of a sample size (19 seasons) with teams that may have gone through multiple ownership changes and many cycles of up and down seasons?  Or is there some sort of inherent flaw in the theory behind the formula itself that means that any correlation between what the formula "predicts" and what happens in reality is little more than coincidence?

3/25/2011 1:53 PM
Any model that uses a curve predicts the "middle" better than the "ends"

Apply the real life example I know - insurance.

More specifically, home insurance. Your suburban home is going to be picked up by any insurance company, but the ultra rich dude probably is going to need a specialty insurer because he gives the actuaries in your garden variety insurance company heart attacks.

ie. his or her home is at the end of the curve.
3/25/2011 2:00 PM
67 wins low out of 19 seasons at an assumed average .550 winning percentage (1693 total wins) is only 4% off.  I would say that is pretty close...

I'm guessing his formula ends up over-valuing good teams and under-valuing bad teams (good teams always fall short and bad teams always win more) but for a majority of teams it's pretty close.
3/25/2011 2:03 PM
pretty much what death said
3/25/2011 2:04 PM
The good teams are bitchslapping the bad teams by 6 and cranking up their EXP%.   The bad teams are sneaking by the good teams  by 1 and winning more than their EXP%.

I don't think that's very surprising.
3/25/2011 2:21 PM
All good points.  I guess I expected to see something flatter with what I thought was significantly large sample size.  Instead I did indeed get something more like a bell curve, with a very noticeable pattern at both ends (good teams under-achieving, poor teams over-achieving).  Wasn't really expecting that.
3/25/2011 2:39 PM
Well you should have asked us first.
3/25/2011 2:42 PM
Nothing like spending a couple of days figuring out what everyone else sees as obvoius.
3/25/2011 2:50 PM
Yea, dumbass.

Sorry, but I've got to go. I'm doing some research on the best way to get gasoline from the nozzle into my car.
3/25/2011 2:54 PM
They were REALLY slow days.
3/25/2011 2:59 PM
I'm guessing teams abandoned mid-season and playing tired players will alter the results a bit.   
3/25/2011 3:29 PM
With the new abandonment replacement rules, are we expecting more threads like this from tec?  Lots of time on his hands now...
3/25/2011 7:55 PM
I noticed that my 9 starters are getting more at bats than my bench players. Tec, can you do some analysis on the reason why?
3/25/2011 8:41 PM
I'll get on that as soon as I figure out why there's such a strong affinity between wins and scoring more runs than your opponent. I find that very odd.
3/25/2011 8:58 PM
This sounds like typical commentator "insights"..."let me tell you, everytime this team scores more runs than the opponent, they will win games!"
3/25/2011 9:39 PM
123 Next ▸
Expected Win Percentage Topic

Search Criteria

Terms of Use Customer Support Privacy Statement

© 1999-2026 WhatIfSports.com, Inc. All rights reserved. WhatIfSports is a trademark of WhatIfSports.com, Inc. SimLeague, SimMatchup and iSimNow are trademarks or registered trademarks of Electronic Arts, Inc. Used under license. The names of actual companies and products mentioned herein may be the trademarks of their respective owners.