However, a source of possible skewness would be if a city like Seattle was in a world where the other 31 ballparks were super hitter friendly. The data would possibly say that Seattle depresses offensive production more in that world compared to other worlds. The way to get around potential outliers like that would be to just get more data.
A definite source of skewness is observed in my actual
Seattle Somethings data where my home ERA has been consistent over time but my away ERA hops around based on what stadiums are in my NL as owners come and go over time. In the past, this league had many hitters parks such as Colorado and Albuquerque, but more recently there is a HEAVY concentration of extreme pitchers parks which has done crazy things to league ERA
season |
home ERA |
away ERA |
27 |
3.84 |
5.38 (boy those were the days) |
28 |
4.15 |
4.87 |
29 |
3.66 |
4.69 |
30 |
3.14 |
4.43 |
31 |
3.22 |
3.85 |
32 |
3.14 |
4.01 |
33 |
2.99 |
3.52 |
34 |
2.88 |
3.56 |
35 |
2.96 |
3.40 |
36 |
2.72 |
3.54 |
37 |
2.45 |
3.23 |
38 |
2.62 |
2.98 |
39 |
2.18** |
2.83 |
40 |
2.88 |
3.46 |
41 |
2.51 |
2.52 |
42* (109 games) |
2.47 |
2.86 |
**Side-note, I scrolled thru as many leagues as I could to see if this was an all-time HBD record but I found one team in Iowa City that had one season of 2.13 and another in Burlington that had a season of 2.15 so unfortunately no :(
But as you can see, over time the league's stadium pool is skewing heavily towards the negative, so much so that in S41 my home-away were almost identical which means the league-average-PF that season was just insane.
Therefore, as alluded to previously this variability of stadium-pool from league to league and even within individual leagues itself is always going to be your primary source of error with data-mining home/away on HBD. Real life data good, HBD data bad
1/25/2018 4:40 PM (edited)