one of the reasons I tried to convince selbe to run ratings adjusted to each world, was with one set of ratings, a substantial test could be done to normalize results - seble would have to do this - could not give it to testers becasue it would give a way too much of the games secrets.
I wanted all ratings normalized to the current d2 system.
as a test, a pair of identical dead avg teams could play each other 1000 times running identical offenses and defenses, then run zone vs man, zone vs fcp, etc until all the combinations were run with statistically identical results w/i a margin for error.
that would be half the project. The next half would be to tweek one team in the 'right' areas, so still identical, but so press would be enhanced. In this case, for 1000 sims, the pressing team should be better. Then the 'tweeked team should run zone and man, in this case the results should be worse.
the team should then be tweeked for zone enhancement, and the same tests should be done.
finally the team should be tweeked for man enhancement, and the same tests should be done.
At the end of the day, my guess is this would take nearly 3-6 months, but we realistically would have a better game.
feel free to rip this idea apart, not meant to be cannon, more as an exercise to stimulate thought