My proposal for a new rating system

kevin
Reporter

Posts: 5,559

My proposal for a new rating system Feb 26, 2019 16:32:04 GMT -6

Quote

Post by kevin on Feb 26, 2019 16:32:04 GMT -6

As you may know, I've been critical of the power rating system used for selection and seeding of the playoff teams. While its intent is admirable--select and seed the teams in a fair and impartial manner--it has clear flaws.

It is true that the PR system gets most things right. But that's not a particularly high standard to meet. I looked over the results of the DI girls' playoffs. The better seed, according to the LHSAA system, won 19 of 23 games. That's an impressive-sounding record. But what if you had simply picked the team with the better W-L-T record? You'd also have picked 19 winners correctly.

Another problem is that too much of the system is based on your record and your opponents' records, without any regard for what those records actually mean. Not every 15-win team is the same. Some teams have 15 wins against a very tough 20-game schedule. Other teams have 15 wins against a weaker 25-game schedule. The current system does not control for the number of games played, nor does it look at how good your opponents' opponents really are.

Before I go any farther, I'd like to thank 3balz, who was generous enough to send me the spreadsheet he had compiled in order to update the power ratings every day. His data had every single game played all year, with easy access to any team's schedule, results, etc. Without that there is no way I'd have been able to put this together.

I first tried some simple tweaks to the current system. I tried to account for the different numbers of games played by different teams by calculating each team's win percentage and using that number against a hypothetical 20-game schedule. It didn't make much difference. I tried using an RPI-style system (25% your record, 50% your opponents' records, 25% your opponents' opponents' records). That also didn't make much difference.

It became clear that the schedules that teams play are simply too different for these methods to work well. What I wanted to do was create a method that would be much more similar to what a human would do in comparing teams, while still being completely objective and suitable for a computer to perform. I thought back to what I used to do during the days of coaches' seeding. At the end of the season, I would take the playoff teams and create a brief resume for each one, looking at record, key wins and losses, etc. When I then set out to compare teams, the first thing I looked for, of course, was head-to-head. If Team A beats Team B, Team A is probably better. I know that upsets happen, and that sometimes teams play more than once, but that's my first starting point.

Unfortunately, not all teams play head-to-head. So what's my next step? I look at common opponents. If team A beats teams C, D, and E, but team B loses to C, D, and E, we can probably say that team A is better than team B. Obviously, it doesn't always work out quite that cleanly, but the basic idea should still work. If two teams play the same five opponents and one team goes 3W-1L-1T and the other goes 1W-2L-2T, the team that won three games is probably better.

What I decided to do was to compare every single team in a division against everyone else and award a team a point for each team it defeated, either in a head-to-head matchup (if applicable) or by comparing winning percentage vs. common opponents. If two teams had no head-to-head and no common opponents (or if the records vs. common opponents were identical), I called it a tie and did not award a point to either team.

I tested this method with the DI girls results. With 41 teams in the division, the most points a team could've earned would be 40. One drawback to this method is that it's quite easy for teams to end up tied. If two or more teams were tied, I looked at how they fared in their comparisons to each other. Here are the results of the method:

1. Byrd, C.E. 39
2. Mandeville 38
3. St. Scholastica 37
4. Dominican 36
5. Northshore 33
6. Mt. Carmel 32
7. Acadiana 29
8. Baton Rouge 28
9. St. Joseph's Academy 27
10. Barbe 24
11. Fontainebleau 22
12. Dutchtown 22
13T. West Monroe 21
13T. Lafayette 21
15. St. Amant 19
16T. Sulphur 15
16T. Walker 15
18. East Ascension 14
19. Thibodaux 14
20. Hahnville 13
21. Slidell 13
22. Captain Shreve 11
23. Comeaux 10
24. Denham Springs 10

25. Zachary 10
26T. Alexandria 8
26T. West Jefferson 8
28. Ehret, John 6
29. Chalmette 5
30. Bourgeois, H.L. 4
31. Bonnabel 4
32. Airline 4
33. Higgins, L.W. 4
34. Ponchatoula 3
35. New Iberia 2
36T. King, Grace 1
36T. Covington 1
36T. Pineville 1
39T. Southwood 0
39T. Hammond 0
39T. East St. John 0

You'll notice that this method correctly predicts the winners of 21 of the 23 playoff games, an improvement over the current PR system. In the next post I'll be addressing some of the questions I think people may have, and outline some areas for potential improvement.

kevin
Reporter

Posts: 5,559

My proposal for a new rating system Feb 26, 2019 16:33:33 GMT -6

Quote

Post by kevin on Feb 26, 2019 16:33:33 GMT -6

What problems are there?

A few problems jumped out at me. First of all, why isn’t SSA number 1? They clearly were the best team over the course of the regular season, and they proved it in the playoffs. St. Scholastica was better than both Byrd and Mandeville when you compared them head-to-head, but they ended up behind both of those teams. Why? Let’s look at the three teams SSA didn’t have an edge on. They had no common opponents with Southwood. They had one common opponent with Thibodaux; that common opponent was Dutchtown. Dutchtown tied both SSA and Thibodaux. The last team was Zachary; both SSA and Zachary beat Ponchatoula. So SSA ends up 3rd, even though they win a head-to-head (or common opponents) comparison with every other team in the top 10. One possible solution would be to rerank the top 8 teams using only results against the other top 8 teams; 8 is an arbitrary cutoff, but it represents the teams with byes.

Another problem is that this system counts multiple games against the same opponent. Let’s say Team A is really great and team Z is terrible. Team B loses twice to A and beats Z once. Team C loses once to A and beats Z once. B has a .333 record against common opponents, but C has a .500 record against common opponents. I thought about trying to factor this out, but I’m not sure it would help. Second, it wouldn’t give me a direct answer of what to do if a team has different results in multiple games against the same opponent, as happened several times this year.

How could a team exploit this system?

With this system it’s advantageous for a team to play lots of different opponents (especially opponents in your division), especially from different parts of the state. If you play one team from each district, you are guaranteed to have a common opponents (or a head-to-head matchup) with every other team. A number of the top teams played at least one opponent from every district, and it really helped them. Conversely, if you didn’t play against any teams from a certain district, you missed out on a chance to pick up some points. For example, Acadiana did not play anyone in District 5, so they had no common opponents with HL Bourgeois, East Ascension, East St. John, and Thibodaux. Had they beaten a good team in that district, they could’ve picked up a few points.

I think encouraging teams to play teams from other regions is a good thing, but I do realize that it leaves teams with less funding at something of a disadvantage. Under this system, you don’t gain very much from beating a team multiple times, so it’d be better to play a different team than doubling up with a non-district game against one of your district rivals.

Another possible exploit is that a team would know if it has a better record vs. common opponents, and could cancel an end-of-season game if it fears that a head-to-head loss could hurt its chances. I think just about any computer system could fall prey to that (I’ve heard of college volleyball teams canceling out-of-conference games to manipulate their RPI).

Playing more games is definitely helpful, but I don’t think it creates as much of a problem as under the current PR system. I think with careful scheduling you could play 18-19 games and still come out at the top of this system. The most advantageous thing would be to play the best team you can beat from every other district in your division.

Why just rankings instead of computing numbers for each team?

Simplicity. I thought about trying to come up with a number for each comparison, combining a head-to-head result with winning percentage vs. common opponents, weighting head-to-head more heavily if multiple games were played, etc. I haven’t come up with a great way of calculating that number yet, so I just went with a simple comparison.

Doesn’t that lead to a lot of ties?

Yes, ideally there wouldn’t be quite so many ties. The good thing is that ties are more likely at the lowest part of the rankings (outside of the playoffs). The most obvious area for improvement would be to assign a number between 0 and 1 to each pair of teams, where 1 represents a team that would be totally dominant over its opponent and 0 being the opposite, and then to add those numbers up. But I’m not there yet.

How could the gaps be filled in? Surely SSA is better than Southwood.

I’ve thought about trying to make a more complicated chain (i.e. if SSA beat a team that beat a team that beat Southwood), but I’m not sure what that would look like mathematically or in Excel. Right now I’m using two steps to rank teams:

head-to-head
common opponents

What should 3 be? I’m not sure. I could just say winning percentage or something similar, but then I think we’d be too likely to end up with something similar to the current PR system. I could also make it so that you don’t get as many points if you have to go beyond the second step, but I’m not sure how that would work out.

Why not compare teams from all divisions?

The simple, honest answer is that it would be way more complicated, so I didn’t want to try it first. I think it would probably lead to a situation where there would be lots of teams with no common opponents. Teams that played a lot of games would have an advantage, and it could encourage teams to go schedule lots of teams from different districts in smaller divisions.

pOkLE LHSSCA & Moderator Moderator & LHSSCA Member Posts: 3,628	My proposal for a new rating system Feb 26, 2019 20:31:45 GMT -6 via mobile Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by pOkLE on Feb 26, 2019 20:31:45 GMT -6 🤯
	Gulf Coast Premier League + Great Plains Premier League: www.GCPLsoccer.com www.facebook.com/poolboysfc Crossroads Soccer Association: www.CenlaSoccer.com www.facebook.com/CrossroadsSoccerAssociation Cajun Rush Soccer Club: www.cajunrush.com

ulysses All-District Posts: 120	My proposal for a new rating system Feb 26, 2019 21:13:02 GMT -6 via mobile Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by ulysses on Feb 26, 2019 21:13:02 GMT -6 Fantastic metrics K-Dog!! The logic is spot on. If only you presided over D-4 All District alleged voting. You are always the man Kevin. No one can out play by play by you. Grazi Amigo!! -UGT

3balz
Data Expert

It's tough to make predictions... especially about the future

Posts: 1,260

My proposal for a new rating system Feb 26, 2019 21:50:41 GMT -6

Quote

Post by 3balz on Feb 26, 2019 21:50:41 GMT -6

Did you look at score differential at all? I use this site to help our club team and it seems to be a fairly good predictor... I wonder what formula/method they use.

youthsoccerrankings.us/rankings/National/All/Both/

kevin
Reporter

Posts: 5,559

My proposal for a new rating system Feb 26, 2019 22:10:27 GMT -6

Quote

Post by kevin on Feb 26, 2019 22:10:27 GMT -6

I thought about trying to incorporate scores, but I'm not sure how politically feasible that would be and if the LHSAA would ever pass it. At the very least it could be a useful tiebreaker if teams have identical records vs. common opponents.

Looking at that site, my guess is that is uses something like the simple rating system, which is used by a number of different websites ranking various sports. I'm trying to play around with that and see if it might help--apparently there are ways to set it up to either use or ignore margin of victory.

sonics3
All-American

Posts: 709

My proposal for a new rating system Feb 27, 2019 12:23:43 GMT -6 via mobile quickrestart likes this

Quote

Post by sonics3 on Feb 27, 2019 12:23:43 GMT -6

Feb 26, 2019 22:10:27 GMT -6 kevin said:

I thought about trying to incorporate scores, but I'm not sure how politically feasible that would be and if the LHSAA would ever pass it. At the very least it could be a useful tiebreaker if teams have identical records vs. common opponents.

Looking at that site, my guess is that is uses something like the simple rating system, which is used by a number of different websites ranking various sports. I'm trying to play around with that and see if it might help--apparently there are ways to set it up to either use or ignore margin of victory.

Wouldn’t using a margin of victory (scores) defeat the purpose of why the mercy rule was put into place. If margin of victory is factored in, coaches would try to get to 8 as fast as possible and forget about playing the young ones. This would go for any type of goal differential to be considered in rankings.

kevin
Reporter

Posts: 5,559

My proposal for a new rating system Feb 27, 2019 12:32:23 GMT -6

Quote

Post by kevin on Feb 27, 2019 12:32:23 GMT -6

Feb 27, 2019 12:23:43 GMT -6 sonics3 said:

Feb 26, 2019 22:10:27 GMT -6 kevin said:

I thought about trying to incorporate scores, but I'm not sure how politically feasible that would be and if the LHSAA would ever pass it. At the very least it could be a useful tiebreaker if teams have identical records vs. common opponents.

Looking at that site, my guess is that is uses something like the simple rating system, which is used by a number of different websites ranking various sports. I'm trying to play around with that and see if it might help--apparently there are ways to set it up to either use or ignore margin of victory.

Wouldn’t using a margin of victory (scores) defeat the purpose of why the mercy rule was put into place. If margin of victory is factored in, coaches would try to get to 8 as fast as possible and forget about playing the young ones. This would go for any type of goal differential to be considered in rankings.

Margin of victory could easily be capped at any number--8 goals or 5 goals or 3.

My guess is that using margin of victory could be helpful, but probably isn't necessary. While margin of victory is probably very useful for NCAA basketball or football and helps create a model that can make better predictions, I think the gaps in team strength are much bigger in high school soccer than they are in NCAA DI football or basketball. And for purely political reasons I think it's better to avoid using margin of victory.

quickrestart
Bench Warmer

Posts: 8

My proposal for a new rating system Feb 27, 2019 12:56:40 GMT -6

Quote

Post by quickrestart on Feb 27, 2019 12:56:40 GMT -6

Feb 26, 2019 16:32:04 GMT -6 kevin said:

As you may know, I've been critical of the power rating system used for selection and seeding of the playoff teams. While its intent is admirable--select and seed the teams in a fair and impartial manner--it has clear flaws.

It is true that the PR system gets most things right. But that's not a particularly high standard to meet. I looked over the results of the DI girls' playoffs. The better seed, according to the LHSAA system, won 19 of 23 games. That's an impressive-sounding record. But what if you had simply picked the team with the better W-L-T record? You'd also have picked 19 winners correctly.

I'm not sure who the current power rating system hurts. I see no evidence that it needs to be changed.

My proposal for a new rating system

Post by kevin on Feb 26, 2019 16:32:04 GMT -6

Post by kevin on Feb 26, 2019 16:33:33 GMT -6

Post by pOkLE on Feb 26, 2019 20:31:45 GMT -6

Post by ulysses on Feb 26, 2019 21:13:02 GMT -6

Post by 3balz on Feb 26, 2019 21:50:41 GMT -6

Post by kevin on Feb 26, 2019 22:10:27 GMT -6

Post by sonics3 on Feb 27, 2019 12:23:43 GMT -6

Post by kevin on Feb 27, 2019 12:32:23 GMT -6

Post by quickrestart on Feb 27, 2019 12:56:40 GMT -6