Ice scope data shows zero correlation between height & GOE

cohen-esque · Mar 26, 2019

Shanshani said:
Do you have a link to this data? I would love to see it. If indeed there is a stronger correlation for 2As, that may lend some credence to the theory that in judges minds, all 3As and quads check the height and distance box merely for being 3As and quads. I would disagree with that (clearly, Yuzuru's 3A is outstanding, and I would also like to give a mention to Boyang's 4Lz, and I think both should be rewarded for that. And in my opinion, even Samarin, whose skating is often validly criticized, deserves extra credit for the size of his quads.), but it would be an interesting addition to our knowledge of how judges behave. I agree that in general more data would be helpful.

Well, the graph for the 2A distance/GOE is still around; the data is... somewhere, hopefully, but I can't find it at the moment. The data for all the clean jumps vs PCS is here.

The sample sizes are pretty variable; I've broken it down by jump type since my first post. ALL types of solo jumps I could do anything with showed a correlation between PCS and GOE but the overall relationship is clearly being brought down by the 3Lo, for whatever reason (it showed a much weaker correlation than the other jumps).

gkelly · Mar 26, 2019

CanadianSkaterGuy said:
2 minute mark! https://youtu.be/zTOSs9acAR8

Or this

Casual · Mar 26, 2019

You know what is severely lacking in ISU? Post-event analysis of judging.

I remember an eye-opening video with a side-by-side comparison of Yuna vs. Adelina Sochi skate. Shocking, to say the least. Now, why doesn't ISU perform similar audits of the judging? (Unless they are shamelessly corrupt. :laugh:

)

It would be illuminating to compare the actual results vs. what they should have been - and then train the judges accordingly.

Edwin · Mar 26, 2019

Casual said:
You know what is severely lacking in ISU? Post-event analysis of judging.

I remember an eye-opening video with a side-by-side comparison of Yuna vs. Adelina Sochi skate. Shocking, to say the least. Now, why doesn't ISU perform similar audits of the judging? (Unless they are shamelessly corrupt. )

It would be illuminating to compare the actual results vs. what they should have been - and then train the judges accordingly.

Are certain there are no evaluations at all? No new judging videos and study course and examination materials produced from all the extra footage ISU has access to?

gkelly · Mar 26, 2019

Casual said:
It would be illuminating to compare the actual results vs. what they should have been

How does one determine "what they should have been"? Who makes that determination?

Elucidus · Apr 2, 2019

IceScope sham exposure:
https://www.youtube.com/watch?v=C5LhKLXddts

There is literally one random man who just marks manually, by eye - on a strongly tilted tv image filmed from ceiling camera - two points (presumable beginning and end of jump) and draws direct line between them. The application would calculate the rest after that. Apparently this app was used for baseball before. So.. yeah - talking about "hi-tech" and "computer calculating" - it's just human entering the data manually with huge margin for error (they even don't determine exact area of a boot or blade from where height calculation should begin - with top-down angle it's just impossible) :drama:

Nilf · Apr 3, 2019

Elucidus said:
IceScope sham exposure:
https://www.youtube.com/watch?v=C5LhKLXddts

There is literally one random man who just marks manually, by eye - on a strongly tilted tv image filmed from ceiling camera - two points (presumable beginning and end of jump) and draws direct line between them. The application would calculate the rest after that. Apparently this app was used for baseball before. So.. yeah - talking about "hi-tech" and "computer calculating" - it's just human entering the data manually with huge margin for error (they even don't determine exact area of a boot or blade from where height calculation should begin - with top-down angle it's just impossible)

tbh you don't know how it works exactly. Maybe he just marks an area of interest to find and start tracking the path of the blade. There are a lot of cameras and I think they use computer vision. With quality and high-frequency frames produced by calibrated cameras, they can calculate the homography to 2d-surface and estimate height, length, velocity. It's not rocket science.

CanadianSkaterGuy · Apr 3, 2019

Nilf said:
tbh you don't know how it works exactly. Maybe he just marks an area of interest to find and start tracking the path of the blade. There are a lot of cameras and I think they use computer vision. With quality and high-frequency frames produced by calibrated cameras, they can calculate the homography to 2d-surface and estimate height, length, velocity. It's not rocket science.

There still is inconsistency though - note how sometimes the curve of the jump matches where the skater's feet are and other cases there is a gap between the feet and the curve and other times it overlaps with the skater's feet. Ice scope is a fun frivolous viewer tool to enjoy but the accuracy of it has yet to be proven. Also people are using ice scope results as a comparative be all and end all of how well a skater jumps is inaccurate because skaters execute jumps bigger/better or smaller/worse from competition to competition. For example Hanyu's 4L had an exit speed of only 9 m/s but I'm guessing he's landed it with more speed/flow on the landing before. There needs to be a greater sample size and the API needs to be brought to various countries to see if there's a trend between the nationality of the ice scope operator and the nationality/popularity of the skater that they are assessing. It's a good step, but I'm still very skeptical about the accuracy of it.

NaVi · Apr 3, 2019

While those other metrics are nice(height, distance, speed), time in the air should be an easier metric to get and probably a better metric to use.

gkelly · Apr 3, 2019

Speed is probably more relevant to skating ability than air time.

macy · Apr 3, 2019

kind of proves IJS didn't really get away from or "fix" the issues that 6.0 had...

:slink:

GF2445 · Apr 3, 2019

Ice scope is more for the audience and commentators. Something to continue the conversation.
Height and distance is only one of 6 bullet points for GOE (noted that it is one of the mandatory GOE bullets to receive +4/+5 GOE)
Remember these are guidelines that inform the judges' decision.

Maybe it would become more related the day AI is so advanced that the judges wouldn't be needed to mark the technical elements score.

CanadianSkaterGuy · Apr 4, 2019

macy said:
kind of proves IJS didn't really get away from or "fix" the issues that 6.0 had...

Well just like in 6.0 how a skater with great amplitude on their jumps might not have been credited for that, in this system it's still subjective to give a skater a GOE bullet for amplitude. Unfortunately you can only lead the judges to water - you can't make 'em drink!

CanadianSkaterGuy · Apr 4, 2019

GF2445 said:
Ice scope is more for the audience and commentators. Something to continue the conversation.
Height and distance is only one of 6 bullet points for GOE (noted that it is one of the mandatory GOE bullets to receive +4/+5 GOE)
Remember these are guidelines that inform the judges' decision.

Maybe it would become more related the day AI is so advanced that the judges wouldn't be needed to mark the technical elements score.

Maybe if judges knew they could get replaced by AI they would endeavour to be more accurate/fair. Of course, they're only human.

gkelly · Apr 4, 2019

It's not so much that the rules require jump size to be the primary determinant of jump scoring and judges are either unable or unwilling to follow those rules.

Rather, the rules consider jump size to be one of many positive qualities to be rewarded. Perhaps there is an inverse relationship between jump size and some of the other positive qualities. Or a direct correlation between jump size and some of the negative qualities. For example, I'd expect bigger jumps, on average, to be more telegraphed.

Mathman · Apr 5, 2019

What is intriguing to me in these discussions is the idea of judging by some sort of "artificial intelligence," versus just "using technology to obtain accurate measurements." The latter could easily be obtained if anyone wanted to put up the money.

What would be cool, though, would be a self-teaching program that could evaluate the GOE bullet point "jump matches the music." Then we could move on to PCS (Perfortmance: "physical, emotional, and intellectual involvement.")

A first step might be to tackle Transitions. It should be easy enough to teach a robot to recognize a Mohawk or a counter annd evaluate the bullet points "variety and difficulty."

This would not be impossible (though again, someone would have to care enough to actually do it.) The problem would be, how to evaluate the success of the program's self-modifying output loops so that it could bolster its successes and devalue its failed attempts. In most projects of this type the goal would be to develop a robot judge whose scoring would be indistinguishable from the scoring of human judges. (What type of flexing of muscle groups do humans regard as esthetically pleasing, etc.) This alone would be interesting and I think would shed light on the "art versus sport" conundrum.

But then again, we already have judges who judge like humans do.

CanadianSkaterGuy · Apr 7, 2019

Height and distance and landing speed isn't everything. And looking at the data further I saw a significant example of it:

Look at Slavik Hayrapetyan's 3A: https://www.youtube.com/watch?v=pXDXocE6QWw#t=4m15s

Obviously the lean/forward landing drop the GOE to 0.11 ... although if he were a top-tier/more popular skater I'm sure it would have been higher -- and you did mention that PCS/planned BV correllates with higher GOE. Generally though, skaters with higher PCS also happen to have cleaner elements.

Slavik still had a 3A that was 3rd highest, 4th farthest and 2nd fastest on the landing speed. This example right here shows that a jump can have some of the highest "quantifiable" stats, and still not earn as high GOE as other attempts with lower quantifiable stats, due to other factors of the jump being considered (like the form breaks Slavik had).

i.e. it's not like the judges weren't considering the height/distance/speed - but they were also considering other jump aspects/issues when calculating their final GOE. In my opinion, the judges are more desensitized to lower-tier skaters with excellent height/distance, and focus more on the fact that he had form breaks and penalize him for it. Whereas a top-tier skater can have not as much height/distance and the same minor error but get higher GOE because the judge is predisposed to seeing that skater scored higher.

CanadianSkaterGuy · Apr 7, 2019

Shanshani said:
In essence, if you want to predict how much GOE someone will get for a successful jump, the jump height, jump distance, and landing speed will tell you virtually nothing, whereas the skater's PCS and planned jump BV is a much more reliable indicator.

(Oh, and here are the average stats for all men.)

Average height: 0.59m (standard deviation: 4.1 cm)

Average distance: 2.87m (standard deviation: 0.39 m)

Average landing speed: 14.71 m/s (standard deviation: 3.03 m/s)

Average GOE: 1.75 (standard deviation: 0.92)

You can find all the data for skaters who successfully completed (positive execution) their 3A in the spreadsheet. Notable figures (highest in each category bolded, second highest italicized):

[table="width: 500"]
[tr]
[td]Skater[/td]
[td]Height(m)[/td]
[td]Distance(m)[/td]
[td]Landing speed(m/s)[/td]
[td]GOE[/td]
[/tr]
[tr]
[td]Yuzuru Hanyu[/td]
[td]0.7[/td]
[td]3.62[/td]
[td]15.3[/td]
[td]3.43[/td]
[/tr]
[tr]
[td]Nathan Chen[/td]
[td]0.58[/td]
[td]2.66[/td]
[td]17.1[/td]
[td]2.74[/td]
[/tr]
[tr]
[td]Shoma Uno[/td]
[td]0.51[/td]
[td]3.44[/td]
[td]18.3[/td]
[td]3.09[/td]
[/tr]
[tr]
[td]Mikhail Kolyada[/td]
[td]0.65[/td]
[td]2.50[/td]
[td]11.8[/td]
[td]2.97[/td]
[/tr]
[tr]
[td]Vincent Zhou[/td]
[td]0.58[/td]
[td]2.69[/td]
[td]16.7[/td]
[td]1.6[/td]
[/tr]
[tr]
[td]Jason Brown[/td]
[td]0.60[/td]
[td]2.35[/td]
[td]14.6[/td]
[td]2.51[/td]
[/tr]
[tr]
[td]Boyang Jin[/td]
[td]0.57[/td]
[td]2.55[/td]
[td]16[/td]
[td]2.51[/td]
[/tr]
[tr]
[td]Morisi Kvitelashvili[/td]
[td]0.60[/td]
[td]3.51[/td]
[td]17.0[/td]
[td]1.83[/td]
[/tr]
[tr]
[td]Slavik Hayrapetyan[/td]
[td]0.64[/td]
[td]3.13[/td]
[td]17.9[/td]
[td]0.11[/td]
[/tr]
[/table]

Here's the thing. The GOE bullet says "very good height and very good length (of all jumps in a combo or sequence)". BUT there is no actual number as to what constitutes "very good height" or "very good length".

Also important to note that it doesn't say SUPERIOR height or anything that implies the GOE bullet is awarded for better height/length relative to other skaters.

Now, what if, hypothetically, the minimum benchmarks to achieve this particular GOE bullet were 0.50 m for height and 2.35 m for distance?

Then all of these guys would be getting the height/distance GOE bullet on their triple axel, and minimal correlation between amplitude and GOE makes sense, since everyone's getting the minimum height/distance! :laugh:

Mathman · Apr 9, 2019

CanadianSkaterGuy said:
Now, what if, hypothetically, the minimum benchmarks to achieve this particular GOE bullet were 0.50 m for height and 2.35 m for distance?

Then all of these guys would be getting the height/distance GOE bullet on their triple axel, and minimal correlation between amplitude and GOE makes sense, since everyone's getting the minimum height/distance!

To me, this line of reasoning highlights the whole problem with positive bullet points and GOEs. You have 6 binary bullet points "yes" or "no," with no real explanation of what "yes" means. Then you add up the number of yeses. (Not only that, but you have to decide whether to write yeses or yesses.)

I doubt that the judges are actually doing this. I bet they are looking at the jump, saying to themselves "wow" or "meh" and putting down a number from 1 to 5.

IMHO, this makes sense. The base value is "what did you do?" and the GOE is "how well (qualitatively) did you do it?"

CanadianSkaterGuy · Apr 9, 2019

Mathman said:
To me, this line of reasoning highlights the whole problem with positive bullet points and GOEs. You have 6 binary bullet points "yes" or "no," with no real explanation of what "yes" means. Then you add up the number of yeses. (Not only that, but you have to decide whether to write yeses or yesses.)

I doubt that the judges are actually doing this. I bet they are looking at the jump, saying to themselves "wow" or "meh" and putting down a number from 1 to 5.

IMHO, this makes sense. The base value is "what did you do?" and the GOE is "how well (qualitatively) did you do it?"

To me, this is how judges do it. It's very difficult for a judge to consider every single rule/deduction/bullet/etc. as they watch a jump. It's not like they're watching every single jump multiple times to cover every detail. So they make a call - some might focus on a particular aspect of a jump which is why we see disparities in GOE.

The nature of the competition and politics and the popularity of the skater they're assessing all come into play. I'm fairly certain the judges who judge a skater at Nationals or a fluff competition like WTT would judge them more strictly at Worlds (which obviously explains GOE inflation we see at Nationals).

One thing I always marvelled at were diving judges. Like, AFAIK, they don't get replays of a dive so they have to make a split second decision covering all aspects of the dive without slow-mo replay to help them out.

Interestingly (and unsurprisingly), their scoring is often influenced by the divers preceding them even if they're supposed to treat every dive as an absolute.
https://royalsocietypublishing.org/doi/full/10.1098/rsos.160812

Ice scope data shows zero correlation between height & GOE

cohen-esque

gkelly

Casual

Edwin

СделаноВХрустальном!

gkelly

Elucidus

Match Penalty

Nilf

CanadianSkaterGuy

NaVi

gkelly

macy

GF2445

CanadianSkaterGuy

CanadianSkaterGuy

gkelly

Mathman

CanadianSkaterGuy

CanadianSkaterGuy

Mathman

CanadianSkaterGuy

Similar threads

Connect with us