Note: This post was published on January 27th and updated on February 22nd to include the 2×2 matrix. Thanks to Avinash Kaushik for providing guidance to do that.
It’s clear there is very little awareness of the diving problem in football and one of the main reasons is that nobody has measured how big a problem it is. We just see it happening in the moment, it’s an accepted part of the game, and given it’s part of the status quo very few people are interested in changing it. Measuring the problem will be a way to raise awareness around it but also a way to understand it better. Measurement is also the first step to identify relevant solutions and ultimately to improve the situation being studied. As one of the leading business management experts says:
“If you can’t measure it, you can’t improve it.”
It is not only about counting the number of dives in each game, the ultimate question we want to answer is: How much the course of games, leagues, World Cups are negatively affected by diving? How many times the winner of a game would have been different had there been no dishonesty on the field? What percentage of the results from games are polluted by divers? Could we calculate an honesty index in football?
An easy way to visualize what we want to accomplish is by looking at the following 2×2 matrix. On the vertical axis we separate dives from non-dives, on the horizontal axis we differentiate situations that produce an unjust impact on the game from those that don’t.
We want to study the two lower quadrants: dives that go unidentified and their impact on the course of games, plus dives that are identified and contained by referees. In a world were all players engage with honour and integrity there shouldn’t be unfair results and referees should not have to protect fair play from deceiving players. We hope that our effort will encourage players that dive to become the safeguards of fair play themselves.
Note: The top left quadrant is out of our scope, plays that don’t include dives but produce an unjust impact, like cases of mistaken identity. Those cases could be addressed by the Video Assistant Referee (VAR).
As a data analytics problem we need to take a few steps before arriving to a solution: (1) gather data, (2) give context to that data to enable analysis, (3) draw insights from the analysis, and finally (4) drive action to solve or improve the situation being analysed.
With all that in mind here we present our proposed measurement model to reveal the impact diving has on our beloved sport.
Metrics to count the type of dives that occur in a game.
Judgement is subjective and opinions will vary within 3 possible discrete values: dive, no dive or unclear. As there is no deterministic way to judge these plays, the option is to qualify them with a crowdsourcing mechanism asking the audience to assign the play one of those 3 values. The metric would then report what percentage of the audience thinks it was a dive, what percentage think it wasn’t and what percentage think it’s unclear. The data should always be paired with its statistical significance based on the number of crowdsourced responses. And it should be ensured that the participating audience (panel) is not biased towards one of the teams.
Wikipedia lists a 2009 study that found 4 recognisable traits (we propose a fifth one) that can often be observed when a player is diving. People can be educated in recognising these traits to increase the accuracy of the crowdsourced judgement. To simplify panel bias prevention, a panel of proven experts could be appointed by a neutral organization.
Even if an accurate way of judging cannot be implemented the value of this metric is not so much the actual percentages but the way it completes the overall picture of the diving practice. Counting inconclusive dives as an integer will reveal how many plays where there’s reasonable doubt there are in a game. To put it in visual terms see these simple Venn diagrams:
Metrics to understand the effect dives have over the game and the other team.
As a summary, here are all the proposed metrics:
Basically, add as many dimensions as you want so that you can slice and dice the data in interesting ways. Some examples are:
Metrics by themselves hardly produce any insights. Just counting dives will produce no greater good other than exposing divers a little more. There are many other interesting things to be discovered though. Some examples:
And the most important questions to answer:
By answering these questions we might be able to compute the honesty index. And then many debates can be started. What would you think if suddenly you discovered that 80% (just to say a number) of the results in you favourite sport come from dishonest acts? That only 20% of the results are real and just. We believe this would propel action and change.
Once enough data has been gathered and insights drawn, the maybe proper actions can be taken. Without all that, what we can do right now is to appeal to players sense of honesty & integrity and ask them to demonstrate these values on the field. Which is what we’re trying to do with #StopDiving.
Copyright notice for this post:
© 2018 StopDiving.org All rights reserved