logo
The Belgian lab shaping modern soccer's data revolution

The Belgian lab shaping modern soccer's data revolution

The Guardian2 days ago

If you hope to grasp why modern soccer looks the way it does, or the long strides we've made recently in understanding how it actually functions, it helps to know about what's been happening at one of the world's oldest universities, in Belgium.
That's where you'll find the Sports Analytics Lab at the Catholic University of Leuven, headed up by Jesse Davis, a Wisconsinite computer science professor. Davis grew up going to basketball and football games at the University of Wisconsin-Madison and didn't discover soccer until college, during the 2002 World Cup. When he was hired in Leuven in 2010 to research machine learning, data mining and artificial intelligence, a band of sports-besotted colleagues brought him back to soccer.
Before long, Davis was supervising a stable of post-docs, PhD and master's students working on soccer data. The richness and complexity of the data lent itself well to the study of AI. The work they produced, and made available to anyone through open-source analytics tools, substantially advanced the science behind the sport, and changed the way some clubs thought about playing.
It may also serve as an example of how funding university research can benefit the public, including the businesses working within the field being studied; a potential parable for the value of academia at a time when it is being squeezed from all sides.
In the early days of the analytics movement in sports, it was broadly believed that soccer didn't lend itself very well to advanced statistical analysis because it was too fluid. Unlike baseball, or basketball, or gridiron football, it couldn't be broken down very easily into a series of discrete actions that could be counted and assigned some sort of value. Its most measurable action, shots, and therefore goals, make up a tiny fraction of the events in a given game, presenting a problem for quantifying each player's contributions – especially in the many positions where players tend not to shoot at all.
But while soccer was slow to adapt and adopt analytics, it got there eventually. Most big clubs now have an extensive data department, and there's now a disproportionately large genre of (eminently readable) books on this fairly esoteric subject.
The Sports Analytics Lab published its findings on the optimal areas for taking long shots or asking whether, in some situations, it's more efficient to boot the ball long and out of bounds than to build out of the back. Some of those papers carried inscrutably academic-y titles like 'A Bayesian Approach to In-Game Win Probability' or 'Analyzing Learned Markov Decision Processes Using Model Checking for Providing Tactical Advice in Professional Soccer.'
Wisely, they also published a blog that broke all of it down in layperson's terms.
This fresh research led to collaborations with data analysts at clubs such as Red Bull Leipzig, Club Brugge and the German and United States federations. The lab also worked with its local pro club, Oud-Heverlee Leuven and the Belgian federation.
But what's curious is that a decade and a half on, Davis and his team, which numbers about 10 at any given time, are still doing industry-leading and paradigm-altering research, like its recent work fine-tuning how ball possession is valued.
Now that the sport, at the top end, has fully embraced analytics and baked it into everything it does, you would expect it to outpace and then sideline the outsiders, as has happened in other sports. But it didn't.
'Elite sport, and not just soccer, has an intense focus on what comes next,' says Davis. 'This is particularly true because careers are so fleeting both for players and staff. Consequently, the fact that you may not be around tomorrow does not foster the desire to take risks on projects that, A, may or may not work out or, B, will yield something useful but not in the next six-to-nine months.'
There is innovative work being done within soccer clubs that the outside world doesn't get to see, because what would be the point of sharing all that hard-won insight? The incentives of professional sports strains against the scientific process, which values taking risks and tinkering endlessly with the design of experiments, none of which might yield anything of use. What's more, it requires highly skilled practitioners, who can be tricky and pricey to recruit. The payoff of that investment may be limited. And if it arrives at all, the output of that work may not necessarily help a team win games, especially in the short term.
Meanwhile, most of the low-hanging soccer analytics fruit – like shot value, or which types of passes produce the most danger – has already been picked. What remains are far more complicated problems like tracking data and how to make sense of it.
Sign up to Soccer with Jonathan Wilson
Jonathan Wilson brings expert analysis on the biggest stories from European soccer
after newsletter promotion
You may find, for instance, that while expected goal models have become pretty good at quantifying and tabulating the chances a team created over the course of a game, they do not work well in putting a number on a certain striker's finishing ability because of biases in the training data.
Yes. Sure. Great. But now what? What are Brentford (or his potential new club Manchester United) supposed to do with the knowledge that Bryan Mbeumo's Premier League-leading xG overperformance of +7.7 – that is, Mbeumo's expected goals from the quality of his scoring chances was 12.3, but he actually scored 20 times this past season – doesn't actually suggest that he was the best or most efficient finisher in the Premier League?
What's more, when a club does turn up a useful tidbit, they have to find a way to not only implement that finding, but to track it over the long term. That means building some sort of system to accommodate it, which entails data engineering and software programming. On the club side, this kind of work can take up much, or most, of the labor in analytics work.
'For some of the deep learning models to work with tracking data takes months to code for exceptional programmers,' says Davis. 'Building and maintaining this is a big upfront cost that does not yield immediate wins. This is followed by a cost to maintain the infrastructure.'
Academics, on the other hand, have less time pressure and can move on to some new idea if a project doesn't work out or there is simply no more new knowledge to be gained from it. 'I don't have to worry about setting up data pipelines, building interactive dashboards, processing things in real time, etc,' says Davis.
The research itself is the point. The understanding that issues from it is the end, not the means. And then everybody else benefits from this intellectual progress.
There may be a useful lesson in this for how a federal government, say, may consider the value of investing in scientific inquiry.
Leander Schaerlaeckens is at work on a book about the United States men's national soccer team, out in 2026. He teaches at Marist University.

Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

The power from thousands of UK car drivers is now in your hands
The power from thousands of UK car drivers is now in your hands

Auto Express

time15 minutes ago

  • Auto Express

The power from thousands of UK car drivers is now in your hands

I received an interesting E-mail from a reader last week, following up on a recent long-term report on the single-motor Volvo EC40 I'm currently driving. It turns out that our reader, a keen petrolhead called Derek Dunthorne, is running the twin-motor version of the Volvo EC40, and although it's in a different specification to mine and something of an upgrade, we share many similar opinions. Advertisement - Article continues below Derek loves the interior, telling me that the 'Android-based interface is brilliant – easy to use and completely logical – although, like most folk, I would prefer knobs for the heating and ventilation'. He also gave me some background information on his buying decision. Moving away from several Porsche Macans, this would be his – wait for it – 56th car. What particularly impressed him was Volvo's interior quality and finish; they're not quite at Porsche standards, in his opinion, but better than the Korean alternatives. He initially considered the smaller and newer EX30, but found he preferred the more conventional driver displays of the EC40, which is something I happen to agree with. It also turns out that we are even getting the same miles per kWh. But the really interesting information from Derek was beyond even what our long-term reports can offer. It transpires that he bought the car at six months old with 1,800 miles on the clock for a discount of 33 per cent on the new price and a three per cent PCP (to hedge against depreciation). He was impressed with Volvo's used car 30-day exchange policy, and as it transpired that was a good choice, because he actually ended up using it. Unhappy with his first car for a number of reasons, he returned it and the company refunded his money and allowed him to choose another, which he now loves. In terms of running costs, his are a fraction of mine. He uses the Octopus Energy Intelligent Go tariff that enables him to charge overnight for 7.0p per kWh. This means he has spent around £120 on electricity in 12 months and about 4,000 miles. Like many, he has never needed to use public charging, with all his journeys having been no longer than the 200 miles that the Volvo can comfortably complete. In the digital era, we often hear about information overload, but Derek's E-mail and the context of his purchase and ongoing experience really brought home the power of information for car buyers. Which is what our annual Driver Power satisfaction survey is all about: real-world experiences from real-world drivers. You can read the 2025 results next week, but in the meantime 'be more Derek'. Fill in the latest survey that will contribute to next year's results, because there is clearly no such thing as too much information when it comes to changing cars. We need all the help we can get, from experts and those living with the cars we're considering. Buy a car with Auto Express. Our nationwide dealer network has some fantastic cars on offer right now with new, used and leasing deals to choose from...

Torino part ways with manager Vanoli
Torino part ways with manager Vanoli

Reuters

time16 minutes ago

  • Reuters

Torino part ways with manager Vanoli

June 5 (Reuters) - Torino have parted ways with manager Paolo Vanoli, who had one year left on his contract, the Serie A club said on Thursday. Vanoli was appointed a year ago but after Torino finished last season in 11th place, well away from the relegation battle but never in the mix for European qualification, the club have decided the time is right to make a change. According to Italian media reports, former Lazio manager Marco Baroni is the likely replacement for 52-year-old Vanoli.

Europe's central bank expected to lower interest rates as Trump's trade war threatens growth
Europe's central bank expected to lower interest rates as Trump's trade war threatens growth

The Independent

time17 minutes ago

  • The Independent

Europe's central bank expected to lower interest rates as Trump's trade war threatens growth

Lower inflation and concern that U.S. President Donald Trump 's trade war will slow already modest growth have cleared the way for the European Central Bank to cut interest rates at Thursday's policy meeting, a step that would lower borrowing costs for consumers and businesses and promote economic activity. With a cut widely expected by market analysts, a key question is how low the bank will go, given uncertainty about the impact of U.S. trade policy on Europe's export-dependent economy. Bank President Christine Lagarde will face questions about the bank's outlook for coming meetings at her post-decision news conference. A cut of a quarter percentage point would be the eighth rate cut since June 2024 and would take the bank's benchmark rate to 2%. Trump on April 2 announced a 20% tariff, or import tax, on goods from the European Union. He later threatened to raise the tariff to 50% after expressing dissatisfaction with the progress of trade talks with the EU. Trump and the EU's executive commission have agreed to suspend implementation and any retaliation by the EU until July 14 as negotiators seek to reach agreement. Trump added more disruption this week by suddenly increasing a 25% tariff on steel imports to 50% for all countries except for the U.K. The threat of even higher tariffs has raised fears that growth will underperform already modest forecasts. The EU's executive commission lowered its growth forecast for this year to 0.9% from 1.3% on the optimistic assumption that the 20% tariff rate can be negotiated down to no more than 10%. Low inflation has bolstered the ECB's ability to cut rates. Annual inflation for the 20 countries that use the euro fell to 1.9% in May from 2.2% in April as energy prices eased. The ECB raised rates to a record high of 4% to suppress a 2021-2023 inflation outbreak that reached double digits. But with inflation now below its 2% target, the bank has more freedom to cut. Lower rates make it cheaper to borrow and buy things, supporting demand for goods and in theory increasing spending and investment.

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into the world of global news and events? Download our app today from your preferred app store and start exploring.
app-storeplay-store