Simulations for the 2026 FIFA World Cup

Introduction

This summer brings the 2026 FIFA World Cup, the most ambitious edition of the tournament yet. For the first time, 48 teams will compete — up from 32 — spread across 12 groups in a host trio of the United States, Canada, and Mexico. The expanded field means a new knockout format as well: the top two teams from each group advance directly to the Round of 32, joined by the eight best third-place finishers, setting up a 32-team bracket before the traditional Round of 16 begins. As I have done for previous soccer tournaments, I’ve modeled and simulated the World Cup 10,000 times to generate probabilistic predictions. For the curious reader, model details, limitations, and code links appear at the end of this article.

I’ll try to update these predictions throughout the tournament, so check my Twitter/X to follow along.

World Cup 2026

  • Brazil is my model’s slight favorite at 17.5%, edging out Spain (14.5%) and Argentina (11.5%). The South American heavyweights combine elite attacking and defensive ratings, and both drew manageable groups.
  • Spain enters as reigning European champions and actually carries the highest net team rating in the model, but a group containing Uruguay keeps their Group H from being a walkover, which pulls their championship probability slightly below Brazil’s.
  • Argentina (11.5%) are the defending World Cup champions and drew arguably the most favorable group on paper, facing Algeria, Austria, and Jordan in Group J. Don’t expect any surprises there.
  • England (9.2%) remain a top-four contender. Group L also contains Croatia, which sets up a potential rematch of the 2018 World Cup semifinal as early as the group stage.
  • My model is a bit lower on France (7.0%) than the betting markets. Group I — featuring Senegal and Norway — is trickier than it looks, and France’s path to the knockout stage is not a formality.
  • Portugal (6.5%) are rated very highly by the model, perhaps a touch underappreciated by the public. Their Group K draw alongside Colombia is one of the more unusual pairings of the tournament; one of those two sides will likely exit in the group stage.
  • Germany (4.5%), Belgium (4.0%), and Colombia (3.8%) form a credible next tier. All three have realistic paths to the quarterfinals and beyond.
  • Morocco (2.7%) are the model’s top African side but drew an extremely tough Group C alongside Brazil. Getting out of that group would be a significant achievement.
  • The United States (1.4%) has a genuine, if modest, shot at a deep run as one of the three host nations. Group D — with Paraguay, Australia, and Türkiye — is competitive but winnable, and home crowd support could provide an advantage the model only partially captures.

The Model

The model used in these simulations is a slight variation of the Bayesian bivariate Poisson model (BVP) outlined in Equation (2) of (Benz and Lopez, 2021). The BVP modeling framework is as follows. Let \((Y_{ij}, Y_{ik}) \sim BVP(\lambda_{1i}, \lambda_{2i}, \lambda_{3i})\) denote the goals for two teams \(j\) vs. \(k\) in match \(i\). Under this framework, \(Y_{ij} \sim \text{Pois}(\lambda_{1i} + \lambda_{3i})\) and \(Y_{ik} \sim \text{Pois}(\lambda_{2i} + \lambda_{3i})\), where \(\lambda_{3i}\) can be viewed as a correlation term between the expected goals for the two teams in game \(i\). In our previous work, we found that goals are better modeled with \(\lambda_3 = 0\), a finding similar to that of (Groll et al. 2018). Adopting this choice for this work, this reduces to a product of two independent Poisson random variables, which I model as follows:

\[ \begin{aligned} \log(\lambda_1) &= \mu + \alpha_{t[j]} + \delta_{t[k]} + \biggr(\tau_{h}\mathbb{I}(j = \text{home})~+~\tau_n\mathbb{I}(j = \text{neutral})\biggr) \\ \log(\lambda_2) &= \mu + \alpha_{t[k]} + \delta_{t[j]} + \biggr(\tau_{h}\mathbb{I}(k = \text{home})~+~\tau_n\mathbb{I}(k = \text{neutral})\biggr) \\ \end{aligned} \]

In the above,

  • \(\mu\) represents the log goal expectation of an average international team playing away from home
  • \(\alpha\) represents attacking team strengths
  • \(\delta\) represents defensive team strengths
  • \(\tau\) represents location parameters. Nearly all World Cup matches are played at neutral sites, though the three host nations get a small home boost.

Each team is assigned a posterior mean estimate for \(\alpha\) and \(\delta\) by fitting the model in Stan on international match data from 2016 onwards. Teams require a minimum of 20 matches in that window to receive a direct rating; otherwise a weak prior pulls them toward the mean.

As with prior iterations of this model, I use a weighted log-likelihood to account for the fact that international teams don’t play many games and not all games carry equal meaning. Each match is weighted as:

\[ w_i = b_i\exp\biggr(-\frac{T_\max - T_i}{T_\max-T_\min}\biggr) \]

  • \(b_i\) is the base weight for game \(i\): 1 for friendlies, 1.25 for Nations League matches, 2 for continental championships, and 2.5 for World Cup matches.
  • \(T_i\) is the date of game \(i\)
  • \(T_\min\) and \(T_\max\) are the earliest and latest games in the sample

The \(\exp(\cdot)\) term decays weights so that more recent games count more than older ones. The parameters \(T_\min\) and \(b_i\) are tunable and chosen somewhat arbitrarily here; cross-validation would be the right approach for optimizing them with more time. I fit the model in Stan — see the code linked below for prior choices and other fitting details.

Limitations

Modeling international soccer is genuinely hard. A few of the challenges worth keeping in mind when interpreting these predictions:

Limited sample sizes. International teams play far fewer matches than club sides — on the order of 10–15 competitive games per year, and many fewer for smaller nations. Teams like Curaçao, Cape Verde, or Haiti (all group stage participants) have limited data against quality opposition, so model uncertainty for these sides is high. The predictions treat posterior means as point estimates in simulation, which understates true uncertainty.

Friendly game devaluation is imperfect. The weighting scheme discounts friendlies, but the distinction between a “serious” friendly and a competitive warm-up is blurry in practice. Top teams routinely rest starters in friendlies regardless of the opponent or occasion, which means those results carry less signal than the framework assumes. There is no perfect solution here.

Home/neutral site advantage is coarse. The model estimates a single neutral site parameter (\(\hat{\tau}_n \approx 0.22\)) applied to all matches in the United States, Canada, and Mexico equally. In reality, the US team is likely to enjoy something closer to a genuine home advantage given the expected crowd composition, while Canada and Mexico may benefit similarly in games near their borders. A single parameter cannot distinguish these cases.

The 48-team format is unprecedented. This is the first World Cup with 48 teams and the first with a Round of 32. The rule that selects the eight best third-place teams involves a complex cross-group comparison of points, goal difference, and goals scored. There is no historical precedent to validate whether simulation of this stage works as expected at scale; the tiebreaker rules introduce combinatorial complexity that can produce counterintuitive results.

Tournament-specific form is unobserved. The model is fit on pre-tournament data and treats those ratings as fixed throughout group play. In reality, injuries, player availability, tactical adaptations, and squad peaking (or fatigue from a long club season) all matter enormously in tournament soccer — and none of it is captured here.

Within-game correlation is ignored. Setting \(\lambda_3 = 0\) treats each team’s goals as independent within a match. Empirically this is well-supported, but it misses real within-game dynamics: a team trailing by a goal tends to take more risks, which can inflate scoring for both sides. The practical effect on tournament-level predictions is likely small but non-zero.

Avatar
Luke Benz
Research Advisor, Statistics

Related