Arsenal vs West Ham

Arsenal vs West Ham 09/08/2015

Possibly the largest upset of the week came at the Emirates where Arsenal started their title campaign with a home loss to West Ham. A quick look at the game statistics:

In [1]:
import league_analysis
epl = league_analysis.year_201516.epl_league
league_analysis.display_given_matches([epl.get_game('Arsenal', 'West Ham', '09/08/15')])
Home Away
Team Arsenal West Ham
Goals 0 2
Shots 22 8
SOT 6 4

So Arsenal were pretty dominant in shots, 22 vs 8. They did seem to have trouble converting those shots to shots on target though. We can look at all of the matches in the leagues we have data from (Eng top four leagues and Scottish Premiership), to see how often the team that has fewer shots wins the match:

In [2]:
all_matches = list(league_analysis.get_all_matches(years=league_analysis.all_years))
num_matches = len(all_matches)
def shots_winner_wins(match):
    return ((match.FTR == 'H' and match.HS > match.AS) or
            (match.FTR == 'A' and match.AS > match.HS))
def shots_loser_wins(match):
    return ((match.FTR == 'H' and match.HS < match.AS) or
            (match.FTR == 'A' and match.AS < match.HS))
more_shots = league_analysis.count_matches(shots_winner_wins, all_matches)
fewer_shots = league_analysis.count_matches(shots_loser_wins, all_matches)
draws = league_analysis.count_matches(lambda m: m.FTR == 'D', all_matches)
league_analysis.display_pairs([('more shots', more_shots),
                               ('fewer shots', fewer_shots),
                               ('all wins', num_matches - draws),
                               ('draws', draws),
                               ('total number of matches', num_matches)])
more shots 4979
fewer shots 2982
all wins 8432
draws 3089
total number of matches 11521

So, unsurprisingly, the team that takes the larger number of shots is more likely to win, but it is only a ratio of approximately 5:3 (some games the teams have equal shot numbers). So it is not that uncommon for a team that is out-shot to win. However, many of these games may be cases in which the difference was small, so we check on how often a team is out-shot by the margin that Arsenal out-shot West Ham by, and how often such a team loses the match:

In [3]:
def get_win_shots(match):
    if match.FTR == 'H':
        return match.HS
    elif match.FTR == 'A':
        return match.AS
    else:
        return None
def get_lose_shots(match):
    if match.FTR == 'H':
        return match.AS
    elif match.FTR == 'A':
        return match.HS
    else:
        return None
In [4]:
out_shot_win = 0
out_shot_draw = 0
out_shot_lose = 0
for match in all_matches:
    if match.FTR == 'D':
        if match.HS - match.AS >= 14 or match.AS - match.HS >= 14:
            out_shot_draw += 1
    else:
        win_shots = get_win_shots(match)
        lose_shots = get_lose_shots(match)
        if win_shots - lose_shots >= 14:
            out_shot_win += 1
        elif lose_shots - win_shots >= 14:
            out_shot_lose += 1

all_out_shots = out_shot_win + out_shot_draw + out_shot_lose
league_analysis.display_pairs([('A team wins shots by 14 or more', all_out_shots),
                               ('Won shots by 14 but lost', out_shot_lose),
                               ('Won shots by 14 and won', out_shot_win),
                               ('Won shots by 14 but draw', out_shot_draw)])
A team wins shots by 14 or more 675
Won shots by 14 but lost 118
Won shots by 14 and won 416
Won shots by 14 but draw 141

So in only 675 of the 11521 games did one team out-shoot the other by 14 shots or more. In 118 of those cases did the team that took the most shots actually lose the game. By contrast there are 416 games in which such a dominant shots team wins the game. I'm a little surprised there are not a larger number of dominant displays where a team really bosses the shots and wins the game. Similarly to the teams that win the shots battle at all, when a team wins the shots battle by as much as Arsenal did at the weekend, they are favourites to win the game, the ratio of wins to losses is only approximately 4:1. Good, but not guaranteed.

To be clear, I'm not terribly surprised by the 118 number. When a significantly superior team finds themselves behind in a game they tend to ramp up their urgency and hence shots. So it is not surprising to find the odd game in which such a team just does not find away to score. The 118 number represents less than 1% of the games.

I am surprised that less 4% of games result in a team bossing the shots by 14 or more and reaping the benefits of this. I guess this demonstrates that once a team obtains a lead (or possibly a comfortable lead) they don't have much adventure about them. Or perhaps they pass up shooting opportunities in the hope of cultivating better chances, whilst hopefully whittling down the clock.

This might tell us something about score-effects. That is, that score effects are rather important.

Comments

Comments powered by Disqus