0% found this document useful (0 votes)

6 views

Instructions

The document provides guidelines and a problem statement for building a model to predict the outcome of football matches from past data. It describes the goal is to predict if the home team will win, lose, or draw for matches in the 2017-18 season. The data provided and expectations for model submissions are also outlined.

Uploaded by

rajdeep.jzs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

Instructions

Uploaded by

rajdeep.jzs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Guidelines

At Sahaj, we strive to build high-quality software that has strong aesthetics (is
readable and maintainable), has extensive safety net to safeguard quality, handles
errors gracefully and works as expected, without breaking down.

We are looking for people with data science knowledge coupled with pragmatism to
deliver the implementation to business production environments. The data scientist
should understand the domain and build models that have the ability to deal with
the real-life constraints.

Following is a list of things to keep in mind, before you submit your solution, to
ensure that your model focuses on attributes, we are looking for -

● Have you understood all the variables in the data?

● Have you followed best practices to make sure your model is robust?
● Have you thought about how the model would evolve, over a period of time,
in production, when more data is available?
● Have you made an effort to make your code readable and robust?
● Have you thought about the biases that can be there in the data?

Page 1
Problem Statement

Your goal is to build a model to predict the outcome of a football match, given data for the past 9
years. All the football matches from 2009 to 2017 are covered in the dataset.

Based on the dataset provided, your goal should be to come up with an optimal solution to
predict if a Home Team would win or lose or draw a game (column name FTR) for the year of
2017-18.

We are looking to compare multiple approaches and choose the one that performs the best.

Use of external data sources is encouraged, with some recommendation. Of course, the actual
results of these matches can be easily downloaded from the web. However, this problem
statement is intended to be for fun and learning. You can choose to enrich the dataset by using
other publicly available European football league datasets, e.g. http://www.football-data.co.uk/
or http://football-data.mx-api.enetscores.com/

The train and the test dataset is not randomly sampled for testing and training

Page 2
About the Data
The data is collected from http://www.football-data.co.uk/ and consists of different leagues.

Column Details

Name Description

HomeTeam Home Team

AwayTeam Away Team

FTR Full-Time Result (H=Home Win, D=Draw, A=Away Win)

HTHG Half Time Home Team Goals

HTAG Half Time Away Team Goals

HS Home Team Shots

AS Away Team Shots

HST Home Team Shots on Target

AST Away Team Shots on Target

AC Away Team Corners

HF Home Team Fouls Committed

AF Away Team Fouls Committed

HC Home Team Corners

HY Home Team Yellow Cards

AY Away Team Yellow Cards

HR Home Team Red Cards

AR Away Team Red Cards

Date On which day the match was played

league Under which league the match was played

Page 3
Expectations from the submission
1. Approach to the dataset.
2. Choice of the model(s).
3. Model validation approach (used locally).
4. Choice of features used for the model creation.

Page 4

Solution Manual Principles of Modern Manufacturing SI Version, Global Editionn Groover
No ratings yet
Solution Manual Principles of Modern Manufacturing SI Version, Global Editionn Groover
7 pages
End of Unit Test: Name Class
86% (7)
End of Unit Test: Name Class
3 pages
Automated Seed Sowing Machine
67% (3)
Automated Seed Sowing Machine
40 pages
WALTER J. ONG - Ramus, Method, and The Decay of Dialogue - From The Art of Discourse To The Art of Reason-Harvard University Press (1958) PDF
100% (5)
WALTER J. ONG - Ramus, Method, and The Decay of Dialogue - From The Art of Discourse To The Art of Reason-Harvard University Press (1958) PDF
440 pages
Solution Manual Chemical Reaction Engineering, 3rd Edition
100% (4)
Solution Manual Chemical Reaction Engineering, 3rd Edition
136 pages
CSCI251-MIB-SPRING-2019 - DR Azeem PDF
No ratings yet
CSCI251-MIB-SPRING-2019 - DR Azeem PDF
16 pages
Options Trading Strategies
87% (23)
Options Trading Strategies
6 pages
ARAI oPIP Altair Battery and Motor Thermal Management FINAL PDF
No ratings yet
ARAI oPIP Altair Battery and Motor Thermal Management FINAL PDF
93 pages
CW1 Paper
No ratings yet
CW1 Paper
4 pages
Thesis Proposal Presentation
No ratings yet
Thesis Proposal Presentation
15 pages
Comparison of Football Results Using Machine Learning Algorithms
No ratings yet
Comparison of Football Results Using Machine Learning Algorithms
7 pages
Corentin Herbinet Using Machine Learning Techniques To Predict The Outcome of Profressional Football Matches
No ratings yet
Corentin Herbinet Using Machine Learning Techniques To Predict The Outcome of Profressional Football Matches
73 pages
57_step Ppt 2 Cpr3 Final
No ratings yet
57_step Ppt 2 Cpr3 Final
32 pages
Data-Engineering EINDE
No ratings yet
Data-Engineering EINDE
13 pages
Football_match
No ratings yet
Football_match
2 pages
EPL Prediction Web App
No ratings yet
EPL Prediction Web App
15 pages
Final Project Powerpoint
No ratings yet
Final Project Powerpoint
18 pages
Back2Back_brain_dead_2k25
No ratings yet
Back2Back_brain_dead_2k25
37 pages
Prediction of Football Match Score and Decision Making Process
No ratings yet
Prediction of Football Match Score and Decision Making Process
4 pages
AS_Problem Statement (2)
No ratings yet
AS_Problem Statement (2)
4 pages
WilkinsonDraft2
No ratings yet
WilkinsonDraft2
3 pages
Application For Football League Data Collection and Analysis
No ratings yet
Application For Football League Data Collection and Analysis
85 pages
Zynga Product Case
No ratings yet
Zynga Product Case
10 pages
tesi
No ratings yet
tesi
73 pages
RL_Exp-4 updated
No ratings yet
RL_Exp-4 updated
2 pages
Sports Result Prediction System: Random Forest Algorithm Performing Regression and Database
No ratings yet
Sports Result Prediction System: Random Forest Algorithm Performing Regression and Database
7 pages
Predicting Players Rating
No ratings yet
Predicting Players Rating
4 pages
Prediction of english premier league soccer matches
No ratings yet
Prediction of english premier league soccer matches
60 pages
turover prediction
No ratings yet
turover prediction
52 pages
Data Science at The Warriors - Assignment 1
No ratings yet
Data Science at The Warriors - Assignment 1
6 pages
Deep Learning and Transfer Learning Architectures For English Premier League Player Performance Forecasting
No ratings yet
Deep Learning and Transfer Learning Architectures For English Premier League Player Performance Forecasting
13 pages
Entropy 23 00090 v3
No ratings yet
Entropy 23 00090 v3
12 pages
Rating Australian Rules Football Teams With The Playerratings Package
No ratings yet
Rating Australian Rules Football Teams With The Playerratings Package
9 pages
Ruck Those Stats! Machine Learning As The New Coach
No ratings yet
Ruck Those Stats! Machine Learning As The New Coach
5 pages
Project Python-1
No ratings yet
Project Python-1
3 pages
Cmu 432 Fis - Fps - Group 2
No ratings yet
Cmu 432 Fis - Fps - Group 2
14 pages
24 Ultimate Data Science Projects To Boost Your Knowledge and Skills
No ratings yet
24 Ultimate Data Science Projects To Boost Your Knowledge and Skills
10 pages
NBA2023 2024 Data Guidelines
No ratings yet
NBA2023 2024 Data Guidelines
3 pages
6450 Bayesian Final Project Report - Team 2
No ratings yet
6450 Bayesian Final Project Report - Team 2
15 pages
Machine Learning For Football Matches and Tournaments
No ratings yet
Machine Learning For Football Matches and Tournaments
8 pages
Football - Match - Result - Prediction - Using - Neural - Networks - and - Deep - Learning Yeah
No ratings yet
Football - Match - Result - Prediction - Using - Neural - Networks - and - Deep - Learning Yeah
4 pages
Project Report
No ratings yet
Project Report
16 pages
Intracollege Datathon 2.0_Case
No ratings yet
Intracollege Datathon 2.0_Case
5 pages
SpecCV-SeniorDataScientist
No ratings yet
SpecCV-SeniorDataScientist
3 pages
Proyect Predict Football Match Winners With Machine Learning and Python Foundations of Programming
100% (1)
Proyect Predict Football Match Winners With Machine Learning and Python Foundations of Programming
5 pages
Predicting Football Matches Using Neural Networks in MATLAB
100% (1)
Predicting Football Matches Using Neural Networks in MATLAB
6 pages
Predicting Game Results For Football League Using Deep Learning
No ratings yet
Predicting Game Results For Football League Using Deep Learning
6 pages
Football Match Data Analysis Using Machine Learning: Bachelor of Science (Information Technology)
No ratings yet
Football Match Data Analysis Using Machine Learning: Bachelor of Science (Information Technology)
24 pages
BA - Group 8 - Final
No ratings yet
BA - Group 8 - Final
12 pages
Final Prjoect
No ratings yet
Final Prjoect
32 pages
Machine Learning VIVEK
80% (5)
Machine Learning VIVEK
118 pages
Applying Machine Learning To Event Data in Soccer
No ratings yet
Applying Machine Learning To Event Data in Soccer
70 pages
EddWebsterCV
No ratings yet
EddWebsterCV
1 page
A Comparative Study of The Different Classification Algorithms On Football Analytics
No ratings yet
A Comparative Study of The Different Classification Algorithms On Football Analytics
16 pages
FinalProject24_PatrickMartinRuben_WinningFinalPremierLeagueWithML
No ratings yet
FinalProject24_PatrickMartinRuben_WinningFinalPremierLeagueWithML
20 pages
Capstone Notes-1
No ratings yet
Capstone Notes-1
18 pages
Sports Result Prediction System
No ratings yet
Sports Result Prediction System
2 pages
Assignment 1 E23
No ratings yet
Assignment 1 E23
3 pages
Practise Questions
No ratings yet
Practise Questions
26 pages
AI Project 2nd
No ratings yet
AI Project 2nd
31 pages
python code 6-10 class X
No ratings yet
python code 6-10 class X
6 pages
Machine Learning Project - Parijat
No ratings yet
Machine Learning Project - Parijat
26 pages
Python practice questions (1)
No ratings yet
Python practice questions (1)
5 pages
SoP Draft 5
No ratings yet
SoP Draft 5
3 pages
Sukanya December Predictive Modeling 14th Jan 2024
No ratings yet
Sukanya December Predictive Modeling 14th Jan 2024
50 pages
Article Review 11 Eng
No ratings yet
Article Review 11 Eng
18 pages
A Simpler Football Simulation: A New Paradigm That Re-Frames the G.O.A.T. Debate
From Everand
A Simpler Football Simulation: A New Paradigm That Re-Frames the G.O.A.T. Debate
Andrew R. Crawford
No ratings yet
Game More, Spend Less: Essential Accessories for Budget Gamers
From Everand
Game More, Spend Less: Essential Accessories for Budget Gamers
Henry Cavilline
No ratings yet
Theoretical Framework
No ratings yet
Theoretical Framework
5 pages
GFDL Barotropic Vorticity Eqns
No ratings yet
GFDL Barotropic Vorticity Eqns
12 pages
HUMAN EYE PPT .
No ratings yet
HUMAN EYE PPT .
15 pages
The Implementation of A Control Circuit For A Microcontroller Based Automated Irrigation System
No ratings yet
The Implementation of A Control Circuit For A Microcontroller Based Automated Irrigation System
10 pages
AMPLIFIERS
No ratings yet
AMPLIFIERS
48 pages
Structured Query Language: C-Dac Hyderabad
No ratings yet
Structured Query Language: C-Dac Hyderabad
29 pages
Richardson 1970
No ratings yet
Richardson 1970
4 pages
YSS Cold Working Die PDF
No ratings yet
YSS Cold Working Die PDF
11 pages
Modul Fizik X A PLUS 2015
100% (3)
Modul Fizik X A PLUS 2015
79 pages
5Tooth preparation (1)
No ratings yet
5Tooth preparation (1)
98 pages
13 Domestic SC BLK-STR
No ratings yet
13 Domestic SC BLK-STR
1 page
Using Length Area and Volume in Calculations L2
No ratings yet
Using Length Area and Volume in Calculations L2
5 pages
Everything Science Grade 10 - Everything Maths and Science (PDFDrive) - 16
No ratings yet
Everything Science Grade 10 - Everything Maths and Science (PDFDrive) - 16
1 page
BCA Syllabus1
No ratings yet
BCA Syllabus1
29 pages
Exercise: Labels As Symbols
No ratings yet
Exercise: Labels As Symbols
40 pages
Wire Rope Slings - Certex PDF
No ratings yet
Wire Rope Slings - Certex PDF
27 pages
Behat
No ratings yet
Behat
85 pages
Advanced Welding Processes For Transmission Pipelines Article
67% (3)
Advanced Welding Processes For Transmission Pipelines Article
17 pages
Omron F2LP-W
No ratings yet
Omron F2LP-W
8 pages
Music Theory: Syntax, Function and Form David S. Lefkowitz Chapter 20: Other Logics
No ratings yet
Music Theory: Syntax, Function and Form David S. Lefkowitz Chapter 20: Other Logics
9 pages
LS 2
No ratings yet
LS 2
8 pages
+turbine - Base Costing - 1PG
No ratings yet
+turbine - Base Costing - 1PG
1 page

Instructions

Uploaded by

Instructions

Uploaded by

Guidelines

● Have you understood all the variables in the data?

HomeTeam Home Team

AwayTeam Away Team

FTR Full-Time Result (H=Home Win, D=Draw, A=Away Win)

HTHG Half Time Home Team Goals

HTAG Half Time Away Team Goals

HS Home Team Shots

AS Away Team Shots

HST Home Team Shots on Target

AST Away Team Shots on Target

AC Away Team Corners

HF Home Team Fouls Committed

AF Away Team Fouls Committed

HC Home Team Corners

HY Home Team Yellow Cards

AY Away Team Yellow Cards

HR Home Team Red Cards

AR Away Team Red Cards

Date On which day the match was played

league Under which league the match was played

You might also like