0% found this document useful (0 votes)
65 views12 pages

Applied Modelling and Visualisation

Applied modeling visualization

Uploaded by

Anurag Verma
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
65 views12 pages

Applied Modelling and Visualisation

Applied modeling visualization

Uploaded by

Anurag Verma
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 12

BPP Business School

Coursework Cover Sheet

Please use this document as the cover sheet of for the 1st page of your assessment.

Please complete the below table – the grey columns

Module Name Applied Modelling and Visualisation


Programme Name
Student Reference Number
(SRN)
Assessment Title MAV – Marjanta AirlinesReport – CW3 [S]

Please complete the yellow sections in the below declaration:

Declaration of Original Work:

I hereby declare that I have read and understood BPP’s regulations on plagiarism and that this is my
original work, researched, undertaken, completed and submitted in accordance with the requirements
of BPP School of Business and Technology.

The word count, excluding contents table, bibliography and appendices, is ______ words.

Student Reference Number: __________ Date: ______

By submitting this coursework you agree to all rules and regulations of BPP regarding assessments
and awards for programmes.

Please note that by submitting this assessment you are declaring that you are fit to sit this
assessment.

BPP University reserves the right to use all submitted work for educational purposes and may
request that work be published for a wider audience.

MAV – CW3 [S] (Dec-23)


MSc Management with Data Analytics

Applied Modelling and Visualisation

Coursework Assessment Brief

Submission mode: Turnitin online access

MAV – CW3 [S] (Dec-23)


1. General Assessment Guidance

• Your summative assessment for this module is made up of this 2,500 words submission which
accounts for 100% of the marks
• Please note late submissions will not be marked.
• You are required to submit all elements of your assessment via Turnitin online access. Only
submissions made via the specified mode will be accepted and hard copies or any other digital
form of submissions (like via email or pen drive etc.) will not be accepted.
• For coursework, the submission word limit is 2,500 words. You must comply with the word
count guidelines. You may submit LESS than 2,500 words but not more. Word Count guidelines
can be found on your programme home page and the coursework submission page.
• Do not put your name or contact details anywhere on your submission. You should only put
your student registration number (SRN) which will ensure your submission is recognised in the
marking process.
• A total of 100 marks are available for this module assessment, and you are required to achieve
minimum 50% to pass this module.
• You are required to use only Harvard Referencing System in your submission. Any content which
is already published by other author(s) and is not referenced will be considered as a case of
plagiarism.
You can find further information on Harvard Referencing in the online library on the VLE. You can
use the following link to access this information: http://bpp.libguides.com/Home/StudySupport
• BPP University has a strict policy regarding authenticity of assessments. In proven instances of
plagiarism or collusion, severe punishment will be imposed on offenders. You are advised to
read the rules and regulations regarding plagiarism and collusion in the GARs and MOPP which
are available on VLE in the Academic registry section.
• You should include a completed copy of the Assignment Cover sheet. Any submission without
this completed Assignment Cover sheet may be considered invalid and not marked.

MAV – CW3 [S] (Dec-23)


2. Assessment Brief

Source: https://stock.adobe.com/uk/images/landing-at-sunset/82605693

For this assignment you are working as a Data Analytics Consultant for the Marjanta Airlines and
have been asked to prepare a Consultancy Report based on the airline’s passenger ‘satisfaction’ Data
Set. This report and your findings will be used in a ‘visually appealing’ presentation to the CEO,
Senior Flight personnel and Cabin Crew in the Annual Staff Conference and it has been proposed
some interactive elements will be placed securely on the company intranet.

Summative Submission

You are provided with a set of data MARJANTA_DATA_CW3.csv that summarises the levels of
passenger ‘satisfaction’. The file contains over 103,000 rows of information from the UK National
Airlines database system for the current calendar year. Your objective is to use machine learning
principles to model and visualise key data with a view to helping staff better understand what
factors impacted levels of ‘satisfaction’ for passengers using the airline. Each feature is listed below:

Field Data Description


Ref Number
id Number
Gender TEXT: Male/Female

Satisfied Y = Satisfied
N = Unsatisfied
Age Number

MAV – CW3 [S] (Dec-23)


Age Band

Type of Travel

Class

Flight Distance Number: Distance in Miles


Destination Text: Destination Country Name
Continent

Inflight Wi-Fi service Number rating:


0 to 5 (where 0 is low/poor)
Departure/Arrival time convenient Number rating:
0 to 5 (where 0 is low/poor)
Ease of Online booking Number rating:
0 to 5 (where 0 is low/poor)
Gate location Number rating:
0 to 5 (where 0 is low/poor)
Food and drink Number rating:
0 to 5 (where 0 is low/poor)
Online boarding Number rating:
0 to 5 (where 0 is low/poor)
Seat comfort Number rating:
0 to 5 (where 0 is low/poor)
Inflight entertainment Number rating:
0 to 5 (where 0 is low/poor)
On-board service Number rating:
0 to 5 (where 0 is low/poor)
Leg room service Number rating:
0 to 5 (where 0 is low/poor)
Baggage handling Number rating:
0 to 5 (where 0 is low/poor)

MAV – CW3 [S] (Dec-23)


Check-in service Number rating:
0 to 5 (where 0 is low/poor)
Inflight service Number rating:
0 to 5 (where 0 is low/poor)
Cleanliness Number rating:
0 to 5 (where 0 is low/poor)
Departure Delay in Minutes Number
Arrival Delay in Minutes Number

Your summative submission should be a written report in MSWord format (NOT a PDF file) and
should be at most 2,500 words. It should describe how applied modelling and visualisation can be
used to present summaries of passenger data. Your report will inform a corporate presentation so
should be appropriately tailored to a rich and varied audience consisting of CEO, Senior Flight
personnel and Cabin Crew. You are also required to carry out independent research into the
deferent categories of ‘satisfaction’ and techniques used to analyse and forecast data in your report.

You must complete all the following tasks:

(ILO1 - Formulate innovative data driven solutions to commercial problems)

TASK 1: Develop a data-driven solution to the given scenario (ILO1).

The solution must use two analytical models to predict the scale and accuracy of the airline’s data
using the Python programming language and relevant Python libraries taking into consideration the
following guidance notes.

Task 1 - Data-Driven Solution Guidance notes:


You should provide a data-driven solution that:

✓ Follows an established design methodology (e.g. PPDAC or CRISP-DM or SDLC), including


flowcharts and pseudocode
✓ Performs an Extract, Transform, and Load (ETL) process (including import, clean and prepare the
data for analysis, whilst ensuring that the relevant test, validation and training sets are created).
✓ Performs Exploratory Data Analysis (EDA) with appropriate visualisations
✓ Trains and tests TWO analytical models
✓ Evaluates the models based on your choice of loss function
✓ Produces appropriate visualisations of your results
✓ Describes the solution development process

You should choose two from the following models:

• Logistic regression
• Decision Tree
• Bagging
• Random Forest
• AdaBoost

MAV – CW3 [S] (Dec-23)


• XGBoost
• Artificial neural network
• Another appropriate state-of-the-art algorithm

(ILO2 – Critically evaluate the use of algorithms and model when developing analytical solutions)

Task 2: Critically analyse the two models chosen for your solution in Task 1 (ILO2)

Critically analyse the two models chosen for your solution in Task 1, and in particular, the strengths
and limitations of each model using the guidance notes provided below with references to the
relevant literature.

Task 2 Guidance notes:


Your critical analysis must also include:

✓ An explanation of your chosen loss function


✓ A short discussion of the accuracy metrics
✓ A summary table of the of the accuracy metrics of the two chosen models to support the
selection of the best model

(ILO3 – Critically appraise the concepts, tools and techniques for data visualisation)

Task 3: Communicate your findings supported by several outputs from Task 1 (ILO3)

Communicate your findings supported by several outputs from Task 1, including graphical outputs
such as correlation matrix, heat map, and confusion matrix using the guidance notes provided
below.

Task 3 Guidance notes:


Your critical appraisal should be based on your findings in Task 1, and must also include:

✓ An analysis of how the Exploratory Data Analysis (EDA) output guided your selection of the
analytical models
✓ An explanation of the justification for performing EDA and the use of appropriate descriptive
statistics and visualisations to understand the results of that analysis
✓ A recommendation of the use of one model for sustaining or increasing the rate of ‘satisfaction’

MAV – CW3 [S] (Dec-23)


3. Research and Referencing

Your report should include a list of references used to develop the report and research to support
the suggested approach. The list should use only the Harvard Referencing System as highlighted in
the General Assessment Guidance section of this document. All the figures/tables used in the report
must have captions and, wherever needed, properly referenced, and explained in your submission.

Suggested report format

Cover page (University cover sheet)

Table of Contents

List of Abbreviations (if appropriate)

Introduction (Scope and Background)

Key Factors that impact on passenger ‘satisfaction’

Tasks (with Technical Details and Independent Research)

Recommendations

Next steps

References

Appendix

The sections in bold contribute to the word count of 2,500 words

Adding your pre-run code to your report prior to uploading to TurnItIn

Locate the report file and embed your Pre-run Python notebook. If you are unable to
embed your python notebook in your MS Word document for any reason, you must provide
a shared link to the file. This is easily done within Google Colab by selecting the ‘Share
button’ in the top right-hand corner of the screen:

IMPORTANT: If you do not embed your notebook or provide a link you will lose marks

MAV – CW3 [S] (Dec-23)


4. Marking Guide

Modelling and Visualisation Fail Marginal Fail Pass Merit Distinction High Distinction
0-39% 40-49% 50-59% 60-69% 70-79% 80-100%
30% Formulate data-driven Notebook fails to Notebook correctly loads Notebook correctly loads Notebook correctly loads the Notebook correctly loads the Notebook correctly loads the input
execute, fails to the input data file into a the input data file into a input data file into a Python input data file into a Python data data file into a Python data structure
solutions (ILO1) display the options, or Python data structure. No Python data structure. data structure. Comments structure. The comments in a modular fashion. The comments
halts during execution. comments are given on Comments are given on and explanations are given provided cover technical details of provided cover exceptional technical
the method used. the approach taken. with detail on the extract the extract phase of the project, details of the extract phase of the
Guidelines: Inadequate and often Notebook uses a package phase of the project. demonstrating extensive project, demonstrating extensive
• Adopt an appropriate implicit knowledge to conduct EDA, as well as Notebook correctly knowledge on dataframe imports. knowledge on dataframe imports and
management framework base with some comparisons of the outputs handles duplicate values Notebook handles duplicate their peculiarities.
omissions and/or lack of the appropriate model as well as EDA. values, missing values as Notebook handles duplicate
( e.g. PPDAC or CRISP-DM of theory relating outcomes and metrics but Comments are given. well as descriptive statistics values, missing values and Notebook handles duplicate values,
or SDLC) to the use of ETL with no explanation or explaining the steps taken to explains in detail the steps taken handles missing values, correctly
processes. No comments. The script achieves reach the results. Notebook to reach the results. uses a package to achieve prediction
• Perform an Extract, discussion of prediction for the also achieves prediction for for the future trends and outputs the
Transform, and Load ambiguities, Weak and often implicit ‘satisfaction’ likelihood the ‘satisfaction’ Likelihood Correctly uses a package to appropriate model outcomes, metrics
assumptions or knowledge base with some and also correctly with good explanation and achieve prediction for the as well as an example of the
(ETL) process anomalies. ‘satisfaction’ likelihood and
omissions and/or lack of outputs appropriate comments about the method prediction in action for a new mock
• Perform Exploratory Data model outcomes and used. There are model outputs the appropriate model entries and scenarios. Comments
theory of the use of
Analysis (EDA) Notebook fails metrics with reasonable evaluation metrices outcomes and metrics. provided are profound in detail.
to produce any modelling and visualisation level of commentary and outputted alongside
• Use TWO analytical outputs which can be for a data project (and explanation. predictions. Explanations are detailed and Explain in detail the steps taken to
models for analysis used to communicate relevant code libraries) profound. reach the results with further
• Produce appropriate your findings Notebook correctly uses Notebook correctly uses a explanation of methods to expand
Notebook correctly uses a a package to produce package to produce Notebook correctly uses a the steps taken or process followed.
visualisations of results communication tools, communication package to produce
package to produce
with reasonable tools with good explanation communication tools, with very Also explains rationale behind the
communication tools explanations and and comments about the detailed explanation and methods used.
but does not contain any comments. method used. comments about the model
explanation or output and your chosen method Notebook correctly uses a package
commentary. of communication conveys this. to produce communication tools with
very detailed explanation and
comments about the method
used including examples of similar
practices and suggestions to further
enhance the communication of
results.

MAV – CW3 [S] (Dec-23)


Modelling and Visualisation Fail Marginal Fail Pass Merit Distinction High Distinction
0-39% 40-49% 50-59% 60-69% 70-79% 80-100%
30% Critically evaluate the use ofInadequate and often Weak and often implicit Satisfactory Good knowledge base that Excellent knowledge base that Outstanding knowledge base that
implicit knowledge knowledge base with some knowledge base that explores and analyses the explores and analyses the theory explores and analyses the theory
models, analysing the base with some omissions and/or lack of begins to explore and theory relating to the use of relating to the use of relating to the use of programming
strengths and weaknesses omissions and/or lack theory relating to the use analyse the theory programming for predictive programming for predictive for predictive modelling. Excellent
of theory relating of programming for relating to the use of modelling. Good explanation modelling. Excellent explanation explanation of loss functions,
(ILO2) to the use of predictive modelling. Weak programming for of loss functions, accuracy of loss functions, accuracy accuracy metrices and comparative
programming for explanation of loss predictive modelling. metrices and comparative metrices and comparative strengths of models based on ability
predictive modelling. function, accuracy metrics, Satisfactory explanation strengths of models based strengths of models based on to sustain or increase ‘satisfaction’
Guidelines: No explanation of loss or recommendation of of loss functions, on ability to sustain or ability to sustain or increase rate drawing on the academic
function, accuracy model for sustaining or accuracy metrices and increase ‘satisfaction’ rate ‘satisfaction’ rate drawing on the literature with outstanding originality
metrics, or increasing ‘satisfaction’ comparative strengths of drawing on the academic academic literature with and autonomy at the cutting edge of
• Analyse the strengths and recommendation of rate. models based on ability literature with originality and considerable originality and current scholarship.
limitations of each model model for sustaining to sustain or increase autonomy. autonomy.
or increasing ‘satisfaction’ rate
• Explain chosen loss ‘satisfaction’ rate. drawing on the
function academic literature.

• Discuss the models’


accuracy metrics
• Provide a comparative
table of the accuracy
metrics

MAV – CW3 [S] (Dec-23)


Modelling and Visualisation Fail Marginal Fail Pass Merit Distinction High Distinction
0-39% 40-49% 50-59% 60-69% 70-79% 80-100%
30% Critically using and appraising Inadequate and often Weak and often implicit Satisfactory Good knowledge base that Excellent knowledge base that Outstanding knowledge base that
implicit knowledge knowledge base with some knowledge base that explores and analyses the explores and analyses the theory explores and analyses the theory
data visualisation techniques base with some omissions and/or lack of begins to explore and theory relating to the use of relating to the use of data relating to the use of data
(ILO3). omissions and/or lack theory relating to the use analyse the theory data visualisation. visualisation techniques. visualisation.
of theory relating of data visualisation. relating to the use of
to the use of EDA, There isn’t sufficient data visualisation.
Guidelines: descriptive statistics evidence of useful data The student has presented The student has presented
and data visualisation. visualisations, neither in The student has several appropriate data several high-quality data The student has presented several
There are no data the notebook nor the presented several visualisations, visualisations, excellently outstanding data visualisations,
• Analyse how the EDA visualisations, neither report. appropriate data communicating insights communicating insights visually excellently communicating insights
in the notebook nor visualisations, visually both in the report and both in the report and the visually both in the report and the
output influenced choice the report. There is weak explanation communicating insights the notebook. notebook. notebook.
of the analytical models for performing EDA, visually both in the
The student did not coming up with appropriate report and the notebook. There is good explanation for There is excellent explanation for There is outstanding explanation for
• Justify performing EDA explain the descriptive statistics and performing EDA, appropriate performing EDA, appropriate performing EDA, appropriate
• Evaluate choice of justification for how EDA guides model
performing EDA, did selection.
There is satisfactory
explanation for
descriptive statistics and how descriptive statistics and how
EDA guides model selection. EDA guides model selection.
descriptive statistics and how EDA
guides model selection.
appropriate visualisations not present performing EDA,
appropriate appropriate descriptive There are examples of data
• Recommend one model descriptive statistics statistics and how EDA visualisation techniques at the cutting
based on performance and has not explained guides model selection. edge of industry using a variety of
how EDA guides methods.
model selection.

MAV – CW3 [S] (Dec-23)


Modelling and Visualisation Fail Marginal Fail Pass Merit Distinction High Distinction
0-39% 40-49% 50-59% 60-69% 70-79% 80-100%
10% Academic Research and Inadequate critical Limited critical analysisSatisfactory critical Good critical analysis and/or Excellent critical analysis and/or Outstanding critical analysis and/or
analysis or evaluation and/or evaluation with analysis and/or evaluation evaluation
Referencing Skills with some difficulties. reflection and evaluation. Demonstrates intellectual
evaluation. Good skills. Demonstrates intellectu
Largely imitative and broad evidence-based skills. Demonstrates intellectual originality, integrity,
reflection and solid, well- al originality and
descriptive. Some critique. Solid structure or originality, coherence, creativity and imagination
difficulty with reasoned judgements
argument including line of imagination
integrity, coherence and working consistently in the higher
structuring the line of logical reasoning and forming from evidence-
imagination. cognitive domains to a professional
logical argument and accuracy in expression ofbased critique. Assumptions are clearly
Follow the guidelines given in standard.
accuracy in argument. Consistent logical stated.
Section 3 Research and expression of structure of argument Assumptions are clearly stated.
argument. Limited and full and including the line of Assumptions are clearly stated.
Referencing Good with precise, full and
appropriate references and reasoning and accuracy Excellent with precise, full and
Inadequate reference notes with minor or appropriate references and
in expression of appropriate references and notes Outstanding with precise, full and
s and notes but may insignificant errors notes at a high standard.
argument. at near-publishing standard. appropriate references and notes at
contain publishing standard.
inconsistencies, errors
or omissions. Satisfactory with
precise, full and
appropriate references
and notes.

MAV – CW3 [S] (Dec-23)

You might also like