Statistics Sample
Statistics Sample
Level 7
Assessment title
Statistical Analysis and Interactive
Dashboard Design
Weighting within This assessment is worth 100% of the overall module mark.
module
How to submit
Your assessment should be submitted through blackboard and should be separated into
two formats. First, a single pdf report (between 5k-7k words) and second, a zipped file
containing the R codes and Power Bi dashboard. Please check that the report file and zip
file are:
1. Your report has been named as “your name.pdf” like ”John Smith.pdf”.
2. Check the zip file is valid and openable.
3. The zip file should contain material that are clearly labelled and fully working
versions of R codes and Power Bi dashboard should be included with a clearly
written description of each application and its use in a “Read Me.txt” file. Your
dashboard should be shared as a .pbix file.
1
Assessment task details and instructions:
Your task is to demonstrate your newly developed knowledge and understanding of data
handling, validation, statistical analysis, and visualisation by exploring and presenting data
from an extensive and complex data set.
The dataset for this assignment can be accessed from one of these two sources of your
choice:
https://databank.worldbank.org/source/world-development-indicators
http://data.un.org/Explorer.aspx
Once you have followed the above links, you can download the dataset by selecting the
countries and variables and years you want to work with.
2
Assessment Tasks
The assessment has 2 tasks, Statistical analysis and interactive dashboard building. You
should complete these 2 tasks and also do other necessary steps that have been mentioned
in the “Report Writing Frame” document.
At your first task at the organisation, you are given some research objectives (here, you will
propose these objectives yourself). You should incorporate proper statistical analysis
methods to fulfil these objectives.
At your second task at the organisation, they have asked you to select some indicators
(using the data sources above) which you believe tell a significant story, and to produce a
single-screen interactive dashboard to present this data. For example, it could be to
compare the trade situation of the least developed countries with developed countries.
Your dashboard is to be made publicly available on their website, so you should consider
how you can present the data to a general audience who may not have existing expertise in
the subject you choose.
4.2. Do a correlation analysis for the indicators and evaluate the results in the context
of your stated objectives.
4.3. As a researcher, define at least two hypotheses testing related to the objectives
and test them.
4.4. Do regression analysis. Explain why the selected regression techniques are
appropriate for the selected variables and defined objectives and show if you’ve
found any similar research in the literature.
3
4.5. Do time series analysis. Explain why the selected techniques are better for the
defined objectives and show if you’ve found any similar research in the literature.
1. Clearly define the objectives of the dashboard based on the dataset you have selected.
2. Based on the objectives, select at least 10 suitable countries of your choice.
3. Produce a single-screen interactive dashboard of at least 10 countries’ data.
4. Clear, effective presentation of all factors in a coherent, intuitively comprehensive form,
reflecting the objectives you have set for your dashboard.
5. A design applicable to the full range of countries presented in the dataset without
modification to the dashboard form or structure. (i.e., the dashboard should support a
side- by-side comparison of multiple countries and/or financial years).
Alongside the dashboard design, you should provide a full report which summarises:
• The objectives you have defined for your dashboard, indicating clearly what your
planned solution will communicate to your audience.
• The data visualisation principles which have informed your dashboard design, with
reference to literature and best practice in data visualisation.
• The steps you have taken to pre-process and prepare the data.
• An overview of your design with a full justification of the design rationale
For extra credit you should also implement the following advanced features in your
dashboard design:
• Use of DAX
• Use of relationships in your data model
• Use of hierarchies, grouping or binning
• Use of in-built Power BI tools
To receive extra credit, these features must be fully documented in your accompanying
report.
Remarks:
1. Use similar datasets and objectives for both tasks. Although. if you prefer you can select
different objectives for each task.
2. You must use R programming language for the entire statistical analysis part (task 1).
3. Based on the data that you have downloaded from the given sources, you can mix
different data sets / variables to make your own data set in a meaningful and correct
format.
4. You must Include the screenshot of the final dashboard in the pdf report (task 2).
4
Assessed intended learning outcomes:
1. Analyse a data science project to devise a structure for its implementation, analysis,
and evaluation, justifying any decisions made.
2. Critically assess the relative strengths and uses of a range of statistical analysis
techniques (including t-tests, ANOVA, various regression models and categorical data
analysis, test of hypothesis, and time series analysis).
3. Present and visualise the statistical results, analysing key findings.
4. Evaluate the quality of graphs according to their expressiveness and effectiveness.
1. Understand the history and context of data science ethics, skills, challenges, and
methodologies the term implies.
2. Will learn how to work with a real-world dataset that possibly is not in your domain
expertise, and you don’t have prior knowledge and understanding of that field.
3. Develop skills in presenting quantitative data using appropriate displays, tabulations,
and summaries.
4. Understand the nature of sampling variation and the role of statistical methods in
developing and testing hypotheses.
5. Select and use appropriate statistical methods in the analysis of complex datasets.
6. Present findings based on statistical analysis in a clear, concise, and understandable
manner.
7. Select the proper visualization methods for a given data analysis and presentation
problem.
Module Aims
The module is focused on the underpinning knowledge and practical skills needed for
working within the data sciences industry.
5
Feedback arrangements
You can expect to receive individual feedback in the form of an annotated marking
matrix with specific comments for each section, general comments for the work and up
to 3 specific areas for improvement.
Support arrangements
You can obtain support for this assessment by contacting Dr Kaveh Kiani or Nathan
Topping for the technical aspects of the module. Further support can be obtained from
the university as follows:
askUS
The University offers a range of support services for students through askUS.
Good Academic Conduct and Academic Misconduct
Students are expected to learn and demonstrate skills associated with good academic
conduct (academic integrity). Good academic conduct includes the use of clear and correct
referencing of source materials. Here is a link to where you can find out more about the
skills which students require http://www.salford.ac.uk/skills-for-learning.
Academic Misconduct is an action which may give you an unfair advantage in your
academic work. This includes plagiarism, asking someone else to write your assessment
for you or taking notes into an exam. The University takes all forms of academic
misconduct seriously. You can find out how to avoid academic misconduct here
https://www.salford.ac.uk/skills-for-learning.
Assessment Information
If you have any questions about assessment rules, you can find out more here.
Personal Mitigating Circumstances
If personal mitigating circumstances may have affected your ability to complete this
assessment, you can find more information about the personal mitigating circumstances
procedure here.
Personal Tutor/Student Progression Administrator
If you have any concerns about your studies, contact your Personal Tutor or your
Student Progression Administrator.
Assessment Criteria
It would be best to look at the assessment criteria to determine what we are explicitly
looking at during the assessment.
Reassessment
If you fail your assessment and are eligible for reassessment, you will be allowed to re-do
the assignment based on the feedback given. The submission for this will be based on
university’s reassessment calendar and routines.
6
Assessment Rubric
series analysis has been done (2 advance Reg • Detailed and thorough definition and
and 2 TS models) justification of bespoke data representations
➢ Correct predictions have been made based that define appropriate data-centric displays
on 4 models. and features.
➢ Outstanding comparative analysis of the • Clear and consistent format and layout with a
hypothesis testing and more than 2 test of reasoned and justified perceptive and
hypothesis has been included. cognitive feature set throughout.
➢ All the results consist of highly precise and • A highly objective focused representation
well-explained statements for both technical that presents all evidence and draws the
and non-technical audiences. conclusion for the task objective.
• A detailed and thorough critical review of the
proposed data visualization with
consideration of the task and matching of the
Distinction
80 - 89
o Correct correlation, regression and time • Definition and justification of bespoke data
series analysis has been done (2 advance Reg representations that define appropriate data-
and 1 TS models) centric displays and features.
o Correct predictions have been made based • Clear format and layout with a justified
on 3 models. perceptive and cognitive feature set
o Excellent comparative analysis of the throughout.
hypothesis testing and more than 2 test of • An objective focused representation that
hypothesis has been included. presents all evidence and draws the
o All the results consist of precise and well- conclusion for the task objective.
explained statements for both technical and • A critical review of the proposed data
non-technical audiences. visualization with consideration of the task
and matching of the form presented to the
task objectives.
• Use of the additional features mentioned in
the brief.
Assessment Information/Brief
Scale Mark Rank Statistical Analysis Description Data Visualization Description
• Aim and objectives have been defined and • Detailed consideration of comparative
adequately explained. analysis presented within the visual
• More than minimum sample size has been representation for a small selection of
used, and the approach for selecting this countries.
sample has been justified. • Detailed consideration of a common
• Advanced consideration of data preparation. perceptual model and justification of this in a
• Besides general preparation, handling missing cognitive context.
data and outlier detection algorithms have • Refined representation of data using tailored
been utilized. representational forms that extend and
• In depth descriptive statistical analysis has refine the basic offering of the packages
Very Good
Pass
Assessment Information/Brief
Scale Mark Rank Statistical Analysis Description Data Visualization Description
• Aim and objectives have not been defined • Functional representation of raw data based
properly. on standard representations with minimal
• Less than minimum sample size has been modification of the attributes
used. • Indirect representation of data with minimal
• Minimum data preparation. analysis or pre-preparation.
• Unsatisfactory descriptive statistical analysis • Little justification of approaches and
has been provided. principles applied.
• Unsatisfactory R analysis steps. • A consistent but basic report that shows how
Unsatisfactory
• Unsatisfactory correlation, regression and general principles and approaches have been
40- 49
time series analysis has been done (some of used to define a coherent presentation.
the models are wrong) • Task focused presentation that considers the
• Requested test of hypothesis has not been objectives set but does not justify rationally.
included. • A basic report that considers the functional
• Results have not been explained. layout and data representation without
justification against human perception
and/or cognition.
• Little or no supporting research of visual form
decisions based on unstructured and
validated web-based presentations of data
forms
• Aim and objectives have not been defined. • Less than the minimum number of countries
Fail
• Less than minimum sample size has been have been visualised
used. • Functional data representations using basic
• Minimum data preparation. forms with minimum modification
• Inadequate statistical analysis has not been • Inconstant/uncoherent report that has little
Inadequate
• Inadequate R analysis steps. • Little attempt to address the task focus or set
• Inadequate correlation, regression and time clear objectives for the dashboard
series analysis has been done (Most of the
models are wrong).
• Requested test of hypothesis has not been
included.
• Results have not been explained.
The submitted assessment report diverges significantly from the provided assessment brief,
displaying a significant lack of alignment between the given guidelines and the content presented.
Poor / Very poor /
Extremely poor
The report appears to lack the even basic elements outlined in the brief, deviating from the
specified criteria and objectives.
0 - 29
Assessment Information/Brief