0% found this document useful (0 votes)
3 views

Data+visualisation+.pptx

The document is a lecture on data visualization, emphasizing its importance in making complex data understandable and actionable. It outlines key elements for successful visualizations, tips for creating effective visuals, and various types of charts along with their purposes. Additionally, it discusses popular Python libraries for data visualization, such as Matplotlib and Seaborn, and provides insights from a FIFA 2019 data analysis project.

Uploaded by

prasadbillade622
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views

Data+visualisation+.pptx

The document is a lecture on data visualization, emphasizing its importance in making complex data understandable and actionable. It outlines key elements for successful visualizations, tips for creating effective visuals, and various types of charts along with their purposes. Additionally, it discusses popular Python libraries for data visualization, such as Matplotlib and Seaborn, and provides insights from a FIFA 2019 data analysis project.

Uploaded by

prasadbillade622
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 30

Course : Python for Analysis

Edit Master text styles


Lecture On : Data visualisation
Tips

Instructor : Santosh Gupta

2
Session Agendas

In this session we will discuss How to OCTOBER


make data visualization more
interesting, dynamic, relevant, and
well-received by diverse audiences
31
3
4

What is Data Visualization?

● Data visualization is the graphical representation of information and data. It is the


process of translating large data sets and metrics into charts, graphs and other
visuals.

● The visual representation of data makes it easier to identify and share real-time
trends, outliers, and new insights about the information represented in the data.

● It allows analysts to properly interpret data.

● Data visualization has the ability to make complex information easily understood
by the audience.

upgrad.com
5

Why data visualization is important


See the big picture Make better decisions

Uncover insights and see patterns Understand and Quickly act


within complex data on decisions.

Present meaningful data Democratize your data

Share insights with others Provide one source of truth for


in an easy-to-understand form. your entire organization.

upgrad.com
6
Four elements of successful visualizations

-Venn diagram by David McCandless


upgrad.com
7

● Information (data): The information or data that you are trying to convey is a key
building block for your data visualization. Without information or data, you cannot
communicate your findings successfully.

● Story (concept): Story allows you to share your data in meaningful and interesting
ways. Without a story, your visualization is informative, but not really inspiring.

● Goal (function): The goal of your data visualization makes the data useful and
usable. This is what you are trying to achieve with your visualization. Without a goal,
your visualization might still be informative, but can’t generate actionable insights.

● Visual form (metaphor): The visual form element is what gives your data
visualization structure and makes it beautiful. Without visual form, your data is not
visualized yet.

All four of these elements are important on their own, but a successful data visualization
balances all four.
upgrad.com
8

Tips to create visualizations


Break down the complexity into steps

Define your intent Understand and Model data, check


with users clean data for visual validity

Identify who can benefit from Look at your data set structure Use basic visual models to see
data visualization. Rely on typologies. Analyze rows and and understand enormous data
simple charts up front. columns for inconsistencies. sets.

upgrad.com
9

Experiment with Test and iterate Refine and implement


structure and style

Draft different design Gather user impressions and Look for bugs and functional
versions. Try a variety of opinions. Use research to errors. Check your
charts. influence visualization method visualizations for
iterations and justify changes. inconsistencies.

upgrad.com
10

Some design principles for creating effective visuals

Principle Description

Choose the right visual Decide which visual will be the most effective for your audience.
Sometimes, a simple table is the best visualization. Other times,
you need a more complex visualization to illustrate your point.

Optimize the data-ink ratio The data-ink entails focusing on the part of the visual that is
essential to understanding the point of the chart. Try to minimize
non-data ink like boxes around legends or shadows to optimize the
data-ink ratio.

Use orientation effectively Make sure the written components of the visual, like the labels on a
bar chart, are easy to read. You can change the orientation of your
visual to make it easier to read and understand.

upgrad.com
11

Principle Description

Color There are a lot of important considerations when thinking about


using color in your visuals. These include:
● using color consciously and meaningfully, staying consistent
throughout your visuals
● being considerate of what colors mean to different people
● using inclusive color scales that make sense for everyone
viewing them.

Numbers of things Think about how many elements you include in any visual. If your
visualization uses lines, try to plot five or fewer. If that isn’t possible,
use color or hue to emphasize important lines. Also, when using
visuals like pie charts, try to keep the number of segments to less
than seven since too many elements can be distracting.

upgrad.com
12

Types of Charts
Charts are often divided into categories based on their goals, aesthetics or visual features. Here are
a few examples.

Trends

Trend charts represent data along with the


time dimension. Use them mainly to track
changes over periods of time of varying
duration and scale. Examples include line
charts, area charts, histograms and stream
charts.

upgrad.com
13

Comparison

Charts designed for comparison aim to


visualize differences between elements.
Examples include bar charts or bar
graphs, bubble charts and radar charts.

Part to whole

The goal of these charts is to show the inner


subdivision of a value among different categories
or groups. Examples include pie charts, stacked
bar charts, stacked area charts and treemaps.

upgrad.com
14

Correlations

These charts are used for


multidimensional data, for example,
correlation between phone-call duration
and customer satisfaction. Examples
include scatter plots and heatmaps.

Relationships and connections

Charts included in this category represent


hierarchies. They explain the role of an element
within an ecosystem or to observe the inner
nature of a subject in different phases and states
of a process. Examples include alluvial diagrams
and tree diagrams.

upgrad.com
15

Maps

These charts are used for geographical


data, for example, voters by county or
average wage by neighborhood. Examples
include choropleth maps and connecting
lines.

upgrad.com
16

Examples of some charts

Line Chart:

A line chart is used to track changes over short and long periods of time.

upgrad.com
17

Bar Chart:

Bar charts use size to contrast and compare two or more values, using height or lengths to
represent the specific values.

upgrad.com
18

Heatmap:

Similar to bar charts, heatmaps also use color to compare categories in a data set. They are mainly
used to show relationships between two variables and use a system of color-coding to represent
different values.

upgrad.com
19

Pie chart:

The pie chart is a circular graph that is divided into segments representing proportions corresponding
to the quantity it represents, especially when dealing with parts of a whole.

upgrad.com
20

Scatter plot:

Scatter plots show relationships between different variables. Scatter plots are typically used for two
variables for a set of data, although additional variables can be displayed.

upgrad.com
21

Distribution graph:

A distribution graph displays the spread of various outcomes in a dataset.

upgrad.com
Overview 22

upgrad.com
23
Example of an advanced data visualization

upgrad.com
24

Data visualization in Python

Python offers multiple great graphing libraries that come packed with lots of different features.

To get a little overview here are a few popular plotting libraries:

Matplotlib low level, provides lots of freedom

Pandas Visualization easy to use interface, built on Matplotlib

Seaborn high-level interface, great default styles

Plotly can create interactive plots

upgrad.com
25

Matplotlib :

● Matplotlib is the most popular python plotting library.


● It is a low-level library with a Matlab like interface which offers lots of freedom at the
cost of having to write more code.

Pandas Visualization :

● Pandas is an open source high-performance, easy-to-use library providing data


structures, such as dataframes, and data analysis tools.
● Pandas Visualization makes it really easy to create plots out of a pandas dataframe
and series.
● It also has a higher level API than Matplotlib and therefore we need less code for
the same results.

upgrad.com
26

Seaborn :

● Seaborn is a Python data visualization library based on Matplotlib.


● It provides a high-level interface for creating attractive graphs.
● You can create graphs in one line that would take you multiple tens of lines in
Matplotlib.
● Its standard designs are well equipped and it also has a nice interface for working
with pandas dataframes.

upgrad.com
27

FIFA 2019 Data Analysis and Visualization using


Python

We are given the dataset containing FIFA 2019 players information. Let us try to analyse this
dataset and derive some insights.

Dataset in excel format: FIFA_2019_Dataset

Jupyter notebook: fifa-data-visualization.ipynb

upgrad.com
28

FIFA 2019 Data Analysis and Visualization using


Python - Insights
● Most footballers prefer 'right' foot.

● More than about 80% of the football players are internationally very well reputed.

● For most players (about more than 50%) their weak foot is as good as their strong/prefered foot.

● The most played position is 'ST - Striker' followed by 'GK - Goalkeeper' and 'CB - Center Back'.

● Most players falls under the wage category of < 60000 euros. Very few players are paid more
than 100000 euros.

● Most players fall under 6'0 or 5'10 height categories and 150 lbs - 180 lbs weight category.

● Most players participating in FIFA 2019 are from 'England' followed by 'Germany and Spain'.
'Argentina, France and Brazil' also show a good amount of participation.

upgrad.com
29

● Most football players fall under the age group of 20-27 years.

● List of Best Players per each position

● Players from Brazil and Spain is shown to have higher overall scores

● Real Madrid is shown to have players with high average overall scores, followed by Chelsea and
FC Barcelona.

● List of 15 youngest Players from the FIFA 2019

● List of Top 10 Right footed footballers.

● Important features of each position.

● Player stats.

upgrad.com
30

Any Queries?

Thank You!

upgrad.com

You might also like