0% found this document useful (0 votes)

0 views23 pages

L7 - DS304 Visualization

The document discusses various methods for visualizing time series data, emphasizing the importance of maintaining the inherent order of time in data points. It covers different visualization techniques such as line graphs, connected scatterplots, and smoothing methods like moving averages, highlighting their advantages and potential pitfalls. Additionally, it addresses the significance of window size in moving averages and introduces weighted moving averages as a way to assign importance to recent data points.

Uploaded by

deamonking1234king

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

0 views23 pages

L7 - DS304 Visualization

Uploaded by

deamonking1234king

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Visualization

Siddharth R
Visualizing Time Series Data
● Data points have an
inherent order of “time”
● Preferred chart type for
visualizing time series data?
● Data: Monthly submissions
to the preprint server
bioRxiv, from November
2013 until April 2018.
● Any difference between
scatterplot?
Visualizing Time Series Data
● Here, the dots are spaced evenly
along the x axis, and there is a
defined order among them.
● Each dot has exactly one left and
one right neighbor (except the
leftmost and rightmost points, which
have only one neighbor each).
● We can visually emphasize this
order by connecting neighboring
points with lines, called “Line
Graph”
Visualizing Time Series Data
● Do we really need to draw lines
between data?
● Is the line corresponding to a
made up data?
● Whether point is important or line
is important?
Visualizing Time Series Data
● Using lines to represent time series
is generally accepted practice,
however, and frequently the dots are
omitted altogether
● Without dots, the figure places more
emphasis on the overall trend in the
data and less on individual
observations
● In general, the denser the time
series, the less important it is to
show individual observations with
dots.
Visualizing Time Series Data
● We can also fill the area under the
curve with a solid color
● It visually separates the area above
the curve from the area below
● It is only valid if the y axis starts
at zero, so that the height of the
shaded area at each time point
represents the data value at that
time point
Multiple Time Series
● If we want to show the monthly submissions to multiple preprint servers, a
scatterplot is not a good idea, because the individual time courses run into
each other
Multiple Time Series
Multiple Time Series - With more than one response variable
● For example, we may be interested
to find the change in house prices
from the previous 12 months as it
relates to the unemployment rate.
We may expect that house prices
rise when the unemployment rate is
low, and vice versa
● We can visualize such data as two
separate line graphs stacked on top
of each other
Twelve-month change in house prices (a) and
unemployment rate (b) over time, from January 2001
through December 2017.
Multiple Time Series
● As an alternative, we can plot the two
variables against each other, drawing a
path that leads from the earliest time
point to the latest
● “Connected scatterplot”, because we
are technically making a scatterplot of the
two variables against each other and
then are connecting neighboring points.
● What is missing here?
Multiple Time Series
● When drawing a connected scatterplot, it is
important that we indicate both the direction
and the temporal scale of the data.
● A gradual darkening of the color to indicate
direction, alternatively, one could draw
arrows along the path
● In a connected scatterplot, you can find
correlated (positive and negative)
movement between the two variables.
● If the two variables have a somewhat cyclic
relationship, we will see circles or spirals in
the connected scatterplot.
Multiple Time Series
● Separate line graphs tend to be easier to read, but it is very hard to spot
patterns like cyclical relationships.
● Once people are used to connected scatterplots, they may be able to extract
certain patterns (such as cyclical behavior with some irregularity) that can be
difficult to spot in line graphs.
● Research reports that readers are more likely to confuse order and direction
in a connected scatterplot than in line graphs
● Connected scatter plots seem to result in higher engagement, and thus such
plots may be effective tools to draw readers into a story
Visualizing Trends
● When making scatter plots or time series , we are often more interested in the
overarching trend of the data than in the specific detail of where each individual
data point lies.
● There are two fundamental approaches to determining a trend:
○ Smoothing is useful for uncovering general patterns in data without
assuming a predefined mathematical relationship, making it ideal for datasets
with a lot of noise or fluctuations.
○ Curve fitting, on the other hand, assumes that a specific function can
describe the trend and provides a mathematical model that can be used for
prediction and deeper analysis.
Smoothing
● Captures key patterns in the data while removing irrelevant minor detail or
noise
● Reduces the impact of outliers or sudden changes
● Common approaches
○ Simple Moving Averages
○ Weighted Moving Averages
○ LOESS
Moving Averages
● This method relies on the notion that observations close in time are likely
to have similar values. Consequently, the averaging removes random
variation, or noise, from the data.
● Averages the values within a sliding window of fixed size
● Types:
○ One sided moving average
○ Centered moving average
Types of Moving Average: One sided vs Centered
One-sided include the current and previous
observations for each average. For example, the
formula for a moving average (MA) of X at time t
with a length of 7 is the following:

Centered include both previous and future

observations that surround it in both directions. It
also known as two-sided moving averages. The
formula for a centered moving average of X at time t
with a length of 7 is the following:
Example: Florida Covid Daily Deaths
● Here human-based scheduling factor
that influences when causes of death
are recorded.
● Some activities must be less likely to
occur on weekends because the
lowest day of the week is almost
always Sunday, and weekends
● Because of this seasonal pattern, the
number of recorded deaths for a
particular day depends on the day of
the week you’re evaluating.

Source: https://statisticsbyjim.com/time-series/moving-averages-smoothing/
Example: (Contd..)
● Now we need to remove this season
pattern to reveal the underlying trend
component.
● The graph displays one-sided moving
averages with a length of 7 days for these
data. Notice how the seasonal pattern is
gone and the underlying trend is visible.
● Each moving average point is the daily
average of the past seven days.
Importance of Window Size
● The 20-day moving average
removes small, short-term spikes
but otherwise follows the daily data
closely
● The 100-day moving average, on
the other hand, removes even fairly
substantial drops or spikes that
play out over a time span of
multiple weeks.
Common pitfall in moving average
● What is the difference between the two
charts?
● Parts are missing at either the beginning
or the end or both.
● All data points in the window are weighted
equally
Weighted Moving Average (WMA)
● WMA is a moving average where each data
point is assigned a specific weight based
on its position in the series.
● Weights are assigned linearly or according
to a specific pattern chosen by the user.
● The most recent data point typically
receiving the highest weight.
● The sum of the weighting should add up to
one (or 100%).

(172.38×5/15)+(171.37×4/15)+178.67×3/15)+(176.08×2/15)+(172.72×1/15)=173.85
Next Class
● Parametric vs Nonparametric Curve Fitting
● Locally estimated scatterplot smoothing (LOESS)
Thank You !!!

Lecture Notes On Time Series Analysis by Dr. Ajijola
100% (3)
Lecture Notes On Time Series Analysis by Dr. Ajijola
31 pages
The Military Experiences of Ordinary Africans in World War II
No ratings yet
The Military Experiences of Ordinary Africans in World War II
21 pages
Chapter 5
No ratings yet
Chapter 5
73 pages
Data Visualization 14 TimeSeriesData
No ratings yet
Data Visualization 14 TimeSeriesData
33 pages
Chapter 10: Time Series and Forecasting
No ratings yet
Chapter 10: Time Series and Forecasting
7 pages
Time Series a Level Notes UPDATED (Precision ). (1)
No ratings yet
Time Series a Level Notes UPDATED (Precision ). (1)
38 pages
Time Series A Level Notes (Precision Academy) .
No ratings yet
Time Series A Level Notes (Precision Academy) .
23 pages
Time Series
100% (1)
Time Series
61 pages
Week3 Smoothing
No ratings yet
Week3 Smoothing
8 pages
2-Time Series Analysis 22-02-07 Revised
No ratings yet
2-Time Series Analysis 22-02-07 Revised
14 pages
Lec 10_EBM 313
No ratings yet
Lec 10_EBM 313
43 pages
Forecasting
No ratings yet
Forecasting
33 pages
Econ 2 - Time Series
No ratings yet
Econ 2 - Time Series
23 pages
Time Series Analysis and Forecasting
No ratings yet
Time Series Analysis and Forecasting
36 pages
Time Series Analysis Notes
No ratings yet
Time Series Analysis Notes
14 pages
Applied Statistics Chapter 2 Time Series
No ratings yet
Applied Statistics Chapter 2 Time Series
82 pages
Time Series Notes
No ratings yet
Time Series Notes
26 pages
Time Series (Autosaved)
No ratings yet
Time Series (Autosaved)
84 pages
Timeseries
No ratings yet
Timeseries
98 pages
ads chap5
No ratings yet
ads chap5
112 pages
Time Series Analysis-1
No ratings yet
Time Series Analysis-1
14 pages
TIME SERIES MODEL
No ratings yet
TIME SERIES MODEL
22 pages
Time Series Analysis - Economics
100% (1)
Time Series Analysis - Economics
48 pages
BSA Unit-II Topic - Time Series & Index No.
No ratings yet
BSA Unit-II Topic - Time Series & Index No.
55 pages
Topic 4 Analysis of Time Series
No ratings yet
Topic 4 Analysis of Time Series
38 pages
Time Series
No ratings yet
Time Series
98 pages
Time Series Analysis
No ratings yet
Time Series Analysis
26 pages
Tsa - Day1
No ratings yet
Tsa - Day1
9 pages
Jal1603 Tsaf Unit-2 Ppt
No ratings yet
Jal1603 Tsaf Unit-2 Ppt
24 pages
AIS 3209 Chapter 1 To 4
No ratings yet
AIS 3209 Chapter 1 To 4
31 pages
BA2_5_time_series
No ratings yet
BA2_5_time_series
91 pages
Lecture 9
No ratings yet
Lecture 9
86 pages
Forecasting
No ratings yet
Forecasting
21 pages
Timeseries - Analysis
No ratings yet
Timeseries - Analysis
37 pages
Times Series Analysis Notes
No ratings yet
Times Series Analysis Notes
5 pages
Lesson Slides - 4A Time Series Data and Their Graphs - Edrolo
No ratings yet
Lesson Slides - 4A Time Series Data and Their Graphs - Edrolo
34 pages
Time Series Data
No ratings yet
Time Series Data
36 pages
Time Series and Forecasting
No ratings yet
Time Series and Forecasting
3 pages
Time Series Analysis. Trends, Patters, Seasonality
No ratings yet
Time Series Analysis. Trends, Patters, Seasonality
14 pages
Chap 05 Time Series Analysis and Forecasting
No ratings yet
Chap 05 Time Series Analysis and Forecasting
63 pages
Time Series Forecasting
100% (1)
Time Series Forecasting
52 pages
Forecasting - Intro - 2023
No ratings yet
Forecasting - Intro - 2023
42 pages
Chapter 6
No ratings yet
Chapter 6
44 pages
Times Series 1
No ratings yet
Times Series 1
88 pages
Time Series Notes
100% (1)
Time Series Notes
38 pages
Stata
No ratings yet
Stata
33 pages
DS Module 06
No ratings yet
DS Module 06
8 pages
CH 17
No ratings yet
CH 17
12 pages
Time series Assignment -1
No ratings yet
Time series Assignment -1
3 pages
DMDW NOTES UNIT 3
No ratings yet
DMDW NOTES UNIT 3
9 pages
Time Series Summary
No ratings yet
Time Series Summary
14 pages
Time Series Analysis and Forecasting
No ratings yet
Time Series Analysis and Forecasting
7 pages
BBA3 QT - Time - Series - Unit - 2
No ratings yet
BBA3 QT - Time - Series - Unit - 2
11 pages
Lesson 11
No ratings yet
Lesson 11
45 pages
Management Science - Module 2.2 Forecasting Models
No ratings yet
Management Science - Module 2.2 Forecasting Models
33 pages
Time Series and Forecasting
No ratings yet
Time Series and Forecasting
75 pages
01 ASAP GM TimeSeriesForcasting - Day1 - 2 - Introduction
No ratings yet
01 ASAP GM TimeSeriesForcasting - Day1 - 2 - Introduction
66 pages
Time Series Forecasting Using Holt-Winters Exponential Smoothing
No ratings yet
Time Series Forecasting Using Holt-Winters Exponential Smoothing
13 pages
Illuminating Data: A hands on guide to data visualization in R
From Everand
Illuminating Data: A hands on guide to data visualization in R
Eman Ahmad
No ratings yet
Start Predicting In A World Of Data Science And Predictive Analysis
From Everand
Start Predicting In A World Of Data Science And Predictive Analysis
Matthew Abbitt
No ratings yet
The Power of Graphs
From Everand
The Power of Graphs
Pasquale De Marco
No ratings yet
OSA JAN24 S1 BCOM BCOM ITM Principles of Software Engineering FINAL
No ratings yet
OSA JAN24 S1 BCOM BCOM ITM Principles of Software Engineering FINAL
2 pages
Test Bank for Organic Chemistry, 3rd Edition: Janice Smith pdf download
100% (2)
Test Bank for Organic Chemistry, 3rd Edition: Janice Smith pdf download
45 pages
Samsung Rotary Type Compressor
100% (1)
Samsung Rotary Type Compressor
18 pages
Manuel Maintenance Pompes Corken
No ratings yet
Manuel Maintenance Pompes Corken
88 pages
Anu PDF
No ratings yet
Anu PDF
9 pages
20 Years Speciliased Pyq Garima Goel Biological Classification
No ratings yet
20 Years Speciliased Pyq Garima Goel Biological Classification
14 pages
Processing Technologies for Milk and Milk Products Methods Applications and Energy Usage 1st Edition Agrawal - The ebook is ready for download with just one simple click
100% (2)
Processing Technologies for Milk and Milk Products Methods Applications and Energy Usage 1st Edition Agrawal - The ebook is ready for download with just one simple click
57 pages
(123doc) - Bai-Tap-Ham-So-On-Thi-Olympic-Toan-Sinh-Vien
No ratings yet
(123doc) - Bai-Tap-Ham-So-On-Thi-Olympic-Toan-Sinh-Vien
13 pages
7.1 General: Section 7 Interchanges
No ratings yet
7.1 General: Section 7 Interchanges
19 pages
Q1 Mod2
No ratings yet
Q1 Mod2
16 pages
Pathology -Dr.Priyanka Sachdev - (17 Oct 23)
No ratings yet
Pathology -Dr.Priyanka Sachdev - (17 Oct 23)
188 pages
Bahasa Inggris (Sheila XI MIPA 2)
No ratings yet
Bahasa Inggris (Sheila XI MIPA 2)
11 pages
Note Highligths The Important Only F Not Close Parenthises Thats Explain (Ucsp)
No ratings yet
Note Highligths The Important Only F Not Close Parenthises Thats Explain (Ucsp)
13 pages
Biology A2 Classified
100% (2)
Biology A2 Classified
279 pages
Political Donations, Public Procurement and Government Efficiency
No ratings yet
Political Donations, Public Procurement and Government Efficiency
11 pages
Energy Losses in Pipes: Experiment # 4
No ratings yet
Energy Losses in Pipes: Experiment # 4
4 pages
TrendNet 24 Puertos Te100-S24
No ratings yet
TrendNet 24 Puertos Te100-S24
2 pages
AHSME 1996 Problems
No ratings yet
AHSME 1996 Problems
5 pages
Important Computer Questions For RSMSSB Informatics Assistant Exam Set 1 1
No ratings yet
Important Computer Questions For RSMSSB Informatics Assistant Exam Set 1 1
5 pages
Supranatural Ini Tidak Hanya Thread Ini
No ratings yet
Supranatural Ini Tidak Hanya Thread Ini
13 pages
PPT
No ratings yet
PPT
26 pages
Cost: As A Resource Sacrificed or Forgone To Achieve A Specific Objective. It Is Usually Measured
No ratings yet
Cost: As A Resource Sacrificed or Forgone To Achieve A Specific Objective. It Is Usually Measured
19 pages
BTSDSB2018
No ratings yet
BTSDSB2018
30 pages
User Research Case Study: Enhancing the Office Cafe Ordering Experience with Eillor App
No ratings yet
User Research Case Study: Enhancing the Office Cafe Ordering Experience with Eillor App
3 pages
Decision Support Systems Lecture2Supplementary PDF
No ratings yet
Decision Support Systems Lecture2Supplementary PDF
32 pages
de Thi Dap An
No ratings yet
de Thi Dap An
6 pages
Review Petition Against Judgment Quashing 100 Reservation For STs in SCheduled Areas of AP Telanga PDF
No ratings yet
Review Petition Against Judgment Quashing 100 Reservation For STs in SCheduled Areas of AP Telanga PDF
18 pages
Transgender 101 Final PDF
No ratings yet
Transgender 101 Final PDF
44 pages
Airline Customer Satisfaction and Loyalty
100% (2)
Airline Customer Satisfaction and Loyalty
15 pages

L7 - DS304 Visualization

Uploaded by

L7 - DS304 Visualization

Uploaded by

Visualization

Centered include both previous and future

You might also like