0% found this document useful (0 votes)
11 views

Slide_1_ Introduction and Basic Statistics

Uploaded by

gyankelvin2023
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views

Slide_1_ Introduction and Basic Statistics

Uploaded by

gyankelvin2023
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 28

Introduction: Data

Analysis Concepts
GDAP PROJECT
GDAP PROJECT | Data Analysis Concepts 1
Course Outline For Power BI with SQL
1. Data Analytics Concepts 18. creating relationships
2. Basic Statistics 19 . Creating Relationships In Model View
20. Row Level Security
3. Power Bi installation
21. Testing Roles
4. Power BI Infrastructure
22. Data Analysis Expression Language(DAX) Calculations
5. Introduction to Power BI Desktop 23. Creating Calculated Tables
6. Loading Data In Power BI 24. Creating Calculated Columns
7. Using Power Query to clean & transform 25. Using the Column Tools Tabs For Calculations
8. Editing records 26. Measures
27. Creating Quick Measures
9. Join Types
28. Creating New Measures From Scratch
10. Enhancing records
29. Time Intelligent Functions and key Performance Indicators
11. sorting null values out 30. Calendar Auto
12. Designing Data Model 31. Report Design
13. Working with tables 32.Formatting Pi Chart
14. Breaking down tables 33. Using the slicer to filter Data In a graph
34. Creating Dashboards
15. Adding unique Columns
35. Creating Dashboard in power BI Service
16. Creating a hierarchy
36. power BI Service
17. Understand Cardinality and Cross Filtering

GDAP PROJECT | Data Analysis Concepts 2


Course Objectives

At the end of this course participants will be able to ;


1. Import, transform and cleanse data using Power Query Editor
2. Build a data model for insightful reporting
3. Manipulate the model with DAX
4. Publish and share visualizations

GDAP PROJECT | Data Analysis Concepts 3


Setting Expectations
Target Audience Prerequisites
– Aspiring data analyst professionals – Knowledge and skills in SQL
– Application developers

Course Materials
– Online resources
– Downloadable labs

Suggested Approach
– Complete each module and lab in turn
– Engage with fellow students
GDAP PROJECT | Data Analysis Concepts 4
Delivery Mode
Course Duration: 140 Hours / 2 Months

SQL Soft Skills & Enterprenueship


Face to Face : 40 Hours(4Hours per Day ) Online : 20 Hours (4Hours per Day)

Power Bi
Face to Face : 60 Hours(4 Hours per Day )
Online : 20 Hours (4 Hours per Day )

GDAP PROJECT | Data Analysis Concepts 5


Course Lab Environment

Labs are based on the Car Servicing.xls excel file and AdventureWorksLT sample
database in Azure SQL Database
– Setup instructions are in the Module 2

There is a lab for each module, consisting of:


– Challenges based on the techniques discussed in the module
– References to relevant documentation
– Suggested solution file

GDAP PROJECT | Data Analysis Concepts 6


Module Overview

01 Introductions 02 Basic Statistics


• What is Data Analysis • Data and information
• The Data Analysis Process • Types of data
• Types Of Data Analysis • Statistical concept
• Tools and Techniques • Descriptive Statistics:
Summarizing Your Data
03 Power BI
• Inferential Statistics:
Drawing Conclusions
• Common Statistical
Techniques
• Statistics and Data
GDAP PROJECT | Data Analysis Concepts Visualization 7
Learning Outcomes

At the end of this module the participant should be able to;


1. Explain Data Analysis
2. State the processes in data analysis
3. State the types of data analysis
4. List tools and techniques in data analysis
5. Explain Basic statistics

GDAP PROJECT | Data


Data Analysis Concepts
AnalysisConcepts 8
Introduction to Basic Statistics

GDAP PROJECT | Data Analysis Concepts 9


What Is Data Analysis

Data analysis is the process of collecting, cleaning,


transforming, and interpreting data to extract meaningful
insights.

It involves applying statistical and computational methods


to uncover patterns, trends, and correlations within
datasets.

These insights can be used to inform decision-making,


solve problems, and identify opportunities.

GDAP PROJECT | Data Analysis Concepts 10


The Data Analysis
Process

GDAP PROJECT | Data


Data Analysis Concepts
AnalysisConcepts 11
The Data Analysis Process

Data analysis is a systematic approach to


collecting, cleaning, transforming, and
interpreting data to extract meaningful insights.
It typically involves the following steps

GDAP PROJECT | Data


Data Analysis Concepts
AnalysisConcepts 12
The Data Analysis Process

Data Data Interpretation


Data Collection Exploration

Data Data
Cleaning Modeling

GDAP PROJECT | Data


Data Analysis Concepts
AnalysisConcepts 13
Types Of Data Analysis

GDAP PROJECT | Data


DataAnalysis
AnalysisConcepts
Concepts 14
14
Types Of Data Analysis

Descriptive Analysis Diagnostic Analysis Predictive Analysis

Summarizing data Exploring the reasons Building models to


using measures like behind observed forecast future
mean, median, mode, patterns or trends. outcomes based on
and standard deviation. historical data.
GDAP PROJECT | Data
DataAnalysis
AnalysisConcepts
Concepts 15
15
Types Of Data Analysis

Prescriptive Analysis
Mercury is the closest planet to the Sun and the
smallest one in the Solar System—it’s only a bit larger
than the Moon

GDAP PROJECT | Data


DataAnalysis
AnalysisConcepts
Concepts 16
16
Tools and Techniques

GDAP PROJECT | Data


DataAnalysis
AnalysisConcepts
Concepts 17
17
Tools and Techniques
Statistical Software

SPSS SAS R Software Python


Statistical Package for Statistical R is a programming Python is a programming
language language
the Social Sciences Software suite

GDAP PROJECT | Data


DataAnalysis
AnalysisConcepts
Concepts 18
18
Tools and Techniques
Data Visualization

Tableau Power BI Python


Tableau can help anyone see and Microsoft Power BI is an interactive data Python is a programming
understand their data. visualization software product developed by language
Microsoft with a primary focus on business
intelligence

GDAP PROJECT | Data


DataAnalysis
AnalysisConcepts
Concepts 19
19
Tools and Techniques
Machine Learning Libraries

Scikit-Learn TensorFlow PyTorch


scikit-learn is a free and open-source An end-to-end open source machine PyTorch is a machine learning library
machine learning library for the Python learning platform for everyone. base on the Torch
programming language

GDAP PROJECT | Data


Data Analysis Concepts
AnalysisConcepts 20
End of Data Analysis Section. Any
Questions?

GDAP PROJECT | Data


DataAnalysis
AnalysisConcepts
Concepts 21
21
Basics
Statistics
Statistics is the language through which
we communicate with data.

It provides the tools and methods to


collect, organize, analyze, interpret, and
present data. In essence, statistics is the
foundation of data analysis.

GDAP PROJECT | Data


DataAnalysis
AnalysisConcepts
Concepts 22
22
Data & Information
Data
Raw and unprocessed facts,
figures, or symbols.

Information
Data that has been processed,
organized, and interpreted.
GDAP PROJECT | Data
DataAnalysis
AnalysisConcepts
Concepts 23
23
Classification of statistical data

Statistical data

Quantitative Qualitative

Continuous Discrete Ordinals Nominals

GDAP PROJECT | Data


DataAnalysis
AnalysisConcepts
Concepts 24
24
Descriptive Statistics

Mean Median Mode


The average value of a The middle value when The most frequent
dataset. It's sensitive to data is sorted. It's robust value in a dataset.
outliers to outliers.

Range Variance Standard Deviation


The difference The average squared The square root of
between the highest deviation from the the variance.
and lowest values. mean.

GDAP PROJECT | Data


DataAnalysis
AnalysisConcepts
Concepts 25
25
Inferential Statistics: Drawing Conclusions

Probability Hypothesis
The likelihood of Making educated guesses
an event occurring about a population based on
sample data
Regression Confidence
Anaysis Intervals
Modeling the Estimating a population
relationship parameter with a certain
between variables. level of confidence.

GDAP PROJECT | Data


Data Analysis Concepts
AnalysisConcepts 26
Statistics and Data Visualization

Donut chart Area chart


A donut chart is essentially A visual representation
a pie chart with a hole in
of Change Over Time
the middle

Histogram Bar chart


A Visual Representation of A bar chart, or bar graph, is
Data Distribution a visual representation of
data using rectangular bars.

GDAP PROJECT | Data


DataAnalysis
AnalysisConcepts
Concepts 27
27
Thank You C

GDAP PROJECT
GDAP PROJECT | Data Analysis Concepts 28

You might also like