Data Science Practical Manual

Uploaded by

Medha Bandi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views14 pages

Data Science Practical Manual

Uploaded by

Medha Bandi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

1. Write a program to find the mean absolute deviation for the given data set.

[26,46,56,45,19,22,24].

PROCEDURE:
Step 1: Calculate the mean.
Step 2: Calculate the distance of each data point from the mean. We need to find the
absolute value.
Step 3: Calculate the mean of the distances.
OUTPUT:
2. Write a program to find standard deviation for the following data set.

There are 39 plants in the garden. A few plants were selected randomly and their heights
in cm were recorded as follows: 1,2,3,5,8. Calculate the standard deviation of their
heights.

PROCEDURE:
Step 1: Calculate the mean by adding up all the data pieces and dividing it by the number
of pieces of the data.
Step 2: Subtract mean from every value.
Step 3: Square each of the differences.
Step 4: Find the average of squared numbers calculated in point number 3 to find the
variance.
Step 5: Lastly, find the square root of variance. That is the standard deviation.
OUTPUT:

There are 39 plants in the garden. A few plants were selected randomly and their heights in
cm were recorded as follows: 1,2,3,5,8. Calculate the standard deviation of their heights.

STEP3:
STEP1: STEP 2: CALCULATE STEP5:
CALCULATE CALCULATE SQUARE OF STEP 4: STANDARD
DATA SET MEAN DISTANCE DISTANCE VARIANCE DEVIATION
1 3.8 2.8 7.84 6.16 2.4819347
2 1.8 3.24
3 0.8 0.64
5 1.2 1.44
8 4.2 17.64
3. Write a program to collect data. Analyse it and interpret the result. Consider
the following data set for the statistical problem-solving process.

Consider that you have a food event in your residential society. Perform detailed
analysis and interpret what should be the top five cuisines that most people in the
society prefer for this event.

PROCEDURE:
Step 1: Formulate Statistical Investigative Questions
Step 2. Collect/Consider the Data
Step 3. Analyse the Data
Step 4. Interpret the Data
OUTPUT:
Data collected from each block of the apartment:

Consolidated data for analysis:

Data interpretation:
1. How many are interested in South Indian Cuisine?
26
2. How many people are interested in Chinese cuisine from block 3?
3
4. Write a program to find central limit theorem after observing the following data.
In a country in the middle east region, the recorded weights of the male population
follow a normal distribution. The mean and the standard deviations are 70 kg and 15
kg, respectively. If a person is eager to find the record of 50 males in the population,
then what would mean and the standard deviation of the chosen sample?
PROCEDURE:
Step 1: Draw groups of people at random from your area. We will call this a sample.
We will draw multiple samples in this case, each consisting of 30 people.
Step 2: Calculate the individual mean of each sample set.
Step 3: Calculate the mean of these sample means.
Step 4: To add up to this, a histogram of sample mean weights of people will
resemble a normal distribution.
The formula for the central limit theorem is:

μ = Population mean
σ = Population standard deviation
μx¯¯¯ = Sample mean
σx¯¯¯ = Sample standard deviation
n = Sample size
OUTPUT:
5. Write a program to find the quartile for the following odd dataset.
34 24 43 5 58 81 29 90 22 67 32 88 57 34 43 44 91 24 62
PROCEDURE:
Step 1: Sort in Ascending Order
Step 2: Find N
Step 3: Calculate Lower Quartile (Q1)
Lower Quartile (Q1) = (N+1)x1/4
Step 4: Calculate Middle Quartile (Q2)
Middle Quartile (Q2) = (N+1)x2/4
Step 5: Calculate Upper Quartile (Q3)
Upper Quartile (Q3)= (N+1)x3/4
OUTPUT:

N 19
SORTED
POSITION DATASET DATA
1 34 5 Q1 Q2 Q3
2 24 22 POSITION 5 10 15
3 43 24 DATA 58 67 43
4 5 24
5 58 29
6 81 32
7 29 34
INTER QUARTILE
90 34
8 RANGE: Q3-Q1 -15
9 22 43
10 67 43
11 32 44
12 88 57
13 57 58
14 34 62
15 43 67
16 44 81
17 91 88
18 24 90
19 62 91
6. Write a program to find the quartile for the following even dataset.
54 28 76 64 41 83 19 71 37 58
PROCEDURE:
Step 1: Sort in Ascending Order
Step 2: Find N
Step 3: Calculate Middle Quartile (Q2) or find the median of the dataset
Middle Quartile (Q2) =N/2 & (N+1)/2
Step 4: Split the Dataset into first half and second half
Step 5: Calculate Lower Quartile (Q1) for first half of the data set.
Lower Quartile (Q1) = (N+1)2
Step 6: Calculate Upper Quartile (Q3)
Upper Quartile (Q3) = (N+1)2
OUTPUT:

SORTED
POSITION DATASET DATA
1 54 19 N 10
2 28 28
3 76 37
4 64 41 Q2
5 41 54 POSTION 5.5 BETWEEN 5 AND 6
6 83 58 DATA 56 (54+58 )/2
7 19 64
8 71 71
9 37 76
10 58 83

SORTED First Last

POSITION DATASET DATA Half Half Q1 Q3
1 54 19 19 58 POSITION 3 3
2 28 28 28 64 DATA 37 71
3 76 37 37 71
4 64 41 41 76
5 41 54 54 83
56
6 83 58 INTERQUARTILE RANGE: Q3-Q1
7 19 64 34
8 71 71
9 37 76
10 58 83
7. Write a program to find the decile for the following data set.
4 9 10 10 12 13 88 90 91 96 99 100 16 49 49 52 55 58 60 60 63 64 65 65 65 73 75 81 83
84 86 17 26 27 33 38 42 43 46
PROCEDURE:
Step 1: Arrange the data set in ascending order.
Step 2: Give the position for each data points.
Step 3: Calculate the decile using the formula
Di = (N + 1) * i / 10
Step 4: Calculate the decile from D1 to D9
OUTPUT:
Position Data Set
1 4
2 9 n 39
3 10
4 10
5 12 Decile Data Position Data
6 13 D1 4 10
7 16 D2 8 17
8 17 D3 12 38
9 26 D4 16 49
10 27 D5 20 58
11 33 D6 24 64
12 38 D7 28 73
13 42 D8 32 84
14 43 D9 36 91
15 46
16 49
17 49
18 52
19 55
20 58
21 60
22 60
23 63
24 64
25 65
26 65
27 65
28 73
29 75
30 81
31 83
32 84
33 86
34 88
35 90
36 91
37 96
38 99
39 100

Point Measures
No ratings yet
Point Measures
21 pages
Business Statistics Practice Questions
No ratings yet
Business Statistics Practice Questions
8 pages
Engineering Data Analysis Part 1 23241stsem Notes
No ratings yet
Engineering Data Analysis Part 1 23241stsem Notes
108 pages
Measures of Variation
No ratings yet
Measures of Variation
149 pages
4. Exploring Numerical Data_students
No ratings yet
4. Exploring Numerical Data_students
97 pages
Pds Record Document Ds II
No ratings yet
Pds Record Document Ds II
36 pages
Lecture 1
No ratings yet
Lecture 1
43 pages
Q4 Intro Quartile
No ratings yet
Q4 Intro Quartile
40 pages
Q4 - LESSON 1&2 - Median and Quartile of Ungrouped data
No ratings yet
Q4 - LESSON 1&2 - Median and Quartile of Ungrouped data
35 pages
Data Management
No ratings yet
Data Management
50 pages
Data Management Problems With Solution
No ratings yet
Data Management Problems With Solution
36 pages
Statistics Part 1 and 2
No ratings yet
Statistics Part 1 and 2
53 pages
Quantitative Methods For Management
No ratings yet
Quantitative Methods For Management
118 pages
Chapter 3 - Descriptive statistics (Ungrouped Data)
No ratings yet
Chapter 3 - Descriptive statistics (Ungrouped Data)
30 pages
Chapter 1.3 Data description (B)
No ratings yet
Chapter 1.3 Data description (B)
26 pages
Module On Measures of Variability
No ratings yet
Module On Measures of Variability
33 pages
Probability and Statistics (Tutorial 2)
No ratings yet
Probability and Statistics (Tutorial 2)
27 pages
Solutions For The Practice Sums
No ratings yet
Solutions For The Practice Sums
24 pages
Chapter 4 (Part2 - MMW)
No ratings yet
Chapter 4 (Part2 - MMW)
32 pages
TRISEM14-2021-22 BMT5113 TH VL2021220200049 Reference Material I 04-Aug-2021 Measures of Dispersion
No ratings yet
TRISEM14-2021-22 BMT5113 TH VL2021220200049 Reference Material I 04-Aug-2021 Measures of Dispersion
59 pages
R_-_III_UNIT[1]
No ratings yet
R_-_III_UNIT[1]
34 pages
V Nishal 24MID0281
No ratings yet
V Nishal 24MID0281
17 pages
Range, SD, QD, Variance
No ratings yet
Range, SD, QD, Variance
14 pages
MATH 10 4th Quarter LP
No ratings yet
MATH 10 4th Quarter LP
11 pages
1.3 Measure of Variability and Position
No ratings yet
1.3 Measure of Variability and Position
47 pages
27 (10A-DS) RAYYAN KHAN 1
No ratings yet
27 (10A-DS) RAYYAN KHAN 1
14 pages
MCA Mathematical Foundation For Computer Application 13
No ratings yet
MCA Mathematical Foundation For Computer Application 13
26 pages
NAOMI Assasment 2 BUS STATS
No ratings yet
NAOMI Assasment 2 BUS STATS
4 pages
Lecture-1-Numerical Representation of Data
No ratings yet
Lecture-1-Numerical Representation of Data
14 pages
Quantitative Methods in Business: Rhean T. Urbiztondo Student Saturday 8:00am-2:00pm
No ratings yet
Quantitative Methods in Business: Rhean T. Urbiztondo Student Saturday 8:00am-2:00pm
33 pages
1743 Chapter 2 Data Description (B)
No ratings yet
1743 Chapter 2 Data Description (B)
22 pages
Measures of Variability
No ratings yet
Measures of Variability
17 pages
Lecture IV Measures of relative positioning
No ratings yet
Lecture IV Measures of relative positioning
7 pages
Asset-V1 VIT+MBA001+2020+type@asset+block@Week 2 Content
No ratings yet
Asset-V1 VIT+MBA001+2020+type@asset+block@Week 2 Content
20 pages
11 4variationswithinadataset
No ratings yet
11 4variationswithinadataset
4 pages
Stat_Ques. Bank
No ratings yet
Stat_Ques. Bank
10 pages
Measures of variability
No ratings yet
Measures of variability
10 pages
Measures of Variability: Range
No ratings yet
Measures of Variability: Range
5 pages
gr10t3-statistics-five-number-system-and-box-and-whisker-diagram
No ratings yet
gr10t3-statistics-five-number-system-and-box-and-whisker-diagram
6 pages
8th PPT Lecture On Measures of Position
0% (1)
8th PPT Lecture On Measures of Position
19 pages
Descriptive statistics,Unit 1_34538ac4-b21d-4633-8ac6-d0ac88bf17a4
No ratings yet
Descriptive statistics,Unit 1_34538ac4-b21d-4633-8ac6-d0ac88bf17a4
4 pages
Mathematics As A Tool (Descriptive Statistics) (Midterm Period) Overview: This Module Tackles Mathematics As Applied To Different Areas Such As Data
No ratings yet
Mathematics As A Tool (Descriptive Statistics) (Midterm Period) Overview: This Module Tackles Mathematics As Applied To Different Areas Such As Data
33 pages
Wk02 T01 ANS
No ratings yet
Wk02 T01 ANS
11 pages
FDS QB
No ratings yet
FDS QB
3 pages
Ken Black QA ch03
0% (1)
Ken Black QA ch03
61 pages
lab
No ratings yet
lab
14 pages
Statistics
No ratings yet
Statistics
23 pages
Fallsem2019-20 Mat2001 Eth Vl2019201000373 Reference Material I 19-Jul-2019 Measures of Variaation
No ratings yet
Fallsem2019-20 Mat2001 Eth Vl2019201000373 Reference Material I 19-Jul-2019 Measures of Variaation
14 pages
S.4 Notes Melanie Bulseco
No ratings yet
S.4 Notes Melanie Bulseco
5 pages
Module 6 Assessment: Solution: (A) Given, X̅ 75 Z X-X ̅ / S
No ratings yet
Module 6 Assessment: Solution: (A) Given, X̅ 75 Z X-X ̅ / S
4 pages
History of Gamu Isabela
No ratings yet
History of Gamu Isabela
6 pages
MAT112 CH 11 Ungrouped Data PDF
No ratings yet
MAT112 CH 11 Ungrouped Data PDF
4 pages
Data Science Practical Manual Printout[1]
No ratings yet
Data Science Practical Manual Printout[1]
4 pages
Measure of Variation
No ratings yet
Measure of Variation
50 pages
ss6th Grade Statistical Variability Chapter Questions
No ratings yet
ss6th Grade Statistical Variability Chapter Questions
7 pages
Frequency Distribution Table: Measure of Dispersion: Range, Variance, Standard Deviation
No ratings yet
Frequency Distribution Table: Measure of Dispersion: Range, Variance, Standard Deviation
4 pages
Lesson 5 Measure of Spread 1
No ratings yet
Lesson 5 Measure of Spread 1
9 pages
WGU C784 – APPLIED HEALTHCARE STATISTICS PRE ASSESSMENT TEST EXAM QUESTIONS AND VERIFIED ANSWERS GRADED A+ 2024 UPDATE
No ratings yet
WGU C784 – APPLIED HEALTHCARE STATISTICS PRE ASSESSMENT TEST EXAM QUESTIONS AND VERIFIED ANSWERS GRADED A+ 2024 UPDATE
17 pages
Mini 5 - Normal Distribution
100% (1)
Mini 5 - Normal Distribution
6 pages
305 Ex 5 Excel13 Downloading Census
100% (1)
305 Ex 5 Excel13 Downloading Census
12 pages
Identifying Parameter for Testing in Given Real-Life Problems
No ratings yet
Identifying Parameter for Testing in Given Real-Life Problems
12 pages
Math 1
No ratings yet
Math 1
24 pages
Measures of Dispersion: Profgrcnair
No ratings yet
Measures of Dispersion: Profgrcnair
22 pages
Chapter 7 Control and Coordination CBSE Class10 Science
No ratings yet
Chapter 7 Control and Coordination CBSE Class10 Science
24 pages
Sanskrit Activity
No ratings yet
Sanskrit Activity
20 pages
List of Metropolitan Areas Secret
No ratings yet
List of Metropolitan Areas Secret
7 pages
DOC-20240809-WA0078.
No ratings yet
DOC-20240809-WA0078.
7 pages
Lecture Week 3
No ratings yet
Lecture Week 3
9 pages
afc9e60f2649adca15efe3a3b0215989
No ratings yet
afc9e60f2649adca15efe3a3b0215989
6 pages
Document 1 (10)
No ratings yet
Document 1 (10)
9 pages
Document 1 (7)
No ratings yet
Document 1 (7)
9 pages
GCE As Level Representation of Data Histograms
No ratings yet
GCE As Level Representation of Data Histograms
10 pages
Final Exam - Sample Test
No ratings yet
Final Exam - Sample Test
6 pages
137195df5229391ce9ac2403186f2146 (1)
No ratings yet
137195df5229391ce9ac2403186f2146 (1)
1 page
Class 10 Biology Practicals
No ratings yet
Class 10 Biology Practicals
7 pages
996158f11d113b93a0f5367b78b69e7d
No ratings yet
996158f11d113b93a0f5367b78b69e7d
4 pages
modal paper maths
No ratings yet
modal paper maths
8 pages
7611_AADHAAR_CAMP_GUIDELINES
No ratings yet
7611_AADHAAR_CAMP_GUIDELINES
1 page
7844_Aadhar_Enrolment_Camp_circular
No ratings yet
7844_Aadhar_Enrolment_Camp_circular
1 page
141b39d4e744140a4279fa7388eea3fd (2)
No ratings yet
141b39d4e744140a4279fa7388eea3fd (2)
4 pages
2123_Grade_10_-NOV_PA3 (1)
No ratings yet
2123_Grade_10_-NOV_PA3 (1)
3 pages
cbse ganit challenge
No ratings yet
cbse ganit challenge
6 pages
Presentation 6
No ratings yet
Presentation 6
4 pages
9729_8108_Canteen_Circular_2024-25_(1)
No ratings yet
9729_8108_Canteen_Circular_2024-25_(1)
6 pages
4692_10th_PRE_BOARD_2&3
No ratings yet
4692_10th_PRE_BOARD_2&3
4 pages
Quantitative Genetics
No ratings yet
Quantitative Genetics
31 pages
dfc0c49aa827c8aac3bb567997610c49
No ratings yet
dfc0c49aa827c8aac3bb567997610c49
1 page
Rural Urban Census 2011
No ratings yet
Rural Urban Census 2011
40 pages
AS Module 2 The Demographic Transition Model
No ratings yet
AS Module 2 The Demographic Transition Model
10 pages
Ch2 - Population Forecasting
0% (1)
Ch2 - Population Forecasting
5 pages
2008 Early In-Person Voting in Franklin County, Ohio
No ratings yet
2008 Early In-Person Voting in Franklin County, Ohio
9 pages
Caraga Population 2010 Census
No ratings yet
Caraga Population 2010 Census
44 pages
Greek DNA
No ratings yet
Greek DNA
12 pages
Santa Maria Davao Occidental
100% (1)
Santa Maria Davao Occidental
6 pages
Chapter 7
No ratings yet
Chapter 7
15 pages
Day 1 - Census Research
No ratings yet
Day 1 - Census Research
2 pages
Kotak Misai
No ratings yet
Kotak Misai
4 pages
Clark Pockets of Belief
No ratings yet
Clark Pockets of Belief
4 pages
Aleksandra Vuletic, Censuse in 19th Century in Serbia
No ratings yet
Aleksandra Vuletic, Censuse in 19th Century in Serbia
25 pages
AP Statistics Study Guide
100% (1)
AP Statistics Study Guide
12 pages
Stat 332 Solutions To Assignment 1
No ratings yet
Stat 332 Solutions To Assignment 1
2 pages
Random Sampling (Stratified) Example
No ratings yet
Random Sampling (Stratified) Example
4 pages
Mms Testing of Hypothesis
No ratings yet
Mms Testing of Hypothesis
69 pages
Sample Size
No ratings yet
Sample Size
6 pages
B.Sc. Degree Examination, 2011
No ratings yet
B.Sc. Degree Examination, 2011
2 pages
Abra
No ratings yet
Abra
2 pages
Census of India 2011 - MCQ Questions On Current Affairs For Various Examinations
No ratings yet
Census of India 2011 - MCQ Questions On Current Affairs For Various Examinations
10 pages
Vedic Mathematics: Learn School Maths Easy Way-1: Students Books, #1
From Everand
Vedic Mathematics: Learn School Maths Easy Way-1: Students Books, #1
Dr. Yogesh Chandna
No ratings yet
Fraud Analytics Using Descriptive, Predictive, and Social Network Techniques: A Guide to Data Science for Fraud Detection
From Everand
Fraud Analytics Using Descriptive, Predictive, and Social Network Techniques: A Guide to Data Science for Fraud Detection
Bart Baesens
No ratings yet

Data Science Practical Manual

Uploaded by

Data Science Practical Manual

Uploaded by

1. Write a program to find the mean absolute deviation for the given data set.

Consolidated data for analysis:

SORTED First Last

You might also like