Basic Statistics Concepts For Data Science
Basic Statistics Concepts For Data Science
Science
1. Descriptive Statistics
It is used to describe the basic features of data that provide a summary of the
given data set which can either represent the entire population or a sample of
the population.
average.
Mode: It refers to the value that appears most often in a data set.
Median: It is the middle value of the ordered set that divides it in exactly half .
2. Variability
set as compared.
numbers in a data set. In general terms, it means the difference from the
mean. A large variance indicates that numbers are far apart from average
value. Small variance indicates that the numbers are closer to the average
values. Zero variance indicates that the values are identical to the given set.
Range: This is defined as the difference between the largest and smallest
value of a dataset.
Percentile: It refers to the measure used in statistics that indicates the value
Quartile: It is defined as the value that divides the data points into quarters .
3. Correlation
between two variables. The correlation coefficient indicates the strength of the
relationship.
relationship.
It specifies of all possible events. In simple terms, an event refers to the result
probability.
5. Regression
types:
Linear regression: It is used to fit the regression model that explains the
variables.
relationship between the binary response variable and one or more predictor
variables.
6. Normal Distribution
variables is unknown, the normal distribution is used. The central limit theorem
7. Bias
analysis, the selection in such a way that data is not randomized resulting in