basic statistics
Statistics
concepts
- central tendency
- mean(average): dividing the sum of data by the count of the data
- median: 50% of a quantile or the data in the middle of the whole data
- mode: the frequent count of data
- quantile: 25%, 50%, 75% of the whole data
- measurement of distribution
- variance: how widely the whole data distribute
- std(standard deviation): the square root of variance
- coefficient
- correlation coefficient: the size of the relation between two variances / range must be within [-1, 1].
the degree of the change to one variance as another one also changes - covariance: the degree of how far both two variances are distant from each mean value / when one variance is in faraway point from its mean, how far another variance is distant from its mean
- regression
- Linear regression => regressional coefficient (the size that the independent variance impacts the dependent variance)
important: minimizing the errors of data
- for the prediction