Quasi-variance formula and equations, examples, exercise

4341
Jonah Lester

The quasi-variance, Quasi variance or unbiased variance is a statistical measure of the dispersion of the data of a show with respect to the mean. The sample, in turn, consists of a series of data taken from a larger universe, called population.

It is denoted in various ways, here it has been chosen sctwo and to calculate it the following formula is followed:

Figure 1. The definition of quasi-variance. Source: F. Zapata.

Where:

-sc two = the quasi-variance or variance of the sample (sample variance)

-xi = each of the sample data

-n = number of observations

-X = the sample mean

Given that the unit of the sample quasi-variance is the square of the unit in which the sample comes, when interpreting the results it is preferred to work with the quasi standard deviation or sample standard deviation.

This is denoted as sc and is obtained by extracting the square root of the quasivariance:

sc = √ sc two

The quasi-variance is similar to the variance stwo, with the only difference that the denominator of that is n-1, while in that of the variance it is divided only by n. It is evident that when n is very large, the values ​​of both tend to be the same.

When you know the value of the quasi-variance, you can immediately know the value of the variance.

Article index

  • 1 Examples of quasi-variance
  • 2 Why divide by n-1?
    • 2.1 Alternative way to calculate the quasi-variance
    • 2.2 The standard score
  • 3 Exercise resolved
    • 3.1 Solution a
    • 3.2 Solution b
  • 4 References

Examples of quasi-variance

Often you want to know the characteristics of any population: people, animals, plants and, in general, any type of object. But analyzing the entire population may not be an easy task, especially if the number of elements is very large..

Samples are then taken, with the hope that their behavior reflects that of the population and thus be able to make inferences about it, thanks to which resources are optimized. This is known as statistical inference.

Here are some examples in which the quasi-variance and the associated quasi-standard deviation serve as a statistical indicator by indicating how far the results obtained are from the mean.

1.- The marketing director of a company that manufactures automotive batteries needs to estimate, in months, the average life of a battery.

To do this, he randomly selects a sample of 100 purchased batteries of that brand. The company keeps a record of the buyers' data and can interview them to find out the life of the batteries.

Figure 2. Quasi-variance is useful for making inferences and quality control. Source: Pixabay.

2.- The academic direction of a university institution needs to estimate the enrollment of the following year, analyzing the number of students who are expected to pass the subjects they are currently studying..

For example, from each of the sections currently taking Physics I, the management can select a sample of students and analyze their performance in that chair. In this way you can infer how many students will take Physics II in the next period.

3.- A group of astronomers focuses their attention on a part of the sky, where a certain number of stars with certain characteristics are observed: size, mass and temperature for example.

One wonders if stars in another similar region will have the same characteristics, even stars in other galaxies, such as the neighboring Magellanic Clouds or Andromeda..

Why divide by n-1?

In the quasi variance it is divided by n-1 instead of doing it between n and it is because the quasi variance is a unbiased estimator, as said at the beginning.

It happens that from the same population it is possible to extract many samples. The variance of each of these samples can also be averaged, but the average of these variances does not turn out to be equal to the variance of the population..

In fact, the mean of the sample variances tends to underestimate the population variance, unless you use n-1 in the denominator. It can be verified that the expected value of the quasi-variance E (sctwo) is precisely stwo.

Therefore, it is said that the quasivariate is unbiased and is a better estimator of the population variance stwo.

Alternative way to calculate quasi-variance

It is easily shown that the quasi-variance can also be calculated as follows:

sctwo = [∑xtwo / (n-1)] - [∑nXtwo / (n-1)]

The standard score

By having the sample deviation, we can know how many standard deviations a particular value x has, either above or below the mean..

For this, the following dimensionless expression is used:

Standard score = (x - X) / sc

Exercise resolved

Calculate the quasi-variance and quasi-standard deviation of the following data, consisting of monthly payments in $ made by an insurance company to a private clinic.

863 903 957 1041 1138 1204 1354 1624 1698 1745 1802 1883

a) Use the definition of quasi-variance given at the beginning and also check the result using the alternative form given in the previous section.

b) Calculate the standard score of the second piece of data, reading from top to bottom.

Solution to

The problem can be solved by hand with the help of a simple or scientific calculator, for which it is necessary to proceed in order. And for this, nothing better than organizing the data in a table like the one shown below:

Thanks to the table, the information is organized and the quantities that are going to be needed in the formulas are at the end of the respective columns, ready to use immediately. Summations are indicated in bold.

The mean column is always repeated, but it is worth it because it is convenient to have the value in view, to fill each row of the table.

Finally, the equation for the quasivariance given at the beginning is applied, only the values ​​are substituted and as for the summation, we already have it calculated:

sctwo = 1,593,770 / (12-1) = 1,593,770 / 11 = 144,888.2

This is the value of the quasi-variance and its units are "dollars squared", which does not make much practical sense, so the quasi-standard deviation of the sample is calculated, which is nothing more than the square root of the quasi-variance:

sc = ($ 144,888.2) = $ 380.64

It is immediately confirmed that this value is also obtained with the alternative form of the quasi-variance. The necessary sum is at the end of the last column on the left:

sctwo = [∑xtwo / (n-)] - [∑nXtwo / (n-1)] = [23,496,182 / 11] - [12 x 1351two/ eleven]

= 2,136,016.55 - 1,991,128.36 = $ 144,888 squared

It is the same value obtained with the formula given at the beginning.

Solution b

The second value from top to bottom is 903, its standard score is

Standard score of 903 = (x - X) / sc = (903 - 1351) /380.64 = -1.177

References

  1. Canavos, G. 1988. Probability and Statistics: Applications and methods. Mcgraw hill.
  2. Devore, J. 2012. Probability and Statistics for Engineering and Science. 8th. Edition. Cengage.
  3. Levin, R. 1988. Statistics for Administrators. 2nd. Edition. Prentice hall.
  4. Measures of dispersion. Recovered from: thales.cica.es.
  5. Walpole, R. 2007. Probability and Statistics for Engineering and Sciences. Pearson.

Yet No Comments