The random sampling it is the way to select a statistically representative sample from a given population. Part of the principle that every element in the sample should have the same probability of being selected.
A draw is an example of random sampling, in which each member of the participant population is assigned a number. To choose the numbers corresponding to the raffle prizes (the sample) some random technique is used, for example extracting from a mailbox the numbers that were recorded on identical cards.
In random sampling, it is essential to choose the sample size appropriately, because a sample that is not representative of the population can lead to erroneous conclusions, due to statistical fluctuations..
Article index
There are formulas for determining the proper size of a sample. The most important factor to consider is whether or not the population size is known. Let's look at the formulas to determine the sample size:
When the population size N is unknown, it is possible to select a sample of adequate size n, to determine if a certain hypothesis is true or false.
For this, the following formula is used:
n = (Ztwo p q) / (Etwo)
Where:
-p is the probability that the hypothesis is true.
-q is the probability that it is not, therefore q = 1 - p.
-E is the relative margin of error, for example an error of 5% has a margin E = 0.05.
-Z has to do with the level of confidence required by the study.
In a standardized (or normalized) normal distribution, a confidence level of 90% has Z = 1.645, because the probability that the result is between -1.645σ and + 1.645σ is 90%, where σ is the standard deviation.
1.- 50% confidence level corresponds to Z = 0.675.
2.- 68.3% confidence level corresponds to Z = 1.
3.- 90% confidence level is equivalent to Z = 1,645.
4.- 95% confidence level corresponds to Z = 1.96
5.- 95.5% confidence level corresponds to Z = 2.
6.- 99.7% confidence level is equivalent to Z = 3.
An example where this formula can be applied would be in a study to determine the average weight of pebbles on a beach.
Clearly it is not possible to study and weigh all the pebbles on the beach, so it is advisable to extract a sample as random as possible and with the appropriate number of elements..
When the number N of elements that make up a certain population (or universe) is known, if you want to select by simple random sampling a statistically significant sample of size n, this is the formula:
n = (Ztwop q N) / (N Etwo + Ztwop q)
Where:
-Z is the coefficient associated with the confidence level.
-p is the probability of success of the hypothesis.
-q is the probability of failure in the hypothesis, p + q = 1.
-N is the size of the total population.
-E is the relative error of the study result.
The methodology to extract the samples depends a lot on the type of study that needs to be done. Therefore, random sampling has infinite applications:
For example, in telephone surveys, the people to be consulted are chosen using a random number generator, applicable to the region under study..
If you want to apply a questionnaire to the employees of a large company, then you can resort to the selection of the respondents through their employee number, or identity card number.
This number must also be chosen randomly, using for example a random number generator.
In the case that the study is on parts manufactured by a machine, parts must be chosen randomly, but from batches manufactured at different times of the day, or on different days or weeks..
Simple random sampling:
- It allows reducing the costs of a statistical study, since it is not necessary to study the total population to obtain statistically reliable results, with the desired confidence levels and the level of error required in the study.
- Avoid bias: as the choice of the elements to be studied is completely random, the study faithfully reflects the characteristics of the population, although only part of it was studied.
- The method is not adequate in cases where you want to know the preferences in different groups or population strata.
In this case, it is preferable to previously determine the groups or segments on which the study is to be carried out. Once the strata or groups have been defined, then if it is appropriate to apply random sampling to each one of them..
- It is very unlikely that information will be obtained on minority sectors, of which it is sometimes necessary to know their characteristics.
For example, if it is a question of making a campaign on an expensive product, it is necessary to know the preferences of the wealthiest minority sectors.
We want to study the population's preference for a certain cola drink, but there is no previous study in this population, of which its size is unknown..
On the other hand, the sample must be representative with a minimum confidence level of 90% and the conclusions must have a percentage error of 2%..
-How to determine the sample size n?
-What would the sample size be if the margin of error is relaxed to 5%??
Since the size of the population is unknown, the formula given above is used to determine the sample size:
n = (Ztwop q) / (Etwo)
We assume that there is the same probability of preference (p) for our brand of soft drink as of non-preference (q), then p = q = 0.5.
On the other hand, as the result of the study must have a percentage error less than 2%, then the relative error E will be 0.02.
Finally, a Z value = 1,645 produces a confidence level of 90%.
Summarizing, we have the following values:
Z = 1,645
p = 0.5
q = 0.5
E = 0.02
With these data, the minimum sample size is calculated:
n = (1,645two 0.5 0.5) / (0.02two) = 1691.3
This means that the study with the required margin of error and with the chosen level of confidence, must have a sample of respondents of at least 1692 individuals, chosen by simple random sampling..
If you go from a margin of error of 2% to 5%, then the new sample size is:
n = (1,645two 0.5 0.5) / (0.05two) = 271
Which is a significantly lower number of individuals. In conclusion, the sample size is very sensitive to the desired margin of error in the study..
Yet No Comments