Coefficient of Dispersion: Definition, Formula and Examples

Coefficient of Dispersion: Definition, Formula and Examples

Edited By Komal Miglani | Updated on Jul 02, 2025 07:52 PM IST

Collecting data and expressing it in the form of measures of data is an essential concept for us. The measure of the spread shows how much variation is there in data. It shows how the data is spread, and scattered, and what is the deviation, and variance of the data. These values describe the data in a better way and help the analyst to analyze the data in a better way and take out the insights from it. This is one of the fundamentals of statistics which has numerous applications in various domains like data analysis, weather forecast, business, etc.

This Story also Contains
  1. Measures of the Dispersion of the Data
  2. Coefficient of Dispersion
  3. Solved Examples Based on Coefficient of Dispersion
  4. Summary
Coefficient of Dispersion: Definition, Formula and Examples
Coefficient of Dispersion: Definition, Formula and Examples

This article is about the concept Coefficient of Dispersion. This is an important concept which falls under the broader category of Statistics. This is not only important for board exams but also for various competitive exams.

Measures of the Dispersion of the Data

An important characteristic of any set of data is the variation in the data. The degree to which the numerical data tends to vary about an average value is called the dispersion or scatteredness of the data.

The following are the measures of dispersion:

  1. Range

  2. Mean Deviation

  3. Standard deviation and Variance

Range

Range is the difference between the highest and the lowest value in a set of observations.

The range of data gives us a rough idea of variability or scatter but does not tell about the dispersion of the data from a measure of central tendency.

Mean Deviation

Mean deviation measures the deviation of the average mean to the given set of data.

Mean deviation for ungrouped data

Let $n$ observations are $\mathrm{x}_1, \mathrm{x}_2, \mathrm{x}_3, \ldots ., \mathrm{x}_{\mathrm{n}}$.

Mean deviation about 'a', M.D. $(a)=\frac{1}{n} \sum_{i=1}^n\left|x_i-a\right|$
Mean deviation about mean, M.D. $(\bar{x})=\frac{1}{n} \sum_{i=1}^n\left|x_i-\bar{x}\right|$
Mean deviation about median, M.D.(Median $) \left.=\frac{1}{n} \sum_{i=1}^n \right\rvert\, x_i-$ Median $\mid$

Standard Deviation

The standard deviation is a number that measures how far data values are from their mean.
The positive square root of the variance is called the standard deviation. The standard deviation is usually denoted by $\sigma$ and it is given by

$
\sigma=\sqrt{\frac{1}{n} \sum_{i=1}^n\left(x_i-\bar{x}\right)^2}
$

Variance

The mean of the squares of the deviations from the mean is called the variance and is denoted by $\sigma^2$ (read as sigma square). Variance is a quantity that leads to a proper measure of dispersion.

The variance of $n$ observations $x_1, x_2, \ldots, x_n$ is given by

$
\sigma^2=\frac{1}{n} \sum_{i=1}^n\left(x_i-\bar{x}\right)^2
$

Coefficient of Dispersion

The measure of variability which is independent of units is called coefficient of dispersion.

An important characteristic of any set of data is the variation in the data. The degree to which the numerical data tends to vary about an average value is called the dispersion or scatteredness of the data.

The following are the measures of dispersion:

  1. Range

  2. Mean Deviation

  3. Standard deviation and Variance

1. Coefficient of Range

The coefficient of the range equals $\frac{x_{\max }-x_{\min }}{x_{\max }+x_{\min }}$
Where $x_{\max }$ is the highest observation, and $x_{\min }$ is the lowest observation

2. Coefficient of Mean Deviation

The coefficient of mean deviation is $=\frac{M D}{\bar{x}}$

where $MD$ is the mean deviation and $\bar{x}$ is the mean of the data.

3. Coefficient of Standard Deviation

The coefficient of standard deviation is $\frac{\sigma}{\bar{x}}$
where $\sigma$ and $\bar{x}$ are the standard deviation and mean of the data respectively.

4. Coefficient of Variance

The mean of the squares of the deviations from the mean is called the variance and is denoted by $\sigma^2$ (read as sigma square).
Variance is a quantity that leads to a proper measure of dispersion.
The variance of $n$ observations $x_1, x_2, \ldots, x_n$ is given by

$
\sigma^2=\frac{1}{n} \sum_{i=1}^n\left(x_i-\bar{x}\right)^2
$


The coefficient of variation is defined as

$
\text { C.V. }=\frac{\sigma}{\bar{x}} \times 100, \bar{x} \neq 0
$

where $\sigma$ and $\bar{x}$ are the standard deviation and mean of the data. It is consistent than the other and thus is considered better.

Recommended Video Based on Coefficient of Dispersion

Solved Examples Based on Coefficient of Dispersion

Example 1: Coefficient of variation of two distributions are $50$ and $60$ , and their arithmetic means are $30$ and $25$ respectively. Difference of their $S.D.$ is?
1) $0$
2) $1$
3) $1.5$
4) $2.5$

Solution
$
\begin{aligned}
& \frac{\sigma_1}{\bar{x}_1} \times 100=50 \Rightarrow \sigma_1=\frac{\bar{x}}{2}=\frac{30}{2}=15 \\
& \frac{\sigma_2}{\bar{x}_2} \times 100=60 \Rightarrow \sigma_1=\frac{60}{4}=15 \\
& \left|\sigma_1-\sigma_2\right|=0
\end{aligned}
$
Hence, the answer is the option 1 .

Example 2: The following are the weights (in kg ) of $8$ students in a class: $62,65,58,60,70,55,68$, and $72$ . What is the coefficient of variation $(CV)$ of the weights?
1) $10.3 \%$
2) $9.9 \%$
3) $23.1 \%$
4) $27.5 \%$

Solution
To calculate the coefficient of variation $(CV)$, we first need to calculate the mean weight and standard deviation of the weights:

$
\begin{aligned}
& \text { Mean weight }=\frac{62+65+58+60+70+55+68+72}{8}=64.5 \\
& \begin{aligned}
\text { Standard deviation } & =\sqrt{\frac{(62-64.5)^2+(65-64.5)^2+(58-64.5)^2+(60-64.5)^2+(70-64.5)^2+(55-64.5)^2+(68-64.5)^2+(72-64.5)^2}{7}} \\
& =6.39
\end{aligned}
\end{aligned}
$
Now we can calculate the coefficient of variation $(CV)$:

$
\frac{6.39}{64.5} \times 100 \%=9.9 \%
$
Hence the answer is option (2)

Example 3: If the coefficient of variation $(CV)$ of a set of data is $0.5$ , what can be said about the variability of the data compared to the mean?
1) The data is highly variable compared to the mean.
2) The data is moderately variable compared to the mean.
3) The data is slightly variable compared to the mean.
4) The variability of the data cannot be determined from the coefficient of variation.

Solution mean, which indicates that the data is highly variable compared to the mean. In general, a $CV$ greater than $1$ indicates high variability, while a $CV$ less than $1$ indicates low variability.
Hence, the answer is option (1).

Example 4: Which of the following is a drawback of using the coefficient of quartile deviation as a measure of dispersion?
1) It does not take into account extreme values.
2) It is sensitive to changes in the units of measurement.
3) It is difficult to calculate.
4) It is affected by the size of the dataset.

Solution

The coefficient of quartile deviation $(CQD)$ is a measure of relative dispersion that is based on the quartiles of a dataset. However, a drawback of $CQD$ is that it does not take into account extreme values, such as outliers, which can have a significant impact on the dispersion of the data. Therefore, $CQD$ may not accurately reflect the dispersion of the dataset if it contains extreme values.
Hence, the answer is option (1).

Example 5: If the mean of the data: $7,8,9,7,8,7, \lambda, 8$ is $8$, then the variance of this data is :
1) $\frac{7}{8}$
2) $1$
3) $\frac{9}{8}$
4) $2$

Solution

$
\begin{aligned}
& \text { mean of data }=\frac{7+8+9+7+8+7+7+8}{8}=8 \\
& \Rightarrow \lambda=10
\end{aligned}
$
Variance

$
\begin{aligned}
& V^2=\frac{(7-8)^2+(8-8)^2+(9-8)^2+(7-8)^2+0^2+(7-8)^2+(10-8)^2+(8-}{8} \\
& =\frac{8}{8}=1
\end{aligned}
$

Variance $=1$
Hence, the answer is the option 2.

Summary

The measure of variability which is independent of units is called coefficient of dispersion. The standard deviation is a number that measures how far data values are from their mean.
$
\sigma=\sqrt{\frac{1}{n} \sum_{i=1}^n\left(x_i-\bar{x}\right)^2}
$
The mean of the squares of the deviations from the mean is called the variance.
The coefficient of variation is defined as

$
\text { C.V. }=\frac{\sigma}{\bar{x}} \times 100, \bar{x} \neq 0
$

where $\sigma$ and $\bar{x}$ are the standard deviation and mean of the data.


Frequently Asked Questions (FAQs)

1. What are the measures of dispersion?

The degree to which the numerical data tends to vary about an average value is called the dispersion or scatteredness of the data. The measures of dispersion are Range, Mean deviation, Variance and Standard deviation.

2. What is the coefficient of dispersion?

The measure of variability in a data which is independent of units is called coefficient of dispersion.

3. What is coefficient of variance?

The mean of the squares of the deviations from the mean is called the variance. The coefficient of variation is defined as

$
\text { C.V. }=\frac{\sigma}{\bar{x}} \times 100, \bar{x} \neq 0
$

where $\sigma$ and $\bar{x}$ are the standard deviation

4. What is coefficient of standard deviation?

The standard deviation is a number that measures how far data values are from their mean. $
\sigma=\sqrt{\frac{1}{n} \sum_{i=1}^n\left(x_i-\bar{x}\right)^2}
$

5. What is the Coefficient of Dispersion (COD) in statistics?
The Coefficient of Dispersion (COD) is a measure of relative variability in a dataset. It quantifies how spread out the data points are relative to the central tendency, typically the mean. The COD is particularly useful for comparing the dispersion of datasets with different units or scales.
6. How does the Coefficient of Dispersion differ from other measures of variability?
The Coefficient of Dispersion is a relative measure, unlike absolute measures like standard deviation or variance. It expresses variability as a percentage of the mean, making it useful for comparing datasets with different scales or units. This allows for easier interpretation and comparison across diverse datasets.
7. What is the formula for calculating the Coefficient of Dispersion?
The formula for the Coefficient of Dispersion is:
8. Why is the Coefficient of Dispersion expressed as a percentage?
The Coefficient of Dispersion is expressed as a percentage to provide a standardized, easy-to-interpret measure of relative variability. This percentage representation allows for quick comparisons between different datasets, regardless of their original units or scales.
9. Can the Coefficient of Dispersion be negative?
No, the Coefficient of Dispersion cannot be negative. Since it's calculated using the absolute values of standard deviation and mean, and then multiplied by 100, the result is always a positive percentage.
10. Can the Coefficient of Dispersion be negative?
No, the Coefficient of Dispersion cannot be negative. It's always expressed as a positive percentage because it's based on the absolute values of the standard deviation and mean.
11. How is the Coefficient of Dispersion interpreted?
The Coefficient of Dispersion is interpreted as the percentage of the mean that the standard deviation represents. For example, a COD of 25% means that the standard deviation is 25% of the mean, indicating the relative spread of the data.
12. In what situations is the Coefficient of Dispersion particularly useful?
The Coefficient of Dispersion is particularly useful when comparing datasets with different units or scales, or when the absolute values of the data are less important than their relative spread. It's commonly used in fields like finance, economics, and quality control.
13. How does the Coefficient of Dispersion relate to the Coefficient of Variation?
The Coefficient of Dispersion and the Coefficient of Variation (CV) are essentially the same measure. The COD is typically expressed as a percentage, while the CV is usually expressed as a decimal. To convert CV to COD, simply multiply by 100.
14. Can the Coefficient of Dispersion be used for all types of data?
While the Coefficient of Dispersion can be calculated for many types of data, it's most meaningful for ratio-scale data (data with a true zero point). It may not be appropriate for nominal or ordinal data, or for data with negative values or a mean close to zero.
15. How does the Coefficient of Dispersion handle outliers in a dataset?
The Coefficient of Dispersion can be significantly affected by outliers because it uses both the mean and standard deviation in its calculation. Outliers can inflate both these measures, potentially leading to an overestimation of the true dispersion in the data.
16. How is the Coefficient of Dispersion used in quality control?
In quality control, the Coefficient of Dispersion can be used to monitor the consistency of production processes. A lower COD indicates more consistent output, while a higher COD might signal increased variability that needs investigation.
17. How can the Coefficient of Dispersion be used in hypothesis testing?
While not directly used in hypothesis testing, the Coefficient of Dispersion can inform the choice of statistical tests. For instance, if two groups have very different CODs, it might suggest the need for tests that don't assume equal variances.
18. Can the Coefficient of Dispersion be used to detect outliers?
While the Coefficient of Dispersion itself doesn't detect outliers, an unexpectedly high COD might suggest the presence of outliers. However, other methods like box plots or z-scores are more commonly used for outlier detection.
19. How does the Coefficient of Dispersion relate to the concept of effect size in meta-analysis?
In meta-analysis, effect sizes are often standardized measures, similar to the Coefficient of Dispersion. While the COD itself isn't typically used as an effect size, understanding relative variability (which the COD measures) can be important in interpreting and comparing effect sizes across studies.
20. What does a low Coefficient of Dispersion indicate?
A low Coefficient of Dispersion indicates that the data points are clustered closely around the mean. This suggests that the dataset has relatively low variability or dispersion compared to its central tendency.
21. What does a high Coefficient of Dispersion signify?
A high Coefficient of Dispersion signifies that the data points are spread out widely from the mean. This indicates high variability or dispersion in the dataset relative to its central tendency.
22. How does sample size affect the Coefficient of Dispersion?
Sample size doesn't directly affect the calculation of the Coefficient of Dispersion. However, larger sample sizes generally provide more reliable estimates of population parameters, including the COD. Smaller samples may lead to less stable or representative COD values.
23. What are the limitations of using the Coefficient of Dispersion?
Some limitations of the COD include: it's sensitive to outliers, it can be misleading for data with a mean close to zero, it doesn't provide information about the shape of the distribution, and it assumes that the standard deviation is a good measure of spread for the data.
24. Can the Coefficient of Dispersion be used to compare datasets with different means?
Yes, one of the main advantages of the Coefficient of Dispersion is that it can be used to compare datasets with different means. By expressing dispersion as a percentage of the mean, it provides a standardized measure that allows for meaningful comparisons across different scales or units.
25. What's the relationship between the Coefficient of Dispersion and data normality?
The Coefficient of Dispersion doesn't assume or require data normality. However, for normally distributed data, the COD can be related to the proportion of data falling within certain ranges. For non-normal distributions, the interpretation may be less straightforward.
26. How is the Coefficient of Dispersion used in finance?
In finance, the Coefficient of Dispersion is often used to compare the volatility of different investments or financial instruments. It helps investors assess the relative risk of various options, regardless of their absolute price levels or returns.
27. What's the difference between the Coefficient of Dispersion and the Gini coefficient?
While both measure dispersion, they serve different purposes. The Coefficient of Dispersion measures relative variability around the mean, while the Gini coefficient specifically measures income or wealth inequality within a population. The Gini coefficient ranges from 0 to 1, whereas the COD is expressed as a percentage.
28. How does the Coefficient of Dispersion relate to the concept of risk in statistics?
The Coefficient of Dispersion is closely related to the concept of risk in statistics. A higher COD indicates greater variability relative to the mean, which often translates to higher risk in many contexts, such as financial investments or quality control processes.
29. Can the Coefficient of Dispersion be used for multivariate data?
The Coefficient of Dispersion is typically used for univariate data (single variable). For multivariate data, other measures like the coefficient of multiple variation or multivariate coefficient of variation are more appropriate.
30. How does changing the units of measurement affect the Coefficient of Dispersion?
One of the key advantages of the Coefficient of Dispersion is that it's invariant to changes in units. Whether you measure in meters or feet, dollars or euros, the COD will remain the same, making it useful for comparing datasets with different units.
31. What's the relationship between the Coefficient of Dispersion and the standard score (z-score)?
While both relate to variability, they serve different purposes. The COD gives an overall measure of relative dispersion, while z-scores standardize individual data points. However, a dataset with a low COD will tend to have z-scores closer to zero.
32. Can the Coefficient of Dispersion be used with skewed distributions?
Yes, the Coefficient of Dispersion can be calculated for skewed distributions. However, interpretation should be cautious as the mean (used in the COD calculation) may not be the best measure of central tendency for highly skewed data.
33. How does the Coefficient of Dispersion compare to the Interquartile Range (IQR) as a measure of variability?
The COD and IQR both measure variability, but differ in approach. The COD uses the mean and standard deviation, making it sensitive to all data points including outliers. The IQR focuses on the middle 50% of the data, making it more robust to outliers.
34. What's the relationship between the Coefficient of Dispersion and effect size in statistics?
The Coefficient of Dispersion is related to effect size measures like Cohen's d. Both provide standardized measures of variability, but Cohen's d typically compares two groups, while the COD describes variability within a single dataset.
35. How does the Coefficient of Dispersion relate to the concept of precision in measurements?
The Coefficient of Dispersion is closely related to precision in measurements. A lower COD indicates higher precision (less variability relative to the mean), while a higher COD suggests lower precision (more variability relative to the mean).
36. What's the difference between the Coefficient of Dispersion and the Range?
The Coefficient of Dispersion and Range both measure spread, but differ significantly. The Range only considers the extreme values and is an absolute measure. The COD uses all data points, is relative to the mean, and is expressed as a percentage, making it more informative for many comparisons.
37. How does the Coefficient of Dispersion behave with datasets that have a bimodal distribution?
For bimodal distributions, the Coefficient of Dispersion may not provide a complete picture of the data's variability. It will still give a measure of overall spread, but it won't capture the unique characteristics of the bimodal shape. In such cases, additional measures or graphical representations should be considered.
38. Can the Coefficient of Dispersion be used with time series data?
Yes, the Coefficient of Dispersion can be applied to time series data. It can help quantify the relative variability of the series over time. However, it doesn't account for the temporal order of the data, so other time series-specific measures might be needed for a complete analysis.
39. How does the Coefficient of Dispersion relate to the concept of signal-to-noise ratio?
The Coefficient of Dispersion is inversely related to the signal-to-noise ratio. A lower COD suggests less "noise" (variability) relative to the "signal" (mean), which would correspond to a higher signal-to-noise ratio.
40. What's the relationship between the Coefficient of Dispersion and the concept of reliability in psychometrics?
In psychometrics, a lower Coefficient of Dispersion could indicate higher reliability of a measure, as it suggests more consistent results relative to the mean. However, reliability is typically assessed using more specific measures like Cronbach's alpha or test-retest correlation.
41. How can the Coefficient of Dispersion be used in environmental science?
In environmental science, the Coefficient of Dispersion can be used to compare the variability of different environmental parameters, such as pollution levels or species abundance, across different locations or time periods, regardless of their absolute values.
42. What's the difference between the Coefficient of Dispersion and the Index of Dispersion?
While both measure variability, they differ in calculation and interpretation. The Coefficient of Dispersion is (Standard Deviation / Mean) × 100, while the Index of Dispersion is Variance / Mean. The COD is always expressed as a percentage, while the Index of Dispersion is not.
43. How does the Coefficient of Dispersion behave with datasets that include zero values?
The Coefficient of Dispersion can handle datasets with zero values, as long as the mean is not zero. However, if there are many zero values, it may inflate the COD and potentially misrepresent the true variability of the non-zero values.
44. How does the Coefficient of Dispersion relate to the concept of heteroscedasticity?
The Coefficient of Dispersion can be useful in identifying heteroscedasticity (unequal variability across a range of values). If the COD varies significantly when calculated for different subsets of the data, it might indicate heteroscedasticity.
45. What's the relationship between the Coefficient of Dispersion and the concept of entropy in information theory?
While both relate to variability or uncertainty, they're quite different. The COD measures relative spread in a dataset, while entropy quantifies the average amount of information in a message or dataset. A higher COD doesn't necessarily imply higher entropy, or vice versa.
46. How can the Coefficient of Dispersion be used in marketing research?
In marketing research, the Coefficient of Dispersion can be used to compare the variability of different metrics (e.g., customer satisfaction scores, purchase amounts) across different market segments or time periods, regardless of differences in scale or average values.
47. What's the difference between the Coefficient of Dispersion and the Coefficient of Quartile Deviation?
Both measure relative dispersion, but use different calculations. The Coefficient of Dispersion uses the standard deviation and mean, while the Coefficient of Quartile Deviation uses the first and third quartiles. The latter is less affected by extreme values.
48. How does the Coefficient of Dispersion behave with datasets that have negative values?
The Coefficient of Dispersion can be calculated for datasets with negative values, as long as the mean is not zero. However, interpretation may be less intuitive, and caution should be exercised, especially if there's a mix of positive and negative values.
49. Can the Coefficient of Dispersion be used to compare variability across different sample sizes?
Yes, the Coefficient of Dispersion can be used to compare variability across different sample sizes. However, it's important to note that estimates from smaller samples may be less reliable, so such comparisons should be made cautiously.
50. What's the relationship between the Coefficient of Dispersion and the concept of statistical power?
The Coefficient of Dispersion indirectly relates to statistical power. Higher variability (indicated by a higher COD) generally reduces statistical power, making it harder to detect significant effects. However, power calculations typically use more specific measures of variability.
51. How can the Coefficient of Dispersion be used in epidemiology?
In epidemiology, the Coefficient of Dispersion can be used to compare the variability of disease rates or other health metrics across different populations or time periods, regardless of differences in the absolute rates or population sizes.
52. What's the difference between the Coefficient of Dispersion and the Fano factor?
Both measure variability relative to the mean, but in different ways. The Coefficient of Dispersion is (Standard Deviation / Mean) × 100, while the Fano factor is Variance / Mean. The COD is always expressed as a percentage, while the Fano factor is not.
53. How does the Coefficient of Dispersion relate to the concept of measurement uncertainty?
The Coefficient of Dispersion can be seen as a measure of relative uncertainty. A higher COD suggests greater uncertainty in measurements relative to their average value. However, formal measurement uncertainty calculations often involve more specific procedures.
54. Can the Coefficient of Dispersion be used with ordinal data?
While the Coefficient of Dispersion can be calculated for ordinal data, its interpretation may be less meaningful than for interval or ratio data. For ordinal data, measures like the coefficient of quartile deviation might be more appropriate.

Articles

Back to top