2.1.2 Algorithm for Tolerance IntervalsAlgorithm-Tolerance-Intervals
Tolerance intervals are a statistical interval, and with some confidence level, a specified minimum sampled proportion of a population is expected to fall within this range. For a given minimum percentage of population () in interval and confidence level (), tolerance interval provides the limits, and at least a certain proportion () of the population falls between the limits with the confidence level ().
The type of tolerance interval or bound includes Two-Sided and One-Sided (Lower Bound and Upper Bound).
- Two-Sided: Tolerance interval with both lower and upper bound.
- One-Sided: Include only lower bound and only upper bound, and the former is the value that the minimum percentage of population is likely to be greater than, but be less than for the later.
Both parametric and nonparametric tolerance intervals are computed. For nonparametric tolerance intervals, it only assumes that the parent distribution is continuous, and it is a distribution free method, that does not depend on the parent population of the sample. Meanwhile, parametric tolerance intervals are calculated assuming the parent distribution is one of the following ones:
- Normal
- Lognormal
- Gamma
- Exponential
- Smallest Extreme Value
- Weibull
- Largest Extreme Value
- Logistic
- Loglogistic
Normal
For confidence level of and minimum percentage of population in the interval ( is also called the coverage of the tolerance interval), the exact tolerance intervals, lower limit and upper limit , are calculated by the following equations:
-
-
where
- the mean of the samples.
- the tolerance factor (k-factor).
- the standard deviation of the samples.
Tolerance Factor for One-Sided Intervals
The exact tolerance factor for one-sided interval is computed by:
- , and
where
- the percentile of the noncentral t-distribution with degrees of freedom, and is the noncentrality parameter.
- the percentile of the standard normal distribution.
- the number of observations.
Tolerance Factor for Two-Sided Intervals
The exact tolerance factor for two-sided interval is root () of the following equation:
-
where
- the number of observations.
- the cumulative distribution function for a chi-square distribution with degrees of freedom.
- the percentile of the noncentral chi-square distribution with degree of freedom, and is the noncentrality parameter.
Lognormal
The tolerance interval for the lognormal distribution is calculated following the process below:
- Take the natural logarithm of the original data.
- Compute the tolerance intervals for the transformed data by the same procedure for the Normal distribution.
- Exponentiate the limits of the tolerance intervals obtained in the previous step, so to get the tolerance intervals of the original data.
Gamma
The tolerance interval for the gamma distribution is calculated following the process below:
- Take the cubic root of the original data.
- Compute the tolerance intervals for the transformed data by the same procedure for the Normal distribution.
- Cubic power the limits of the tolerance intervals obtained in the previous step, so to get the tolerance intervals of the original data.
Exponential
For confidence level of and minimum percentage of population in the interval ( is also called the coverage of the tolerance interval), the tolerance intervals, lower limit and upper limit , are calculated differently between one-sided and two-sided.
One-Sided Tolerance Intervals
-
-
where
- the number of observations.
- the mean of the samples.
- the percentile of the chi-square distribution with degrees of freedom.
Two-Sided Tolerance Intervals
-
-
where
-
- the number of observations.
- the mean of the samples.
And is the solution to the system of equations below:
-
-
-
where
- the cumulative distribution function of the chi-square distribution with degrees of freedom.
Smallest Extreme Value
For confidence level of and minimum percentage of population in the interval ( is also called the coverage of the tolerance interval), the tolerance intervals, lower limit and upper limit , are calculated as followed.
One-Sided Tolerance Intervals
-
-
where
- the estimated location parameter of smallest extreme value distribution.
- the estimated scale parameter of smallest extreme value distribution.
- is the lower tolerance factor. And is the unique root of the following function.
- , and
-
- the upper tolerance factor. It is computed by replacing with and with in the functions for calculating .
where
-
- a normalizing constant, and
- the incomplete gamma function.
- the centered observations based on the estimated location and scale parameters of the smallest extreme value distribution.
- the number of observations.
Two-Sided Tolerance Intervals
Replace by and by in the formulas for calculating the one-sided tolerance intervals above to get the two-sided smallest extreme value tolerance intervals.
Weibull
The tolerance interval for the weibull distribution is calculated following the process below:
- Take the natural logarithm of the original data.
- Compute the tolerance intervals for the transformed data by the same procedure for the Smallest Extreme Value distribution.
- Exponentiate the limits of the tolerance intervals obtained in the previous step, so to get the tolerance intervals of the original data.
Largest Extreme Value
The tolerance interval for the largest extreme value distribution is calculated following the process below:
- Multiply the original data by .
- Compute the tolerance intervals for the transformed data by the same procedure for the Smallest Extreme Value distribution.
- Multiply the limits of the tolerance intervals obtained in the previous step by again, so to get the tolerance intervals of the original data.
Logistic
For confidence level of and minimum percentage of population in the interval ( is also called the coverage of the tolerance interval), the tolerance intervals, lower limit and upper limit , are calculated as followed.
One-Sided Tolerance Intervals
-
-
-
-
where
- the lower tolerance factor.
- the upper tolerance factor.
- the estimated location parameter of logistic distribution.
- the estimated scale parameter of logistic distribution.
- the upper percentile of the standard normal distribution.
- the lower percentile of the standard logistic distribution.
-
Two-Sided Tolerance Intervals
Replace by and by in the formulas for calculating the one-sided tolerance intervals above to get the two-sided logistic tolerance intervals.
Loglogistic
The tolerance interval for the loglogistic distribution is calculated following the process below:
- Take the natural logarithm of the original data.
- Compute the tolerance intervals for the transformed data by the same procedure for the Logistic distribution.
- Exponentiate the limits of the tolerance intervals obtained in the previous step, so to get the tolerance intervals of the original data.
Nonparametric
For confidence level of and minimum percentage of population in the interval ( is also called the coverage of the tolerance interval), the tolerance intervals, lower limit and upper limit , by nonparametric method, which is a distribution free method, do not depend on the parent population of the samples, and are calculated as followed.
Let be the ordered statistics based on random sample from some continuously distributed population . Then
-
where
- the cumulative distribution function of the beta distribution with parameters and .
Because the coverage of the intervals have a beta distribution with known parameter values, and these values are not dependent of the distribution of the parent population, . Thus is a distribution-free tolerance intervals.
One-Sided Intervals
Consider the following:
-
where
- the largest integer that satisfies the inequality.
- a binomial random variable, and parameters are and .
- the number of observations.
Then the lower tolerance bound () and the upper tolerance bound () are given by:
-
-
The actual or effective coverage is given by .
Two-Sided Intervals
Consider the following:
-
where
- the smallest integer that satisfies the inequality.
- a binomial random variable, and parameters are and .
- the number of observations.
Thus,
-
where
- the inverse cumulative distribution function of .
Choose , , then the tolerance intervals are given by:
-
-
Note that both and are rounded down to the nearest integer.
The actual or effective coverage is given by .
Goodness-of-fit Test
The Anderson-Darling statistics is used to perform the goodness-of-fit test. For each distribution, the modified Anderson-Darling goodness-of-fit test statistics is computed.
|