Measurement Uncertainty in Conformity Assessment
Measurement Uncertainty in Conformity Assessment
net/publication/50196781
CITATIONS READS
26 1,313
2 authors:
Some of the authors of this publication are also working on these related projects:
Development of nano-based sensors for rapid food physical-chemical analysis View project
All content following this page was uploaded by Desimoni Elio on 16 July 2014.
REVIEW
Received: 27 October 2010 / Revised: 1 February 2011 / Accepted: 4 February 2011 / Published online: 25 February 2011
# Springer-Verlag 2011
Abstract The uncertainty of measurement is the key indica- a quantitative indication of its quality, so that the user of the
tor of the quality of any experimental result. Proper result can assess its reliability [1]. Such an indication is
consideration of this uncertainty is imperative when testing a represented at best by the measurement uncertainty (MU),
sample against legal/compositional limits. This task can be the value associated with the result of a measurement that
quite challenging when the entity measured in the investigated characterises the dispersion of the values that could
sample is so close to the limit that its uncertainty, however reasonably (e.g. with a given probability/confidence level)
estimated, critically affects decision-making. This explains be attributed to the measurand [1]. As emphasized by the
the many literature contributions discussing the problem. ISO Guide to the Expression of Uncertainty in Measure-
Even though some of the most authoritative organisations ment (GUM), without a clear indication of their uncertainty,
have issued specific guidelines aimed at assisting the staff measurement results cannot be compared either among
involved in such measurements, several aspects of conformity themselves or with reference values given in a specification
testing are still debated in the literature. In this review, after a or standard [1].
short outline of existing information, somewhat more detailed Unfortunately, when dealing with measurements aimed
insight is given into the guidelines of ASME, ISO, and at evaluating conformity with some specification, the
Eurachem/CITAC, because they are the most useful tools for matter becomes quite complex when the measured entity
operators of testing and calibration laboratories. Some aspects in the sample under investigation (e.g. the concentration
of Council Directive 96/23/EC are also discussed. Insight into in chemical analyses) is so close to the specification that
the contents of the mentioned documents enables emphasis of the MU, anyhow estimated, critically affects decision-
analogies and discrepancies. making. This explains the uninterrupted appearance of
contributions devoted to discussion of the multi-faceted
Keywords Conformity testing . Decision rules . Limiting aspects of considering the MU when assessing confor-
values . Specification limits . Guard band . Uncertainty of mity to legal or compositional limits [2–32]. As can be
measurement seen, these papers, listed in chronological order, span the
last fifteen years. Noticeably, a few of them appeared even
after some of the most authoritative organisations issued
Introduction specific guidelines aimed at assisting the staff involved in
such measurements [33–36]. This is probably indicative of
It is well known that, when reporting the result of a a still ongoing debate.
measurement of a physical quantity, it is mandatory to give In this review, after a short outline of existing literature
information a somewhat more detailed insight is given into
the guidelines of ASME [33], ISO [34], and Eurachem/
E. Desimoni (*) : B. Brunetti CITAC [35], because they are the most useful tools for
DiSTAM, Università degli Studi di Milano,
operators of testing and calibration laboratories. Of course,
Via Celoria 2,
20133 Milano, Italy this paper is not aimed exhaustively at presenting the three
e-mail: [Link]@[Link] standards, a task that would obviously require much more
1730 E. Desimoni, B. Brunetti
accordingly increased. Some authors have suggested that, in The coverage factor is a multiplier chosen on the basis of
cases such as B and C, stating conformity or non-conformity the desired level of confidence to be associated with the
with a level of confidence lower than 95% is better than interval defined by U=k·uc. Most frequently, k is in the
nothing [7, 10, 37]. However, such a possibility does not range 2 to 3 [1]. When the normal distribution applies and u
always appear realistic, as in the case of court cases in which is a reliable estimate of the standard deviation of the
the conformity or non-conformity statement must be measurand, U=2·uc defines an interval having a level of
“beyond reasonable doubt” [3]. confidence of approximately 95% (more exactly, a level of
Of course, the discussion about a lower limiting value confidence of 95.45%), and U=3·uc defines an interval
(half bottom of Fig. 1) mirrors that detailed above. having a level of confidence of approximately 99% (more
It has also been reported that, even at present, it is exactly, a level of confidence of 99.73%). However, some
possible that some specification makes no reference to aspects relevant to the concept of uncertainty still deserve
properly considering the effects of MU on the assessment specific comments.
of conformity. In these cases “… it may be appropriate for Notwithstanding the detailed and authoritative docu-
the user to make a judgement of conformity, based on ments intended to explain the meaning of uncertainty, it
whether the test result is within the specified limits with no was noticed that surprisingly invalid MUs were sometimes
account taken of the uncertainty. This is often referred to as provided, often labelled as “standard deviation” [18].
a shared risk, since the end-user takes some of the risk that Examples of such erroneous estimates are an uncertainty
the product may not meet the specification after being resulting from calibration only, a repeatability standard
tested with an agreed measurement method” [37]. deviation and a linearity of some calibration curve.
Finally, recent papers, when examining uncertain cases Moreover, using k=2 or 3 can no longer be accepted if
such as B and C in Fig. 1, also had the objective of the combined uncertainty has too few degrees of freedom
evaluating the effect of MU on producer’s and user’s risk [1, 19, 26]. If the effective number of degrees of freedom,
(usually associated to type I and, type II errors, respectively) νeff, is too low (for example, fewer then six according to
in classification and conformity assessments [32]. Ref. [43], fewer than 30 according to Ref. [19]) the
Student-t distribution is the most appropriate (approximate)
choice for associating a level of confidence with U. It is
Which uncertainty? known that νeff can be estimated by use of the Welch–
Satterthwaite formula [1]. As underlined by the GUM, the
Nowadays, the term “uncertainty of measurement” is experimental standard deviation of the mean of as many as
definitely used to mean the expanded uncertainty, U, thirty repeated observations of a measurand described by a
obtained by multiplying the combined standard uncer- normal distribution has itself an uncertainty of approxi-
tainty, uc (sometimes reported as uc(y), where y is the mately 13% [1].
estimate of the measurand Y), by the coverage factor, k [1, It is also possible that the reported uncertainty data look
2, 7, 16, 22, 33–35, 37–39]. The intended purpose of U is questionable. In these situations, the Horwitz equation [44–
to provide an interval around the result of a measurement 46] can sometimes provide a more realistic view [18, 19].
that may be expected to encompass a large fraction of the The equation is usually presented as:
distribution of values that could reasonably be attributed
to the measurand [1]. RSD% ¼ 2ð10:5logðmf ÞÞ ð1Þ
The combined standard uncertainty is the total uncer-
tainty of a measurement result estimated by properly Where RSD% is the among-laboratory relative standard
combining all the uncertainty components affecting the deviation and mf is the analyte mass fraction (e.g. mf=10−6
whole experimental procedure. Whenever the procedure means mg kg−1). The equation describes the precision
includes the sampling of the material under examination, it performances of a measurement method without regard to
is mandatory considering the uncertainty of sampling the nature of the analyte, the type of test material, the
among all the other uncertainty components (see for applied analytical technique, and the complexity of the
example Refs. [13–15, 34, 35, 39–42]). However, it should procedure. Equation 1 is presented in Fig. 2. Acceptable
be also noted that the combined standard uncertainty is an performances usually provide variability values within one-
estimated standard deviation relying on the assumption that half to twice the predicted RSD% [45]. Within-laboratory
no source of uncertainty has been neglected or overlooked values are expected to be one-half to two-thirds of the
and that, consequently, is itself affected by a more or less among-laboratory values [46]. Even if significant devia-
significant uncertainty. Also the GUM (section G 1.2) tions from the estimates obtained by the Horwitz equation
emphasizes that the value of the expanded uncertainty is at are possible (Ref. [47] and references cited therein)
best only approximate [1]. nevertheless, the equation is still an acceptable basis for
1732 E. Desimoni, B. Brunetti
When confronting the problem of testing for conformity as “undetected” results falling below the limit of
with a concentration LV, some authors tried to develop detection [9].
alternative approaches enabling decision-making to be The rules for stating when a limiting value is exceeded
performed directly in the signal domain (SD), that is by were introduced in a paper where, after discussing the one-
comparing the signal of the sample under investigation with stage and two-stage procedures detailed in Ref. [12] and
the signal relevant to a sample containing exactly the subsequently adopted by the ISO guidelines [34] (see the
specified LV (e.g. to a suitable certified reference material, section about the ISO guidelines, below), the authors
CRM). Then, the conclusion of the comparison had simply suggested two procedures for estimating the number of
to be translated to the CD by a proper calibration constant. measurements necessary for appropriate reliability of the
It is well known that the physical quantities of interest results [12, 13].
(concentration in this case) cannot be measured directly but Another paper was intended to highlight some experi-
are connected to the measured signals through a calibration mental problems preventing uniform implementation of
constant [48]. This view is at the basis of some accepted legislative standards in the EU and Codex [22]. The authors
approaches suitable for estimating the limit of detection emphasized that often there is no common interpretation of
(LOD), where proper statistical tests must enable evaluation analytical results across the EU in the food sector, so that
of whether the concentration of the analyte in the sample significantly different decisions may be taken after analy-
under investigation is higher than zero (see, for example, sing the same sample. Particular attention directed at the
among the most authoritative, Refs. [49, 50]). Of course, consequences of reporting and using the experimental
estimating the LOD is a problem quite similar to that of results in different ways, considering, or not, the recovery
assessing conformity, where proper statistical test must and using results including a different number of significant
enable evaluation of whether the concentration of the figures [22].
analyte in the sample under investigation is higher (or Economic aspects affecting conformity assessment were
lower) than a limiting value. also considered by some authors. One paper discussed
But comparing signals requires proper consideration economic terms of common rules in conformity assessment
of both false-positive (type I) and false-negative (type II) based on measurement by extending tools of sampling
errors: see for example the ISO approach to the LOD when using inspection by variable and inspection by
[50]. Approaches have been tentatively proposed for attribute [24]. Another paper discussed decision-making
assessing conformity to some limiting values by working in conformity assessment in terms of effective cost
in the SD and considering both types of errors [4, 6, 8, associated with measurement, testing and incorrect
11]. Unfortunately, they suffer from disadvantages, name- decision-making [31].
ly the actual availability of the CRM containing exactly Attention was also drawn to the need to provide an
the concentration of the analyte specified by the limiting unambiguous and simple procedure for assessing confor-
value, [6, 8, 11] (quite an improbable case) and, if mity by designing really appropriate decision rules for
available, the uncertainty of the analyte concentration in conformity tests (see the following sections). This should
the analysis certificate of the CRM [20]. In subsequent require the knowledge of the acceptable level of the
papers, approaches were proposed for performing a test probability of making a wrong decision. The author
for conformity, or a test for non-conformity, by working in emphasized that, at least in principle, the acceptable level
the concentration domain but, at least, by taking into of the probability of making a wrong decision can be
account both false-positive and false-negative errors [20, determined if the cost/consequence of taking a wrong
23]. In particular, an approach was proposed based on an decision is known [25]. Unfortunately, as in the case of
existing model of the limit of detection [28]. measurements concerning contaminants in foods, little or
no information is usually available [25].
Finally, when introducing the methods used by
Interpretative problems accredited calibration laboratories, for example within the
Deutscher Kalibrierdienst (e.g. the German accreditation
Several papers were intended to deal with some peculiar body for calibration laboratories, DKD), it was also
aspects of conformity tests. Attention was focused at reported that when measuring a characteristic for confor-
cases in which legislative limits were set below the mity with a tolerance zone, e.g. in the case of a set of LVs, a
detection capability of the existing analytical techniques statement of conformity should only be made if the ratio of
[9]. The examined case study showed that legislation the width of the tolerance zone to the standard uncertainty
based on limiting values may sometimes be beyond associated with the estimate of the characteristic is
analytical capability, and that the limited analytical sufficiently large [27]. Such a condition can be quantified
capability can be exacerbated by the practice of reporting by the measurement capability index, Cm, e.g. the ratio of
1734 E. Desimoni, B. Brunetti
the width of permissible values to some multiple of the when a measurement is within this zone. A transition zone
standard uncertainty associated with the estimate of the is the set of values of a characteristic, for a specified
characteristic [27] or, analogously, to some multiple of the measurement process and decision rule, that is neither in
standard deviation representing the variability of the process the acceptance nor rejection zone.
or product [32]. This aspect is also managed in the ASME A guard band is the magnitude of the offset from the
document [33] (see the N:1 decision rule in the relevant specification limit to the acceptance or rejection zone
section). boundary.
Simple acceptance means a situation when the accep-
tance zone equals and is identical with the specification
International standards zone. Simple rejection means a situation when the rejection
zone consists of all values of the characteristic outside the
As anticipated in the Introduction, at present the guidelines specification zone. Relaxed acceptance means a situation
of ASME [33], ISO [34], and Eurachem/CITAC [35] are when the acceptance zone is increased beyond the
among the most useful tools for the operators of testing and specification zone by a guard band (Fig. 3a). In a binary
calibration laboratories involved in assessing conformity or decision rule, relaxed acceptance is accompanied by
non-conformity with given specification. This because they stringent rejection. Stringent acceptance, in contrast ,
describe procedures sufficiently simplified to be widely means a situation when the acceptance zone is reduced
interpretable and managed. from the specification zone by a guard band(s) (Fig. 3b). In
The ASME document providing guidance for assessment a binary decision rule, stringent acceptance is accompanied
of electrical and mechanical products, is briefly discussed by relaxed rejection.
here below, because: It should be kept in mind that conformity tests including
the choice of the guard band are based on limitations
1. its principles inspired the Eurachem/CITAC guide [25,
stemming from economic, health, or other fields of
26, 35]; and
interest. The tests performed rely on scientific criteria
2. it is a convenient introduction to the concept of guard
and limitations, but the final decision is from the outside
bands and decision rules.
world.
The terminology adopted below can differ from the original The ASME guidelines identify different cases of
terminology used in the three documents because of the acceptance and rejection zones by decision rules. In
need to use the same symbols for the same object/ particular, it reports that the most common form of
quantities. acceptance and rejection in industry is that performing
simple acceptance and rejection using an N:1 decision rule.
The ASME B89.7.3.1-2001 guidelines N:1 means that the measurement interval, result±U, cannot
be larger than the fraction 1/N of the specification zone.
The objective of the ASME B89.7.3.1-2001 standard Usually N is taken equal to 3 or 4. Of course, using this
“Guidelines for decision rules: considering measurement decision rule can lead to decision making problems when
uncertainty in determining conformance to specifications” the measurement result is too close to the specification limit
(the ASME guidelines from now on) is to facilitate the (s). Because the N:1 decision rule is not applied by the ISO
development of understanding between suppliers and and Eurachem/CITAC guidelines [34, 35], it is not
customers regarding proper consideration of MU in considered in the following paragraphs.
conformity tests [33]. It was prepared by the American To increase confidence that a rejected product is
Society of Mechanical Engineers, but it can greatly help in actually out of specification, e.g. by choosing a low risk
understanding the other guidelines because it lists some for the producer, ASME guidelines apply a stringent
basic definitions. rejection and relaxed acceptance. This means that the
According to the ASME guidelines [33], a decision rule relaxed acceptance zone is obtained by increasing the
is a documented rule that describes how MU will be specification zone by a Z% guard band at the specifica-
allocated with regard to accepting or rejecting a product tion limit or at both specification limits. Z% is the size
according to its specification and the results of a measure- of the guard band expressed as a percentage of the
ment. An acceptance zone is the set of values of a expanded uncertainty (a 100% guard band has the
characteristic, for a specified measurement process and magnitude of U) (Fig. 3a).
decision rule, that results in product acceptance when a Similarly, to increase confidence in product quality by
measurement is within this zone. A rejection zone is the set reducing the probability of accepting an out-of-specification
of values of a characteristic, for a specified measurement product, e.g. by choosing a low risk for the consumer,
process and decision rule, that results in product rejection ASME guidelines apply a stringent acceptance and relaxed
Uncertainty of measurement and conformity assessment: a review 1735
Lower Upper
specification specification
limit limit
Specification zone
g In g In
rejection. Here the acceptance zone is obtained by reducing are not permissible (Fig. 4). The intervals are based on
the specification zone by the guard band(s) amount. Again accepted and required probabilities.
the size of the guard band is expressed as a percentage of U The ISO standard also details the requirements for defining
(Fig. 3b). limiting values. The entity and the quantifiable characteristic
of the entity shall be clearly and unambiguously specified, the
The ISO 10576–1 international standard test procedure should be a standardised one and, as already
cited in the section Which uncertainty?, the MU shall neither
The objective of the ISO 10576–1 international standard explicitly nor implicitly be referred to in the designation of
(the ISO standard from now on) is to provide assurance of the LVs. The ISO standard reports examples of single and
conformity or assurance of non-conformity, either in the double LVs and specifies that the uncertainty interval shall be
form of supplier’s declaration, or of a third party certifica- determined according to the GUM [1].
tion. In its introduction, the ISO standard, also, provides The principal feature of the ISO standard is the
some definitions. Conformity testing is defined as a recommendation that the conformity test be performed as
systematic examination of the extent to which an entity a two-stage procedure, in agreement with a previous
conforms to a specified criterion [34]. The limiting values suggestion [12, 13]. The advantage of the two-stage
(LV) or specification limits (SL) are the specified values of procedure is a substantially higher probability of declaring
the characteristic giving upper and/or lower bounds of the conformity for entities with permissible values of the
permissible values. The region of permissible values is the quantity of interest (the concentration, in chemical analy-
interval or intervals of all permissible values of the ses) which are closer to the LV. The two-stage procedure is
characteristic. The region of non-permissible values is the represented in Fig. 5. By the wording “appropriate
interval or intervals of all values of the characteristic that combination of the two (set of) measurement results”
1736 E. Desimoni, B. Brunetti
YES
Is IMU inside RPV?
NO
Does I MU include NO
a SL?
YES
Stage 2
Perform the measurement procedure once more
Determine an appropriate combination of the two
measurement results and calculate the new IMU
YES
Is IMU inside RPV?
NO
NO
Does I MU include
a SL?
Inconclusive test
Non-conformity
may be assured
the acceptance zone in Fig. 5a corresponds to the relaxed Appendix A of the Guide reports some examples of how
acceptance zone according to ASME (reported in Fig. 3a). the guard bands can be determined. In general the size of
In the case presented above, the value of the guard band,g, the guard band is k·u (see the section Which uncertainty?).
is chosen so that, for a measurement result greater than or A point deserving some comment is relevant to Case 1a, in
equal to LV+g, the probability of false rejection is less than which only the standard uncertainty, u, is available [35]. It
or equal to α. A typical value of α (the probability of false is reported that in many cases, current practice is to use k=
positive errors) is 5%. 2. As stated by the Guide, on the assumption that the
Analogously, in Fig. 6b the acceptance zone corre- distribution is approximately normal, this choice gives a
sponds to the stringent acceptance zone according to level of confidence of approximately 95% that the value of
ASME guidelines (reported in Fig. 3b). Some potentially the measurand lies in the interval y±2·u. On this basis, the
unclear aspects of the Eurachem/CITAC definition of Guide states that “the probability that the value of the
acceptance and rejection zones were recently discussed measurand is less than y+2·u is approximately 97.5%.” It is
[29, 30]. likely that this last sentence can be quite perplexing to
1738 E. Desimoni, B. Brunetti
LV
g: guard band
2.3% 2.3%
g: guard band A
c
g
b)
Fig. 7 a Explanation of the sentence “If the size of the guard band is
2·u, then the probability that y is less than y+2·u is approximately
Z c
y2u Z1 97.5%”. b same as a but the size of guard band is u
f ðcÞdc ffi 0:023 ffi f ðcÞdc ð3Þ
0 yþ2uc
About Directive 96/23/EC
and by the following equations for Fig. 7b:
The Commission Decision of 12 August 2002 (the Decision
Z
yþuc
from now on) was aimed at implementing Council
f ðcÞdc ffi 0:683 ð4Þ Directive 96/23/EC concerning the performance of analyt-
yuc ical methods and the interpretation of results obtained in the
monitoring of specific substances and residues thereof in
live animals and animal products, when they affect public
Z
yuc Z1
health (available online) [36]. The Decision provides rules
f ðcÞdc ffi 0:159 ffi f ðcÞdc ð5Þ for the analytical methods to be used in the testing of
0 yþuc official samples and specifies common criteria for the
Uncertainty of measurement and conformity assessment: a review 1739
This uncertainty interval is all below the LVu. This last References
result enables the statement of conformity with the
specified limit at the given confidence level. 1. Guide to the expression of Uncertainty in Measurement (1995) 1st
Ed., and corrected and reprinted Ed., ISO Secretariat, Switzerland
Example 2 2. Analytical Methods Committee (1995) Uncertainty of measure-
ment: implications of its use in analytical science. Analyst
120:2303–2308
The second example deals with the Eurachem/CITAC 3. Barwick V (1996) Setting and enforcing legislative limits. Anal
approach [35]. In this case, one should define the size of Commun 33:123–124
the guard band and choose to perform a test enabling high 4. de Ruig WG (1996) Characterizing and optimizing analytical
confidence of correct rejection (as in Fig. 6a) or high method quality. Trends Food Sci Technol 7:234–240
5. Ellison S, Wegscheider W, Williams A (1997) Measurement
confidence of correct acceptance (as in Fig. 6b). uncertainty. Anal Chem 69:607A–613A
Commission regulation (EU) No 105/2010 of 5 February 6. Desimoni E, Mannino S (1998) Uncertainty of measurement and
2010 amending Regulation (EC) No 1881/2006 setting legislative limits. Accredit Qual Assur 3:335–337
maximum levels for specific contaminants in foodstuffs 7. King B (1999) Assessment of compliance of analytical results
with regulatory or specification limits. Accredit Qual Assur 4:27–
with regard to ochratoxin A specifies the upper limit value 30
LVu =80 μg kg−1 for the concentration of ochratoxin A in 8. Buzoianu M (2000) Measurement uncertainty and legal limits in
liquorice extract [53]. Suppose that the combined uncer- analytical measurements. OIML Bull XLI:5–12
tainty of a measurement method, evaluated before 9. Ellison SRL, Barwick VJ, Williams A (2000) Legislative limits
below detection capability. Accredit Qual Assur 5:308–313
performing the necessary measurements, is 3.5 μg kg−1. 10. Hibbert DB (2001) Compliance of analytical results with
By using the above data, choosing α=0.05 and performing regulatory or specification limits: a probabilistic approach.
the test enabling high confidence of correct acceptance, the Accredit Qual Assur 6:346–351
guard band is equal to g=1.65·3.5=5.775 μg kg−1. The 11. Desimoni E, Mannino S, Brunetti B (2001) On the assessment of
compliance with legal limits: Part 1: signal and concentration
guard band is subtracted from the upper limit value. The domains. Accredit Qual Assur 6:452–458
acceptance zone then extends to 80–5.775 = 12. Holst E, Thyregod P, Wilrich P-Th (2001) On conformity testing
74.225 μg kg−1. Suppose that the concentration of and the use of two stage procedures. Int Stat Rev 69:419–432
ochratoxin A in two samples are 86.07 μg kg−1 and 13. Christensen JM, Holst E, Olsen E, Wilrich PT (2002) Rules for
stating when a limiting value is exceeded. Accredit Qual Assur
72.33 μg kg−1. The first sample is rejected and the second 7:28–35
is accepted. In contrast, when performing the test is to 14. Andersson H (2002) The use of uncertainty estimates of test
enable high confidence of correct rejection, the guard band results in comparison with acceptance limits. Accredit Qual Assur
is added to the upper limit value. The acceptance zone then 7:228–233
15. Glaeser H (2002) Compliance with legal requirements: definition
extends to 80+5.775=85.775 μg kg−1. In this second case, of limits, sampling and measurement uncertainty. Accredit Qual
both samples are accepted. Assur 7:388–391
16. Kessel W (2002) Measurement uncertainty according to ISO/
BIPM-GUM. Thermochim Acta 382:1–16
17. Källgren H, Lauwaars M, Magnusson B, Pendrill L, Taylor P
Conclusions (2003) Role of measurement uncertainty in conformity assessment
in legal metrology and trade. Accredit Qual Assur 8:541–547
This literature information confirms that some aspects of 18. van der Veen AMH (2003) Measurement uncertainty and doping
the assessment of conformity with legal or compositional control in sport. Accredit Qual Assur 8:334–339
19. van der Veen AMH (2004) Measurement uncertainty and doping
limiting values deserve further developments. A general control in sport. Part 2: Metrological traceability aspects. Accredit
agreement exists about the need to properly take into Qual Assur 9:311–316
account the uncertainty of measurement in decision 20. Desimoni E, Brunetti B, Cattaneo R (2004) The assessment of
making, and use of the correct MU (including the sampling compliance with legal limits. Part 2. How to consider both type I
and type II errors when working in the concentration domain.
uncertainty component and estimated according to GUM). Accredit Qual Assur 9:445–449
But problems still exist, especially concerning the need for 21. Desimoni E, Brunetti B (2004) About CCα and CCβ as
unification and/or unequivocal formulation of the wording introduced by the commission decision of 12 August 2002
of prescriptions by the regulatory Authorities. Finally, implementing council directive 96/23/EC. Accredit Qual Assur
9:724–725
decision making could be greatly facilitated by issuing 22. Wood R (2006) Compliance decision criteria—problems encoun-
really unified and, consequently, generally agreeable and tered and actions taken in the EU and Codex. Accredit Qual Assur
usable guidelines. 11:83–88
23. Desimoni E, Brunetti B (2006) Considering uncertainty of
measurement when assessing compliance or non-compliance
Acknowledgements Financial support by the Fondo Interno per la with reference values given in compositional specifications
Ricerca Scientifica e Tecnologica 2008 (Università degli Studi di and statutory limits: a proposal. Accredit Qual Assur 11:363–
Milano) is acknowledged. 366
Uncertainty of measurement and conformity assessment: a review 1741
24. Pendrill LR (2008) Operating ‘cost’ characteristics in sampling by 37. ILAC-G8 (1996) Guidelines on assessment and reporting of
variable and attribute. Accredit Qual Assur 13:619–631 conformity with specification, International Laboratory Accredi-
25. Williams A (2008) Compliance with specifications. Accredit Qual tation Cooperation, ILAC, Australia
Assur 13:617–618 38. NMKL Procedure No. 5 (1997) Estimation and expression of
26. Williams A (2008) Principles of the EURACHEM/CITAC guide measurement analysis in chemical analysis, NMKL, Finland
“Use of uncertainty information in compliance assessment”. 39. UKAS LAB 12 (2000) The expression of uncertainty in testing,
Accredit Qual Assur 13:633–638 UKAS, UK
27. Czaske M (2008) Use of uncertainty measurement by accredited 40. Wunderli S (2003) Uncertainty and sampling. Accredit Qual
calibration laboratories when stating compliance. Accredit Qual Assur 8:90
Assur 13:645–651 41. Heydorn K, Esbensen K (2004) Sampling and metrology. Accredit
28. Desimoni E, Brunetti B (2008) About considering both false Qual Assur 9:391–396
negative and false positive errors when assessing compliance and 42. Ramsey MH, Thompson M (2007) Uncertainty from sampling, in
non-compliance with reference values given in compositional the context of fitness for purpose. Accredit Qual Assur 12:503–
specifications and statutory limits. Accredit Qual Assur 13:653– 513
662 43. Ellison SRL, Rosslein M, Williams A (eds.) (2000) EURACHEM/
29. Desimoni E, Brunetti B (2010) About Acceptance and Rejection CITAC Guide “Quantifying uncertainty in analytical measure-
zones as defined in the EURACHEM/CITAC Guide (2007) “Use ments” QUAM: 2000.P1. [Link] Accessed 17
of uncertainty information in compliance assessment”. Accredit September 2010
Qual Assur 15:45–47 44. Horwitz W, Kamps LR, Boyer KW (1980) J Assoc Off Anal
30. Ellison SLR, Williams A (2010) Response to “About acceptance Chem 63:1344–1354
and rejection zones”. Accredit Qual Assur 15:49–51 45. Horwitz W (1982) Evaluation of analytical methods used for
31. Beges G, Drnovsek J, Pendrill LR (2010) Optimising calibration regulation of foods and drugs. Anal Chem 54:67A–76A
and measurement capabilities in terms of economics in conformity 46. Albert R, Horwitz W (1997) A Heuristic derivation of the horwitz
assessment. Accredit Qual Assur 15:147–154 curve. Anal Chem 69:789–790
32. Hinrichs W (2010) The impact of measurement uncertainty on the 47. Linsinger TPJ, Josephs RD (2006) Limitations of the application
producer's and user's risks, on classification and conformity of the Horwitz equation. Trends Anal Chem 25:1125–1130
assessment: An example based on tests on some construction 48. Currie LA (1995) Nomenclature in evaluation of analytical
products. Accredit Qual Assur 15:289–296 methods including detection and quantification capabilities
33. ASME B89.7.3.1-2001 (2002) Guidelines for decision rules: (IUPAC Recommendations 1995). Pure Appl Chem 67:1699–
considering measurement uncertainty in determining conformance 1723
to specifications, ASME, New York 49. Currie LA (1968) Limits for qualitative detection and quantitative
34. ISO 10576–1:2003 (2003) Statistical methods – Guidelines for the determination. Application to radiochemistry. Anal Chem 40:586–
evaluation of conformity with specified requirements, Part 1: 593
General principles, ISO, Geneva 50. International Organization for Standardization (ISO) ISO (1997)
35. Ellison SRL, Williams A (eds.) (2007) EURACHEM/CITAC 11843-1 Capability of detection – Part 1:L Terms and definitions
Guide “Use of uncertainty information in compliance assessment” 51. ISO 11843: 1997 (1997) Capability of detection-part 1: terms and
first edn 2007. [Link] Accessed 17 September definitions, part 2: methodology in the linear calibration case,
2010 ISO, Geneva
36. Commission Decision of 12 August 2002 implementing Council 52. Council Directive 98/83/EC of 3 November 1998 on the quality of
Directive 96/23/EC concerning the performance of analytical water intended for human consumption, OJ C, 5.12.98, p. 32
methods and the interpretation of results (2002/657/EC). [Link] 53. Commission regulation (EU) No 105/2010 of 5 February 2010
[Link]/smartapi/cgi/sga_doc?smartapi!celexplus!prod!Doc amending Regulation (EC) No 1881/2006 setting maximum levels
Number&lg=en&type_doc=Decision&an_doc=2002&nu_doc=657. for certain contaminants in foodstuffs as regard ochratoxin A OJ
Accessed 17 September 2010 L, 6.2.2010, p. 7
ISO methodologies for evaluating measurement uncertainty, such as those defined in GUM, provide a standardized approach to quantifying and expressing uncertainty. This consistency ensures that results are comparable across different contexts and stakeholders, enhancing the reliability of conformity assessments. By incorporating uncertainty into the process, assessors can make more informed decisions regarding compliance, thus reducing the risk of erroneous conclusions about conformity .
The ISO two-stage procedure offers higher probability of declaring conformity for entities near the permissible limits due to its iterative assessment, which refines the decision-making based on averaged and uncertainty-expanded results across two stages. This provides a structured approach that reduces the risk of erroneous conformity claims, compared to the one-stage method which relies solely on initial results and might overlook borderline cases .
Both ASME and Eurachem/CITAC guidelines emphasize the use of guard bands and decision rules in conformity assessment. ASME focuses on stringent or relaxed zones, whereas Eurachem/CITAC defines acceptance and rejection zones without these descriptors. Both rely on statistical certainty to minimize errors in compliance testing, aligning in their foundational approach toward using measurement uncertainty as a critical determinant in establishing conformity .
The ISO 10576–1 standard differentiates permissible from non-permissible values using intervals defined by limiting values (LVs) or specification limits (SLs). Permissible values fall within these defined upper and lower bounds, ensuring conformity with specified criteria. Non-permissible values reside outside these intervals, indicating a lack of conformity. This distinction helps systematically categorize test results based on compliance with predetermined acceptable boundaries .
The ISO 10576–1 standard provides assurance of conformity or non-conformity through systematic examination matching specified criteria. It defines the assurance of non-conformity as a situation where the conformity test demonstrates beyond reasonable doubt that the characteristic does not adhere to requirements. The standard involves using limiting values and specifies that uncertainty should not be cited in defining these values. This guidance helps ascertain conformity through accepted probabilities and specified limits .
Decision rules are crucial as they help delineate acceptance and rejection zones, which are essential in compliance assessment of measurements against specified limits. By considering measurement uncertainty, decision rules ensure that measurements within these zones follow a degree of statistical certainty, thus reducing Type I and Type II errors. The zones impart structure to the interpretation of measurement results, accommodating for uncertainties inherent in testing and thus improving the reliability of conformity assessments .
ASME guidelines apply stringent rejection criteria and relaxed acceptance zones to increase confidence that a rejected product is actually out of specification. They achieve this by increasing the specification zone with a Z% guard band at the specification limit, reducing the risk for the producer. For acceptance, a stringent criteria and relaxed rejection is applied to reduce the probability of accepting an out-of-specification product, lowering the risk for the consumer .
Considering both false negatives and false positives is crucial because it addresses the two primary risks in compliance assessment: accepting non-compliant products (false negatives) that pose safety or quality issues, and rejecting compliant products (false positives) that lead to unnecessary cost and resource losses. Balancing these errors through statistical methods ensures that decision-making in conformity assessment is robust and economically efficient .
Guard bands create binary decision rules by establishing acceptance and rejection zones. In ASME guidelines, guard bands are used to create stringent acceptance and relaxed rejection zones, thereby delineating the areas where products can be confidently accepted as conforming or rejected as non-conforming with set specification limits. These zones are determined by adjusting the specification limits through a percentage of expanded uncertainty, enhancing decision-making confidence regarding product conformity .
The Eurachem/CITAC guide emphasizes using decision rules for determining acceptance and rejection zones without explicitly mentioning simple, stringent, or relaxed zones like ASME. The guide outlines scenarios for compliance assessment, including setting decision limits based on statistical assurance levels. It aligns with ISO and ASME principles, addressing compliance when measurement results are measured against upper limits .