truncated regression model
TRANSCRIPT
-
8/13/2019 Truncated Regression Model
1/14
This article was downloaded by: [190.80.134.42]On: 14 October 2013, At: 17:27Publisher: Taylor & FrancisInforma Ltd Registered in England and Wales Registered Number: 1072954 Registered office: Mortimer House37-41 Mortimer Street, London W1T 3JH, UK
Quality EngineeringPublication details, including instructions for authors and subscription information:
http://www.tandfonline.com/loi/lqen20
A Truncated Logistic Regression Model in Probability o
Detection EvaluationYan Guo
a, Kai Yang
b& Adel Alaeddini
c
aMaterials and Technology Department , Siemens Energy Inc. , Orlando, Florida
bDepartment of Industrial and System Engineering , Wayne State University , Detroit,
MichigancDepartment of Industrial and Operations Engineering , University of Michigan , Ann Arbo
Michigan
Published online: 29 Aug 2011.
To cite this article:Yan Guo , Kai Yang & Adel Alaeddini (2011) A Truncated Logistic Regression Model in Probability ofDetection Evaluation, Quality Engineering, 23:4, 365-377, DOI: 10.1080/08982112.2011.603664
To link to this article: http://dx.doi.org/10.1080/08982112.2011.603664
PLEASE SCROLL DOWN FOR ARTICLE
Taylor & Francis makes every effort to ensure the accuracy of all the information (the Content) containedin the publications on our platform. However, Taylor & Francis, our agents, and our licensors make norepresentations or warranties whatsoever as to the accuracy, completeness, or suitability for any purpose of t
Content. Any opinions and views expressed in this publication are the opinions and views of the authors, andare not the views of or endorsed by Taylor & Francis. The accuracy of the Content shouldnot be relied upon ashould be independently verified with primary sources of information. Taylor and Francis shall not be liable forany losses, actions, claims, proceedings, demands, costs, expenses, damages, and other liabilities whatsoeveor howsoever caused arising directly or indirectly in connection with, in relation to or arising out of the use ofthe Content.
This article may be used for research, teaching, and private study purposes. Any substantial or systematicreproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or distribution in anyform to anyone is expressly forbidden. Terms & Conditions of access and use can be found at http://www.tandfonline.com/page/terms-and-conditions
http://dx.doi.org/10.1080/08982112.2011.603664http://www.tandfonline.com/action/showCitFormats?doi=10.1080/08982112.2011.603664http://www.tandfonline.com/page/terms-and-conditionshttp://www.tandfonline.com/page/terms-and-conditionshttp://dx.doi.org/10.1080/08982112.2011.603664http://www.tandfonline.com/action/showCitFormats?doi=10.1080/08982112.2011.603664http://www.tandfonline.com/loi/lqen20 -
8/13/2019 Truncated Regression Model
2/14
A Truncated Logistic Regression Model in
Probability of Detection EvaluationYan Guo1,
Kai Yang2,
Adel Alaeddini3
1Materials and Technology
Department, Siemens Energy
Inc., Orlando, Florida2Department of Industrial and
System Engineering, Wayne
State University, Detroit,Michigan3Department of Industrial and
Operations Engineering,
University of Michigan, Ann
Arbor, Michigan
ABSTRACT In nondestructive evaluation (NDE) studies, the probability of
detection curve (POD) is an important performance metric. The traditional
POD estimation is to conduct NDE inspections for artificially fabricated spe-
cimens with known flaws. This approach is often challenges because not
only do fabricated flaws not adequately represent the flaws found in the
field, but the cost and time of fabricating artificial specimens can also be
very high. In practice, field samples and components in service withnaturally occurring defects are readily available and much less expensive
to test. However, the disadvantage of this field approach is that the exact
number and sizes of the flaws in a sample (especially for flaws with small
sizes) are unknown. As a result, serious bias in estimating the POD can
occur. In this article, a truncated logistic regression method is developed
that can estimate POD accurately and consistently with field samples based
on multiple inspections. A case study illustrates the successful application of
the proposed approach in a leading manufacturing company. The simula-
tion studies also show that the proposed methods estimate the POD on field
samples with quality comparable to that of the traditional approach on
artificial specimens.
KEYWORDS field inspection, logistic regression, nondestructive evaluation,
POD, simulation
INTRODUCTION
Nondestructive evaluation (NDE) techniques are widely used to evaluate
the states or properties of many kinds of components and structures without
damaging them. The common NDE methods include liquid penetrant, radio-graphic, eddy current, and ultrasonic testing. Effective NDE systems are very
essential for industries such as semiconductor manufacturing, aircraft, and
power generation in which the costs and effects of failures are much higher
than that of inspections and replacements. For example, in semiconductor
manufacturing, ultra-high-frequency ultrasound is extensively employed to
make visible images of the internal voids and defects on the silicon die,
die paddle, and lead frame of the integrated circuits (ICs). Figure 1a illus-
trates the structure of the particle-filled plastic mold compound of an IC,
as well as the circular mold marks at the top surface of the component.
Address correspondence to Kai Yang,Professor, Department of Industrial
and System Engineering, Wayne StateUniversity, 4815 Fourth St., Detroit,
MI 48201. E-mail: [email protected]
Quality Engineering, 23:365377, 2011
Copyright# Taylor & Francis Group, LLC
ISSN: 0898-2112 print=1532-4222 online
DOI: 10.1080/08982112.2011.603664
365
-
8/13/2019 Truncated Regression Model
3/14
The small white features are voids (trapped bubbles)
in the mold compound. Figures 1b and 1c also show
a large crack that could lead to a potential failure of a
connecting rod and several tiny cracks on the surface
of a gear using liquid-penetrant method.
A key advantage of NDE systems is their ability to
detect flaws or abnormal conditions well before the
occurrence of catastrophic failures of key compo-
nents or structures. Specifically, the capability of an
NDE system is measured by its ability to detect aparticular size of flaw, a, which is proportional to
the potential hazard. The larger the flaw size, the
higher chance of failure.
For an ideal NDE system, it is desirable to catch all
of the potential harmful flaws in an inspection run as
illustrated in Figure 2a. In Figure 2a, ath denotes a
threshold value of flaw size. If the flaw size a is
greater thanath, the corresponding flaw is a potential
concern and should be detected. On the other hand,
ifa is smaller thanath, the corresponding flaw is not
potentially critical to the applications. POD(a) inFigure 2 represents the probability of detection as a
function of flaw size a. Therefore, for an ideal NDE
system, all flaws with sizes a greater than ath would
be detected with probability one; that is, POD(a) 1
for all a ath. In contrast, flaws with sizes a smallerthan ath would not be reported; that is, POD(a) 0for all a
-
8/13/2019 Truncated Regression Model
4/14
assess the POD curve, artificial specimens are usually
designed and fabricated carefully with artificial flaws
whose sizes and locations are known on specimens
verified by destructive analysis or advanced labora-
tory methods after the inspection (MIL-HDBK-1823
1999). The obvious advantage of using artificial
specimens is that we will know all of the true infor-
mation about flaws and their sizes, so we can know
exactly what flaws are detected and what flaws aremissed by an NDE system.
The aforementioned POD curve estimation pro-
cedure based on the evaluation of artificial speci-
mens is often challenged because the inspection
conditions in service are quite different from the
simulated in-service inspection environment. In such
trials, artificial flaws (electrical discharge machining
notches for eddy current inspections) or fabricated
flaws (fatigue cracks in coupons for penetrant
inspections) do not adequately represent the flaws
found in the field (Thompson et al. 2006). Further-more, the cost and time of fabricating these artificial
specimens required for a POD study is very high,
which is a significant obstacle to the routine assess-
ments of POD. These problems may even delay or
prevent a new inspection process or new technology
from being implemented.
In practice, field samples, components in service
with naturally occurring defects, are readily available
and much less expensive. Field samples are real com-
ponents or structures that are picked for routine non-
destructive evaluations. The flaws in field samples arealso real flaws that we are trying to catch. However,
one major disadvantage of field samples in POD esti-
mation is that we may not know the number and sizes
of the flaws in a sample exactly, in particular for flaws
with small sizes, because currently there is no NDE
system that can guarantee detection of all of the flaws.
These undetected flaws in field samples will create
complications in POD curve estimation. Specifically,
the traditional standard approach in estimating the
POD curve, such as the ones described in MIL-HDBK-
1823 (1999), will not give the best POD curve esti-mation results when we are using field samples
instead of artificial specimens. Brewer (1994) pro-
posed that a POD curve is attainable if crack growth
data are available, which use the basic principle that
the size of a previously missed flaw can be estimated
from currently observed flaw size. One approach pro-
posed by Meeker et al. (2001) utilized the analytical
model of the physical system to estimate the signal
of system response and the probability of detection.
Smith et al. (2007) proposed combining the empirical
data and the analytical model to reduce the
complexity of determining POD.
This article will describe an approach for applying
a truncated logistic regression (TLR) model to adjust
the bias introduced by the undetected flaws when
the NDE inspection techniques are applied in the fieldsamples. In particular, the proposed method uses TLR
to combine different NDE measurements into one
optimal POD curve that is substantially superior to
the POD curve from standard logistic regression. This
approach will overcome the uncertainties introduced
by naturally occurring flaws and will cancel out the
influence of lacking knowledge of undetectable flaws
to increase the accuracy and reliability. Such method-
ology is particularly appropriate for situations where
there are multiple NDE methods available and the
methods detect different defects.In the next section, the basic probability of detec-
tion models are reviewed and the effect of unde-
tected flaws in field samples on models are
discussed. Next, the truncated logistic regression
model for probability of detection is presented. We
further show the effects of bias adjustment of the
truncated logistic regression model on POD esti-
mation and compare these results with the POD
estimation model without considering truncation.
PROBABILITY OF DETECTION
Data from NDE inspections can be classified into
two categories according to the specific responses
from an NDE technique to a defect. The first category
is calledadata, whereais a continuous variable that
represents the intensity measurement data from NDE
system signals when it measures a flaw with size a.
The flaw size,a, could be measured by an advanced
measurement system or destructive analysis. In this
case, the expected magnitude of a will be pro-
portional to the flaw size a. The second categoryinvolves binary data (also called hit=miss data)
representing the presence or absence of a flaw. In
this case, when a flaw is detected by an NDE system,
it is a hit. When the NDE system fails to detect a flaw,
it is a miss. The hit=miss category includes NDE data
produced by systems that generate visual images of
targets and their backgrounds, because these images
367 A Truncated Logistic Regression Model for POD
-
8/13/2019 Truncated Regression Model
5/14
will be analyzed by pattern recognition techniques to
be translated into binary results.
The statistical estimation methods of POD models
are different for these two categories of NDE data.
The model for a data is called the a vs. a model,
and the model for the second category is a binary
response regression model for hit=miss data.
The a vs. a ModelIn NDE testing process, the NDE device will create
a signal when a sample is tested. The signalstands
for the output response that can be used to charac-
terize flaws. In detection theory, the probability of
detection with the given probability density function
fsignalassuming the signal amplitudes as random vari-
ables can be expressed as (Rummel and Matzkanin
1997; Yang and Donath 1983)
POD a P aa aadec Z1
aadecfsignal aas;a daas 1
whereadecis the critical NDE output response magni-
tude above which an indication is considered relevant
to the flaw and is reported. The signal measure of the
NDE system is given by a and a is the real size of
flaw. The signal amplitudes are often approximately
normally distributed (Berens 1987) with respective
characteristics N ls a ;r2s a
. As a result, the prob-
ability of detection in Eq. [1] takes the form
PODa Z1a
1ffiffiffiffiffiffi2p
p rs a
exp aasls a 2
2r2s a !
daas
1 U a ls a rs a
2
From empirical studies (Berens 1987), it has been
shown that a logarithmic transformation on a vs. a
data often allows the use of normal theory regression
model. LetY log(a) andx log(a), then the simpleversion of theavs.amodel is
Y b1 b2x e 3
Equations [2] and [3] are the basis for the statistical esti-
mation of POD model for avs.a data when artificial
specimens with known flaws and flaw sizes are used.
However, when service samples are used, where the
number of flaws and flaw sizes are unknown, the
POD model estimated by using Eqs. [2] and [3] may
have serious bias resulting in incorrect estimation of
the regression parameters (Meeker and Thompson
2007). Figure 3a illustrates an example of such bias,
which is due to the existence of unknown missed
flaws, concentrated at the low end ofavs.a and cor-
responding to small undetectable flaws. Meeker and
Thompson (2007) developed a left-truncated
linear regression model to estimate POD curves for
a vs. a data with field samples, and this truncated
linear regression model is reported to achieve good
results.
Binary Regression Model
for Hit/Miss Data
Binary responses are most common for NDE
methods such as liquid penetrant inspection, radio-graphic methods, and the newly developed sonic
infrared methods (Han and He 2006). For hit=miss
testing, the whole set of binary response data is
binomial in nature with detection probability given
by POD(a). Maximum likelihood is used to estimate
FIGURE 3 Effect of undetectable flaws on the bias of the regression: (a)a vs.a model and (b) binary regression model for hit/miss data(the dashed line is the regression fit without undetectable flaws and the solid line is the true regression model).
Y. Guo et al. 368
-
8/13/2019 Truncated Regression Model
6/14
the parameters of a POD curve (Chambers and
Hastie 1992). The likelihood of detection probability
using a single observation I is
LiPi; ai;xi Pxii1 Pi1xi 4
where xi is a Bernoulli distributed random vari-
able, PiP(ai) is the probability of detection atflaw size ai, and Li is the maximum likelihood ofPi. The log odds function is often suggested to
model binary data (Chambers and Hastie 1992)
and is given as
Pi expb1 b2log ai1 expb1 b2log ai
5
The parameter to be estimated is b b1 b2Y.Similar to the avs. a data, Eqs. [4] and [5] are the
bases for the statistical estimation of the POD model
for hit=miss data when artificial specimens with
known flaws and flaw sizes are used. Again, whenservice samples are used in which the number of
flaws and flaw sizes are unknown, the POD model
estimated using Eqs. [4] and [5] may also have serious
bias due to unknown missed flaws, an example of
which is shown in Figure 3b. In this article, we
develop a left-truncated logistic regression model
to estimate POD curves for hit=miss data. This pro-
posed truncated logistic regression model will be
thoroughly presented, discussed and evaluated in
subsequent sections.
TRUNCATED LOGISTIC REGRESSION
MODEL
There are two approaches to POD evaluation
according to how the samples are collected. The
first approach is conducted in laboratories, whereas
the other approach is conducted in the field. In the
laboratory approach, the samples are fabricated
with artificial flaws whose sizes and locations
are designed to simulate the real flaw and are
under control while fabricating. For these artificialflaws, the sizes are known before the inspections.
The binary inspection results (hit=miss) are
collected by comparing the results to the actual
flaws.
The second approach is conducted in the field and
is the approach discussed in this article. The samples
are collected in the field inspection stage, and the
flaws are unknown before inspection. The unde-
tected flaws are also unknown. In order to obtain
information on undetected flaws, multiple NDE
methods are used to determine missed flaws by
individual NDE methods.
Without loss of generality, we illustrate the field
approach with two combined NDE methods as
shown in Figure 4. The second NDE method
(method B) is assumed to have different capabilitiesfor detecting the flaws that were missed by method
A. In addition, we assume that the total number of
detected flaws by multiple NDE methods is greater
than the number of detected flaws by each method.
This assumption is easily met, and the purpose of
setting this assumption is to avoid an unbounded
likelihood function while curve fitting. In addition
to flaw size, many other factors, such as flaw orien-
tation, flaw type, flaw location, etc., will also affect
the probability of detection. Given the same flaw
size, some NDE methods such as penetrant testingare more capable of detecting flaws open to the sur-
face, whereas others are more effective in detecting
tight-closed flaws, such as sonic infrared (IR). There-
fore, even if the flaws are similar in size, different
NDE methods usually pick up different subsets of
flaws.
Based on the characteristics of NDE inspections,
the logic operation is used for inspecting results of
multiple NDE methods, as illustrated in Figure 4,
where the frame represents the whole set of flaws,
but the details of this set are unknown. The area con-sisting of zones I, II, and III is the set of detected
flaws by either one of the NDE methods in use.
FIGURE 4 Illustration of inspections by multiple NDE methods.
369 A Truncated Logistic Regression Model for POD
-
8/13/2019 Truncated Regression Model
7/14
Though multiple methods are used for detection,
there are still undetected flaws, shown as zone
IV in Figure 4. The bias introduced by the unde-
tected flaws will be adjusted by the truncated
logistic regression model. Truncated logistic
regression is an improvement of the standard
regression by using multiple NDE methods in the
field.
In NDE inspection, there are two tasks: (1) detec-tion and (2) sizing. Once an indication is detected,
other advanced methods, such as those involving
microscopes or computed tomography, for
example, will be used for sizing. Some NDE meth-
ods have the ability to measure the size at the time
of detection. After inspection, the detected flaws
(shown in zones I, II, and III) are sized for POD
estimation.
The regression model assumes that the sizes of
detected flaws (zones I, II and III) are measured
but not the sizes of the undetected flaws (zone IV).For zones I, II, and III, because at least one of the
NDE methods detected the flaw, sizing can be con-
ducted after detection. Therefore, the POD evalu-
ation model outlined previously could be applied
to field inspections. As a result, the accuracy of esti-
mation will be improved by applying a truncated
logistic regression model.
We assume that Nflaws are to be inspected and
the number of inspection systems is M. Define a
binary random variable yij represent the ith flaw
inspected by the jth inspection system, where
yij 0; undetected flawmiss fori 0;1; . . . ;N;1; detected flawhit j 1; . . . ;M
Letairepresent the size of flaw i. Furthermore, we
define bj bj1 bj2T as the vector of POD curveparameters of the jth inspection system for
j 1, . . .,M, where b bT1 bT2 . . . bTMT is the vectorof POD curve parameters for all inspection systems.
From Eq. [5], the probability of detection for theindividual flaw by an inspection system is
Pyij1 expbj1 bj2log ai
1 expbj1 bj2log aipbj; ai 1 qbj;ai 6
The likelihood ofP, based on a single observation, is
then
Lib; ai;yij YMj1
pbj; aiyijqbj;ai1yij 7
and the estimator bbj is the optimal solution for
maximizing log(Li(b;ai,yij)). Equations [6] and
[7] are the conventional regression models for PODevaluation.
With the multiple NDE inspection strategy in the
field, only the detected flaws are recorded. The set
of flaws for POD curve fitting that contains the set
of flaws that were detected by at least one NDE
inspection system is given by
< iXM
j1yij1; i1; 2; . . . ;N
( ) 8
The flaws that were missed by all inspections
are truncated. This set of truncated data is expressed
as
I iXM
j1yij0; i1; 2; . . . ;N
( ) 9
Because the truncated data are not known in prac-
tice, the conventional logistic regression continu-
ously applied for POD estimation would introduce
a bias because the effect of truncated flaws would
not be taken into consideration. An adjustment is
then made for the conventional regression model
to be conditioned given that a flaw is detected by
at least one NDE inspection system. The resulting
TLR model would be
Lb; ai;yij Yi2