statistical analysis and adaptive technique for dynamical process monitoring

Sp

YK1

1

OdMTow1Lawt

yd

0d

chemical engineering research and design 8 8 ( 2 0 1 0 ) 1381–1392

Contents lists available at ScienceDirect

Chemical Engineering Research and Design

journa l homepage: www.e lsev ier .com/ locate /cherd

tatistical analysis and adaptive technique for dynamicalrocess monitoring

ingwei Zhang ∗, Zhiming Li, Hong Zhouey Laboratory of Integrated Automation of Process Industry, Ministry of Education, Northeastern University, Shenyang, Liaoning10004, PR China

a b s t r a c t

Multivariate statistical process monitoring (MSPM) methods based on two-dimensional dynamic kernel PCA (2-D-

DKPCA) and two-dimensional dynamic kernel Hebbian Algorithm (2-D-DKHA) are proposed. First, a nonlinear batch

process monitoring scheme based on 2-D-DKPCA is proposed. Its basic idea is to use KPCA to depict the both within-

batch dynamics and batch-to-batch dynamics. However, the proposed 2-D-DKPCA needs to store the whole kernel

matrix and calculate all nonlinear components. Kernel matrix will thus become extremely huge when the numbers

of successive batches and samples are large. Then, kernel Hebbian Algorithm (KHA) is introduced to 2-D-DKPCA to

construct 2-D-DKHA. KHA can extract adaptively nonlinear principal components without storing and manipulating

the whole kernel matrix and only calculate the principal components. Thus, proposed 2-D-DKHA has the ability of

monitoring complex batch processes. The 2-D-DKPCA and 2-D-DKHA are first proposed in this article. Also, from the

proposed 2-D method, it is easily to obtain the 1-D algorithm.

The proposed method 2-D-DKPCA is applied to the fault detection in a nonlinear dynamic system and compared

with 2-D dynamic PCA (2-D-DPCA). The simulation results show that 2-D-DKPCA is more suitable for nonlinear

dynamic process than DPCA. Then the proposed method 2-D-DKHA is applied to penicillin process. The monitoring

results show 2-D-DKHA can detect the faults of complex batch process.

© 2010 The Institution of Chemical Engineers. Published by Elsevier B.V. All rights reserved.

Keywords: Process monitoring; Two-dimensional dynamic kernel PCA (2-D-DKPCA); Two-dimensional dynamic kernel

Hebbian Algorithm (2-D-DKHA)

dynamic characteristics, PCA exhibits bad behavior because

. Introduction

n-line monitoring of processes is crucial to the safe pro-uction of high-quality products (Qin, 2003; Nomikos andacGregor, 1995a,b; Chiang et al., 2001; Lieftucht et al., 2006;

ates et al., 1998). The development of effective methods forn-line monitoring and fault diagnosis of batch processesould significantly improve product quality (Westerhuis et al.,

999; Gregersen and Jorgensen, 1999; Albert and Kinley, 2001;ennox et al., 2001; Kourti et al., 1996). However, nonlinearitiesnd dynamics are inherent characteristics of batch processes,hich may exist not only within a batch but also from batch

o batch.Several approaches based on multivariate statistical anal-

sis have been proposed for on-line monitoring and faultetection in the batch processes (Nomikos and MacGregor,

∗ Corresponding author.E-mail address: [email protected] (Y. Zhang).Received 27 December 2009; Received in revised form 3 March 2010; A

263-8762/$ – see front matter © 2010 The Institution of Chemical Engioi:10.1016/j.cherd.2010.03.002

1994; Chen and Chen, 2006; Gallagher and Wise, 1996; Yue etal., 2000; Qin et al., 2006; Nomikos and MacGregor, 1995a,b; Luet al., 2005). Multiway PCA (MPCA) has extended the multivari-ate statistical method of principal component analysis (PCA)to batch processes (Nomikos and MacGregor, 1994; Chen andChen, 2006; Gallagher and Wise, 1996). Qin proposed multiwaypartial least squares (MPLS) with RLS for monitoring processesfor which both the process data and the product quality areavailable (Yue et al., 2000; Qin et al., 2006). Nomikos and Mac-Gregor proposed estimating method of the missing data ontrajectory deviations (Nomikos and MacGregor, 1995a,b). How-ever, for some complicated cases in industrial chemical andenvironmental processes, which especially have nonlinear

ccepted 3 March 2010

of its linearity assumption. The applications of dynamic PCA(DPCA) for multivariate process and quality control can be

neers. Published by Elsevier B.V. All rights reserved.

http://www.sciencedirect.com/science/journal/02638762

mailto:[email protected]

dx.doi.org/10.1016/j.cherd.2010.03.002

and d
1382 chemical engineering research
found in references (Ku et al., 1995; Russell et al., 2000; Li andQin, 2001). Ku et al. (1995) proposed dynamic PCA (DPCA) thatuses an augmenting matrix with time-lagged variables. DPCAcan extract the time-series model from the eigenvectors of thecovariance of process control and monitoring; these character-istics include finite duration, nonlinearities, non-steady statebehavior and batch-to-batch variation. Russell et al. (2000)used scores and residual space PCA, DPCA, and CVA to theTennessee Eastman process simulator, which was designed tosimulate a wide variety of faults occurring in a chemical plantbased on a facility at Eastman Chemical. However, the dynam-ics are one order and the relation representation cannot begiven clearly. Li and Qin (2001) showed that when process vari-ables are corrupted by measurement noise DPCA fails to give aconsistent estimate of the process model in general whetheror not process noise is present. Then propose an indirect DPCAapproach for the consistent estimate of process model. How-ever, the methods mentioned above perform poorly when it isapplied to industrial process data having nonlinearities.

Several techniques based on multivariate statistical anal-ysis have been proposed for on-line monitoring and faultdetection in the nonlinear processes. Dong and McAvoy (1996)used nonlinear principal component analysis (NLPCA) basedon principal curves and neural networks to monitor batch pro-cesses. The associated scores and the corrected data points fortraining samples are obtained by the principal curve method,then the neural network model is used to map original datainto the corresponding scores. Rännar et al. (1998) investigatedan adaptive batch monitoring method using hierarchical PCA.To coupe with nonlinear system monitoring, kernel densityand kernel PCA (KPCA), has been in development in recentyears (Mika et al., 1999). KPCA can efficiently compute non-linear PCs in high-dimensional feature spaces by means ofintegral operators and nonlinear kernel functions. The basicidea of KPCA is to first map the input space into a fea-ture space via nonlinear mapping and then to compute thenonlinear PCs in that feature space. Lee et al. proposed anonlinear process monitoring technique using KPCA to mon-itor continuous process and demonstrated its superiority tothe PCA monitoring method (Lee et al., 2004a,b). The meth-ods are very powerful for analyzing historical data from pastproduction, and diagnosing operating problems (Lee et al.,2004a,b; Romdhani et al., 1999; Choi et al., 2005; Cho et al.,2005; Lee et al., 2004a,b). The multiway kernel principal com-ponent analysis (MKPCA) and multiway kernel independentcomponent analysis (MKICA) have been emerging to solvethe nonlinear problem (Lee et al., 2004a,b; Zhang and Qin,2007, 2008; Zhang et al., 2010). The KPCA monitoring tech-nique was extended from continuous processes to batchprocesses. Off-line analysis and on-line batch monitoring aredeveloped on the basis of MKPCA and MKICA, considering vari-ance scaling in the feature space. And to cope with dynamicsystem monitoring, process dynamics should also be takeninto account when developing a monitoring model based onKPCA.

Batch process has more nonlinear dynamic characteristicscompared to continuous process since it is usually operated indifferent stages. So, it is necessary to develop a more efficientmonitoring technique for batch processes.

The KICA technique was extended (whitened KPCA plusICA) from continuous processes to batch processes andimproved KPCA and KICA monitoring technique for fault
detection and diagnosis was proposed (Zhang and Qin, 2007,2008). In this article, the nonlinear dynamic process data
esign 8 8 ( 2 0 1 0 ) 1381–1392

are double indexed by the sampling time within a batchand the batch number of successive batch runs. A novelbatch monitoring scheme, a 2-D nonlinear dynamic kernelprincipal component analysis (2-D-DKPCA) is proposed tomodel and monitor simultaneously the time- and batch-wisenonlinear dynamics. There exists a method extending non-linear dynamic method for capturing within-batch dynamics:dynamic kernel principal component analysis (DKPCA), whichconcerns only within-batch dynamics (Choi and Lee, 2004).Compared with the linear method, the proposed 2-D-DKPCAis a truly 2-D nonlinear dynamic model; it can capture bothwithin-batch and batch-to-batch nonlinear dynamics at thesame time using a parsimonious model structure. In order tomanipulate the large volume of data generated by batch pro-cess, an iterative algorithm called kernel Hebbian Algorithm(KHA) is combined with the proposed 2-D-DKPCA to computenonlinear principal components iteratively, which is called2-D-DKHA. The proposed 2-D-DKHA can deal with more com-plex situations since it does not need to store and manipulatethe whole kernel matrix. Also, from the proposed 2-D method,we can obtain the 1-D algorithm easily.

The organization of the article is as follows. ConventionalKPCA monitoring and KHA is briefly reviewed in the next sec-tion, followed by 2-D-DKPCA and 2-D-DKHA. The performanceof process monitoring using the proposed methods is illus-trated through examples. Simulations show that the proposed2-D-DKPCA based batch monitoring can be very effective fordetecting even small changes in correlation structure or non-linear dynamic process drifts. Then 2-D-DKHA is applied to thepenicillin process. The simulation results show that the pro-posed 2-D-DKHA has the ability of monitoring complex batchprocesses. Conclusions are given at the end of the article.

2. KHA

The size of the kernel matrix is the square of the numberof samples. The computation time may increase with thenumber of samples. So kernel Hebbian Algorithm (KHA), aniterative algorithm has been proposed by Kim, Franz andSchölkopf to cope with this problem (Kim et al., 2005). KHAis Generalized Hebbian Algorithm (GHA) in space F, so let usintroduce GHA first.

GHA is a widely used iterative PCA method, and it is a train-ing algorithm for a neural network of the form y = Wx actingon the training data xk. It combines the Oja learning rule andGram-Schmidt orthogonalization. While Oja learning rule canfind only the first eigenvector, GHA can find all of the eigen-vectors. After training, the weight matrix W ∈ Rr×m contains reigenvectors of the covariance matrix of input data correspondthe largest r eigenvectors, and the output y is the projectionon the eigenvectors, i.e. y is the principal component in PCA.The update rule of W is

W(t + 1) = W(t) + �(t)(y(t)x(t)T − LT[y(t)y(t)T]W(t)) (1)

where t denotes a discrete moment in time, �(t) is a learningrate, and LT[·] extracts lower triangular part of a matrix. Thereare some theorems show that when t → ∞ W will converge tothe eigenvectors of the covariance matrix of input data.

Let

y = W(t)˚(x(t)) (2)

desi

ws

W

ii

wW

W

wr

A

˚

wJ

A

˛

w

y

s˚

˚

y

w

k

a

˛

chemical engineering research and

ith W(t) ∈ Rr×∞. The GHA update rule in space F is repre-ented as

(t + 1) = W(t) + �(t)(y(t)˚(x(t))T − LT[y(t)y(t)T]W(t)) (3)

Now, the rows of weight matrix W are eigenvectors v of CF

n Eq. (4). Assume that there is a function J(t) which maps t to∈ {1, . . . , N} ensuring ˚(x(J(t))) = ˚(xi). The eigenvectors of CF

ith nonzero eigenvalues lie in the span of ˚(xk), k = 1, . . . , N,(t) can represented by

(t) = A(t)˚T (4)

here ˚ = (˚(x1) . . . ˚(xN)), A(t) = (˛1, . . . , ˛r)T. Eq. (12) can be

ewritten as

(t + 1)˚T = A(t)˚T + �(t)(y(t)˚(x(t))T − LT[y(t)y(t)T]A(t)˚T) (5)

The mapped data ˚(x(t)) can be got by

(x(t)) = ˚b(t) (6)

ith a canonical unit vector b(t) = (0 . . . 1 . . . 0)T ∈ RN (only the(t)th element is 1). Thus, the recursive rule becomes

(t + 1) = A(t) + �(t)(y(t)b(t)T − LT[y(t)y(t)T]A(t)) (7)

Representing this in component-wise form,

ij(t + 1)={

˛ij(t)+�yi(t)−�yi(t)∑i

k=1˛kj(t)yk(t), if J(t) = j

˛ij(t) − �yi(t)∑i

k=1˛kj(t)yk(t), otherwise(8)

here

i(t) =N∑

k=1

˛ik(t)˚(xk) · ˚(x(t)) =N∑

k=1

˛ik(t)k(xk, x(t)) (9)

Consider the centering in the feature space, one shouldubtract the mean of data from each sample point, i.e. use

˜ (x(t)) = ˚(x(t)) − ¯ (x(t)) to replace ˚(x(t)) in Eq. (9), where

¯ = 1N

N∑k=1

˚(xk) (10)

Eq. (10) is the sample mean. Then, y can be gotten by

¯ i(t) =N∑

k=1

˛ik(t)(k(x(t), xk) − k(xk)) − ¯ i(t)

N∑k=1

(k(x(t), xk) − k(xk))

(11)

ith

¯ (xk) = 1N

N∑l=1

k(xl, xk) (12)

nd

¯ i(t) = 1N

N∑l=1

˛il(t) (13)

gn 8 8 ( 2 0 1 0 ) 1381–1392 1383

Kim, Franz and Schölkopf has proven that W will approachthe first r normalized eigenvectors of the correlation matrix inthe feature space, ordered by decreasing eigenvalue.

3. 2-D-DKHA for batch process

3.1. 2-D-DKPCA

Two-dimensional linear dynamic model structures can be cat-egorized into two groups (Lu et al., 2005): two-dimensionalstate-space models and lagged regression models such astwo-dimensional autoregressive (AR), moving average (MA),or autoregressive moving average (ARMA) models. Batch pro-cess dynamics can be viewed as a kind of two-dimensional(2-D) dynamics including time- and batch-wise dynamics andcan be described by a 2-D model structure (Yao and Gao,2008). KPCA is applied on the two-dimensional augmenteddata matrix for capturing auto-correlations in the two direc-tions and cross-correlations of process variables in this article.This approach is named as 2-D-DKPCA.

Consider process data generated by I number of succes-sive batches, J variables and K sampling intervals. Let ¯xj(i, k)be process measurement of variable j at sampling interval kin batch run i (i = 1, . . . , I; j = 1, . . . , J; k = 1, . . . , K), which canbe arranged in a two-dimensional field with two directionsi and j standing for batch and time, respectively (Lu et al.,2005). In the case of the batch process with two-dimensionaldynamics, the current values of the variables will dependon not only the past values in time direction, ¯xj(i, k − 1),¯xj(i, k − 2), . . ., ¯xj(i, k − �); but the past values in batch direc-tion, ¯xj(i − 1, k), ¯xj(i − 2, k), . . ., ¯xj(i − ˇ, k), and even in the crossdirection, ¯xj(i − 1, k − 1), . . ., ¯xj(i − �, k − �), where � and ˇ arethe autoregressive orders in the two directions. The regioncovering the above lagged values is defined as region of sup-port (ROS), for statistical monitoring of the two-dimensionaldynamic batch processes. KPCA can be applied to an aug-mented data matrix defined below to extract simultaneouslytwo-dimensional auto-correlated and cross-correlated rela-tionships

˚( ¯X) =

⎡⎢⎢⎢⎢⎢⎢⎢⎢⎣

˚( ¯xm,n)

...

˚( ¯xi,k)

...

˚( ¯xI,K)

⎤⎥⎥⎥⎥⎥⎥⎥⎥⎦

=

⎡⎢⎢⎢⎢⎢⎢⎢⎢⎣

˚( ¯y1)

...

˚( ¯yl)

...

˚( ¯yL)

⎤⎥⎥⎥⎥⎥⎥⎥⎥⎦

(14)

where

l = (i − m)(K − n + 1) + k − n + 1, l = 1 . . . L (15)

L = (I − m + 1)(K − n + 1) (16)

¯xi,k = ¯yl = [ ¯x(i, k), ¯x(i, k − 1), . . . , ¯x(i, k − � + 1), ¯x(i − 1, k),

¯x(i − 1, k − 1), . . . , ¯x(i − 1, k − � + 1), . . . ,

¯x(i − ˇ + 1, k), ¯x(i − ˇ + 1, k − 1), . . . , ¯x(i − ˇ + 1, k − � + 1)] (17)

¯x(i, k) = [ ¯x1(i, k), ¯x2(i, k), . . . , ¯xJ(i, k)] (18)

and d
The columns of ˚( ¯X) are composed of the current measure-ments and the lagged process measurements in its ROS. ¯xi,k

represents for the current value of all variables at samplinginterval k in batch run i and the lagged process measure-ments in its ROS. Using ˚( ¯yl) to replaces ˚( ¯xi,k) can simplifythe calculation.

The covariance matrix can be expressed in a linear featurespace F, i.e.

¯CF

= 1L

L∑l=1

˚( ¯yl)˚( ¯yl)T

(19)

To diagonalize the covariance matrix, one has to solve theeigenvalue problem in the feature space

¯� ¯v = ¯CF

¯v (20)

where eigenvalues ¯� ≥ 0 and ¯v ∈ F\{0}. Here, ¯CF

¯v can beexpressed as follows:

¯CF

¯v =(

1L

L∑l=1

˚( ¯yl)˚( ¯yl)T

)¯v = 1

L

L∑l=1

〈˚( ¯yl), ¯v〉˚( ¯yl) (21)

where operator 〈x, y〉 denotes the product between x and y.

Hence, ¯� ¯v = ¯CF

¯v is equivalent to

¯�〈˚( ¯yl), ¯v〉 = 〈˚( ¯yl),¯C

F¯v〉 (22)

And there exist coefficients ¯l (l = 1, . . . , L), such that

¯v =

⎡⎢⎢⎢⎢⎢⎣

¯v1

¯v2

...

¯vL

⎤⎥⎥⎥⎥⎥⎦ =

L∑l=1

¯ l˚( ¯yl) (23)

Combining (22) and (23),

¯�

L∑l=1

¯ l〈˚( ¯yg), ˚( ¯yl)〉

= 1L

L∑l=1

¯ l

⟨˚( ¯yg),

L∑h=1

˚( ¯yh)

⟩⟨˚( ¯yh), ˚( ¯yl)

⟩(24)

where g = 1, . . . , L.In general, it is difficult to calculate the value of the value of

the dot product in the feature space since the mapping func-tion � is unknown. The kernel trick allow us to compute thevalue of the dot product without having to carry out the map�.

Now, let us define an L × L matrix�K, where

L = (I − m + 1)(K − n + 1) (25)

The mean centred kernel matrix ˜�K can be easily obtained

from

˜�K = �

K − 1L�K − �

K1L + 1L�K1L (26)

esign 8 8 ( 2 0 1 0 ) 1381–1392

where

1L = 1L

⎡⎢⎢⎣

1 . . . 1

.... . .

...

1 . . . 1

⎤⎥⎥⎦ ∈ RLL (27)

So Eq. (26) can be converted to

¯�

L∑l=1

¯ l˜�kg l = 1

L

L∑l=1

¯ lL∑

h=1

˜�kg h ˜

�kh l (28)

where

kg l = 〈˚( ¯yg), ˚( ¯yl)〉 (29)

kg h = 〈˚( ¯yg), ˚( ¯yh)〉 (30)

kh l = 〈˚( ¯yh), ˚( ¯yl)〉 (31)

The left side of the equation is

¯�

L∑l=1

¯ l˜�kg l = ¯� ˜

�K ¯ (32)

The right side is

1L

L∑l=1

¯ lL∑

h=1

˜�kg h ˜

�kh l =

(1L

)˜�K2 ¯ (33)

The right side is equal to the left side.

¯�L ˜�K ¯ = ˜

�K2 ¯ (34)

To find solutions of Eq. (34), we should solve the eigenvaluesproblem

L ¯� ¯ = ˜�K ¯ (35)

Now, performing KPCA in F is equivalent to resolving theeigen-problem of Eq. (35). The dimensionality of the prob-lem can be reduced by retaining only the first r eigenvectors,r ≤ L. Normalize the first r eigenvectors by requiring that thecorresponding vectors in F be normalized. The nonlinear com-ponents are as follows:

¯tp = 1√¯�p

〈 ¯vp, ˚( ¯Xnew)〉 = 1√

¯�p

L∑l=1

¯ p

l 〈˚( ¯yl), ˚( ¯Xnew)〉

= 1√¯�p

L∑l=1

¯ p

l k( ¯yl,¯Xnew) (36)

where p = 1, . . . , r. ¯vp

is the eigenvector of the corresponding

covariance matrix. ¯�p is the corresponding eigenvalue. ¯Xnew

represents for the new on-line data., Let ¯� be the on-line sam-ple time,

¯Xnew = [ ¯x(i, ¯�), ¯x(i, ¯�−1), . . . , ¯x(i, ¯� − �+1), ¯x(i−1, ¯�), ¯x(i−1, ¯� − 1),

. . . , ¯x(i−1, ¯�−�+1), . . . , ¯x(i − ˇ + 1, ¯�), ¯x(i − ˇ + 1, ¯� − 1),

. . . , ¯x(i − ˇ + 1, ¯� − � + 1)] (37)

desi

x

nttmmIiwitDru

3

2K

acairp

aspta2

dt

t

c

�

�

o

3

Itcccc


¯ (i, ¯�) = [ ¯x1(i, ¯�), ¯x2(i, ¯�), . . . , ¯xJ(i, ¯�)] (38)

Two-dimensional autoregressive orders (ˇ, �), and theumber of retained principal components r. Implicitly, thewo-dimensional DKPCA may be considered as an equivalento a multivariate 2-D-AR model; therefore, order estimation

ethods for 2-D-AR models can be borrowed for the deter-ination of the two-dimensional autoregressive orders (ˇ, �).

n this article, two-dimensional Akaike information criterions selected, as it is simple and effective even for processes

ith significant cross-correlations. The order pair consist-ng of the maximum in the two directions is selected as thewo-dimensional autoregressive orders of two-dimensionalKPCA. As for the number of principal components to be

etained in DKPCA, cross-validation method can be directlysed.

.2. 2-D-DKHA

-D-DKPCA can solve an eigenvalues problem of a L × L matrix˜�

. In batch process, when the numbers of successive batchesnd samples are large, L will become extremely huge. But KHAan extract nonlinear principal components without storingnd manipulating the whole kernel matrix. One can use ¯yl asnput to train the neural network. Then the matrix A whoseows are coefficients ¯ will be got and the output y is nonlinearrincipal component ¯t.

In order to normalize ¯v, the eigenvectors of the covari-nce matrix in feature space, the principal eigenvalues ¯�1 . . . ¯�r

hould be estimated, where r is the number of principal com-onents. Wang has proposed a method which can estimatehe eigenvalues in KPCA (Wang et al., 2007). In this section,method is proposed which can compute the eigenvalues in

-D-DKHA as follows:The variance of the projection of ˚(x(t)) on ith eigenvalues

irection can give the estimation of ¯�i. From ¯ak = ¯ak/|| ¯ak||√

¯�k,he projection can be written as

¯k =

N∑k=1

¯ai

k(t)

|| ¯ak||√

¯�k

(˚(xk) − ˚) · (˚(x(t)) − ¯ ) = yk(t)

|| ¯ak||√

¯�k

(39)

The variance of ¯tk, i.e. the estimation of kth eigenvalue ¯�k

an be represented as

¯k = var{¯tk} = var{yk(t)}

¯�k|| ¯ak||2(40)

So the estimation is

ˆk =√

var{yk(t)}|| ¯ak||

(41)

Thus, the estimation of eigenvalues in 2-D-DKHA can bebtained.

.3. 2-D-DKPCA based fault detection

n the earlier two-dimensional DKPCA modeling, with a properwo-dimensional orders, and a proper number of principalomponents retained, the transformed nonlinear principal
omponent scores can extract all nonlinear auto- and cross-orrelated relationships in process data. The T2 statistic,alculated from the principal component scores, are not inde-
gn 8 8 ( 2 0 1 0 ) 1381–1392 1385

pendent in time- and batch-to-batch sense, which violatesstatistical assumption for process monitoring. ˆ (x) representsthe reconstructed input vector with v ∈ F\{0} in the featurespace. The SPE in the feature space is defined as

SPE = ||˚(x) − ˆ (x)||2 (42)

It can be calculated by

SPE = k(x, x) − tTt (43)

where t is the nonlinear PC calculated by Eq. (10) and k(x, x) isthe centered kernel function

k(x, y)=k(x, y)− 1N

N∑k=1

(k(x, xk) + k(y, xk)) + 1N2

N∑i,j=1

k(xi, xj) (44)

The T2 is defined as

T2 = tT�−1t (45)

where � denotes the diagonal matrix of nonzero or significanteigenvalues.

Since the number of dominant principal components mayvary in different batches, it is important to determine thenumber of PCs adaptively. There are some methods to deter-mine the number, including cumulative percent variance, thescree test, average eigenvalues, and the variance of recon-struction error (Liu et al., 2009). In this article, cumulativepercent variance method is used to determine the number ofnonlinear PCs. The control limits are given according to Lee etal. (2004a,b). Assuming that the scores follow a multivariatenormal distribution, the confidence limits for T2 are obtainedusing the F-distribution,

T2∼p(I2 − 1)I(I − p)

Fp,I−p,˛ (46)

where I is the number of batches in the model, p the num-ber of principal components, and ˛ is the significance level.If the assumption is not valid, the confidence limits can beapproximated from the kernel density estimation (Martin etal., 1996).

Assuming that the prediction errors are normally dis-tributed, the confidence limits for the SPE are calculated fromthe �2 distribution and are given by

SPE∼g�2h,˛; g = v

2m, h = 2m2

v

where m and v are the estimated mean and variance, respec-tively, of the SPE from the reference batches (Nomikos andMacGregor, 1995a,b). This approximating distribution is foundto work well even in cases where the errors are not normallydistributed (Van Sprang et al., 2002).

For on-line monitoring, the distribution of the T2k

is approx-imated by Eq. (46) and that of the SPKk can be approximatedby a weighted �2 distribution of SPEk∼(vk/2mk)�2

2m2k/vk

, where

mk and vk are the mean and variance of the SPEk obtained forthe data set used for the model development an time instantk (Nomikos and MacGregor, 1995a,b).

3.3.1. Outline of off-line 2-D-DKPCA monitoring

(1) Get the normal operating data.

1386 chemical engineering research and d

Fig. 1 – (a and b) The contrast of 2-D-DKPA and 2-D-DKHA.

to model and monitor simultaneously the time- and batch-

(2) Normalize the data subset using the mean and the stan-dard deviation of each variable.

(3) For the scaled data xl ∈ RmnJ, l = 1, . . . , L, calculate thekernel matrix and the mean centered kernel matrix ˜

�K

according to Eq. (26).(4) Compute eigenvectors of the covariance matrix according

to Eq. (35) and determine the number of eigenvectors toremain.

(5) Extract the nonlinear principal components according toEq. (36).

(6) Calculate the monitoring statistics (SPE and T2) of the nor-mal operating data according to Eqs. (43) and (45).

(7) Determine the control limits of SPE and T2 charts.

3.3.2. Outline of on-line 2-D-DKPCA monitoring

(1) Get new data.(2) Normalize the data subset using the mean and the stan-

dard deviation of each variable.(3) Compute the kernel vector kf ∈ R1×L of the new data,

[kf ]l= [kf (xf , xl)] (47)

where xf ∈ RmnJ is the new data vector and xl ∈ RmnJ is thenormal operating data vector for l = 1 . . . L.

esign 8 8 ( 2 0 1 0 ) 1381–1392

(4) Compute the mean centered kernel vector use the follow-ing equation:

kf = kf − 1f K − kf 1L + 1f K1L (48)

where 1f = (1/L)[1, . . . , 1] ∈ R1×L, ˜�K and 1L are calculated by

off-line algorithm steps 2 and 3.(5) Extract the nonlinear principal components of the new

data use the following equation:

tnew,p =L∑

l=1

˛plkt(xl, xt) (49)

(6) Calculate SPE and T2 of the new data according to Eq. (46)to monitor the process.

3.4. 2-D-DKHA based fault detection

3.4.1. Build the normal operating condition model

(1) Acquire the data and normalize them via the mean andstandard deviation of each variable.

(2) For the scaled data xl ∈ RmnJ, l = 1, . . . , L, compute k(xk) =(1/N)

∑N

l=1k(xl, xk) with N = L, initialize matrix A and setiteration counter i = 1 to prepare iteration.

(3) For each input xl(i = 1 . . . L), compute yk (k = 1 . . . r) andthen update A.

(4) Check the convergence of A. If not converged, let iterationcounter increase 1 and go back to Step 3); if converged, out-put the matrix A and the nonlinear principal componenty.

(5) Estimate eigenvalues � use Eq. (41) and then re-computethe nonlinear principal components t use Eq. (39).

(6) Calculate SPE and T2 by using Eqs. (43) and (45).(7) Compute the control limits for SPE and T2.

3.4.2. Online monitoring scheme

(1) Acquire a new operating data and normalize it using thenormal operating data’s mean and standard deviation.

(2) Calculate nonlinear principal component.(3) Calculate SPE and T2 by using Eqs. (43) and (46).(4) Compute the control limits for SPE and T2.

Compared with 2-D-DKPCA, as shown in Fig. 1, the pro-posed 2-D-DKHA does not need to store and manipulate thewhole kernel matrix but only needs to calculate the corre-sponding principal components.

4. Case studies

4.1. Nonlinear dynamic modeling

In this article, 2-D nonlinear dynamic algorithms are proposed

wise nonlinear dynamics. The proposed 2-D-DKPCA basedprocess monitoring is tested with a batch process having the

chemical engineering research and design 8 8 ( 2 0 1 0 ) 1381–1392 1387

F

f

wlstntx

5

(o[

te

4SpttDfpbtittfi

4Tm

Fig. 3 – Monitoring for a process drift from batch 5 by2-D-DPCA.

Fig. 4 – Monitoring for a process drift from batch 5 by

ig. 2 – Faulty trajectories of variable x2 with process drift.

ollowing 2-D nonlinear dynamics.

x1(i, k) = 0.5 sin(x1(i, k − 1)) + 0.8x1(i − 1, k)

− 0.33x1(i − 1, k − 1) + w1

x2(i, k) = 0.67 sin(x2(i, k − 1)) + 0.44x2(i − 1, k)

− 0.11 cos(x2(i − 1, k − 1)) + sin(

k�

0.3K

)+ w2

x3(i, k) = 0.65x1(i, k) + 0.35x2(i, k) + w3

x4(i, k) = −1.26x1(i, k) + 0.33x2(i, k) + w4

here x1 and x2 are two independent variables, x3 and x4 areinear combinations of x1 and x2; wj (j = 1, 2, 3, 4) are Gaus-ian random variables with variance 0.02. For the first batch,he trajectories of x1 and x2 are initialized by two first-orderonlinear dynamic signals. For the first sample of each batch,he values of x1 and x2 are set as x1(i, 1) = x1(i − 1, 1) + w1 and

2(i, 1) = x2(i − 1, 1) + w2. There are totally 10 batch runs with0 samples in each batch, that is, I = 10, J = 4, K = 50.

The 2-D autoregression orders of the variables are all1, 1), thus, the 2-D augmented data vector is madef [x(i, k), x(i, k − 1), x(i − 1, k), x(i − 1, k − 1)], where x(i, k) =x1(i, k), x2(i, k), x3(i, k), x4(i, k)].

Two kinds of fault will be discussed, a process drift andhe change of correlation structure. The 99% control limits arexpressed as a dotted line in the example.

.1.1. Fault 1: small process driftome faulty batches have been generated to simulate a smallrocess drift on variable x2 from batch 5 by adding a signalhat increases slowly with time and batch. Fig. 2 shows faultyrajectories of variable x2 with process drift. Then apply 2--DPCA and 2-D-DKPCA respectively. For the linear method,

our principal components are retained. Let the RBF kernelarameter c = 6000. Figs. 3 and 4 show the monitoring resultsy using 2-D-DPCA method and Figs. 5 and 6 show the moni-oring results by using 2-D-DKPCA method. From Figs. 3 and 4,t is difficult to find which batch is faulty. From Figs. 5 and 6,he SPE and T2 values are increasing with time which reflectshe small process drift and we found the fault is introducedrom batch 5. Figures showed us that the proposed 2-D-DKPCAs available for nonlinear processes.

.1.2. Fault 2: structure changehe other fault used for testing the proposed 2-D-DKPCAethod is simulated by changing the correlation structure.

2-D-DPCA.

Variable x2 is changed to the following from batch 5

x2(i, k) = 0.8 cos(x2(i, k − 1)) + 0.67x2(i − 1, k)

− 0.47 cos(x2(i − 1, k − 1)) + sin(

k�

0.35K

)

Fig. 5 – Monitoring for a process drift from batch 5 by2-D-DKPCA.

1388 chemical engineering research and design 8 8 ( 2 0 1 0 ) 1381–1392

Fig. 6 – Monitoring for a process drift from batch 5 by2-D-DKPCA.

Fig. 9 – Monitoring for a changed correlation structure frombatch 5 by 2-D-DPCA (batch 1 to batch 10).

Fig. 10 – Monitoring for a changed correlation structurefrom batch 5 by 2-D-DKPCA (batch 1 to batch 10).

Fig. 7 – Faulty trajectories of variable x2 with changedcorrelation structure.

Fig. 7 shows both normal and faulty trajectories of x2 frombatch 5 to batch 8. Applying 2-D-DPCA with four PCs retained,the result are represented in Figs. 8 and 9. Then using 2-D-DKPCA with parameter c = 500, SPE and T2 charts are shownin Figs. 10 and 11 which clearly reflects the changed correla-
tion structure. From Figs. 10 and 11, the SPE and T2 values areincreasing with time which reflects the structure change andwe found the fault is introduced from batch 5. This indicates
Fig. 8 – Monitoring for a changed correlation structure frombatch 5 by 2-D-DPCA (batch 1 to batch 10).

Fig. 11 – Monitoring for a changed correlation structurefrom batch 5 by 2-D-DKPCA (batch 1 to batch 10).

that linear approach has difficult in dealing with nonlinearobject.

From these two kinds of fault, it can be seen that the advan-tage of 2-D-DKPCA in the nonlinear dynamic processes.

design 8 8 ( 2 0 1 0 ) 1381–1392 1389

4

Imft

albpaiocbbtsmriI


.2. Penicillin process

n this section, the proposed method is applied to theonitoring of a well-known benchmark process, penicillin

ermentation process. A flow diagram of the penicillin fermen-ation process is given in Fig. 12.

Trajectories of nine variables from a nominal batch runre shown in Fig. 13. The production of secondary metabo-ites such as antibiotics has been the subject of many studiesecause of its academic and industrial importance. Here, therocess to produce penicillin, which has nonlinear dynamicsnd multiphase characteristics, is focus on. In typical operat-ng procedure for the modeled fed-batch fermentation, mostf the necessary cell mass is obtained during the initial pre-ulture phase. When most of the initially added substrate haseen consumed by the microorganisms, the substrate feedegins. The penicillin starts to be generated at the exponen-ial growth phase and continues to be produced until thetationary phase. A low substrate concentration in the fer-entor is necessary for achieving a high product formation

ate due to the catabolite repressor. Consequently, glucoses fed continuously during fermentation at the beginning.n the present simulation experiment, a total of 60 refer-

Fig. 13 – Trajectories of nine variab

Fig. 12 – Penicillin fermentation process.

ence batches are generated using a simulator (PenSim v2.0simulator). Detail process description is well explained from
http://www.chee.iit.edu/∼cinar/software.htm. These simula-tions are run under closed-loop control of pH and temperature,while glucose addition is performed open-loop. Small vari-
les from a nominal batch run.

http://www.chee.iit.edu/~cinar/software.htm

1390 chemical engineering research and design 8 8 ( 2 0 1 0 ) 1381–1392

Fig. 14 – Monitoring result of batch 2 (solid line: SPE chart;dotted line: control limit).

Fig. 15 – Monitoring result of batch 2 (solid line: T2 chart;

Fig. 17 – Monitoring result of batch 3 (solid line: T2 chart;dotted line: control limit).

dotted line: control limit).
ations are automatically added to mimic the real normaloperating conditions which are under the default initial set-ting conditions. The duration of each batch is 400 h, consisting
of a pre-culture phase of about 45 h and a fed-batch phase ofabout 355 h.

2
Fig. 19 – Monitoring result of batch 5 (solid line: T chart;dotted line: control limit).
To test the proposed 2-D-DKHA method, 5 normal batchesare used to build the model in this study. Sample period isset to 1 h. Then, another five batches are generated to be the
test data. Substrate feed rate fault is added on each batchfrom sample 100 to the end except the first and the fourthbatch and test them by the proposed method with 99% con-

desi

fifms2prw5pcSsc

5

I2btcceslcthdovimbavnfpw

R

A

C

C

C

C

C

D

G


dence limit. For batch 2, the substrate feed rate decreasedrom sample 100 with slope −0.003, Figs. 14 and 15 show the

onitoring result. The T2 chart detected the fault from aboutample 229 whereas the SPE chart detected it from sample01. For batch 3, the substrate feed rate decreased from sam-le 100 with slope −0.005, Figs. 16 and 17 show the monitoringesult. The T2 chart detected the fault from about sample 221hereas the SPE chart detected it from sample 202. For batch

, the substrate feed rate decreased to 80% of normal at sam-le 100, Figs. 18 and 19 show the monitoring results. The T2

hart detected the fault from about sample 122 whereas thePE chart detected it from sample 100. The monitoring resultshow 2-D-DKHA can detect the faults of complex batch pro-esses.

. Conclusions

n this article, two algorithms are proposed: 2-D-DKPCA and-D-DKHA. The proposed 2-D-DKPCA can capture both within-atch and batch-to-batch nonlinear dynamics at the sameime and it needs to store the whole kernel matrix and cal-ulates all nonlinear components. The proposed 2-D-DKHAan capture both within-batch and batch-to-batch nonlin-ar dynamics at the same time and it does not need totore the whole kernel matrix and only calculates the non-inear principal components. 2-D-DKPCA and 2-D-DKHA areombined with statistical process monitoring technique inhis article. However, the proposed 2-D-DKPCA method stillas shortcomings. The disadvantages of DPCA have beeniscussed (Treasure et al., 2004; Yao and Gao, 2008). It isutlined that dynamic extensions to conventional multi-ariate statistical process control models may lead to thenclusion of large numbers of variables in the condition

onitor. To prevent this, a dynamic monitoring scheme,ased on subspace identification, is introduced (Treasure etl., 2004). In 2-D-DPCA modeling, the utilization of stateariables instead of lagged process variables reduces theumber of variables and provides a clearer contribution plot

or fault diagnosis (Yao and Gao, 2008). This time-laggedroblem in 2-D-DKPCA will be solved in further researchork.

eferences

lbert, S. and Kinley, R.D., 2001, Multivariate statisticalmonitoring of batch processes: an industrial case study offermentation supervision. Trends Biotechnol, 19: 53–62.

hen, J.H. and Chen, H.-H., 2006, On-line batch processmonitoring using MHMT-based MPCA. Chem Eng Sci, 61:3223–3239.

hiang, L.H., Russell, F.L. and Braatz, R.D., (2001). Fault Detectionand Diagnosis in Industrial Systems. (Springer, London).

ho, J.-H., Lee, J.-M., Choi, S.W., Lee, D. and Lee, I.-B., 2005, Faultidentification for process monitoring using kernel principalcomponent analysis. Chem Eng Sci, 60: 279–288.

hoi, S.W. and Lee, I.-B., 2004, Nonlinear dynamic processmonitoring based on dynamic kernel PCA. Chem Eng Sci, 59:5897–5908.

hoi, S.W., Lee, C., Lee, J.-M., Park, J.H. and Lee, I.-B., 2005, Faultdetection and identification of nonlinear processes based onKPCA. Chemometr Intell Lab Syst, 75: 55–67.

ong, D. and McAvoy, T.J., 1996, Batch tracking via nonlinearprincipal component analysis. AIChE J, 42: 2199–2208.

allagher, N.B. and Wise, B.M., 1996, Application of multi-wayprincipal component analysis to nuclear waste storage tankmonitoring. Comput Chem Eng, 20: 739–744.

gn 8 8 ( 2 0 1 0 ) 1381–1392 1391

Gregersen, L. and Jorgensen, S.B., 1999, Supervision of fed-batchfermentations. Chem Eng J, 75: 69–76.

Kim, K.I., Franz, M.O. and Schölkopf, B., 2005, Iterative kernelprincipal component analysis for image modeling. IEEE TransPattern Anal Mach Intell, 27(9): 1351–1364.

Kourti, T., Lee, J. and MacGregor, J.F., 1996, Experiences withindustrial applications of projection methods for multivariatestatistical process control. Comput Chem Eng, 20: 745–750.

Ku, W.F., Storer, R.H. and Georgakis, C., 1995, Disturbancedetection and isolation by dynamic principal componentanalysis. Chem Intell Lab Syst, 30: 179–196.

Lee, J.M., Yoo, C.K. and Lee, I.B., 2004, Fault detection of batchprocesses using multiway kernel principal componentanalysis. Comput Chem Eng, 28: 1837–1847.

Lee, J.M., Yoo, C.K., Choi, S.W., Vanrolleghem, P.A. and Lee, I.-B.,2004, Nonlinear process monitoring using kernel principalcomponent analysis. Chem Eng Sci, 59: 223–234.

Lennox, B., Montague, G.A., Hiden, H.G., Kornfeld, G. andGoulding, P.R., 2001, Process monitoring of an industrialfed-batch fermentation. Biotechnol Bioeng, 74: 125–135.

Li, W. and Qin, S.J., 2001, Consistent dynamic PCA based onerrors-in-variables subspace identification. J Process Control,11: 661–678.

Lieftucht, D., Kruger, U. and Irwin, G.W., 2006, Improved reliabilityin diagnosing faults using multivariate statistics. ComputChem Eng, 30(5): 901–912.

Liu, X., Kruger, U., Littler, T., Xie, L. and Wang, S.-Q., 2009, Movingwindow kernel PCA for adaptive monitoring of nonlinearprocesses. Chemometr Intell Lab Syst, 96: 132–143.

Lu, N., Yao, Y., Gao, F. and Wang, F., 2005, Two-dimensionaldynamic PCA for batch process monitoring. AIChE J, 51:3300–3304.

Martin, E.B., Morris, A.J., Papazoglou, M.C. and Kiparissides, C.,1996, Batch process monitoring for consistent production.Comput Chem Eng, 20: S599–S604.

Mika, S., Schölkopf, B., Smola, A.J., MJuller, K.-R., Scholz, M. andRJatsch, G., 1999, KPCA and de-noising in feature spaces. AdvNeural Inform Process Syst, 11: 536–542.

Nomikos, P. and MacGregor, J.F., 1994, Monitoring batch processesusing multiway principal component analysis, AmericanInstitute of Chemical Engineers. AIChE J, 40: 1361–1375.

Nomikos, P. and MacGregor, J.F., 1995, Multivariate SPC charts formonitoring batch processes. Technometrics, 37: 41–59.

Nomikos, P. and MacGregor, J.F., 1995, Multi-way partial leastsquares in monitoring batch processes. Chem Intell Lab Syst,30: 97–108.

Qin, S.J., 2003, Statistical process monitoring: basics and beyond. JChemometr, 17: 480–502.

Qin, S.J., Cherry, G., Good, R., Wang, J. and Harrison, C., 2006, Asemiconductor manufacturing process control andmonitoring: a fab-wide framework. J Process Control, 16:179–191.

Rännar, S., MacGregor, J.F. and Wold, S., 1998, Adaptive batchmonitoring using hierarchical PCA. Chem Intell Lab Syst, 41:73–81.

Romdhani, S., Gong, S. and Psarrou, A., 1999, A multi-viewnonlinear active shape model using KPCA, In Proceedings ofBMVC Nottingham, UK, , pp. 483–492.

Russell, E.L., Chiang, L.H. and Braatz, R.D., 2000, Fault detection inindustrial processes using canonical variate analysis anddynamic principal component analysis. Chem Intell Lab Syst,51: 81–93.

Tates, D.J., Louwerse, A.A., Smilde, A.K., Koot, G.L.M. and Berndt,H., 1998, Monitoring a PVC batch process with multivariatestatistical process control charts. Ind Eng Chem Res, 38:4769–4776.

Treasure, R., Kruger, U. and Cooper, J.E., 2004, Dynamicmultivariate statistical process control using subspaceidentification. J Process Control, 14: 279–292.

Wang, H.Q., Song, Z.H. and Li, P., 2007, Modified kernel Hebbian
Algorithm with application to modeling ofhydro-dearomatization process. J Chem Ind Eng, 58(6):1518–1522.

and d

fault diagnosis of large-scale processes using multiblock


Westerhuis, J.A., Kourti, T. and MacGregor, F.J., 1999, Comparingalternative approaches for multivariate statistical analysis ofbatch process data. J Chemometr, 13: 397–413.

Van Sprang, E.N.M., Ramaker, H.-J., Westerhuis, J.A., Gurden, S.P.and Smilde, A.K., 2002, Critical evaluation of approaches foron-line batch process monitoring. Chem Eng Sci, 57:3979–3991.

Yao, Y. and Gao, F., 2008, Subspace identification fortwo-dimensional dynamic batch process statistical
monitoring. Chem Eng Sci, 63: 3411–3418.
Yue, H.H., Qin, S.J., Markle, R.J., Nauert, C. and Gatto, M., 2000,Fault detection of plasma etchers using optical emission

esign 8 8 ( 2 0 1 0 ) 1381–1392

spectra. IEEE Trans Semiconductor Manuf, 13: 374–385.

Zhang, Y. and Qin, S.J., 2007, Fault detection of nonlinearprocesses using multiway kernel independent analysis. IndEng Chem Res, 46: 7780–7787.

Zhang, Y. and Qin, S.J., 2008, Improved nonlinear fault detectiontechnique and statistical analysis. AIChE J, 12: 3207–3220.

Zhang, Y., Zhou, H., Qin, S.J. and Chai, T.Y., 2010, Decentralized

kernel partial least squares. IEEE Trans Ind Inform, 6:3–10.

statistical analysis and adaptive technique for dynamical process monitoring

Documents