166 ieee transactions on semiconductor manufacturing,...

14
166 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL. 17, NO. 2, MAY 2004 Optimized Overlay Metrology Marks: Theory and Experiment Mike Adel, Mark Ghinovker, Boris Golovanevsky, Pavel Izikson, Elyakim Kassel, Dan Yaffe, Alfred M. Bruckstein, Roman Goldenberg, Yossi Rubner, and Michael Rudzsky Abstract—In this paper, we provide a detailed analysis of overlay metrology mark and find the mapping between various properties of mark patterns and the expected dynamic precision and fidelity of measurements. We formulate the optimality criteria and sug- gest an optimal overlay mark design in the sense of minimizing the Cramer–Rao lower bound on the estimation error. Based on the developed theoretical results, a new overlay mark family is pro- posed—the grating marks. A thorough testing performed on the new grating marks shows a strong correlation with the underlying theory and demonstrate the superior quality of the new design over the overlay patterns used today. Index Terms—Box-in-box marks, Cramer–Rao lower bound, dynamic precision, Fisher information matrix, grating marks, overlay mark, overlay mark fidelity, overlay metrology. I. INTRODUCTION A CCURATE and precise overlay metrology is a critical re- quirement in order to achieve high product yield in micro- electronic manufacturing. New challenges become evident as microlithography processes are developed for each new design rule node. A critical link in the overlay metrology chain is the metrology mark which is chosen to be included on the reticle, printed on the wafer, subsequently processed and which is ulti- mately imaged in the metrology tool in the metrology process. In this publication a theoretical and experimental study is de- scribed that shines new light on the limitations of existing mark designs while proposing and validating new designs of superior performance. In Fig. 1 a standard overlay (BiB) 1 mark is shown schemat- ically. It consists of two “boxes” printed on two subsequent layers—top (grey) and bottom (black)—between which the overlay is measured. By design the centers of symmetry of the inner (grey) and outer (black) boxes coincide. The actual Manuscript received December 22, 2003; revised January 20, 2004. This work was supported by the Israel Ministry of Industry and Trade in the framework of the “Magneton” program. M. Adel, M. Ghinovker, B. Golovanevsky, P. Izikson, E. Kassel, and D. Yaffe are with KLA-Tencor, Optical Metrology Division, Migdal HaEmek 23100, Israel (e-mail: [email protected]; [email protected]; [email protected]; [email protected]; [email protected]; [email protected]). A. M. Bruckstein, R. Goldenberg, Y. Rubner, and M. Rudzsky are with the Computer Science Department, Technion, Haifa 32000, Israel (e-mail: [email protected]; [email protected]; [email protected]; [email protected]). Digital Object Identifier 10.1109/TSM.2004.826955 1 In the present paper we unify all conventional overlay mark types—box-in-box, bar-in-bar, frame-in-frame, etc.—under the generic abbreviation “BiB.” Fig. 1. Standard BiB mark (schematically). overlay appears as misregistration between the centers of symmetry of the “black” and “grey” layers. There are two major use cases in overlay metrology for microlithography. The first and the most obvious is termed lot dispositioning. If measured overlay exceeds some allowable threshold, the lot cannot proceed to the next process step. This generally results in rework, that is the lot is returned to the previous lithography step after the resist is stripped. This is provided the overlay measurements were done immediately after development. Under some circumstances the overlay measurements after development are not viable, and are done after etch. In this case, there is no option for rework, and lots outside of allowable thresholds are scrapped. The second use case of overlay metrology is for correction of the exposure tool. Usually, the overlay is measured at four cor- ners of the field and over several fields on the wafer, which pro- vides the necessary statistical sampling to enable stepper cor- rections model to be calculated. This model includes intra-field and inter-field correctibles, such as offset, rotation and scale. These correctibles are fed back to the exposure tool to improve performance on subsequent lots. Conventional BiB based metrology has been the standard overlay metrology for almost two decades. However, as the overlay budget shrinks together with the lithographic design rules, a number of performance limitations are becoming evi- dent. These shortcomings are addressed in the section below. Application of grating structures to lithography and metrology fields is being extensively studied. One of such applications 0894-6507/04$20.00 © 2004 IEEE

Upload: others

Post on 06-Apr-2020

22 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 166 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL…robotics.stanford.edu/~rubner/papers/Adel_2004.pdf · 2004-05-10 · 168 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING,

166 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL. 17, NO. 2, MAY 2004

Optimized Overlay Metrology Marks:Theory and Experiment

Mike Adel, Mark Ghinovker, Boris Golovanevsky, Pavel Izikson, Elyakim Kassel, Dan Yaffe, Alfred M. Bruckstein,Roman Goldenberg, Yossi Rubner, and Michael Rudzsky

Abstract—In this paper, we provide a detailed analysis of overlaymetrology mark and find the mapping between various propertiesof mark patterns and the expected dynamic precision and fidelityof measurements. We formulate the optimality criteria and sug-gest an optimal overlay mark design in the sense of minimizing theCramer–Rao lower bound on the estimation error. Based on thedeveloped theoretical results, a new overlay mark family is pro-posed—the grating marks. A thorough testing performed on thenew grating marks shows a strong correlation with the underlyingtheory and demonstrate the superior quality of the new design overthe overlay patterns used today.

Index Terms—Box-in-box marks, Cramer–Rao lower bound,dynamic precision, Fisher information matrix, grating marks,overlay mark, overlay mark fidelity, overlay metrology.

I. INTRODUCTION

ACCURATE and precise overlay metrology is a critical re-quirement in order to achieve high product yield in micro-

electronic manufacturing. New challenges become evident asmicrolithography processes are developed for each new designrule node. A critical link in the overlay metrology chain is themetrology mark which is chosen to be included on the reticle,printed on the wafer, subsequently processed and which is ulti-mately imaged in the metrology tool in the metrology process.In this publication a theoretical and experimental study is de-scribed that shines new light on the limitations of existing markdesigns while proposing and validating new designs of superiorperformance.

In Fig. 1 a standard overlay (BiB)1 mark is shown schemat-ically. It consists of two “boxes” printed on two subsequentlayers—top (grey) and bottom (black)—between which theoverlay is measured. By design the centers of symmetry ofthe inner (grey) and outer (black) boxes coincide. The actual

Manuscript received December 22, 2003; revised January 20, 2004. Thiswork was supported by the Israel Ministry of Industry and Trade in theframework of the “Magneton” program.

M. Adel, M. Ghinovker, B. Golovanevsky, P. Izikson, E. Kassel, and D. Yaffeare with KLA-Tencor, Optical Metrology Division, Migdal HaEmek 23100,Israel (e-mail: [email protected]; [email protected];[email protected]; [email protected];[email protected]; [email protected]).

A. M. Bruckstein, R. Goldenberg, Y. Rubner, and M. Rudzsky are withthe Computer Science Department, Technion, Haifa 32000, Israel (e-mail:[email protected]; [email protected]; [email protected];[email protected]).

Digital Object Identifier 10.1109/TSM.2004.826955

1In the present paper we unify all conventional overlay marktypes—box-in-box, bar-in-bar, frame-in-frame, etc.—under the genericabbreviation “BiB.”

Fig. 1. Standard BiB mark (schematically).

overlay appears as misregistration between the centers ofsymmetry of the “black” and “grey” layers.

There are two major use cases in overlay metrology formicrolithography. The first and the most obvious is termed lotdispositioning. If measured overlay exceeds some allowablethreshold, the lot cannot proceed to the next process step. Thisgenerally results in rework, that is the lot is returned to theprevious lithography step after the resist is stripped. This isprovided the overlay measurements were done immediatelyafter development. Under some circumstances the overlaymeasurements after development are not viable, and are doneafter etch. In this case, there is no option for rework, and lotsoutside of allowable thresholds are scrapped.

The second use case of overlay metrology is for correction ofthe exposure tool. Usually, the overlay is measured at four cor-ners of the field and over several fields on the wafer, which pro-vides the necessary statistical sampling to enable stepper cor-rections model to be calculated. This model includes intra-fieldand inter-field correctibles, such as offset, rotation and scale.These correctibles are fed back to the exposure tool to improveperformance on subsequent lots.

Conventional BiB based metrology has been the standardoverlay metrology for almost two decades. However, as theoverlay budget shrinks together with the lithographic designrules, a number of performance limitations are becoming evi-dent. These shortcomings are addressed in the section below.Application of grating structures to lithography and metrologyfields is being extensively studied. One of such applications

0894-6507/04$20.00 © 2004 IEEE

Page 2: 166 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL…robotics.stanford.edu/~rubner/papers/Adel_2004.pdf · 2004-05-10 · 168 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING,

ADEL et al.: OPTIMIZED OVERLAY METROLOGY MARKS: THEORY AND EXPERIMENT 167

is for scatterometry based critical dimension (CD) metrology([9], [16], [15], [11]). Gratings are also used for phase shiftmonitoring ([7]). ASML is using grating patterns as alignmentmarks ([14]). In the current paper, we introduce grating marksfor overlay metrology.

A. Device Correlation

As design rules shrink to 100 nm and below, difference in BiBfeature size and device feature size have become significant.Both lithographic pattern placement errors (PPE) [4], [10] andinfluences of other processes (like chemical-mechanical pla-narization—CMP) are known to be feature size and density de-pendent [12]. Therefore, overlay metrology results based on BiBmarks may suffer from discrepancies compared with device fea-ture overlay.

B. In-Chip to Scribe-Line Discrepancy

Another source of BiB-to-device overlay discrepancy orig-inates from their different spatial location in the exposure toolfield. Typically, BiB marks are printed in the scribe lines near thefield corners. Optical conditions (aberrations, focus deviations,etc.) near the field edges may differ from those in the field in-terior, where the device features are printed. Since both overlaybudget and process window (as defined by allowed exposuretool focus and exposure) shrink together with the design rules,in-chip to scribe-line discrepancies are becoming critical.

C. Process Robust Marks

Conventional BiB marks are frequently considered designrule violations in modern IC manufacturing processes. BiBmarks are generically built of wide lines and require emptysurrounding spaces (exclusion zones; see Fig. 1) for successfulmeasurement. Both these facts usually contradict patterndensity and feature size design rule requirements commonly inpractice today. Such violations make handling of BiB marks bythe layout engineer problematic and more importantly have anegative impact on process robustness of the metrology marksince the process is optimized for features and patterns ofsignificantly different dimensions [6].

D. Tool Induced Shifts

Currently overlay measurements are performed on opticalimaging based tools. Optical aberrations and illuminationimperfections are an unavoidable reality of optical metrologysystem design and manufacture. A simple and quantitativemetric of the quality of the optical metrology tool is ToolInduced Shift (TIS). TIS is defined as the average of the overlaymeasurements performed on a given overlay mark before andafter rotation by 180 :

(1)

Nonzero TIS is an indication that the metrology tool has in-duced a systematic discrepancy in the overlay result due to theabove system imperfections. TIS is however, by definition, acalibratable error, if measurements are performed at both orien-tations on a subset of representative marks. A more important

Fig. 2. Schematic arrays of densely printed overlay marks used for OMFcalculation.

metrology uncertainty contributor is TIS variability, defined asthree times the standard deviation of the TIS measured oversites across the wafer.

E. Information Content

In spite of the large space occupied by the conventional BiBmark, it contains a relatively sparse amount of information foroverlay measurement. Generically, each BiB mark consists offour inner and four outer bars only, usually utilizing less than20% of the occupied real estate. By increasing the informa-tional content of the overlay mark one can minimize the effectof random (both spatial and temporal) noise on overlay mea-surement. There are two measurable parameters representingoverlay measurement uncertainty due to temporal and spatialnoise: dynamic precision and overlay mark fidelity (OMF) re-spectively.

F. Dynamic Precision

Dynamic Precision is defined as three times the standard de-viation of the results of a series of measurements of the sameoverlay mark, when these measurements are done in a dynamicloop (including wafer alignment, mark acquisition and measure-ment itself). This parameter quantifies temporal noise in themeasurement of a given overlay mark.

G. Overlay Mark Fidelity (OMF)

Suppose, one can eliminate the temporal noise in the overlaymeasurement (by means of averaging over many dynamic loopsof measurements on the same mark). This still does not ensurethat by measuring two nominally identical marks one will ob-tain identical results. OMF is defined as three times the standarddeviation of the measurement results of the densely printedidentical overlay marks after compensating for dynamic preci-sion [1] (see Fig. 2).

In the present paper, we dwell on the last three shortcomings,that is information content, precision and mark fidelity. We firstcompare different mark design options from a theoretical per-spective, focusing on the sampling, temporal noise, and spatialnoise aspects. We then introduce an optimized grating overlay

Page 3: 166 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL…robotics.stanford.edu/~rubner/papers/Adel_2004.pdf · 2004-05-10 · 168 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING,

168 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL. 17, NO. 2, MAY 2004

Fig. 3. BiB overlay mark with right-outer and top-inner regions of interest.

mark, which demonstrates superior performance over the con-ventional BiB marks. We present experimental data on dynamicprecision (as a measure of temporal noise) and OMF (as a mea-sure of spatial noise) for the new grating overlay marks as com-pared with conventional BiB marks.

II. THEORY: DESIGNING PATTERNS FOR OPTIMAL OVERLAY

REGISTRATION AND POSITION ESTIMATION

In this section, we analyze the dependence of the dynamicprecision and fidelity of the overlay measurement on variouspattern parameters. The overlay measurement is based on mea-surements of horizontal and vertical positions of known pat-terns.

Fig. 3 shows an example of the BiB overlay mark with right-outer and top-inner regions of interest.

The frame’s edges are corrupted by noise whose character de-pends on various factors. We shall deal with two types of noise:additive Gaussian noise at the wafer level and additive Gaussiannoise at the camera level. The first type of noise is spatial noisewhose source is the manufacturing process, and the second typeis a dominant source of the temporal noise in the measurementprocess.

In this paper, we explore how the position estimation error isaffected by various pattern characteristics and by the parametersof the measurement process, by deriving the Cramer–Rao lowerbound on the estimation error for arbitrary patterns and, thenaddress the question of designing patterns that are optimal inthe sense of minimizing the location error.

A. Problem Definition

We shall first deal with the measurement of horizontal po-sition of a known one-dimensional (1-D) pattern in atwo-dimensional (1-D) image. Vertical position estimation canbe done in a similar way. We assume in this case that the mea-surement is performed for every image row independently, andthen an average pattern location estimate is returned as a result.

A periodic pattern of lines may be represented by

(2)

where is a 1-D pattern repeated on every line andis the “spatial noise” on the wafer.

Then the pattern at each row of the image acquired by thecamera can be described by

(3)

where is an overall point spread function, composed of the op-tical and camera point spread functions, is the convolution op-erator, and is the temporal noise at the camera output. All sig-nals are assumed band limited and the noise terms are assumedto be filtered and hence, band-limited, white, and Gaussian.

The task is to find best match locations of the designed patterngiven the signals measured over all image rows. In

the analysis below, we derive the distribution of the pattern loca-tion estimates over all measurements. The statistical analysis isbased on the Cramer–Rao bound, a well-known statistical tool[8], [5].

We can now define the following three problems

1) Find the dependence of the dynamic precision and overlaymark fidelity (OMF) metrics on the general parameterscharacterizing an one dimensional pattern and the mea-surement method without going into the detailed structureof the pattern .

2) Given the pattern , what is a lower bound on theunbiased estimation of the pattern location?

3) What is the optimal pattern in the sense of mini-mizing the Cramer–Rao lower bound on the estimationerror?

B. Dynamic Precision and Fidelity Estimations

In this section, we evaluate the precision and fidelity of themeasurement process based on some general physical parame-ters of the measured signal and the measurement process, suchas optical system aperture, wave length, pattern size, signal-to-noise ratio (SNR), and others, without considering the detailedstructure of the pattern .

1) Single Line Measurement: On every image row we esti-mate the pattern location by using the optimal Matched Filter orcorrelation method. Let be the estimator of the pattern loca-tion of 1–D signal immersed in white Gaussian noise. Itis known on the basis of very general statistical principles thatthe variance of is bounded below by the Cramer–Rao bound,which is given by (see, e.g., [8], [5]).

(4)

where is the signal’s energy, is the unilateralspectral density of the noise, and

is the square of the effective bandwidth of the signal, whereis a Fourier transform of . Using the definitions

and where is the averagesignal power, is the signal length, is the noise power and

is the noise bandwidth, we get

This formula shows that the precision is a function of the signaland noise bandwidths, signal to noise ratio, and the overalllength of the signal.

Page 4: 166 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL…robotics.stanford.edu/~rubner/papers/Adel_2004.pdf · 2004-05-10 · 168 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING,

ADEL et al.: OPTIMIZED OVERLAY METROLOGY MARKS: THEORY AND EXPERIMENT 169

We shall deal with two kinds of noise.

• Spatial noise originating from the mark itself. This noiseundergoes convolution with the point spread functionand therefore its band width is dictated by the opticalsystem. In this case we assume .

• Temporal noise originating mainly from the camera. Thisnoise is of higher bandwidth since we work in condi-tions of over sampling and the pixel size is assumed to beroughly 5 times smaller than the optical resolution length.In this case we assume .

In both cases, the effective bandwidth of the signal is dic-tated by the optical system. The upper bound for is dictatedby diffraction and can be approximated as the reciprocal of theRayleigh resolution distance which is given by

where is the optical wavelength and is the numerical aper-ture of the optical system.

Using we get the fol-lowing expression for the standard deviation of the pattern lo-cation:

With all other factors equal, the standard deviation of tem-poral noise case is smaller because the noise bandwidth is larger.By increasing while keeping all other factors constant we de-crease the variance.

2) Multiple Line Measurement: When we repeat the mea-surement for multiple image rows we should get the same re-sults with different random jitter or estimation noise. When wetake the average of the measurements the precision is increasedby the square root of the number of independent measurements.Let us measure rows with the same method. We shall dealwith two cases: Temporal Noise and Spatial Noise.

3) Temporal (Measurement) Noise: The number of indepen-dent measurements is , and therefore, the standard deviationwill obey

This gives us the dynamic precision of the measurement. Ifwe put the following numbers as a concrete (real life) example

m, m,m we get

nm

4) Spatial (Pattern) Noise: In this case the number of inde-pendent measurements is not since the optical point spreadfunction blurs the noise. Let us call the vertical sampling in-terval on the wafer. The number of independent measurementsis approximated by

In addition the noise bandwidth is roughly the inverse ofthe optical spatial resolution . The standard deviation of the

measurement, interpreted as the Statistical Accuracy (OMF)will hence obey

If we use the following numbers for a concrete (real life) ex-ample m, m,

m, we obtain

nm

The expressions for the statistical precision and fidelity(OMF) are very similar, except the dependence on and

. In order to increase both precision and fidelity we needto decrease the wavelength, and to increase the informationcontent of the signal (the effective bandwidth), the signal’sspatial region ( and ) and the signal to noise ratio. In orderto reach the bound we have to use a signal whose effectivebandwidth achieves . This means that the signal fullyexploits the frequency band up to the limit of diffraction.

C. Lower Bound Estimation—Exploring the Detailed Structureof the Pattern

In this section, we derive a lower bound on the estimationerror for locating a known one dimensional pattern . Unlikethe previous section, this time we wish to get an expression forthe Cramer–Rao bound for a known pattern, in order to gainsome intuition on how optimal patterns should look like.

From (2) and (3), the observed signal is given by

(5)

where is an overall noise added to the designedpattern . We assume that is a gaussian point spread function(PSF), with standard deviation of and is azero-mean white Gaussian noise.

Our measurement vector, is constructed by sam-pling the on a uniform pixel grid. We assume that the firstsample, , has an offset of from the origin.

Since consists of statistically independent, due to thewhiteness of the noise, measurements , and since the addi-tive noise is Gaussian, we can write the probability function as

(6)

where is the number of measurements (pixels), and is theset of samples of , taken at the same positions as .

For convenience we use another form of the (scalar)Cramer–Rao bound [8], [5]:

(7)

where is the (scalar) Fisher information matrix

Page 5: 166 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL…robotics.stanford.edu/~rubner/papers/Adel_2004.pdf · 2004-05-10 · 168 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING,

170 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL. 17, NO. 2, MAY 2004

For our Gaussian case, using (6) it is easy to derive that

(8)

where . Notice that the Fisher information term, andtherefore the Cramer–Rao bound depend on .

Sometimes, more intuition regarding the properties of the pat-tern can be gained from the following way to rewrite ofthe Fisher information expression in (8)

By limiting our discussion to binary input patterns, we canobtain even simpler expression for the Cramer–Rao bound. As-suming the pattern is composed of rectangular blocks,whose left end right edges coordinates are ,the smoothed signal can be easily computedusing the erf function as

where

Then the Fisher information vector will be given by

(9)

D. Design of an Optimal Pattern

In [2](see also [3]) Bruckstein et al. showed how to designa overlay mark pattern of pixels that achieves an estimationof the position with exponential lower bound accuracy.Optimality of this pattern was shown in the sense of informationtheory. Unfortunately, in our model that uses the Cramer–Rao

bound as optimality criteria, and includes PSF smoothing of thepattern and additive Gaussian noise, the BO&O overlay markdesigned in [2] is not optimal.

The information theory approach in [2] uses the actual valuesof the pattern, i.e., zeros and ones. For every additional pixel, theuncertainty of the estimation that was achieved with the previouspixels is improved by a factor of two as the pattern is designedso that the last pixel has a value of “0” for half of the uncer-tainty area and a value of “1” for the other half. The Cramer–Raobound on the other hand uses the changes in the pattern, i.e., itsderivative. This means that the optimal pattern should have highderivative for all values of . The BO&O patternof [2] is not optimal in the Cramer–Rao bound sense because oftwo main reasons: first, for the lower frequency part of the pat-tern, the derivative of the pattern is zero over a large “wasted”areas. Second, for the high frequency part, the changes in thepattern are too dense and will be completely smoothed out bythe imaging PSF.

1) Designing an Offset Invariant Bound Pattern: As wehave already mentioned, the Cramer–Rao bound is not a scalar(number), but rather a function of the offset . Hence the samepattern that undergoes different transformations (translations)yields different Cramer–Rao lower bounds for the differentvalues of the transformation parameters. Therefore we wouldbe interested in an integral measure that guarantees that for anytransformation, i.e., for all the possible range of translations,we would be able to recover the transformation parameterswith high accuracy.

This implies that the pattern is to be designed in such a waythat its derivatives are as large as possible over all the -supportof the pattern (other than a small number of points where thederivative must change sign). Shih and Yu [13] deal with similarproblem for the case of continuous value signal by solving theCramer–Rao minimization problem using quadratic program-ming. In our case of binary signal the optimality is achieved byusing a rectangular pulse signal where the distance between thepulses is . This distance brings adjacent pulses in thesmoothed pattern to touch each other at points where they reachzero.

In order to maximize the minimum of the Fisher informationover all the possible values, we shall need to distribute thebinary blocks edges as evenly as possible.

2) Uniform Fractional Parts Distribution: The problem ofeven distribution of binary blocks over a given interval can beformulated in the following way: Given a constant , place amaximal possible number of points within an interval of length

, in such a way that the minimal dis-tance between adjacent points is greater than , and the frac-tional parts of the points coordinates are uniformlydistributed in the interval [0, 1].

By uniform distribution, we mean that if the number of pointsis , their fractional parts should be apart from each other.

Without loss of generality we can place the first point at po-sition zero. Indeed, if there exists a better mapping that startsfrom another position , we can always shift all the points to theleft by thus retaining the same number of points and not dis-turbing the uniform distribution of the fractional parts of pointscoordinates.

Page 6: 166 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL…robotics.stanford.edu/~rubner/papers/Adel_2004.pdf · 2004-05-10 · 168 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING,

ADEL et al.: OPTIMIZED OVERLAY METROLOGY MARKS: THEORY AND EXPERIMENT 171

Fig. 4. Fractional parts distribution on unit circle.

If the minimal distance between the adjacent points is , thenthe maximal number of points that can be placed within theinterval of length is . Following our definitionof uniformity, the fractional parts of points positions should be

apart from each other.

We suggest the following greedy algorithm for choosingpoints in the interval such that their fractional parts coverthe set and the points are at leastapart from each other.

The algorithm:

1) Set PointsSet UsedFractions

2) Repeat times (steps 3–6)3) Advance4) Advance to the nearest position , whose fractional

part is a multiple of andUsedFractions.

5) Points Points6) UsedFractions UsedFractions

Let us analyze the performance of the algorithm above andcheck whether it can really place points as required.

We denote the fractional part of the by , i.e., .Advancing by from the first point placed at 0 position brings

us, in general, to a position, whose factional part is not a multipleof . To get to the nearest multiple of we have to skip aninterval of length (see Fig. 4). This situationemerges every time we advance by a step of size from a pointaligned to an -multiple fraction. Hence, the overhead of this

-alignment integrated for points is bounded by

Now let us estimate the overhead of the search for the positionwith fractional part that has not been visited yet. As we limitourselves to fixed positions on the unit circle—the multiplesof —we can now switch to the discrete domain and express

everything in terms of steps of size on the unit circle. Letbe the number of steps traversed on unit circle when advancingby on the interval. The is given by

For the example depicted in Fig. 4, the . Let. Then by advancing every time by we visit

-aligned locations on unit circle before returning to 0.Ideally, when the and are co-prime numbers allfractions will be visited before returning to 0 position on unitcircle, thus yielding a valid placement of points with zerooverhead (besides the mentioned above). In generalcase, for , every time we return to a position that has al-ready been visited, one -step is to be skipped in order to switchto another fractions chain of length . Clearly, there will be

such chains, which means that the overhead is given byThus, the total overhead of the algorithm

is

Therefore, the guaranteed lower bound on the number ofpoints that can be placed by the algorithm within the interval oflength is given by

and the actual maximal number of points can be found as

E. The Dense Pattern

Using the binary blocks positions obtained by the algorithmdescribed above we can build an optimal pattern of any lengthfor any given value of .

Fig. 5 shows a comparative analysis of the newly designed“Dense Pattern” and the BO&O patterns. The left part of thefigure shows the BO&O pattern and the right part shows thedense pattern. The original signal, the blurred signal, the blurredsignal derivative and the Cramer–Rao bound are shown from topto bottom. Notice that for the BO&O pattern the Cramer–Raolower bound is different for different values of , while for thedense pattern the bound is much lower and almost indepen-dent of the offset. Fig. 6 presents the simulation results con-firming the theoretical bounds. The simulation was performedby shifting the patterns by various offsets from the interval [0,1], adding Gaussian noise and looking for maximal correlationscore with the original signal. For every offset the experimentwas performed 100 times and the average estimation error iscalculated. Both BO&O and the dense patterns are of the samelength pixels, the noise variance is , and thePSF variance . As predicted, the dense pattern yields

Page 7: 166 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL…robotics.stanford.edu/~rubner/papers/Adel_2004.pdf · 2004-05-10 · 168 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING,

172 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL. 17, NO. 2, MAY 2004

Fig. 5. BO&O and dense (optimal) overlay mark for eight pixels. First row: Original signals. Second row: Blurred signals. Third row: Derivatives. Fourth row:Cramer–Rao bound.

Fig. 6. Position estimation error for BO&O—solid line, and dense (optimal) pattern—dotted line.

lower estimation error, which is more uniformly distributed overthe offset range.

III. EXPERIMENT: GRATING MARK—A NEW OPTIMIZED

OVERLAY MARK

Motivated by the theoretical results presented above we de-signed and tested a new family of overlay marks—the gratingmarks.

Based on the same general principals as the optimal densepattern described above, grating marks however differ in thefollowing aspects:

• Grating marks, unlike the dense pattern, are periodic. Thissignificantly simplifies the manufacturing process and itturns out that, at the current level of technology, the im-provement achieved by uniform edge placement of thedense pattern is negligible compared to other sources oferror.

Page 8: 166 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL…robotics.stanford.edu/~rubner/papers/Adel_2004.pdf · 2004-05-10 · 168 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING,

ADEL et al.: OPTIMIZED OVERLAY METROLOGY MARKS: THEORY AND EXPERIMENT 173

Fig. 7. Grating marks (schematically): (a) clockwise (CW) and (b) counterclockwise (CCW).

Fig. 8. Zoom to one grating mark octant. It is built of periodic series of lines.

• Grating marks, very much like the BO&O overlay mark,are designed as a multiscale structures. This facilitates ap-plication of multiresolution algorithms for faster patternregistration and position disambiguation (anti-aliasing).

In Fig. 7, a grating mark is shown schematically. Similarlyto conventional BiB marks, the grating mark consists of inner(grey) and outer (black) structures printed on top and bottomlayer. Each of these structures is symmetric with respect to 90rotation. The grating mark consists of eight octants (four “grey”and four “black”) and fills an area of , where is the“grating mark size”. Unlike conventional BiB marks, gratingmarks do not require any exclusion zone around them, i.e., anyother structures can be printed in the immediate vicinity of thegrating marks. Grating marks are comprised of structures atthree scales of design:

• On the largest scale (of the order ) the grating markconsists of two layers (“grey” or “inner” and “black” or“outer”) and eight octants. At this scale the grating markis characterized by its size , inner and outer layers, andgrating mark chirality. There are two possible chiralitiesof the grating marks: clockwise (CW)—as shown in Fig.7(a), and counterclockwise (CCW)—see Fig. 7(b).

• On the next spatial scale (termed the “metrology interac-tion scale”) one can see that each octant comprises a pe-riodic series of lines and spaces with a characteristic scaleabout 1 m (see Fig. 8). This enhances information con-tent and enables new image processing techniques due to

Fig. 9. Zoom to two “coarse” lines from the previous figure. Such “coarse”lines may be finely segmented.

the periodicity of the signal. This concept of a periodicmark is conceptually different from the conventional BiBapproach. At this scale grating mark is characterized bypitch (i.e., period of the line series) and duty cycle (orline-to-period ratio).

• Finally, on the third spatial scale (termed “lithography in-teraction scale”), when we zoom in to a single line fromFig. 8, it appears that this line is finely segmented (withthe design rule line and space pattern; see Fig. 9). Thisfine segmentation is typically below the optical resolu-tion limit of the optical metrology tool, and therefore onlythe “coarse” lines create contrast in the acquired image.However, solid and finely segmented “coarse” lines areknown to behave differently both lithographically and inprocess-related areas. At this scale, the grating mark ischaracterized by the fine segmentation pattern chosen.

A. Grating Mark Overlay Measurement

Similarly to the conventional BiB, the overlay of the gratingmark is calculated as a misregistration between the centers of

Page 9: 166 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL…robotics.stanford.edu/~rubner/papers/Adel_2004.pdf · 2004-05-10 · 168 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING,

174 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL. 17, NO. 2, MAY 2004

Fig. 10. Dynamic precision of grating marks as function of the area of the ROIs.

symmetry of the inner (“grey”) and outer (“black”) patterns. Anoptimized measurement algorithm may be designed to utilizethe pre-defined periodic nature of the grating mark patterns.

B. Improved Dynamic Precision of the Grating Mark

Contrary to conventional BiB marks, grating marks utilize themajority of the mark area for the overlay measurement. Thusinformation content of the grating mark is significantly higherthan that of BiB marks. This increased information content re-sults in improved dynamic precision and overlay mark fidelity.

In the experimental verification of the theory, we present re-sults of measurements performed on two nonsegmented (NS)grating marks with two different grating periods (pitches). Wehave measured these marks in 10 dynamic loops and calculateddynamic precision as a function of the area of the mark utilizedfor the overlay calculation. We varied the width and length

in a way preserving the same proportion , and observedthe dynamic precision of the overlay measurements as a func-tion of .

Fig. 10 shows graphs of the precision in and directions,for grating marks with pitches of 2400 and 1500 nm. Includedare best-fit power law curves of the graphs. It is observed thatthe experimental data closely follow a hyperbolic relationshipbetween precision and mark area. This is in a good agreementwith theory.

The theory provides a lower bound for precision, assumingthe maximal effective bandwidth, but it does not give the pitchdependence. The better precision of the 1500-nm pitch markrelative to the 2400-nm pitch mark is in a qualitative agreementwith the theory, indicating that the precision improves with thespatial frequency of the grating.

In addition we have performed extensive measurements ofthe dynamic precision on many different (both conventional BiBand new grating) marks on various layers and wafers, run underdifferent process conditions. Wafers were run in three different

semiconductor manufacturing fabs, identified as Fab1, Fab2 andFab3, on four different processes described as follows.

• Poly/Active: the first patterning step was an active layer,followed by STI processing and an oxide CMP step. Thiswas followed by a gate oxide process and polysilicon de-position. The second patterning step was at Poly.

• Etched Si: this process is a simplified version of the samesequence of patterning as above. Silicon was etched withthe Active pattern. Then a layer of photoresist was spunover etched Silicon and patterned with the Poly reticle.

• Via/Metal: the first patterning step was on a dielectricstack and was processed as a Cu single-damascene metallayer. This was followed by Cu-CMP and deposition of anintermetallic dielectric stack. The second patterning stepwas at Via.

• Metal/Via: On a dielectric stack intended for Cu-dualdamscene, the first patterning step was via. After via etch,photoresist was spun on the same stack and patternedwith Metal trenches.

Fig. 11 summarizes dynamic precision performance of thegrating overlay marks comparatively to conventional BiB. Non-segmented and design rule segmented marks are grouped sepa-rately.

C. Improved Overlay Mark Fidelity (OMF) of theGrating Mark

In order to verify the anticipated reduction in spatial noise,we have measured an array of closely printed identical overlaymarks (both BiB and grating marks). All the marks were mea-sured 10 times in a dynamic loop to separate spatial noise fromtemporal noise. Fig. 12 shows the dependence of the overlaymark fidelity (OMF; which is experimental measure of spatialnoise) on the area of the ROIs.

It can be seen that the graphs do not behave as hyperbolas. Byincreasing the kernel size we improve OMF up to some limit

Page 10: 166 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL…robotics.stanford.edu/~rubner/papers/Adel_2004.pdf · 2004-05-10 · 168 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING,

ADEL et al.: OPTIMIZED OVERLAY METROLOGY MARKS: THEORY AND EXPERIMENT 175

Fig. 11. Dynamic precision of the old (BiB) and new (grating) overlay marks on many different process layers.

Fig. 12. OMF of grating marks as function of the area of the ROIs.

(around 4 microns). Beyond this point OMF nearly saturates.This is believed to indicate that spatial noise does not behave aswhite noise. There are several possible explanations for this hy-pothesis. Firstly, there are some systematic errors such as reticleerrors, which are different from one mark to another, which dohowever repeat themselves field to field over the wafer [1]. Fur-thermore, there are some frequency dependent sources of spatialnoise due to the nature of wafer processing. Fig. 13 presents thesummary of the OMF results of measurements made on manydifferent (both conventional BiB and new grating) marks on var-ious layers and wafers.

Although OMF is an effective metric to estimate the impactof process noise on the metrology uncertainty, there are addi-

tional factors that influence overlay metrology performancewhich should be mentioned. The impact of film stack thicknessand composition on overlay metrology performance is twofold.Firstly, they are the key factors determining image contrast. Al-though there is no fundamental difference in the physics whichdetermines the contrast in images of isolated (BiB like) versusgrating structures, as discussed above, multiple edges increaseinformation content and reduce the contrast threshold abovewhich minimum metrology performance is achieved. Secondly,significant topographical differences between process layersmay impact metrology tool performance. In this area, nosignificant differences in performance were detected betweenBiB and grating targets in the current study.

Page 11: 166 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL…robotics.stanford.edu/~rubner/papers/Adel_2004.pdf · 2004-05-10 · 168 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING,

176 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL. 17, NO. 2, MAY 2004

Fig. 13. OMF of the old (BiB) and new (grating) overlay marks on many different process layers

Fig. 14. Tested patterns. From top to bottom: 1-D BiB, nonsegmented grating, segmented grating, segmented grating with interleaving, and the dense (optimal)pattern.

D. Grating Versus Dense Patterns—Simulation

Finally, let us look once again at the optimal dense patternand estimate its advantage over the suboptimal grating patterns.Since we do not have a real dense pattern, we may try to answer

this question by running a simulation as described in SectionII-E. Fig. 14 presents the general form of the tested patterns: the1-D BiB pattern, three grating patterns and the dense optimalpattern. Three grating patterns were tested using two differentlow pitches: 1500 and 2400 nm. The high pitch was 300 nm

Page 12: 166 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL…robotics.stanford.edu/~rubner/papers/Adel_2004.pdf · 2004-05-10 · 168 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING,

ADEL et al.: OPTIMIZED OVERLAY METROLOGY MARKS: THEORY AND EXPERIMENT 177

Fig. 15. Average position estimation error for BiB pattern, three grating patterns with low pitch 1500- and 2400-nm and the dense (optimal) pattern.

in both cases. The simulation was performed in a 100 pixelswindow, where the pixel size is assumed to be 80 nm.

Simulation results are shown in Fig. 15. One can see that thedense pattern indeed achieves the best accuracy results and theerror is approximately the same for the whole range of possibleoffsets. Another observation is that the precision achieved bythe segmented grating mark is very close to that of the densepattern.

IV. CONCLUDING REMARKS

In this paper, we conduct a thorough analysis of patternsused for overlay metrology and establish the dependencebetween various pattern properties and the expected dynamicprecision and fidelity of the measurements. We show how theCramer–Rao lower bound on the estimation error can be foundfor a given pattern.

We formulate a criteria for the offset invariant bound patternand develop a uniform fractional parts distribution algorithm,which can be used to design an optimal pattern in a minimalCramer–Rao lower bound sense. We suggest such an optimaldesign—the dense pattern—and provide a comparative, simu-lation based performance analysis with the commonly acceptedBiB mark.

We then present a new family of overlay mark patterns—thegrating marks and provide a detailed analysis of their propertiesbased on real measurements. The measured dynamic precisionand overlay mark fidelity of the new grating marks are closeto the theoretically predicted values and demonstrate the supe-riority of the new overlay mark family over the existing BiB

marks. The measurements performed using a computer simula-tion also show that the grating marks have performance close tothat of the optimal dense patterns.

ACKNOWLEDGMENT

The authors would like to thank C. Mack from KLA-Tencor,FINLE Division, Austin, TX, for helpful discussions on themanuscript. They would also like to thank the anonymous re-viewers for useful suggestions.

REFERENCES

[1] M. E. Adel, M. Ghinovker, J. Poplawski, E. Kassel, P. Izikson, I. Pol-lentier, P. Leray, and D. Laidler, “Characterization of overlay mark fi-delity,” in Proc. SPIE, Metrology, Inspection and Process Control forMicrolithography XVII, vol. 5, 2003, pp. 5038–5043.

[2] A. M. Bruckstein, L. O’Gorman, and A. Orlitsky, “Design of Shapes forPrecise Image Registration,” AT&T Bell Labs, Tech. Rep, 1989.

[3] , “Design of shapes for precise image registration,” IEEE Trans.Inform. Theory, vol. 44, pp. 3156–3162, Nov. 1998.

[4] T. A. Brunner, “Impact of lens aberrations on optical lithography,” IBMJ. Res. Dev., vol. 41, no. 1/2, 1997.

[5] T. M. Cover and J. A. Thomas, Elements of Information Theory, ser.Telecommunications. New York: Wiley, 1991.

[6] P. Dirksen, C. A. Juffermans, A. Leeuwestein, C. Mutsaers, T. A. Nuijs,R. J. Pellens, R. Wolters, and J. Gemen, “Effect of processing on theoverlay performance of a wafer stepper,” in Proc. SPIE, Metrology, In-spection and Process Control for Microlithography XI, vol. 3050, 1997,pp. 102–113.

[7] B. La Fontaine, M. Dusa, J. Krist, A. Acheta, J. Kye, H. Levinson, C.Luijten, C. Sager, J. Thomas, and J. van Praagh, “Analysis of focus errorsin lithography using phase-shift monitors,” in Proc. SPIE, vol. 4691,2002, pp. 315–324.

[8] C. W. Helstrom, Elements of Signal Detection and Estima-tion. Englewood Cliffs, NJ: Prentice Hall, 1994.

Page 13: 166 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL…robotics.stanford.edu/~rubner/papers/Adel_2004.pdf · 2004-05-10 · 168 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING,

178 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL. 17, NO. 2, MAY 2004

[9] J. M. Holden, T. Gubiotti, W. A. McGaham, M. Dusab, and T. Kiersb,“Normal incidence spectroscopic ellipsometry and polarized reflectom-etry for measurement of photoresist critical dimensions,” in Proc. SPIE,vol. 4989, 2002, pp. 1110–1121.

[10] A. Luci and E. G. Ballarin, “Optimization of alignment markers to limitthe measurement error induced during exposure by lens aberration ef-fects,” in Proc. SPIE, Metrology, Inspection, and Process Control forMicrolithography XVI, vol. 4690, 2002, p. 374.

[11] M. Littau, J. Ch.-Raymond, Ch. Gould, and Ch. Gambill, “Novel imple-mentations of scatterometry for lithography process control,” in Proc.SPIE, vol. 4689, 2002, pp. 506–516.

[12] B. F. Plambeck, N. Knoll, and P. Lord, “Characterization of chem-ical-mechanical polished overlay targets using coherence probemicroscopy,” in Proc. SPIE, Integrated Circuit Metrology, Inspectionand Process Control IX, vol. 2439, 1995, pp. 298–308.

[13] S.-W. Shih and T.-Y. Yu, “On Designing an Isotropic Fiducial Mark,”Multimedia Man-Machine Interface Laboratory, Department of Com-puter Science and Information Engineering, National Chi Nan Univer-sity, Tech. Rep. TR-M3LAB-2002-002.

[14] J. Staecker, S. Arendt, K. Schumacher, E. Mos, R. van Haren, M. vander Schaar, R. Edart, W. Demmerle, and H. Tolsma, “Advances inprocess overlay on 300 mm wafers,” in Proc. SPIE, vol. 4689, 2002,pp. 927–936.

[15] Y. Toyoshima, I. Kawata, Y. Usami, Y. Mitsui, A. Sezginer, E. Maiken,K.-C. Chan, K. Johnson, and D. Yonenaga, “Complementary use of scat-terometry and SEM for photoresist profile and CD determination,” inProc. SPIE, vol. 4689, 2002, pp. 196–205.

[16] V. Ukraintsev, M. Kulkarni, C. Baum, K. Kirmse, M. Guevremont, S.Lakkapragada, K. Bhatia, P. Herrera, and U. Whitney, “Spectral scat-terometry for 2D trench metrology of Low-K dual-damascene intercon-nect,” in Proc. SPIE, vol. 4689, 2002, pp. 189–195.

Mike Adel received the B.Sc. degree in physicsfrom the University of New South Wales, Sydney,Asutralia, in 1983 and D.Sc. degree in solid statephysics from the Techion-Israel Institute of Tech-nology, Haifa, in 1989.

Since graduation, he has held positions as an in-dustrial physicist in areas ranging from electroopticalremote sensing, biomedical diagnostics and morerecently semiconductor metrology. Since 1999, hehas been employed by KLA-Tencor Israel, MigdalHaEmek, where he currently holds the position of

Director of Advanced Development for the Optical Metrology Division. He isalso an adjunct lecturer to the Technion Industrial Engineering faculty wherehe teaches hi-tech product development in the MBA program.

Mark Ghinovker received the M.Sc. degree in radio-physics and electronics from the Nizhnii Novgorod(formerly Gorkii) University, Russia, in 1990 and thePh.D. degree in solid state physics from Bar-Ilan Uni-versity, Ramat Gan, Israel, in 2000.

He dealt with theoretical and numerical study ofsuperconductivity. Since graduation, he has been em-ployed by KLA-Tencor Israel, Migdal HaEmek, as analgorithm and system engineer in the Advanced De-velopment Group for the Optical Metrology Division.His current research interests span over the fields of

superconductivity, signal processing, pattern recognition, numerical methods ofsolving PDE, optics, microlithography, and semiconductor physics.

Boris Golovanevsky received the M.Sc. degreein industrial heat technology from the PolytechnicInstitute of Odessa, Odessa, U.S.S.R. (now Russia),in 1989 and M.Sc. degree in aerospace engineeringfrom the Techion-Israel Institute of Technology,Haifa, in 1995.

Since 2000, he has been employed by KLA-TencorIsrael, Migdal HaEmek, where he currently holds theposition of Research Scientist in Advanced Develop-ment group for the Optical Metrology Division.

Pavel Izikson received the M.A. degree in statisticsfrom the University of Haifa, Haifa, Israel, in 2000.

Since 2000, he has been employed by KLA-TencorIsrael, Migdal HaEmek, where he currently holds theposition of Statistician in Advanced Development forthe Optical Metrology Division.

Elyakim Kassel graduated in 1986 from Ecole Na-tionale Superieure des Mines de Paris (France) ma-joring in materials science and engineering. He re-ceived the Ph.D. degree in physics from the Univer-sity of Illinois at Chicago om 1991.

He joined National Semiconductor’s Fairchild Re-search Center, Santa Clara, CA, in 1991. He workedthere as a senior process engineer in etch. From 1992to 1995, he was a senior process engineer in lithog-raphy at National Semiconductor (later Tower Sem-inconductor) in Migdal HaEmek, Israel. From 1995

to 2000, he worked at SemiConductor Devices, Misgav, Israel, as a quality en-gineer and subsequently as acting Quality Engineering manager and ReliabilityGrowth team leader. Since 2000, he has been employed by KLA-Tencor Corpo-ration, Optical Metrology Division, Migdal HaEmek, Israel, where he currentlyholds the position of project leader.

Dan Yaffe received the Ph.D. degree in physics fromthe Weizmann Institute of Science, Rehovoth, Israel,in 1971.

He worked at the Israeli Armament Authority(RAFAEL) from 1974 to 1985 on image and signalprocessing algorithms. Since 1985, he works asan independent consultant for Israeli companieson Computer Vision subjects. His current researchinterests include Pattern Recognition and ObjectRecognition.

Alfred M. Bruckstein was born in Transylvania, Ro-mania, on January 24, 1954. He received the B.Sc.and M.Sc. degrees in electrical engineering, from theTechnion-Israel Institute of Technology, in 1977 and1980, respectively, and the Ph.D. degree in electricalengineering from Stanford University, Stanford, CA,in 1984.

From October 1984, he has been with the Tech-nion. From 1987 to 2000, he was a frequent visitorat AT&T Bell Laboratories, Murray Hill, NJ. Hispresent research interests are in computer vision,

ant robotics, pattern recognition, image processing, and computer graphics.He has also done work in estimation theory, signal processing, algorithmicaspects of inverse scattering, point processes and mathematical models inneurophysiology.

Prof. Bruckstein is a member of SIAM, MAA, and AMS, and presently he isthe Dean of the Technion Graduate School.

Roman Goldenberg received the B.A. (summa cumlaude) and Ph.D. degrees in computer science, fromthe Technion-Israel Institute of Technology, Haifa, in1995 and 2003, respectively.

From 1994 to 1996 and 1999, he was with IBMResearch Lab in Haifa. Currently, he is a researchfellow at the Technion R&D Foundation and theSamuel Neeman Institute for Advanced Studiesin Science and Technology. His research interestsinclude video analysis, tracking, motion basedrecognition, PDE methods for image processing,

and medical imaging.

Page 14: 166 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL…robotics.stanford.edu/~rubner/papers/Adel_2004.pdf · 2004-05-10 · 168 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING,

ADEL et al.: OPTIMIZED OVERLAY METROLOGY MARKS: THEORY AND EXPERIMENT 179

Yossi Rubner received the B.Sc. degree (summa cumlaude) in computer science from the Technion-Insti-tute of Technology, Haifa, in 1991, and the M.S. andPh.D. degrees in computer science (with Ph.D. minorin electrical engineering) from Stanford University,Stanford, CA, in 1997 and 1999, respectively.

He is currently an entrepreneur and the principal ofRubner Technology Consulting where he is engagedin projects and consulting to various companies andventure capital firms in the areas of image processingand computer vision. From 2002 to 2003, he was

with Applied Materials, Israel, where he managed the Image Processing groupsof ETEC’s Mask Inspection Product Group (MIPG). Between 1999–2001, heserved as Vice President R&D and CTO of Jigami Corp., Israel, where he ledthe research and development of multimedia acceleration platforms for cellularoperators. His previous experience includes consulting for Xerox Corporationin the Palo Alto Research Center (PARC) in the area of image understanding,development of desktop publishing systems at the Scitex Corporation, andleading R&D projects in an Israel Defense Force Technical Unit from which hewas honorably discharged with the rank of Captain.

Michael Rudzsky received the Ph.D. degree inphysics and mathematics from the Institute of SpaceResearsh, Moscow, U.S.S.R. (now Russia), in 1980.

He worked in the Scientific and Industrial Associ-ation for Space Research, Baku, Azerbajan, till 1990.Since 1991, he was a research fellow in the PhysicsDepartment and since 1995 at the Computer ScienceDepartment of the Technion, Haifa, Israel. His cur-rent recearch interests include computer vision, pat-tern recognition, and compression of images.