isscc 2009 / session 3 / microprocessor technologies / 3

3
68 2009 IEEE International Solid-State Circuits Conference ISSCC 2009 / SESSION 3 / MICROPROCESSOR TECHNOLOGIES / 3.7 3.7 Dual-DLL-Based CMOS All-Digital Temperature Sensor for Microprocessor Thermal Monitoring Kyoungho Woo 1 , Scott Meninger 2 , Thucydides Xanthopoulos 2 , Ethan Crain 2 , Dongwan Ha 1 , Donhee Ham 1 1 Harvard University, Cambridge, MA, 2 Cavium Networks, Marlborough, MA Today’s microprocessors increasingly need on-chip temperature sensors for thermal and power management [1]. Since these sensors do not take part in the main computing activity but rather play the auxiliary, albeit important, role of temperature monitoring, their presence in terms of area, power, and design effort should be minimal, thus, all-digital sensors are desired. Temperature sens- ing based on temperature-dependent delays of inverters [2] could be suited for microprocessor applications, as it lends itself to digital implementation: by using a time-to-digital converter (TDC), an inverter delay can be compared to an absolute delay reference and converted to a digital temperature output [2] (Fig. 3.7.1). We report on an all-digital CMOS temperature sensor for microprocessor application, which also exploits temperature-dependent inverter delays within the TDC-based framework of Fig. 3.7.1. It, however, has two improvements over prior art of [2]. First, it removes the effect of process variation on inverter delays via calibration at one temperature point (instead of 2-point calibration of [2]), thus, reducing high volume production cost. Second, we use two fine-precision DLLs, one to synthesize a set of temperature-independent delay references in a closed loop, the other as a TDC to compare temperature-dependent inverter delays to the references. The use of DLLs simplifies sensor operation and yields a high measurement bandwidth (5kS/s) at 7b resolution, which could enable fast temperature tracking. This is in contrast to [2], where a counter-based cyclic TDC with an open-loop single delay-reference has a longer measurement time for a similar resolution. Inverter delays vary with both temperature and process. Our sensor calibrates out the process dependency with a delay measurement at one temperature. This 1-point calibration is enabled by our observation that the inverter delay can be separated into a temperature-only-dependent part and a process-only-dependent part. The delay, D, of a CMOS inverter with equal PMOS and NMOS strengths may be expressed as Temperature-dependent parameters are the mobility, μ, and threshold voltage, V th , but V th weakly varies with temperature, so to the first order, only μ T α (α: constant) accounts for temperature dependence of D. To highlight this, eq. (1) can be rewritten as D(T, P) = T -α G(P) , where G(P) is what is left in D after factoring out T -α , and P collectively denotes various process variations. G(P) varies only with process variation. T -α varies only with temperature (α varies slowly with doping level, so it can be regarded as a constant in a given technology). This separation of variables enables 1-point calibration. First we perform calibra- tion once by measuring inverter delay D(T C ,P) = T c -α G(P) at a known temperature T C . Then we measure delay D(T,P) = T -α G(P) at an unknown temperature T we seek to know. Division of D(T, P) by D(T C , P) yields a normalized delay at T: D norm (T) D(T,P) / D(T C ,P) = (T/T C ) -α . Process dependency G(P) disappeared due to the separation of variables. D norm (T) is a function of T only, so it is reproducible across all process corners, serving as a good temperature representation. This is the foundation of our sensor operation with 1-point calibration. Simulations validate this approach. Using 0.13μm CMOS technology, we simu- lated delays of two minimum-size inverters in series, with FO-2 load. Figure 3.7.2 shows simulated D(T, P) and D norm (T) where T C = 50°C. While D(T, P) varies across process corners, D norm (T) remains almost the same across corners. A dc- supply shift during measurement, which was not included in the above analysis, can cause significant temperature errors. Techniques to reduce this error are open to further study. In contrast, ac-supply variations are averaged out and cause insignificant error, as seen later. We execute 1-point calibration and delay normalization using the circuit of Fig. 3.7.3. It contains an open-loop delay line, and a DLL that synthesizes tempera- ture-independent-delay references. This reference-DLL (R-DLL) is locked to a crystal oscillator x(t): each delay cell in the R-DLL has constant delay Δ 0 . MUX- 1 taps a node in the R-DLL delay line: if the N-th cell’s output is tapped, the delay from input x(t) to output d(t) of the R-DLL is D DLL = NΔ 0 . This is our delay refer- ence independent of temperature and process. N can be altered to produce dif- ferent reference delays. In the open-loop line, if the M-th cell’s output is tapped by MUX-2, the delay between input x(t) and output c(t) is D OL (T,P)= T -α G(P)M (eq. (2)), which varies with temperature and process. In calibration mode at temperature T C , we set N = N C to fix the reference delay at D DLL = N C Δ 0 . We then increase M (MUX-2 setting) until D OL equals D DLL at M = M C . This comparison of D OL to D DLL to find their lock at M = M C is done via the bang- bang phase detector in the middle of Fig. 3.7.3. So the entire architecture of Fig. 3.7.3 may be viewed as another DLL, which we call the measuring DLL (M-DLL; Fig. 3.7.4, top). At M = M C , we have: As N C Δ 0 and T c -α are constant, G(P)M C is constant across process corners (M C was so chosen), which is used later for delay normalization. This is the end of calibration, done in production line. MUX-2 has a hardwired setting, M C , from now on. Once 1-point calibration is complete, the sensor enters measurement mode. Temperature T is unknown, thus, D OL of the hardwired open-loop line is an unknown delay, which the M-DLL measures by varying the reference delay D DLL of the R-DLL (bottom of Fig. 3,6,4). MUX-1 setting N is varied until D DLL equals D OL at N = N m . At this point, we have: As G(P)M C was fixed in calibration mode (eq. (3)), we eliminate it in eq. (4) to obtain: N m is a function of T only. Process dependency has been removed by the 1-point calibration that fixed G(P)M C . N m is a digital output that faithfully represents T. N m corresponds to the normalized delay seen earlier. Figure 3.7.5 shows the implemented architecture. It is essentially the same as Fig. 3.7.3, but has phase interpolators [3] to produce a 7 more phases between the input and output phase of a delay cell chosen by MUX-1 or MUX-2. The inter- polators provide a fine-delay step of 20ps to map delay variations to 7b outputs to give sub-°C resolution. To attain D DLL = D OL (calibration or measurement), we first alter MUX setting (N or M) for coarse tuning, then, interpolator index k (0 to 7) for fine tuning. Figure 3.7.7. shows our sensor (0.12mm 2 ) fabricated in 0.13μm 1.2V digital CMOS. The clock x(t) is 30MHz. Five chips, which represent process variation, were tested. In the first experiment, 1-point calibration (T C = 50°C) was done only in one chip, yielding M C for the chip. M in each chip was fixed at this M C , that is, the rest of the chips were not calibrated according to their own process variation. The top left of Fig. 3.7.6 shows R-DLL’s delay index N m measured at from 0 to 100°C in steps of 10°C. Due to lack of proper calibration, N m vs. T is not consistent among chips. In the second experiment, each chip received 1- point calibration appropriate to its process variation. Measured N m vs. T is con- sistent among chips (Fig. 3.7.6, top right), validating our approach. We obtain a master curve from Fig. 3.7.6 top right via 3rd-order fitting. The res- olution estimated from the master curve is 0.66°C/LSB for 7b outputs. Chip-to- chip variations of N m (Fig. 3.7.6 top right) are converted to errors (Fig. 3.7.6 bot- tom), which fall within -1.8 to 2.3°C. The errors are largely due to the approxi- mate nature of the separation of variables of eq. (2). Each measured N m above is a number averaged over 100 data to remove jitters and supply fluctuations. The 100 data were sampled at 500kS/s for an effective conversion rate of 5kHz, taking 0.2ms to obtain an averaged N m , with 1.2mW power dissipation (0.24μJ per averaged sample). The 1-point calibration, per- formed once at production test, takes about 0.5ms. References: [1] C. Poirier, R. McGowen, C. Bostak, et al, “Power and Temperature Control on a 90nm Itanium ® -Family Processor,” ISSCC Dig. Tech. Papers, pp. 304-305, Feb. 2005. [2] P. Chen, C-C. Chen, C-C. Tsai, W-F. Lu, “A Time-to-Digital-Converter-Based CMOS Smart Temperature Sensor,” IEEE J. Solid-State Circuits, vol. 40, no. 8, pp. 1642-1648, Aug. 2005. [3] S. Sidiropoulos and M. Horowitz, “A semidigital dual delay-locked loop,” IEEE J. Solid- State Circuits, vol. 32, no. 11, pp. 1683-1692, Nov. 1997. 978-1-4244-3457-2/09/$25.00 ©2009 IEEE D = L W C L C OX 1 μ ln(3 4V th V DD ) V DD (1 V th V DD ) T c -α G(P)M C = N C Δ 0 T ⎯α G(P)M C = N m Δ 0 N m = (T/T C ) ⎯α N C (1) (2) (3) (4) (5)

Upload: others

Post on 01-Jun-2022

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: ISSCC 2009 / SESSION 3 / MICROPROCESSOR TECHNOLOGIES / 3

68 • 2009 IEEE International Solid-State Circuits Conference

ISSCC 2009 / SESSION 3 / MICROPROCESSOR TECHNOLOGIES / 3.7

3.7 Dual-DLL-Based CMOS All-Digital Temperature Sensor for Microprocessor Thermal Monitoring

Kyoungho Woo1, Scott Meninger2, Thucydides Xanthopoulos2, Ethan Crain2, Dongwan Ha1, Donhee Ham1

1Harvard University, Cambridge, MA, 2Cavium Networks, Marlborough, MA

Today’s microprocessors increasingly need on-chip temperature sensors forthermal and power management [1]. Since these sensors do not take part in themain computing activity but rather play the auxiliary, albeit important, role oftemperature monitoring, their presence in terms of area, power, and designeffort should be minimal, thus, all-digital sensors are desired. Temperature sens-ing based on temperature-dependent delays of inverters [2] could be suited formicroprocessor applications, as it lends itself to digital implementation: by usinga time-to-digital converter (TDC), an inverter delay can be compared to anabsolute delay reference and converted to a digital temperature output [2] (Fig.3.7.1). We report on an all-digital CMOS temperature sensor for microprocessorapplication, which also exploits temperature-dependent inverter delays withinthe TDC-based framework of Fig. 3.7.1. It, however, has two improvements overprior art of [2]. First, it removes the effect of process variation on inverter delaysvia calibration at one temperature point (instead of 2-point calibration of [2]),thus, reducing high volume production cost. Second, we use two fine-precisionDLLs, one to synthesize a set of temperature-independent delay references in aclosed loop, the other as a TDC to compare temperature-dependent inverterdelays to the references. The use of DLLs simplifies sensor operation and yieldsa high measurement bandwidth (5kS/s) at 7b resolution, which could enable fasttemperature tracking. This is in contrast to [2], where a counter-based cyclicTDC with an open-loop single delay-reference has a longer measurement timefor a similar resolution.

Inverter delays vary with both temperature and process. Our sensor calibratesout the process dependency with a delay measurement at one temperature. This1-point calibration is enabled by our observation that the inverter delay can beseparated into a temperature-only-dependent part and a process-only-dependentpart. The delay, D, of a CMOS inverter with equal PMOS and NMOS strengthsmay be expressed as

Temperature-dependent parameters are the mobility, µ, and threshold voltage,Vth, but Vth weakly varies with temperature, so to the first order, only µ ∝ Tα (α:constant) accounts for temperature dependence of D. To highlight this, eq. (1)can be rewritten as

D(T, P) = T-αG(P) ,

where G(P) is what is left in D after factoring out T-α, and P collectively denotesvarious process variations. G(P) varies only with process variation. T-α variesonly with temperature (α varies slowly with doping level, so it can be regardedas a constant in a given technology).

This separation of variables enables 1-point calibration. First we perform calibra-tion once by measuring inverter delay D(TC,P) = Tc

-αG(P) at a known temperatureTC. Then we measure delay D(T,P) = T-αG(P) at an unknown temperature T weseek to know. Division of D(T, P) by D(TC, P) yields a normalized delay at T:Dnorm(T) ≡ D(T,P) / D(TC,P) = (T/TC)

-α. Process dependency G(P) disappeared dueto the separation of variables. Dnorm(T) is a function of T only, so it is reproducibleacross all process corners, serving as a good temperature representation. Thisis the foundation of our sensor operation with 1-point calibration.

Simulations validate this approach. Using 0.13µm CMOS technology, we simu-lated delays of two minimum-size inverters in series, with FO-2 load. Figure 3.7.2shows simulated D(T, P) and Dnorm(T) where TC = 50°C. While D(T, P) variesacross process corners, Dnorm(T) remains almost the same across corners. A dc-supply shift during measurement, which was not included in the above analysis,can cause significant temperature errors. Techniques to reduce this error areopen to further study. In contrast, ac-supply variations are averaged out andcause insignificant error, as seen later.

We execute 1-point calibration and delay normalization using the circuit of Fig.3.7.3. It contains an open-loop delay line, and a DLL that synthesizes tempera-ture-independent-delay references. This reference-DLL (R-DLL) is locked to a

crystal oscillator x(t): each delay cell in the R-DLL has constant delay Δ0. MUX-1 taps a node in the R-DLL delay line: if the N-th cell’s output is tapped, the delayfrom input x(t) to output d(t) of the R-DLL is DDLL = NΔ0. This is our delay refer-ence independent of temperature and process. N can be altered to produce dif-ferent reference delays. In the open-loop line, if the M-th cell’s output is tappedby MUX-2, the delay between input x(t) and output c(t) is DOL(T,P) = T-αG(P)M(eq. (2)), which varies with temperature and process.

In calibration mode at temperature TC, we set N = NC to fix the reference delay atDDLL = NCΔ0. We then increase M (MUX-2 setting) until DOL equals DDLL at M = MC.This comparison of DOL to DDLL to find their lock at M = MC is done via the bang-bang phase detector in the middle of Fig. 3.7.3. So the entire architecture of Fig.3.7.3 may be viewed as another DLL, which we call the measuring DLL (M-DLL;Fig. 3.7.4, top). At M = MC, we have:

As NCΔ0 and Tc-α are constant, G(P)MC is constant across process corners (MC

was so chosen), which is used later for delay normalization. This is the end ofcalibration, done in production line. MUX-2 has a hardwired setting, MC, fromnow on.

Once 1-point calibration is complete, the sensor enters measurement mode.Temperature T is unknown, thus, DOL of the hardwired open-loop line is anunknown delay, which the M-DLL measures by varying the reference delay DDLL

of the R-DLL (bottom of Fig. 3,6,4). MUX-1 setting N is varied until DDLL equalsDOL at N = Nm. At this point, we have:

As G(P)MC was fixed in calibration mode (eq. (3)), we eliminate it in eq. (4) toobtain:

Nm is a function of T only. Process dependency has been removed by the 1-pointcalibration that fixed G(P)MC. Nm is a digital output that faithfully represents T.Nm corresponds to the normalized delay seen earlier.

Figure 3.7.5 shows the implemented architecture. It is essentially the same asFig. 3.7.3, but has phase interpolators [3] to produce a 7 more phases betweenthe input and output phase of a delay cell chosen by MUX-1 or MUX-2. The inter-polators provide a fine-delay step of 20ps to map delay variations to 7b outputsto give sub-°C resolution. To attain DDLL = DOL (calibration or measurement), wefirst alter MUX setting (N or M) for coarse tuning, then, interpolator index k (0to 7) for fine tuning.

Figure 3.7.7. shows our sensor (0.12mm2) fabricated in 0.13µm 1.2V digitalCMOS. The clock x(t) is 30MHz. Five chips, which represent process variation,were tested. In the first experiment, 1-point calibration (TC = 50°C) was doneonly in one chip, yielding MC for the chip. M in each chip was fixed at this MC,that is, the rest of the chips were not calibrated according to their own processvariation. The top left of Fig. 3.7.6 shows R-DLL’s delay index Nm measured atfrom 0 to 100°C in steps of 10°C. Due to lack of proper calibration, Nm vs. T isnot consistent among chips. In the second experiment, each chip received 1-point calibration appropriate to its process variation. Measured Nm vs. T is con-sistent among chips (Fig. 3.7.6, top right), validating our approach.

We obtain a master curve from Fig. 3.7.6 top right via 3rd-order fitting. The res-olution estimated from the master curve is 0.66°C/LSB for 7b outputs. Chip-to-chip variations of Nm (Fig. 3.7.6 top right) are converted to errors (Fig. 3.7.6 bot-tom), which fall within -1.8 to 2.3°C. The errors are largely due to the approxi-mate nature of the separation of variables of eq. (2).

Each measured Nm above is a number averaged over 100 data to remove jittersand supply fluctuations. The 100 data were sampled at 500kS/s for an effectiveconversion rate of 5kHz, taking 0.2ms to obtain an averaged Nm, with 1.2mWpower dissipation (0.24µJ per averaged sample). The 1-point calibration, per-formed once at production test, takes about 0.5ms.

References:[1] C. Poirier, R. McGowen, C. Bostak, et al, “Power and Temperature Control on a 90nmItanium®-Family Processor,” ISSCC Dig. Tech. Papers, pp. 304-305, Feb. 2005.[2] P. Chen, C-C. Chen, C-C. Tsai, W-F. Lu, “A Time-to-Digital-Converter-Based CMOS SmartTemperature Sensor,” IEEE J. Solid-State Circuits, vol. 40, no. 8, pp. 1642-1648, Aug. 2005.[3] S. Sidiropoulos and M. Horowitz, “A semidigital dual delay-locked loop,” IEEE J. Solid-State Circuits, vol. 32, no. 11, pp. 1683-1692, Nov. 1997.

978-1-4244-3457-2/09/$25.00 ©2009 IEEE

D =L

W⋅

CL

COX

⋅1μ⋅ln(3− 4Vth VDD )VDD (1−Vth VDD )

Tc-αG(P)MC = NCΔ0

T⎯αG(P)MC = NmΔ0

Nm = (T/TC)⎯α NC

(1)

(2)

(3)

(4)

(5)

Page 2: ISSCC 2009 / SESSION 3 / MICROPROCESSOR TECHNOLOGIES / 3

69DIGEST OF TECHNICAL PAPERS •

ISSCC 2009 / February 9, 2009 / 4:30 PM

Figure 3.7.1: CMOS temperature sensor based on temperature-dependentdelays of CMOS inverters.

Figure 3.7.2: Simulated absolute (top) and normalized (bottom) inverter delaysin different process corners.

Figure 3.7.3: Basic architecture of DLL-based CMOS digital temperature sensor.

Figure 3.7.5: Detailed schematic of our DLL-based CMOS all-digital tempera-ture sensor.

Figure 3.7.6: Measured Nm-vs-T after no proper calibration (top left); measuredNm-vs.-T after proper calibration (top right). Nm takes into account the phaseinterpolator changing steps. Measurement errors (bottom).

Figure 3.7.4: Calibration mode (top) and measurement mode (bottom).

3

Page 3: ISSCC 2009 / SESSION 3 / MICROPROCESSOR TECHNOLOGIES / 3

• 2009 IEEE International Solid-State Circuits Conference 978-1-4244-3457-2/09/$25.00 ©2009 IEEE

ISSCC 2009 PAPER CONTINUATIONS

Figure 3.7.7: Die micrograph. As metal fills block the circuit view, we overlaythe circuit layout.