186 ieee transactions on computer-aided design of...

13
186 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 34, NO. 2, FEBRUARY 2015 A New Frontier in Ultralow Power Wireless Links: Network-on-Chip and Chip-to-Chip Interconnects Soumyasanta Laha, Student Member, IEEE, Savas Kaya, Senior Member, IEEE, David W. Matolak, Senior Member, IEEE, William Rayess, Student Member, IEEE, Dominic DiTomaso, Student Member, IEEE, and Avinash Kodi, Senior Member, IEEE Abstract—This paper explores the general framework and prospects for on-chip and off-chip wireless interconnects imple- mented for high-performance computing (HPC) systems in the context of micro power wireless design. HPC intercon- nects demand very high (10 Gb/s) transmission rates using ultraefficient (1 pJ/bit) transceivers over extremely short (100 cm) ranges. In an attempt to design such wireless interconnects, first a model for the wireless communication channel properties is developed. The use of CMOS-based energy- efficient on–off keying (OOK) transceiver architectures operating in the 60–90 GHz bands is considered as a practical solution. In order to address strict performance requirements of wireless HPC interconnects, and taking advantage of the recent devel- opments in device scaling, compact low-power and innovative circuits based on novel double-gate MOSFETs (DG-MOSFETs) are proposed in the implementation of the architecture. The per- formance of a compact low-noise amplifier (LNA) design using common source (CS) inductive degeneration with 32 nm DG- MOSFETs is investigated by quantitative analysis and simulation. The proposed inductor-less two-stage cascode cascade LNA is optimized for 90 GHz operation and has the advantage of gain switching over its CMOS counterpart without the use of addi- tional switching transistors, which makes it remarkably power efficient and faster. As further examples of efficient and compact DG-MOSFET circuits for OOK transceiver design, a three-stage CS 5 dB tunable power amplifier operating up to 90 GHz, and a novel 90 GHz voltage controlled oscillator are also presented. This is followed by the proposal of an array of four monopole antennas studied using full-wave EM solver. Index Terms—90 GHz, antenna, channel modeling, CS inductive degeneration low-noise amplifier (LNA), double gate MOSFET (DG-MOSFET), gain switching, LC oscillator, power amplifier (PA), wireless network on chip. I. I NTRODUCTION O NE of the most distinguishing features of the twenty-first century integrated electronics has been the proliferation Manuscript received March 28, 2014; revised August 7, 2014; accepted November 15, 2014. Date of publication December 11, 2014; date of current version February 16, 2015. This work was supported by the NSF Award under Grant ECCS-1129010. This paper was recommended by Associate Editor J. Henkel. S. Laha, D. DiTomaso, and A. Kodi are with the School of Electrical Engineering and Computer Science, Ohio University, Athens, OH 45701 USA, (e-mail: {sl922608,kaya,dd292006,kodi}@ohio.edu). S. Kaya is with the School of Electrical Engineering and Computer Science, Ohio University, Athens, OH 45701 USA (e-mail: [email protected]). D. W. Matolak and W. Rayess are with the Department of Electrical Engineering, University of South Carolina, Columbia, SC 29208 USA, (e-mail: [email protected], [email protected]). Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org. Digital Object Identifier 10.1109/TCAD.2014.2379640 Fig. 1. Ultrashort range high-fidelity micro-power wireless interconnects can have dramatic impacts on a number of integrated applications and HPC systems. of wireless connectivity and explosive growth of mul- tifunctional and portable systems utilizing sub-10 GHz communications. Extrapolating this trend to next decades, it is obvious that there is a compelling need for even higher band- widths and data rates, ultralow power in yet more capable systems connected in very short length scales (see Fig. 1). This is especially true for rapidly evolving integration of circuits and systems on flexible and organic substrates for wearable and implantable electronics, many-layer sensor net- works, many-core computers, data centers, and “internet of things” just to point out few. Moreover, creative and judicious use of high-speed ultrashort (< 100 cm) range wireless links in these highly integrated platforms are expected to provide not only significant reduction in complexity and size, but also improvements in efficiency, reconfigurability, fault tolerance, and overall performance [1]. In many of the existing and promising applications utiliz- ing the inherent flexibility and advantages of wireless links, the power is scarce or even absent all together so it must be scav- enged from the ambient energy sources. Thus, the ubiquitous use of wireless connectivity in embedded and distributed elec- tronics systems invariably implies massive challenges in terms of low-power design. Moreover, in most cases, notably in sen- sor networks, the link range is an equally important concern that determine overall design targets. Yet, perhaps luckily, the 0278-0070 c 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

Upload: others

Post on 23-Aug-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 186 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF ...oucsace.cs.ohio.edu/~avinashk/papers/tcad2015.pdf · latency due to limited off-chip bandwidth and dissipate ten times more energy

186 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 34, NO. 2, FEBRUARY 2015

A New Frontier in Ultralow Power Wireless Links:Network-on-Chip and Chip-to-Chip Interconnects

Soumyasanta Laha, Student Member, IEEE, Savas Kaya, Senior Member, IEEE,David W. Matolak, Senior Member, IEEE, William Rayess, Student Member, IEEE,

Dominic DiTomaso, Student Member, IEEE, and Avinash Kodi, Senior Member, IEEE

Abstract—This paper explores the general framework andprospects for on-chip and off-chip wireless interconnects imple-mented for high-performance computing (HPC) systems inthe context of micro power wireless design. HPC intercon-nects demand very high (≥ 10 Gb/s) transmission rates usingultraefficient (∼ 1 pJ/bit) transceivers over extremely short(≤ 100 cm) ranges. In an attempt to design such wirelessinterconnects, first a model for the wireless communicationchannel properties is developed. The use of CMOS-based energy-efficient on–off keying (OOK) transceiver architectures operatingin the 60–90 GHz bands is considered as a practical solution.In order to address strict performance requirements of wirelessHPC interconnects, and taking advantage of the recent devel-opments in device scaling, compact low-power and innovativecircuits based on novel double-gate MOSFETs (DG-MOSFETs)are proposed in the implementation of the architecture. The per-formance of a compact low-noise amplifier (LNA) design usingcommon source (CS) inductive degeneration with 32 nm DG-MOSFETs is investigated by quantitative analysis and simulation.The proposed inductor-less two-stage cascode cascade LNA isoptimized for 90 GHz operation and has the advantage of gainswitching over its CMOS counterpart without the use of addi-tional switching transistors, which makes it remarkably powerefficient and faster. As further examples of efficient and compactDG-MOSFET circuits for OOK transceiver design, a three-stageCS 5 dB tunable power amplifier operating up to 90 GHz, anda novel 90 GHz voltage controlled oscillator are also presented.This is followed by the proposal of an array of four monopoleantennas studied using full-wave EM solver.

Index Terms—90 GHz, antenna, channel modeling, CSinductive degeneration low-noise amplifier (LNA), double gateMOSFET (DG-MOSFET), gain switching, LC oscillator, poweramplifier (PA), wireless network on chip.

I. INTRODUCTION

ONE of the most distinguishing features of the twenty-firstcentury integrated electronics has been the proliferation

Manuscript received March 28, 2014; revised August 7, 2014; acceptedNovember 15, 2014. Date of publication December 11, 2014; date of currentversion February 16, 2015. This work was supported by the NSF Awardunder Grant ECCS-1129010. This paper was recommended by AssociateEditor J. Henkel.

S. Laha, D. DiTomaso, and A. Kodi are with the School of ElectricalEngineering and Computer Science, Ohio University, Athens, OH 45701 USA,(e-mail: {sl922608,kaya,dd292006,kodi}@ohio.edu).

S. Kaya is with the School of Electrical Engineering and Computer Science,Ohio University, Athens, OH 45701 USA (e-mail: [email protected]).

D. W. Matolak and W. Rayess are with the Department of ElectricalEngineering, University of South Carolina, Columbia, SC 29208 USA,(e-mail: [email protected], [email protected]).

Color versions of one or more of the figures in this paper are availableonline at http://ieeexplore.ieee.org.

Digital Object Identifier 10.1109/TCAD.2014.2379640

Fig. 1. Ultrashort range high-fidelity micro-power wireless interconnectscan have dramatic impacts on a number of integrated applications and HPCsystems.

of wireless connectivity and explosive growth of mul-tifunctional and portable systems utilizing sub-10 GHzcommunications. Extrapolating this trend to next decades, it isobvious that there is a compelling need for even higher band-widths and data rates, ultralow power in yet more capablesystems connected in very short length scales (see Fig. 1).This is especially true for rapidly evolving integration ofcircuits and systems on flexible and organic substrates forwearable and implantable electronics, many-layer sensor net-works, many-core computers, data centers, and “internet ofthings” just to point out few. Moreover, creative and judicioususe of high-speed ultrashort (< 100 cm) range wireless linksin these highly integrated platforms are expected to providenot only significant reduction in complexity and size, but alsoimprovements in efficiency, reconfigurability, fault tolerance,and overall performance [1].

In many of the existing and promising applications utiliz-ing the inherent flexibility and advantages of wireless links, thepower is scarce or even absent all together so it must be scav-enged from the ambient energy sources. Thus, the ubiquitoususe of wireless connectivity in embedded and distributed elec-tronics systems invariably implies massive challenges in termsof low-power design. Moreover, in most cases, notably in sen-sor networks, the link range is an equally important concernthat determine overall design targets. Yet, perhaps luckily, the

0278-0070 c© 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

Page 2: 186 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF ...oucsace.cs.ohio.edu/~avinashk/papers/tcad2015.pdf · latency due to limited off-chip bandwidth and dissipate ten times more energy

LAHA et al.: NEW FRONTIER IN ULTRALOW POWER WIRELESS LINKS: NETWORK-ON-CHIP 187

data rate and bit-error-rate (BER) are fairy low (< 10 kbps and< 10−3, respectively) in many such scenarios. In other words,the peak power usage and power efficiency for per unit oftransmission length (W/m) may be considered as most impor-tant performance metrics and often the quality of transmissionis not detrimental. Also true in such cases, as in bio-medicalimplants or mobile sensors, is the fact that there is not acompeting alternative or established “gold-standard” to beat.

In this paper, we focus on a new and different fron-tier, and arguably a more challenging design task: ultralowpower (< 1 mW), extremely high-performance (> 10 Gb/s),and high fidelity (BER < 10−12) wireless links for on-chipand off-chip data transfer at ultrashort (< 10 cm) ranges. Asalso indicated in Fig. 1, these parameters pushes all designtargets to beyond what is possible today. Both the antennasize and data throughput necessitate the use of extremely highfrequencies, reaching up to THz range. In this new frontier formicro-power (μpower) transceiver design, not the power effi-ciency over link range but the energy efficiency per transmittedbit and very low error rates become the essential delimiters.This is because the existing solutions are based on wired linksthat are extremely efficient and robust competitors that mustbe outdone. In other words, it is not sufficient to build a goodlink, but the designers are posed with a problem to actuallybuild a better-than-wired link.

Interconnects in multicore high-performance comput-ing (HPC) architectures are the main drivers of such seeminglyincompatible design requirements even though intraroom mul-timedia and mobile data cloud applications are also potentialbeneficiaries. In the case of HPC solutions, the need is manyfold: both on-chip (intercore) and off-chip (chip-to-chip andboard-to-board) use can be envisaged. Since HPC systemsof today are limited primarily by the interconnect perfor-mance, wireless interconnects are a potential solution thatcan provide energy efficient communication, while providinghigh bandwidth and low latency, especially between multiplecores [2]–[7] and memory banks. The other unique and spe-cific benefits of wireless interconnects for HPC systems mayinclude the following.

1) High energy efficiency for long, one-hopcommunication.

2) Reduced complexity compared waveguides or wires.3) Compatibility with CMOS wireless technology designs.This paper is intended to serve two orthogonal purposes.

On the system level, it is structured to define a generalframework and parameters that would be demanded fromμpower transceivers in this new frontier, including the con-text and requirements for HPC systems, media access chan-nel (MAC) and infrastructure design for the communicationlayer as well as the technological basis for the challengesintroduced. On the device and circuits level, it comprises anumber of hitherto unexplored and novel, ultracompact, andtunable 90 GHz low-noise amplifier (LNA), power ampli-fier (PA), and voltage controlled oscillator (VCO) circuitsbased on double-gate MOSFETs (DG-MOSFETs) as well asantenna structures believed to be applicable to building sys-tems in the introduced framework. Hence, this paper attemptsto explore the complex task of designing on-chip wireless

interconnects within realistic constraints at all levels includingthe system/architecture, communication medium, and anten-nas as well as circuits and devices that are completelynovel (LNA/VCOs) or have been completely redesigned forultralow power operation at 90 GHz.

The structure of this paper is as follows. A general descrip-tion of the wireless interconnects for HPC systems for on andoff-chip communication is introduced in Section II. This isfollowed by a description of the general context for mediumaccess and channel modeling perspectives required for thedesign of efficient, ultralow BER, and short-range μpowerOOK transceiver in Section III. The feasibility of reverber-ation chamber model for the dispersion loss of the wirelesschannel of the on chip interconnect is also analyzed with thecomputation of root mean square delay spread (RMS-DS) inthis section. Section IV deals with the DG-MOSFET deviceand subsequently Section V deals with the design of proposednovel 90 GHz DG-MOSFET LNA and VCO circuits alongwith a wideband PA circuit operating at the same frequency, asexamples to how novel nano-scale CMOS devices can impactmm-wave IC design. This paper concludes with a pilot studyin Section VI that introduces full-wave simulations of a four-monopole antenna array that can be used for both on andoff-chip wireless interconnects.

II. WIRELESS INTERCONNECTS FOR HPC

As transistors continue to scale down, the number of coresthat can be integrated on the same chip has continued toincrease. This increase in core count is driven by the highdemands of multithreaded and HPC workloads. Cores com-municate through the interconnection network which connectscores and other I/O devices both at the on-chip and off-chiplevels. Conventional electrical interconnects suffer from highlatency due to limited off-chip bandwidth and dissipate tentimes more energy to communicate than to compute a float-ing point operation [8]. Potential solution for interconnectsinclude emerging technologies such as wireless and photonicinterconnects.

The potential benefits of wireless link over wired havealready been discussed in Section I. To analyze, it can beobserved that as wire distance increases, both the resis-tance and capacitance of the electrical links also increases.Additionally, electrical links require repeaters made up of sev-eral inverters in order to propagate the electrical signal. Bothwire distance and repeaters contribute to electrical link energyper bit. Therefore, at long distances electrical wires have bothhigh latency and energy. The energy per bit for three inter-connect technologies: electrical, wireless, and photonic can beobserved from Fig. 2. The energy per bit estimates for elec-trical and photonic interconnects were calculated using theDSENT modeling tool in 45 nm technology [9]. The estimatefor the wireless energy was estimated using trends in wirelessinterconnect technology available from [10]. As shown in thefigure, wireless interconnects are assumed to have a low energyper bit of about 1 pJ/bit at up to 20 mm. Wireless interconnectshave an energy per bit advantage over electrical links pasta distance of approximately 10 mm. Photonic interconnects

Page 3: 186 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF ...oucsace.cs.ohio.edu/~avinashk/papers/tcad2015.pdf · latency due to limited off-chip bandwidth and dissipate ten times more energy

188 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 34, NO. 2, FEBRUARY 2015

Fig. 2. Energy per bit for three interconnect technologies: electrical, wireless,and photonics.

have a low energy per bit of approximately 0.17 pJ/bit, how-ever, this does not include the laser energy as well as lossesdue to components such as splitters. Incorporation of these“hidden costs” would significantly erode the advantage ofphotonics in Fig. 2, making wireless solutions especially favor-able until low-power, compact, and low-cost lasers can bebuilt on Si substrates. Therefore, wireless interconnects haveboth technology and energy advantages over other interconnecttechnologies, especially beyond 10 mm of electrical length.

In addition to energy, both latency and bandwidth areimportant factors for interconnect technologies. Since wirelesssignals propagate near the speed of light, data can be sent chip-to-chip in one clock cycle after the signal has been modulatedand sent to the transmitting antenna. Electrical links, on theother hand, can take 3–4 cycles to transmit 20 mm across chipand several hundred cycles to transmit off-chip. In terms ofbandwidth, wireless interconnects are limited by the frequencyspectrum [11] which in turn limits the available bandwidthfor communication. Electrical links can achieve an increase ofbandwidth by adding additional wires or increasing the clockrate incurring either an area or energy penalty. Overall, wire-less interconnects have a limited bandwidth but a better energyefficiency and latency at long distances (> 10 mm) comparedto electrical links. Therefore, wireless interconnect technologycan be used as a supplement to electrical interconnects to pro-vide long distance communication at low energy and latency.Hybrid network-on-chip architectures, which use both wire-less and wired links, have been studied on various topologiesand shown to improve performance and energy [7], [12], [13].Fig. 3 shows a proposed wireless HPC interconnect that com-bines processing elements (PEs), memory controllers (MCs),memory, and interboard traffic managers. Wireless channelscan directly connect to other cores on different system boardsor to MCs that can directly reach DRAM chips therebyreducing both the bandwidth and energy consumption issues.

III. MAC LAYER DESIGN FOR HPCWIRELESS LINKS

A. Frequency Band Selection

To take advantage of wireless interconnects for HPCsystems, data rates must be high—on the order of 10 Gb/s

Fig. 3. Proposed wireless HPC interconnect that combines PEs, MCs,memory, and interboard traffic managers.

per link. In addition, multiple links are required, spanninga range up to 100 cm. These implies carrier frequencieswell above 10 GHz for multiple channels in a frequencydivision arrangement. In the spirit of newly developedIEEE standard 802.11ad for short range indoor commu-nications, we consider in this paper 60–100 GHz as thetarget band for the short term, CMOS compatible solutions.However, in the long-term μpower solutions in the 150–500GHz, and above 500 GHz may be needed, utilizing hybridSiGe BiCMOS and/or III–V [14]–[16]. Clearly, the higherone goes in frequency, the larger the available bandwidth andsmaller the antennas. Moreover, channel modeling can employsimplified high-frequency methods with greater accuracy. Yet,as the frequency increases, design of μpower circuits maybecome even more challenging and complex since use ofpure CMOS technology for the transceivers becomes almostimpossible. In the 150–500 GHz band, where THz CMOS isheaded [17] and the bandwidth is still sufficient, the channelmodeling becomes more complex, and small antennas lessefficient. However, inefficient near-field antennas may betolerable due to ultrashort ranges involved. Hence, untilSi-based CMOS mm-wave solutions are fully developed inthe next decade to reach 150–500 GHz operation, the onlyoption is to utilize the current CMOS mm-wave circuitrycapable of reaching up to 100 GHz operation. Therefore,100 GHz technology is the natural development platform forwireless HPC interconnects despite the inefficient antennasand complex channel modeling that requires full-wave elec-tromagnetic solutions, until THz CMOS is mature and widelyavailable.

B. Channel Path Loss and Dispersion

In terms of the channel itself, attenuation, or path loss,increases with frequency, which requires either higher trans-mission powers or directional antennas or shorter links asone increases frequency. The other significant channel effect—delay dispersion—will depend strongly on the actual on-chipand off-chip (chassis) landscape, and it is difficult to say any-thing quantitative as a function of frequency for this effect.Moreover, antenna design is most challenging at the low-est frequencies, because the severe wireless network on chip

Page 4: 186 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF ...oucsace.cs.ohio.edu/~avinashk/papers/tcad2015.pdf · latency due to limited off-chip bandwidth and dissipate ten times more energy

LAHA et al.: NEW FRONTIER IN ULTRALOW POWER WIRELESS LINKS: NETWORK-ON-CHIP 189

physical size constraints may force antennas to be electricallysmall at lower frequencies. Electrically small antennas gen-erally implies inefficiency, and with the short link distances,near-field effects can pertain, and this will create interferenceand mutual coupling among antennas.

Two potential path loss models in [7] and [18] are applica-ble to the problem at hand. As is common in terrestrial settings,these models take a log-distance form, where path loss L indB at distance d is given by

L(d) = A + 10nlog10(d/d0) + X (1)

with A the path loss in dB at a reference distance d0, d > d0the link distance, n the dimensionless propagation path lossexponent, and X a zero mean Gaussian random variable in dBwith standard deviation σx. Reference distance d0 is chosen forconvenience and within the far field of antennas; n = 3 or 4 iscommon in cellular settings when no line-of sight path exists,and n = 2 in an obstacle-free vacuum. For HPCs, d0 couldbe on the order of 1 mm or so. From plots in [7], we haveestimated n = 1.98, d0 = 1 cm, and for polyimide, A = 44.6dB, and for silicon, A = 79.6 dB. The applicable distancerange is 1–100 cm (so here, d0 can be less than d). Hence,assuming a deterministic model (ignoring X) the path loss canbe estimated using the equation above.

The dispersion in HPC interconnects can be studied usingthe reverberation chamber model as an approximation. Thisassumes the wireless links are contained within a conduct-ing box. The RMS-delay spread (DS), denoted στ , is thencomputed given some material parameters for the reverber-ation chamber in terms of several quality (or Q) factors asfollows [19]:

στ = 1(2π fSQ

) (2)

where SQ is the sum of reciprocals of the various Q-factors,taking into account the presence of any electromagneticallyabsorptive material, chamber apertures, conductive wall prop-erties, and antenna sizes. The aperture Q is the only pertinentif there are large openings in the chamber walls; hence, for our“closed-box” estimates, we need only use the wall and antennafactors [20], given by Qwall = 3V/(2μrδS), and Qantenna =16π2V/(mλ3), where V is cavity volume (d × d × h), S issurface area (2d2 + 4hd), μr = μw/μ0 is the relative wallpermeability, with μw, the actual wall permeability and m0,the permeability of free space, m is the antenna impedancemismatch factor (= 1 for a perfect impedance match), andδ is the skin depth, given by 1/

√π f μwσw with σw the wall

conductivity. For nonferric materials, we can assume μw = μ0.For an air-filled cavity, we show in Fig. 4 an example plotof the delay spread in nanoseconds versus frequency from300 GHz to 100 THz, for copper and aluminum conductors,and for d = 20 mm and several values of h. As noted in [19],the functional form of the plots are approximately στ ∼ 1/

√f .

Clearly, the parameters in this figure are relevant only for on-chip wireless links and similar plots can be generated for otherdistances and frequency ranges of interest.

For cavities filled with dielectric (lossy), the delay spreadswill be smaller, since the propagating waves get attenuated

Fig. 4. RMS-DS versus frequency for “micro” reverberation chambersrepresenting the case for on-chip wireless interconnects.

with each path from wall-to-wall within the chamber(i.e., we add Q−1

absorber > 0 to SQ). Thus, the results in Fig. 4can be viewed as upper bounds for this model. The largestRMS-DS occurs at the lowest frequency, and for the largestcavity size (h = d/5). This delay spread value is approxi-mately 5 ns, but this chamber height is likely unrealisticallylarge. If we assume that the h = d/100 case is representa-tive, the maximum RMS-DS is 0.6 ns at f = 300 GHz. Aconvention in communication systems is to declare a channelnondispersive if the RMS-DS is less than one-tenth of a sym-bol duration [21]; in general, the larger the RMS-DS, the lowerthe distortion-free data rate. The 0.6 ns delay spread impliesa minimum symbol duration of Ts = 6 ns, and this wouldthen correspond to a maximum symbol rate Rs = 1/Ts ∼= 167Msymbols/s (167 Mb/s if binary modulation).

This is an extremely low (i.e., uncompetitive) data rate forHPC systems, but note that the reverberation chamber analysisis a worst case one in terms of delay spread. As noted, thedelay spreads would be substantially smaller if a lossy dielec-tric were within the cavity instead of air, and as frequencyincreases, delay spread also decreases; for example, if we usedf = 40 THz, the maximum RMS-DS would drop by an orderof magnitude, yielding a data rate of 1.67 Gsymbol/s by ourprevious rules. Operation at such frequencies is not currentlypractical, and this value is only provided as an example basedon the very rough estimates of Fig. 4. Note also that the verylow error probabilities required of HPC interconnects (∼ 10−12

for wired links [7]) would likely require even smaller valuesof dispersion unless some higher-layer error correction couldbe applied. Sophisticated forward error correction coding anddecoding could dramatically reduce error probability, but thiswould be at the expense of throughput and latency, plus circuitarea and power, and like equalization, error correction at ratesof multiple gigabits per second is not trivial with present-daydevices.

Although the reverberation chamber analysis is clearly quitepessimistic, the degree of pessimism in the result is diffi-cult to quantify precisely without more sophisticated analysisand landscape specification. As another idealized example,if we assume d = 20 mm and h = d/10, the two-raymodel [22] yields a maximum delay spread of approximately

Page 5: 186 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF ...oucsace.cs.ohio.edu/~avinashk/papers/tcad2015.pdf · latency due to limited off-chip bandwidth and dissipate ten times more energy

190 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 34, NO. 2, FEBRUARY 2015

Fig. 5. Generic architecture of the OOK transceiver proposed for the HPCwireless routers with four channels operating at 10 Gb/s centered around85 GHz.

Fig. 6. Wireless radio signal power required by different data rates (Tx/Rx)and distances (Tx only) in OOK architecture for a BER = 10−12 in freespace.

2 ps, corresponding to a maximum symbol rate on the orderof 50 Gsymbols/s, a very competitive for HPC systems. Thesetwo examples illustrate the enormous range of possible WiNoCdelay spreads, which depend on the environment. Such resultsare clearly initial ones, and more work is required to increasethe accuracy of delay spread estimates.

C. Modulation Scheme and Link Budget

The generic architecture of the wireless transceiver pro-posed for iWISE implementation [12] can be observed fromFig. 5. The transceiver uses on–off keying (OOK) since thesimplicity of this noncoherent modulation technique leads to avery low power consumption as well as ultracompact architec-ture. We target ultralow power operation (< 1 pJ/bit) as wellas ultrashort (≤ 2 cm) range, based on 100 GHz RF-CMOStechnology and on-chip antennas.

Another important consideration for the design of μpowerwireless channels for the HPC interconnects is the carefulanalysis of the signal budget. In order to determine the appro-priate signal levels and the required amplification levels, alink-budget analysis is presented in Fig. 6. According to thisfigure, which considers losses in air and a 10 dB error mar-gin, typical signal levels for a 30 Gb/s link over 1 cm is below−13 dBm. It is clear that different power levels are requiredfor different link ranges (in order to attain the maximumallowable BER). Thus, power levels and required technologyparameters to achieve them must be tuned for the specific con-ditions that pertain. According to Fig. 6, the signal power isa stronger function of the distance than the data rate, there-fore the gain tuning in the LNA/PA circuits may substantially

(a)

(b)

Fig. 7. (a) Generic DG-MOSFET device structure. (b) ID-Vfg characteristicsof an n-type independent and common mode DG (Vbg = Vfg) transistors.Inset: resulting shift in the front gate threshold.

lower overall power dissipation. However, the transmissionlosses and interference affects must be more preciselyevaluated, once antenna technology and full-3-D architecture isdetermined.

IV. DG-MOSFET DEVICES

Novel devices that are the product of the final decadeof CMOS downscaling can endow CMOS mm-wave circuitengineering with many exciting possibilities. For instance,sub-22 nm MOSFETs built on silicon on insulator (SOI)substrates with ultrathin channels and precisely engineeredsource/drain contacts have been replaced by 3-D-engineeredmultigate devices [23], [24]. Such multigate MOSFET archi-tectures can efficiently control the channel from multiplesides instead of the top-side in planar bulk MOSFETs. Theability to alter channel potential by multiple (e.g., double,triple, cylindrical) gates provides a relatively easier and robustway to control the channel electrostatics, reducing the shortchannel effects (SCEs) and leakage concerns considerably.While multigate SOI structures are well recognized for digi-tal performance [25], [26], they are also strong contenders formm-wave applications in rapidly expanding wireless commu-nications market [27].

Being simpler for 3-D structural optimization and rela-tively easier to fabricate as compared to other MOSFETstructures (MIGFET, �-MOSFET, tri-gate, and so on), theDG-MOSFET architecture, Fig. 7(a) is chosen for the sim-ulation study exploring the potential of multigate devicesfor mm-wave analog applications. Such a structure can bebuilt using a modified (two gates separated and independentlyaccessed) FinFET device architecture and is also referred toas independent-gate FinFET. Although it requires additionalrouting for the second (back) gate connection, the functionalitygained by this second independent gate bias certainly makes

Page 6: 186 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF ...oucsace.cs.ohio.edu/~avinashk/papers/tcad2015.pdf · latency due to limited off-chip bandwidth and dissipate ten times more energy

LAHA et al.: NEW FRONTIER IN ULTRALOW POWER WIRELESS LINKS: NETWORK-ON-CHIP 191

Fig. 8. Small signal equivalent of the DG-MOSFET for the CS inductivegeneration topology.

this complexity worthwhile, for a tunable architecture such asthis that can trade-off between power and gain. The simula-tions are performed with Synopsys HSPICE RF using the ASUPTM FinFET models [28]. The typical transfer characteristicsof an n-type DG-MOSFET for both common mode (two gatesare joined) and independent mode (two gates are disconnected)operation are shown in Fig. 7(b).

The advantages of DG-MOSFET as a device over conven-tional CMOS can be summarized as follows.

1) Excellent control of SCEs due to presence of two gatesand ultrathin body.

2) Lower subthreshold leakage current due to reduced SCE.3) Higher ON (drive) current due to two gates.4) Reduced gate leakage, as afford to use thicker gate oxide

due to control from two gates.The above advantages are primarily applicable to the

common (symmetric) gate operation. When applied to con-struct different circuits in this mode many other advantagescome into sight. One such is demonstrated here with theLNA design. On the other hand, the advantages entitled tothe independent mode primarily materialize through circuitapplications and are as follows.

1) Implementation of tunable circuits.2) Circuits with reduced transistor count.The DG-MOSFET LNA designed in this paper, operate in

both common mode and independent mode operation. Thecommon mode advantage has been put forward by a quanti-tative analysis of area efficiency while the independent modeoperation has the advantage of gain switching by incorporatingback gate tuning. These are described in the next section.

V. DG-MOSFET MM-WAVE CIRCUITS

A. Quantitative Analysis: DG-MOSFET CS SourceDegeneration LNA

The small signal model of CS cascode source degenerationwith DG-MOSFET is depicted in Fig. 8. Solving Kirchoff’scurrent law at node S, we get

(Vin − VS) sCgfs + (Vin − VS) sCgbs

+ gmf(Vin − VS) + gmb (Vin − VS) = VS

Z. (3)

The DG-MOSFET considered here is a symmetric device,and therefore, we can safely assume Cgfs = Cgbs = Cgs in

this analysis. Also, gmf = gmb = gm, since the DG-MOSFETis considered to operate in common mode configuration(two gates are joined). Therefore, the current law can bemodeled as

2(Vin − VS)sCgs + 2gm(Vin − VS) = VS

Z. (4)

After arrangement, we have

VS

Vin= 2sCgs + 2gm

1Z + 2sCgs + 2gm

. (5)

The input current, Iin can be written as

Iin = sCgs(Vin − VS). (6)

The input impedance (Zin = Vin/Iin) is thus

Zin = 1

sCgs

(1 − VS

Vin

) . (7)

Inserting the value for VS/Vin from (5) in the above equationand after a few steps of arithmetic, we obtain the following:

Zin = 1

sCgs+ 2Z + 2Zgm

sCgs. (8)

Replacing Z by sLS, the input impedance is rewritten as

Zin = 1

sCgs+ 2sLS + 2gmLS

Cgs. (9)

The last term of (9) is the resistive term and of our interest.The other two reactance terms resonate and cancel each other.The factor “2” in the term is absent in design with conventionalCMOS [29]. Now, the value of source degeneration inductor,LS, in the last resistive term of (9) should be chosen in amanner such that it equals or approaches 50 � impedance tomatch the impedance of the 50 � antenna. As earlier informed,the factor “2” present in (9) for DG-MOSFET, is absent in thedesign with conventional CMOS. This implies, the value of LS

can be half in DG-MOSFET when compared to design withCMOS to achieve the 50 � impedance matching. At this stage,we have ignored the role of the transit frequency.

Now it is known, the expression gm/Cgs in the resistive term,is the transit frequency (ωT ) of the device, and therefore, a con-stant for a particular device. The ωT of DG-MOSFET is almostequal to its single gate counterpart for channel lengths of 75nm and above while it is higher for channel lengths shorterthan 75 nm owing to the good gate control on the channel andreduced SCE [30]. As can be observed in [30], for channellengths lower than 50 nm, the ωT of DG-MOSFET is almostthree times to that of single gate CMOS. Therefore, the abovestatements confirm the inductance, LS, in DG-MOSFET can bea sixth to that of single gate CMOS in a CS inductive degen-eration topology to achieve the same 50 � input impedancefor antenna power matching for an LNA. We have [31]

LS ∝ N2 (10)

and from a fair approximation

A ∝ N (11)

Page 7: 186 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF ...oucsace.cs.ohio.edu/~avinashk/papers/tcad2015.pdf · latency due to limited off-chip bandwidth and dissipate ten times more energy

192 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 34, NO. 2, FEBRUARY 2015

Fig. 9. Basic CS cascode source degeneration LNA in both the CMOS andDG-MOSFET technologies for area analysis.

implies

A ∝ √LS. (12)

Here, N is the number of turns of inductor and A isthe physical layout area of the inductor. From the aboveproportionality

ACMOS

ADG−MOSFET=

√LS(CMOS)

LS(DG−MOSFET)

. (13)

Considering the inductance value of CMOS to be six timesthat of DG-MOSFET from aforesaid explanations coupledwith (13)

ACMOS

ADG−MOSFET≈ 2.5. (14)

Therefore, the area occupied in the CS inductive degenera-tion topology by a CMOS is nearly 2.5 times more than thatof DG-MOSFET.

Now considering a basic cascode CS inductive degenerationLNA that consists of three inductors [29], the total inductorarea of the LNA in CMOS is 7.5AL while in DG-MOSFET it is6AL (Fig. 9). The area of inductors L1 and L2 in both the tech-nologies are considered to be same and the area of inductorLS in CMOS is taken to be 2.5AL versus AL in DG-MOSFETfollowing (14). Hence, the LNA area (based on inductor size)in DG-MOSFET design can be decreased by almost 20% whencompared to single gate CMOS. It should be noted as theinductor is the major occupier of area in mm-wave circuits,with several orders larger in magnitude than that of the activeand other passive devices, the above figures can be accepted toverify the DG-MOSFET to be a viable alternative to conven-tional CMOS for area efficient mm-wave LNA designs in thisCS inductive degeneration topology. The inductor geometryconsidered for the analyses is spiral.

B. Microstrip TL Inductors

LNA designed at mm-wave frequencies such as at 90 GHz,generally incorporates transmission lines to substitute onchip inductors or on chip transformers. This is because,although not impossible, it is quite difficult to build onchip inductors for circuits operating at mm-wave frequencies.

Fig. 10. Inductance versus microstrip aspect ratio (w/h) at l = 50 μm.Inset: resulting characteristic impedance of the microstrips TL on alternativesubstrates.

Also, at microwave frequencies, the conventional lumped com-ponents cannot work as their desired performance as aninductor or capacitor is significantly dwarfed by the undesireddevice parasitics, hence replaced by microstrip transmissionlines.

Considering the above reasons, we adopt microstrip trans-mission lines to substitute inductors in our 90 GHz mm-waveLNA design. The microstrip transmission line is a transmis-sion line geometry with a single conductor trace on one sideof a dielectric substrate and a single ground plane on theopposite side (inset: Fig. 10). These structures achieve induc-tance with the flow of current in the conductor, whereas thecapacitance is associated with the conducting strip separatedfrom the ground plane by the dielectric substrate [32]. Thedielectric substrate considered in this paper is GaAs. In thelayered structures, microstrip transmission lines and coplanarwaveguides are commonly used as the transmission lines.Although microstrip structures have lower Q-factor whencompared with the coplanar wave guides, they have less cou-pling between the adjacent lines and are easier to layout andintegrate [33].

The inductor in question can be formed by shorting themicrostrip center conductor to the lower level ground planeat one end of the transmission line. This results in an induc-tance which, for a given geometry and in a specified frequencyrange, is mostly independent of frequency [34]. Moreover,the microstrip transmission line provides an inductance whichcould be used on any type of substrate, with either low or highresistivity. Beyond the III-V technology, it is worth mention-ing that micro-strip inductors can have big impact in siliconCMOS platforms since they could utilize two or all of themetal wiring levels, allowing a wide range of inductance andquality factor design tradeoffs. An important feature in thistrade-off is the ability to utilize lower (below the inductor)metal wiring levels, as well as lower silicon and polysiliconareas for other than inductor design purposes, without affectingthe operation of the inductor. Thus, silicon RF-CMOS circuitsand its evolution to DG-MOSFETs/FinFETs are also well posi-tioned to take advantage of this technique as a way to savearea and reduce parasitics.

Page 8: 186 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF ...oucsace.cs.ohio.edu/~avinashk/papers/tcad2015.pdf · latency due to limited off-chip bandwidth and dissipate ten times more energy

LAHA et al.: NEW FRONTIER IN ULTRALOW POWER WIRELESS LINKS: NETWORK-ON-CHIP 193

Fig. 11. DG MOSFET-based LNA in common source cascode configuration.Transistors MN1 and MN2 operate in the symmetric mode while MN3 andMN4 operate in the independent mode with the back gates used for dynamictuning.

The microstrip characteristic impedance, Z, and the induc-tance, L, are computed as follows:

Z = 1

√μ0

ε0εelog

⎣Fh

w+

1 +(

2h

w

)2⎤

⎦ (15)

L = Z

2π fsin

(2π l

λ

)(16)

where, F & εe are obtained from expressions in [35]. To illus-trate the design process and serve as example, the impedanceof the microstrip transmission line is plotted for differentsubstrates along with GaAs in the inset of Fig. 10.

At 90 GHz, the free-space wavelength (λ0) is 3.3 mm.The wavelength in GaAs substrate is given by λGaAs =(λ0/

√εGaAs). Considering εGaAs to be 10, λGaAs is given

as 1 mm. As the length of the microstrip transmission lineconsidered for realizing the inductance is in the range of30 μm, which is nearly 3% of the wavelength, we can justifyl λGaAs. Hence, (2) can be rewritten as

L = Zl

λf. (17)

Considering this equation the inductance L is plotted fordifferent aspect ratio (w/h) of the microstrip transmission lineas shown in Fig. 10.

C. LNA Design and Gain Switching

The LNA implemented with DG-MOSFET, is demonstratedhere with the design of the circuit as shown in Fig. 11, con-sisting of four DG-MOSFETs in a two stage cascade commonsource inductive degeneration cascode topology. This is a mod-ified design following the LNA implemented in 90 nm CMOStechnology [36]. The common source transistors MN1 andMN2 operate in the symmetric mode while the two transistorsMN3 and MN4 are configured for independent mode operation.The transistor MN3 in common source topology is connectedto MN4 in cascode configuration. The transistor MN4 acts incommon gate configuration. The width of MN1 is taken to be1 μm while the width for transistors MN3 and MN4 are kept

higher at 2.4 μm for better input return loss and optimizedgain performance. The supply, VDD is kept constant at 1.2 V.MN3 is biased at 1 V (Vb). The back gate of the transistorsMN3 and MN4 are biased for gain switching.

The series peaking circuit consists of an inductiveload, T2, implemented with transmission lines that allowsfor low voltage operation and resonates with the interstagecapacitance, C1, enabling a higher operating frequency [29].The inductor T1, implemented in transmission lines is set toresonate with the gate source capacitance of MN1. The sourcedegeneration circuit consisting of T4 yields (in real part) wideband impedance matching to maximize the interstage powertransfer. The MTL T5 tunes out the middle pole of the cascode,thus compensating for the lower fT [37] of DG-MOSFETwhich is nearly 160 GHz at 32 nm.

The variable voltage bias introduced in the back gate of thetwo cascode transistors MN3 and MN4 implements the gainswitching operation of the DG-MOSFET LNA. The two cas-code transistors are considered for the gain switching primarilybecause the cascode here is not a part of the input transistorat the first stage as can be seen from Fig. 11. The variationof the back gate voltage of a transistor alters the gm of thattransistor. The change in gm in turn affects the input matchingof the LNA, which should ideally be unaffected for maximumpower transfer and minimized reflection losses between theantenna and the LNA. The cascode switching is also attrac-tive because it reduces the current flowing through the loadby a well defined ratio that aids in the achievement of smallgain steps, which is yet another important issue that must betaken into consideration. As observed in the next section, thesmall step size considered in the biasing of the two transis-tors also assist in attaining the small gain steps. Therefore,the back gate bias should not be applied at the input transistorat the first stage (MN1 particularly) which plays the crucialrole of impedance matching in this CS inductive degenerationLNA. Another valid reason of not applying the bias to theinput transistor is to avoid the thermal noise arising out ofthe transistor which can degrade the LNA noise figure whengm is high.

On the contrary, in LNA implemented with conventionalCMOS, the gain switching is usually achieved with the addi-tion of extra transistor(s) that alters the total transconductanceof the transistor which in turn alters the gain. The added tran-sistor that essentially acts as a switch can degrade the originalspeed of the LNA operation. Therefore, the DG-MOSFETbased LNA gain switching is advantageous over its CMOScounterpart since it does not require any additional hardwareand therefore results in no supplementary delay. Moreover, thepower consumption attributed to switching of transistors in theconventional CMOS is significantly reduced in DG-MOSFETbased design. The LNA dissipates 12.1 mW of power for aback gate bias of 0.7 V. The noise figure in the DG-MOSFETcircuit also improves with the absence of these extra switchingdevices.

D. Simulation Results

The simulation shows the 3-dB bandwidth to be 20 GHz,ranging from 80 to 100 GHz. The forward gain (S21) achieves

Page 9: 186 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF ...oucsace.cs.ohio.edu/~avinashk/papers/tcad2015.pdf · latency due to limited off-chip bandwidth and dissipate ten times more energy

194 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 34, NO. 2, FEBRUARY 2015

Fig. 12. Gain (S21) of the LNA varies with Vbg. This is measured forVbg = 0.2–0.7 V.

Fig. 13. Input and output return losses of the LNA (S11 and S22) fordifferent Vbg. S11 remains the same for all back-gate biases.

a peak value of 17 dB at 90 GHz for Vbg = 0.7 V (Fig. 12).Beyond this maximum operating voltage, the gain gets sat-urated and is independent of Vbg. The peak gain reducesgradually as Vbg is reduced and drops to ∼12.5 dB forVbg = 0.2 V. The power dissipated (Pdc) by the LNA alsovaries with Vbg, reaching 12.1 mW at Vbg = 0.7 V. The reflec-tion losses (S11 and S22) obtained from the same simulationsare shown in Fig. 13. The additional gate maintains nearlythe same input matching while the gain is switched withoutany additional hardware/physical resistance. The LNA noisefigure (NF) depends upon the back gate bias, dropping to aminimum at peak gain as expected. At 90 GHz, it ranges from6.8 dB at Vbg = 0.7 V to 7.5 dB at Vbg = 0.2 V (Fig. 14).Clearly, the back gate tuning provides a convenient tool tooptimize specific device performance parameters, setting upunique trade-offs such as that between power and gain.

The proposed LNA is unconditionally stable in the oper-ating frequency range, as can be seen from the simulatedrollet stability factor, i.e., K > 1 (Fig. 15) (also, < 1from easy computation) [29]. The circuit is also simulatedfor linearity performance using a two tone frequency analy-sis near 60 GHz and the observed maximum third-order inputintercept point (IIP3) is −5.0 dBm (for 0.2 V) as shown inFig. 16. The linearity does not vary considerably because

Fig. 14. Noise figure dependence on Vbg of the LNA is evident. The NFchanges marginally in the tuning range of Vbg at 90 GHz.

Fig. 15. Value of K > 1, as one of the necessary condition indicates theunconditional stability of the circuit.

Fig. 16. Computed IIP3 is −5 dBm at Vbg = 0.2 V.

of the source degeneration and the minimum is observed at−6.2 dBm (0.7 V). These parameters are compared againstsome recent mm-wave LNA designs for different technolo-gies (Table I) which include a recent work on a variable gainLNA in 65 nm CMOS [42]. Table II lists the performancetrade-offs for different operating Vbgs among gain, NF, andPdc at 90 GHz for our design.

Page 10: 186 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF ...oucsace.cs.ohio.edu/~avinashk/papers/tcad2015.pdf · latency due to limited off-chip bandwidth and dissipate ten times more energy

LAHA et al.: NEW FRONTIER IN ULTRALOW POWER WIRELESS LINKS: NETWORK-ON-CHIP 195

TABLE IPERFORMANCE COMPARISON OF DIFFERENT MM-WAVE LNAS

TABLE IILNA PERFORMANCE TRADE-OFFS FOR DIFFERENT Vbg’S

Fig. 17. Simulated response of three-stage PA circuit with tunable gain viaDG-MOSFET. Inset: circuit topology [43].

E. DG-MOSFET PA

Following similar principles of operation and design forthe LNA, DG-MOSFETs can also be used to build a classA PA (inset: Fig. 17) [43]. The gain of this circuit can alsobe tuned via the back-gate of the DG-MOSFETs used, whichcan be used for adaptive wireless communication among themultiple cores/chips in a typical HPC architecture. With thesignal levels determined from Fig. 6 for a particular data rateand distance, one can alter the required gain for a given wire-less link for power efficiency. Fig. 17 envisages the practicalityto carry out such a dynamic gain allocation within 5 dB using32 nm DG-MOSFET devices up-to 100 GHz. As the currentDG-MOSFET transistor models are further improved, it shouldbe possible to extend the design of the PA operation into the

Fig. 18. Independent mode tunable DG-MOSFET LC oscillator.

150–500 GHz range as the transistor-scaling reaches to 14 nmlevel as foreseen by the ITRS roadmap (2011 edition). Thedetailed analysis on this circuit can be observed at [43].

F. DG-MOSFET VCO

The LC VCO is another classic example of howindependent-mode DG-MOSFET can be utilized for compactand more innovative circuits. The compact resonant LC VCOimplemented with a DG-MOSFET can be tuned via back gatevoltage bias to vary the oscillation frequency without anyrequirement of a MOS varactor. This unique aspect has beenexploited for the design of the VCO of the 90 GHz transceiver,as shown in Fig. 18.

Unlike the characteristics of the VCO implemented in singlegate CMOS with MOS varactors, the oscillation frequency inthe compact DG-MOSFET VCO decreases with increase involtage once the threshold (VT ) is reached (Fig. 19). This isbecause the inversion charge density, Qi, varies proportionallyto Cox(Vbg − VT ) in both the linear and saturation regions [44],where Cox is the oxide capacitance. Therefore, as Vbg increasesbeyond VT , Qi increases. The increase in Qi results in theincrease of the total gate capacitance of the back gate (Cbg),which in turn lowers the oscillation frequency, f . It shouldbe noted that beyond VT , Cbg only depends on Cox [44]. Theoscillation frequency beyond VT is given as

f = 1

2π√

L1(C1 + Cbg). (18)

A novel formulation can be devised for the characteristic ofthe DG-MOSFET LC VCO which can be written as follows:

f = KDGVCO(Vbg − VT) + fc. (19)

Page 11: 186 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF ...oucsace.cs.ohio.edu/~avinashk/papers/tcad2015.pdf · latency due to limited off-chip bandwidth and dissipate ten times more energy

196 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 34, NO. 2, FEBRUARY 2015

Fig. 19. Frequency tuning with back gate bias for the independent modeLC oscillator. Inset: output waveform of the DG-MOSFET LC oscillatorat 90 GHz.

The slope of the characteristic, KDGVCO, is termed the sensi-tivity of the VCO and is a negative quantity as can be observedfrom the curve fitting of the oscillator characteristic in Fig. 19.In the above equation, fc is the oscillation frequency in the sub-threshold region of the back gate of the devices and is given as

fc = 1

2π√

L1C1. (20)

The voltage response of the oscillator at 90 GHz can beobserved from the inset of Fig. 19.

The DG-MOSFET can actually be extended to incorporateseveral other ultrapower ultra compact wireless componentsthat include OOK modulator, envelope detector, and RF mixer.One can refer to [45] for the details on these novel designs.More matured and definitive analysis can be attempted whenadvanced commercial processes and more refined compactmodels are available for DG-MOSFETs in the near future [46].

VI. CHANNEL CHARACTERIZATION

AND ANTENNA DESIGN

The characteristics of the chip-to-chip channel will have aprofound effect upon wireless link performance [47]. Thus,accurate models for these channels are required to spec-ify channel bandwidths, modulation/demodulation schemes,and any mitigations for channel distortion such as equaliza-tion (including transmitter “prefiltering”). One favorable aspectof these channels is that they are essentially time in-variant.Thus, one can develop a set of models that pertain to theexpected link distances among network nodes.

Another issue that must be addressed is interference. Asshown in [18] for the intracomputer-chassis environment, var-ious clock and other signals (and their harmonics) inevitably“leak” from components and are radiated from wired trans-mission lines. It is possible to characterize these signals viaexperiment and through the literature, and develop our trans-mission schemes to account for this interference. The bestapproach is to use spectral bands that are relatively free of suchsignals, when possible. In the future, spread spectrum may beused to suppress interference, but in the near term, we willuse “interference avoidance” and will investigate interferencesuppression as needed via equalization.

Fig. 20. Conceptual illustration of four-monopole antenna arrays for on-chip wireless interconnects. The main picture is the cross section of a singlemonopole, while the inset shows the 3-D arrangement.

Any wireless channel’s characteristics are intimately relatedto the antennas used at the transmitter and receiver. Hence,our channel characterization will require concurrent antennadesign and optimization. Depending upon frequency band,we may be able to use small arrays at the highest frequen-cies (several hundred GHz). The challenge there is impedancematching and ensuring that manufacturing of such minutestructures does not cause appreciable deviation from physicaldesign parameters. Accounting for pattern-deforming effectsof nearby structures is also important.

To develop models for the channels, we make use of anal-ysis to the full wave electromagnetics solvers (HFSS [48]).One initial design we have explored is an array of antennasabove the top metallization layer of the chip. In the example,here, we employ four monopoles above a ground plane that isthe top metallization layer of a chip. These antennas are fedfrom beneath, allowing an “antenna farm” separate from thetransceiver electronics that reside in the lower IC layers. Thetop metallization layer serves as a ground plane, and may becovered by a thin layer of dielectric material such as silicondioxide or polyimide. Above the dielectric layer will be air,and possibly also a casing. Fig. 20 illustrates the structure ofone of the monopoles and 3-D sketch of the entire model. Theentire chip is covered in a ceramic box (for thermal purposes),and the monopoles are near the square surface corners.

Results obtained with HFSS for this four-monopole arraystructure are shown in Fig. 21 in terms of the loss betweenantenna pairs (insertion loss, in dB) versus frequency along aside (16 mm) and a diagonal across 22.62 mm. If we declare thebandwidth to be the smallest frequency span for which the lossvaries by at most 1 dB, bandwidths ranging from 4–6 GHz areattained with this simple design. Note that the channels are notat exactly 90 GHz, but would be centered as desired by slighttuning of the antennas; channel bandwidths could also be con-siderably enlarged with the use of fixed equalizers. The groundplane size is also exaggerated for expediting the numerical com-putations, hence more realistic results with ground planes ofactual size will not be quite as good.

Page 12: 186 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF ...oucsace.cs.ohio.edu/~avinashk/papers/tcad2015.pdf · latency due to limited off-chip bandwidth and dissipate ten times more energy

LAHA et al.: NEW FRONTIER IN ULTRALOW POWER WIRELESS LINKS: NETWORK-ON-CHIP 197

Fig. 21. Insertion loss versus frequency for four-monopole arrangement with90 GHz design frequency along the side-to-side and diagonal directions.

Our channel models can be used to explore tradeoffs amongpotential modulation and equalization schemes, largely viasimulations. We are currently building an extensive modelingand simulation environment that contains multiple dispersivechannel models to simulate our proposed wireless transmis-sion and reception schemes and evaluate their performance.This will be done concurrently with simulation of severalduplexing/MUX and MA schemes for the frequency bands,data rates, and links of interest.

VII. CONCLUSION

The significance of μ-power wireless links to build flex-ible and efficient interconnects in HPC systems has beenemphasized and explicated in terms of system, communi-cation channel, and circuit design. The system and channelrequirements for the wireless interconnects have been under-lined. In particular, the viability of nano-scale DG-MOSFETsas an alternative to conventional CMOS devices in design-ing high-efficiency mm-wave circuits in wireless interconnectsis explored via a two-stage cascade LNA design primarilyintended for ultralow-power compact OOK transceivers. Thearea and performance improvements in the proposed two-stagecascode LNA based on 32 nm DG-MOSFETs with CS induc-tive degeneration is verified quantitatively and simulation,along with its unique capability for dynamic gain switching, afeature not found in conventional CMOS circuits without addi-tional area and delay overhead. A comparative analysis showsthat the proposed LNA should have impressive performancemetrics that exceed the bulk MOSFET and even match SiGecounterparts. The ultralow power dissipation (≤ 12.1 mW),area efficiency, high frequency operation (90 GHz), a wide3-dB bandwidth of the LNA (10 GHz) makes it ideally suitedfor the OOK receivers where the power (gain) level may beadjusted depending on the link distance via DG-MOSFETsback gates. A three-stage PA operating up to 90 GHz aswell as a varactor less VCO exploiting gain-tuning capabilityof DG-MOSFETs are also illustrated. Additionally, a four-monopole antenna array structure is also studied for on-chipinterconnects via full-wave simulations. The attenuation

(insertion loss) is slightly less than that of free-space due to theantenna gains, and the 10-dB return loss bandwidth is approxi-mately 6 GHz for these simple antennas in the idealized closedchamber conditions.

ACKNOWLEDGMENT

The authors would like to thank Ansoft Corporation for theirgenerous contribution.

REFERENCES

[1] P. Ramm, A. Klumpp, J. Weber, and M. M. V. Taklo, “3D system-on-chiptechnologies for more than Moore systems,” Microsyst. Technol., vol. 16,no. 7, pp. 1051–1055, 2010.

[2] A. Ganguly et al., “Scalable hybrid wireless network-on-chip architec-tures for multicore systems,” IEEE Trans. Comput., vol. 60, no. 10,pp. 1485–1502, Oct. 2010.

[3] P. Y. Chiang et al., “Short-range, wireless interconnect within a com-puting chassis: Design challenges,” IEEE Des. Test. Comput., vol. 27,no. 4, pp. 32–43, Jul./Aug. 2010.

[4] M. Chang et al., “CMP network-on-chip overlaid with multi-band RFinterconnect,” in Proc. IEEE Int. Symp. High Perform. Comput. Archit.,Salt Lake City, UT, USA, 2008, pp. 191–202.

[5] L. Carloni, P. Pande, and Y. Xie, “Networks-on-chip in emerging inter-connect paradigms: Advantages and challenges,” in Proc. ACM/IEEEInt. Symp. Netw. Chip (NoCs), San Diego, CA, USA, 2009, pp. 93–102.

[6] D. Halperin, S. Kandula, J. Padhye, P. Bahl, and D. Wetherall,“Augmenting data center networks with multi-gigabit wireless links,” inProc. ACM SIGCOMM 2011 Conf., Toronto, ON, Canada, pp. 38–49.

[7] S. Lee et al., “A scalable micro wireless interconnect structure forCMPs,” in Proc. Int. Conf. Mobile Comput. Netw. (Mobicom), Beijing,China, 2009, pp. 217–228.

[8] R. Stevens et al., “Architectures and technology for extremescale computing,” Dept. Energy, Off. Sci., Washington, DC, USA,Tech. Rep. PNNL-19379, Dec. 2009.

[9] C. Chun et al., “DSENT—A tool connecting emerging photonics withelectronics for opto-electronic networks-on-chip modeling,” in Proc.IEEE Int. Symp. Netw. Chip (NOCS), Copenhagen, Denmark, 2012,pp. 201–210.

[10] S. Kaya et al., “On ultra-short wireless interconnects for NoCs andSoCs: Bridging the ‘THz Gap,”’ in Proc. IEEE 56th Int. MidwestSymp. Circuits Syst. (MWSCAS), Columbus, OH, USA, Aug. 2013,pp. 804–808.

[11] D. Matolak et al., “Wireless networks-on-chips: Architecture, wirelesschannel, and devices,” IEEE Wireless Commun. Mag., vol. 19, no. 5,pp. 58–65, Oct. 2012.

[12] D. DiTomaso, A. Kodi, S. Kaya, and D. Matolak, “iWISE: Inter-routerwireless scalable express channels for network-on-chips (NoCs) archi-tecture,” in Proc. IEEE Symp. High-Perform. Interconnects, Santa Clara,CA, USA, 2011, pp. 11–18.

[13] S. Deb, A. Ganguly, P. P. Pande, B. Belzer, and D. Heo, “WirelessNoC as interconnection backbone for multicore chips: Promises andchallenges,” IEEE J. Emerg. Sel. Topic Circuits Syst., vol. 2, no. 2,pp. 228–239, Jun. 2012.

[14] S. Voinigescu et al., “Characterization and modelling of a SiGe HBTtechnology for transceiver applications in the 100-300 GHz range,”IEEE Trans. Microw. Theory Techn., vol. 60, no. 12, pp. 4024–4034,Dec. 2012.

[15] A. Balteanu, S. Shopov, and S. P. Voinigescu, “A 2× 44gb/s 110-GHz wireless transmitter with direct amplitude and phase modulation in45-nm SOI CMOS,” in Proc. IEEE Comp. Semicond. Integr. CircuitSymp. (CSICS), Monterey, CA, USA, Oct. 2013, pp. 1–4.

[16] L. Samoska, “An overview of solid-state integrated circuit amplifiersin the submillimeter-wave and THz regime,” IEEE Trans. THz Sci.Technol., vol. 1, no. 1, pp. 9–24, Sep. 2011.

[17] T. Kleine-Ostmann and T. Nagatsuma, “A review on terahertz com-munications research,” J. Infrared Millim. THz. Waves, vol. 32, no. 2,pp. 143–171, 2011.

[18] Y. P. Zhang, Z. M. Chen, and M. Sun, “Propagation mechanisms of radiowaves over intra-chip channels with integrated antennas: Frequency-domain measurements and time-domain analysis,” IEEE Trans. AntennasPropag., vol. 55, no. 10, pp. 2900–2906, Oct. 2007.

Page 13: 186 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF ...oucsace.cs.ohio.edu/~avinashk/papers/tcad2015.pdf · latency due to limited off-chip bandwidth and dissipate ten times more energy

198 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 34, NO. 2, FEBRUARY 2015

[19] E. Genender et al., “Simulating the multipath channel with a reverbera-tion chamber: Application to bit error rate measurements,” IEEE Trans.Electromagn. Compat., vol. 52, no. 4, pp. 766–777, Nov. 2010.

[20] D. A. Hill et al., “Aperture excitation of electrically large, lossy cavi-ties,” IEEE Trans. Electromagn. Compat., vol. 36, no. 3, pp. 169–178,Aug. 1994.

[21] G. L. Stuber, Principles of Mobile Communication. Dordrecht,The Netherlands: Kluwer, 2001.

[22] J. D. Parsons, The Mobile Radio Propagation Channel. Hoboken, NJ,USA: Wiley, 2000.

[23] T. Skotnicki, J. A. Hutchby, T.-J. King, H.-S. Wong, and F. Boeuf,“The end of CMOS scaling: Toward the introduction of new mate-rials and structural changes to improve MOSFET performance,”IEEE Circuits Devices Mag., vol. 21, no. 1, pp. 16–26, Jan./Feb. 2005.

[24] I. Ferain, C. A. Colinge, and J.-P. Colinge, “Multigate transistors as thefuture of classical metal-oxide-semiconductor field-effect transistors,”Nature, vol. 479, pp. 310–316, Nov. 2011.

[25] J.-J. Kim and K. Roy, “Double gate-MOSFET subthreshold circuit forultralow power applications,” IEEE Trans. Electron Devices, vol. 51,no. 9, pp. 1468–1474, Sep. 2004.

[26] A. Amara and O. Rozeau, Planar Double-Gate Transistor. Dordrecht,The Netherlands: Springer, 2009.

[27] A. Lazaro and B. Iniguez, “RF and noise performance of double gateand single gate SOI,” Solid State Electron., vol. 50, no. 5, pp. 826–842,2006.

[28] W. Zhao and Y. Cao, “Predictive technology model for nano-CMOSdesign exploration,” ACM J. Emerg. Technol. Comput. Syst., vol. 3, no. 1,pp. 1–17, 2007.

[29] B. Razavi, RF Microelectronics. Boston, MA, USA: Prentice Hall, 2012.[30] J.-P. Raskin, T. M. Chung, V. Kilchytska, D. Lederer, and D. Flandre,

“Analog/RF performance of multiple gate SOI devices: Wideband sim-ulations and characterization,” IEEE Trans. Electron Devices, vol. 53,no. 5, pp. 1088–1095, May 2006.

[31] D. S. Gardner et al., “Review of on-chip inductor structures withmagnetic films,” IEEE Trans. Magn., vol. 45, no. 10, pp. 4760–4766,Oct. 2009.

[32] F. Gardiol, Microstrip Circuits. Hoboken, NJ, USA: Wiley, 1994.[33] Y. Sun, J. Borngraber, F. Herzel, and W. Winkler, “A fully inte-

grated 60 GHz LNA in SiGe: C BiCMOS technology,” in Proc. IEEEBipolar/BiCMOS Circuits Technol. Meeting, Santa Barbara, CA, USA,2005, pp. 14–17.

[34] W. Abadeer, R. A. Groves, and P. Hansen, “Inductor for integratedcircuits,” U.S. Patent 6 714 113 B1, Mar. 2004.

[35] E. Hammerstad and O. Jensen, “Accurate models for microstripcomputer aided design,” in Proc. IEEE MTT-S Int. Microw. Symp.,Washington, DC, USA, 1980, pp. 407–409.

[36] T. Yao, M. Gordon, K. Yau, M. Yang, and S. P. Voinigescu, “60-GHzPA and LNA in 90-nm RF-CMOS,” in Proc. IEEE Radio Freq. Integr.Circuits (RFIC) Symp., San Francisco, CA, USA, 2006, pp. 4–7.

[37] W.-S. Kim, X. Li, and M. Ismail, “A 2.4 GHz CMOS low noise amplifierusing an inter-stage matching inductor,” in Proc. IEEE Midwest Symp.Circuits Syst., Las Cruces, NM, USA, 1999, pp. 1040–1043.

[38] L. Samoska et al., “Cryogenic MMIC low noise amplifiers for W-bandand beyond,” in Proc. Int. Symp. Space THz Technol., Tucson, AZ, USA,Apr. 2011, pp. 1–4.

[39] A. Leuther et al., “Metamorphic HEMT technology for low noise appli-cations,” in Proc. InP Rel. Mater. Conf., Newport Beach, CA, USA,May 2009, pp. 188–191.

[40] B. Floyd et al., “SiGe bipolar transceiver circuits operating at 60 GHz,”IEEE J. Solid-State Circuits, vol. 40, no. 1, pp. 156–167, Jan. 2005.

[41] S. Pellarano, Y. Palaskas, and K. Soumyanath, “A 64 GHz LNA with15.5 dB gain and 6.5 dB NF in 90 nm CMOS,” IEEE J. Solid-StateCircuits, vol. 43, no. 7, pp. 1542–1552, Jul. 2008.

[42] N. Li et al., “A 24 dB gain 51-68 GHz CMOS low noise amplifierusing asymmetric-layout transistors,” in Proc. Eur. Solid-State CircuitsConf. (ESSCIRC), Seville, Spain, 2010, pp. 342–345.

[43] S. Laha, S. Kaya, A. Kodi, and D. Matolak, “60 GHz OOK transmitterin 32 nm DG FinFET technology,” in Proc. IEEE Int. Conf. WirelessCircuits Syst. (ICWITS), Maui, HI, USA, 2012, pp. 1–4.

[44] Y. Taur and T. Ning, Fundamentals of Modern VLSI Devices. Cambridge,U.K.: Cambridge Univ. Press, 1998.

[45] S. Laha and S. Kaya, Analog Circuits. Rijeka, Croatia: InTech,Dec. 2012, ch. Radio Frequency IC Design with nanoscaleDG-MOSFETSs.

[46] R. Merritt. (Oct. 16, 2012). “TSMC taps ARM’s V8 on road to 16-nmFinFET,” EE Times. [Online]. Available: http://www.Eetimes.com/electronicsnews/4398727/TSMC-taps-ARM-V8-in-road-to-16-nm-FinFET

[47] D. W. Matolak, A. Kodi, and S. Kaya, “Channel modeling for wirelessnetworks-on-chips,” IEEE Commun. Mag., vol. 51, no. 6, pp. 180–186,Jun. 2013.

[48] (Dec. 2013). Ansys Corp. [Online]. Available: http://ansoft.com/

Soumyasanta Laha (S’09) received the B.Sc. degree in physics from theUniversity of Calcutta, Kolkata, India, the M.Sc. degrees in applied physicswith concentration in microelectronics and embedded digital systems withdistinction from the Sikkim Manipal Institute of Technology, Gangtok, India,and the University of Sussex, Sussex, U.K., in 2001, 2003, and 2007, respec-tively, and the Ph.D. degree in electrical engineering from the Russ Collegeof Engineering and Technology, Ohio University, Athens, OH, USA, in 2014.

His current research interests include nanoscale energy-efficient RFtransceivers. He has published over 20 papers in the above areas and has overthree years of industry experience in analog electronics, embedded systems,and RFIC design in India and U.K.

Mr. Laha was the recipient of the Stocker Research Fellowship Award byOhio University.

Savas Kaya (SM’01) received the M.Phil. degree from the University ofCambridge, Cambridge, U.K., and the Ph.D. degree from the Imperial Collegeof Science, Technology, and Medicine, London, U.K., in 1994 and 1998,respectively.

He was a Post-Doctoral Researcher at the University of Glasgow, Glasgow,U.K., from 1998 to 2001, where he researched on transport and scalingin Si/SiGe MOSFETs and fluctuation phenomena in decanano MOSFETs.He is currently with the Russ College of Engineering, Ohio University,Athens, OH, USA. His current research interests include transport theory,device modeling and process integration, nanofabrication, nanostructures, andnanobiosensors.

David W. Matolak (SM’88) received the B.S. degree from Pennsylvania StateUniversity, University Park, State College, PA, USA, the M.S. degree fromthe University of Massachusetts, Amherst, MA, USA, and the Ph.D. degreefrom the University of Virginia, Charlottesville, VA, USA, all in electricalengineering.

He was with the Rural Electrification Administration, Washington,DC, USA, the UMass LAMMDA Laboratory, Amherhst, AT&T BellLaboratories, North Andover, MA, USA, the University of Virginia’sCommunication Systems Laboratory, Charlottesville, Lockheed MartinTactical Communication Systems, Salt Lake City, UT, USA, theMITRE Corporation, McLean, VA, USA, and Lockheed Martin GlobalTelecommunications, Reston, VA, USA, for over 20 years on communicationsystems. From 1999 to 2012, he was with the School of Electrical Engineeringand Computer Science, Ohio University, Athens, OH, USA, and since 2012, hehas been with the Department of Electrical Engineering, University of SouthCarolina, Columbia, SC, USA. His current research interests include commu-nication over fading channels, radio channel modeling, and ad hoc networking.

William Rayess (S’12) received the B.E. degree in computer and commu-nications engineering from Notre Dame University, Zouk Mosbeh, Lebanon,and the MCTP degree from Ohio University, Athens, OH, USA, in 2008 and2009, respectively. He is currently pursuing the Ph.D. degree in electricalengineering from the University of South Carolina, Columbia, SC, USA.

Dominic DiTomaso (S’06) received the B.S. and M.S. degrees in electricalengineering and computer science from Ohio University, Athens, OH, USA, in2010 and 2012, respectively, where he is currently pursuing the Ph.D. degreefrom the Department of Electrical Engineering and Computer Science.

His current research interests include wireless interconnects,network-on-chips, and computer architecture.

Avinash Kodi (SM’06) received the M.S. and Ph.D. degrees in electricaland computer engineering from the University of Arizona, Tucson, AZ, USA,in 2003 and 2006, respectively.

He is currently an Associate Professor with the Department of ElectricalEngineering and Computer Science, Ohio University, Athens, OH, USA.His current research interests include computer architecture, opticalinterconnects, chip multiprocessors, and network-on-chips.

Prof. Kodi was the recipient of the National Science Foundation CAREERAward in 2011.