gaussian pyramid: comparative analysis of hardware

23
ยฉ ANAFOCUS 2007 Gaussian Pyramid: Comparative Analysis of Hardware Architectures F. D. V. R. Oliveira 1 , J. G. R. C. Gomes 1 , J. Fernรกndez-Berni 2 , R. Carmona-Galรกn 2 , R. del Rรญo 2 , ร. Rodrรญguez-Vรกzquez 2 1 Universidade Federal do Rio de Janeiro, Brazil 2 Instituto de Microelรฉctronica de Sevilla (IMSE โ€“ CNM) CSIC โ€“ Universidad de Sevilla, Spain Workshop on Architecture of Smart Cameras Cรณrdoba Spain

Upload: others

Post on 11-Apr-2022

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Gaussian Pyramid: Comparative Analysis of Hardware

ยฉ ANAFOCUS

2007

Gaussian Pyramid: Comparative Analysis of

Hardware Architectures

F. D. V. R. Oliveira1, J. G. R. C. Gomes1, J. Fernรกndez-Berni2, R. Carmona-Galรกn2, R. del

Rรญo2, ร. Rodrรญguez-Vรกzquez2

1Universidade Federal do Rio de Janeiro, Brazil

2Instituto de Microelรฉctronica de Sevilla (IMSE โ€“ CNM)

CSIC โ€“ Universidad de Sevilla, Spain

Workshop on Architecture

of Smart Cameras

Cรณrdoba

Spain

Page 2: Gaussian Pyramid: Comparative Analysis of Hardware

Embedded Vision Processing Architecture

CONVENTIONAL ARCHITECTURE GPUs DSPs FPGAs โ€ฆ

โ€ข Hardware parallelization takes place after having serialized the data previously

โ€ข The fact that the imager requires the physical realization of a 2-D array of elementary cells topographically assigned to the corresponding pixel values can be exploited for early parallelization and distributed memory

Page 3: Gaussian Pyramid: Comparative Analysis of Hardware

Embedded Vision Processing Architecture

PROPOSED ARCHITECTURE

โ€ข Drastic reduction of memory accesses during low-level processing stages, where pixel-wise operations are common

โ€ข Pixel circuitry to accelerate vision algorithms. This circuitry can be implemented in the analog domain for the sake of power and area efficiency

Page 4: Gaussian Pyramid: Comparative Analysis of Hardware

Long-Term Research

Major drawbacks

Reduced fill factor

Large pixel pitch

โ†’ Limited sensitivity

โ†’ Small image size

โ†’ Spatial aliasing

Major achievements

Concept demonstration

Programmable embedded functionalities

Image-to-Decision chain at >1,000fps using 60nW per pixel (industrial chip)

Spatial Gaussian filtering @20nJ/filter

Content-aware HDR acquisition with >145dB intra-frame DR

Major challenges

Implementation of in-pixel embedded functionalities at minimum area cost

Increase hardware-software integration

Page 5: Gaussian Pyramid: Comparative Analysis of Hardware

Drawbacks and Major Challenges

CONVENTIONAL PIXEL

Photo-sensitive

area

pixel pitch ๐‘ท

Amplification & R

ead

ou

t C

ircu

itry

๐‘ท๐Ÿ

Page 6: Gaussian Pyramid: Comparative Analysis of Hardware

Drawbacks and Major Challenges

MULTI-FUNCTIONAL PIXEL

Photo-sensitive

area

pixel pitch ๐‘ท

Amplification &

Re

ado

ut

Cir

cuit

ry

keep the pixel pitch โ†’ reduce the sensitive area

Photo-sensitive

area

pixel pitch ๐‘ท

Amplification &

Re

ado

ut

Cir

cuit

ry

Processing circuitry

& m

em

ory

keep the sensitive area โ†’ reduce the image resolution

Photo-sensitive

area

pixel pitch ๐‘ท + โˆ†

Amplification &

Re

ado

ut

Cir

cuit

ry

Processing circuitry

& m

em

ory

Photo-sensitive

area

Amplification &

Re

ado

ut

Cir

cuit

ry

pixel pitch ๐‘ท

Page 7: Gaussian Pyramid: Comparative Analysis of Hardware

Drawbacks and Major Challenges

How to minimally impact on the

image quality while maximally

exploiting the advantages of focal-

plane processing

Page 8: Gaussian Pyramid: Comparative Analysis of Hardware

Fundamental Processing Primitive

[J. Campbell and V. Kazantev, โ€œUsing an Embedded Vision Processor to Build an Efficient Object Recognition System,โ€ White Paper, Synopsis, 2015]

CMOS IMPLEMENTATION GAUSSIAN FILTERING

- Basic operation in many vision pipelines

Page 9: Gaussian Pyramid: Comparative Analysis of Hardware

Fundamental Processing Primitive

CMOS IMPLEMENTATION GAUSSIAN FILTERING

Page 10: Gaussian Pyramid: Comparative Analysis of Hardware

Fundamental Processing Primitive

CMOS IMPLEMENTATION GAUSSIAN FILTERING

Original full-resolution image

Examples:

Sobel operators

Binomial kernel Original half-resolution image Pre-distorted half-resolution image

Original kernel Reduced kernel

Binomial kernel output

Page 11: Gaussian Pyramid: Comparative Analysis of Hardware

Fundamental Processing Primitive

CMOS IMPLEMENTATION GAUSSIAN FILTERING

Page 12: Gaussian Pyramid: Comparative Analysis of Hardware

Fundamental Processing Primitive

CONVENTIONAL VS. FOCAL-PLANE REALIZATION GAUSSIAN FILTERING

Page 13: Gaussian Pyramid: Comparative Analysis of Hardware

Time Analysis

[F. V. R. Oliveira et al, โ€œGaussian Pyramid: Comparative Analysis of Hardware Architectures,โ€ IEEE Transactions on Circuits and Systems I, in press, 2017]

CONVENTIONAL VS. FOCAL-PLANE REALIZATION GAUSSIAN FILTERING

Focal-plane processing time

๐‘›๐‘˜: size of the Gaussian kernel

๐‘๐ฟ๐‘’๐‘ฃ: number of pyramid levels

๐œ๐ถ๐‘…: time required to perform one charge redistribution

๐œ๐ด๐ท๐ถ: time required to perform the analog-to-digital conversion of one pixel

๐‘๐ด๐ท๐ถ: Number of ADCs

๐‘€ ร— ๐‘: Image resolution

Page 14: Gaussian Pyramid: Comparative Analysis of Hardware

Time Analysis

[F. V. R. Oliveira et al, โ€œGaussian Pyramid: Comparative Analysis of Hardware Architectures,โ€ IEEE Transactions on Circuits and Systems I, in press, 2017]

CONVENTIONAL VS. FOCAL-PLANE REALIZATION GAUSSIAN FILTERING

Digital implementation processing time

๐œ๐‘€๐‘’๐‘š: time required to access a single memory position

๐‘๐‘๐‘ข๐‘ ๐‘€๐‘’๐‘š: number of parallel accesses to memory

๐œ๐‘œ๐‘: time required to perform a single MAC operation

Page 15: Gaussian Pyramid: Comparative Analysis of Hardware

Time Analysis

[F. V. R. Oliveira et al, โ€œGaussian Pyramid: Comparative Analysis of Hardware Architectures,โ€ IEEE Transactions on Circuits and Systems I, in press, 2017]

CONVENTIONAL VS. FOCAL-PLANE REALIZATION GAUSSIAN FILTERING

Parameters of time analysis equations

Page 16: Gaussian Pyramid: Comparative Analysis of Hardware

Time Analysis

[F. V. R. Oliveira et al, โ€œGaussian Pyramid: Comparative Analysis of Hardware Architectures,โ€ IEEE Transactions on Circuits and Systems I, in press, 2017]

Page 17: Gaussian Pyramid: Comparative Analysis of Hardware

Energy Analysis

Trickier highly dependent on the architecture and technology parameters; no global parameter either (clock period in time analysis) Standard circuit blocks

MAC unit SRAM memory cell

Page 18: Gaussian Pyramid: Comparative Analysis of Hardware

Energy Analysis

Equations associated with every aspect of the hardware

Focal-plane energy analysis

๐ธ๐‘๐‘–๐‘ฅ๐ถ๐‘Ž๐‘๐‘ก๐‘ข๐‘Ÿ๐‘’ = ๐ถ๐น๐ท โˆ™ ๐‘‰2๐‘‘๐‘‘๐‘€

+ ๐ถ๐‘…๐‘ ๐‘ก โˆ™ ๐‘‰2๐‘‘๐‘‘๐‘€

+ ๐ถ๐‘‡๐‘‹ โˆ™ ๐‘‰2๐‘‘๐‘‘๐‘€

๐ธ๐‘โ„Ž๐‘”๐‘…๐‘’๐‘‘๐‘–๐‘ ๐‘ก๐‘Ÿ = (๐‘๐ฟ๐‘’๐‘ฃ โˆ’ 1)๐‘›๐‘˜ โˆ™ 2๐‘€ โˆ™ 2๐‘ โˆ™ (2๐ถ๐‘› โˆ™ ๐‘‰2๐‘‘๐‘‘๐‘€

)

.

.

.

Digital implementation energy analysis

๐ธ๐‘€๐ด๐ถ๐‘‘๐‘ฆ๐‘›๐‘Ž๐‘š๐‘–๐‘ = ๐›ผ โˆ™ ๐‘๐‘‘ โˆ™ ๐ถ๐‘› โˆ™ ๐‘‰2๐‘‘๐‘‘(๐‘๐‘œ๐‘ โˆ™ 3๐œ๐‘œ๐‘)/๐œ๐ถ๐‘™๐‘˜

๐ธ๐‘€๐ด๐ถ๐‘ ๐‘ก๐‘Ž๐‘ก๐‘–๐‘ = ๐‘๐‘‚๐‘“๐‘“ โˆ™ ๐‘‰๐‘‘๐‘‘ โˆ™ ๐ผ๐‘™๐‘’๐‘Ž๐‘˜ โˆ™ ๐œ๐ท๐‘–๐‘”๐‘–๐‘ก๐‘Ž๐‘™ . . .

[F. V. R. Oliveira et al, โ€œGaussian Pyramid: Comparative Analysis of Hardware Architectures,โ€ IEEE Transactions on Circuits and Systems I, in press, 2017]

Page 19: Gaussian Pyramid: Comparative Analysis of Hardware

Energy Analysis

Equations associated with every aspect of the hardware

A/D conversion energy analysis

Page 20: Gaussian Pyramid: Comparative Analysis of Hardware

Energy Analysis

Parameters of energy analysis equations

[F. V. R. Oliveira et al, โ€œGaussian Pyramid: Comparative Analysis of Hardware Architectures,โ€ IEEE Transactions on Circuits and Systems I, in press, 2017]

Page 21: Gaussian Pyramid: Comparative Analysis of Hardware

Energy Analysis

Parameters of energy analysis equations

[F. V. R. Oliveira et al, โ€œGaussian Pyramid: Comparative Analysis of Hardware Architectures,โ€ IEEE Transactions on Circuits and Systems I, in press, 2017]

Page 22: Gaussian Pyramid: Comparative Analysis of Hardware

Conclusions

โ€ข Hypothesis early vision stages can be accelerated at the focal plane at low energy cost by adding extra per-pixel circuitry

โ€ข Comprehensive analysis for Gaussian pyramid generation with minimum pixel area impact

โ€ข Major conclusion Potential advantages of focal-plane processing are case-specific

โ€ข A/D conversion: critical stage

โ€ข Regarding processing time, the focal-plane approach ideally requires one ADC per column to report significant advantages

โ€ข Regarding energy saving, the focal-plane approach renders best results for SAR, cyclic or ๐šบ๐šซ

Page 23: Gaussian Pyramid: Comparative Analysis of Hardware

Questions?

THANK YOU VERY MUCH FOR YOUR ATTENTION

Jorge Fernรกndez-Berni [email protected]