3d s y la p is d - download.e-bookshelf.de · introduction to microdisplays david armitage, ian...

SID S e r i e s i n D i s p l a y Te c h n o l o g y

Ernst Lueder

3D Displays

Ernst Lueder University of Stuttgart, Germany & Independent Consultant, USA

This book addresses electrical engineers, physicists, designers of flat panel displays (FDPs), students and also scientists from other disciplines interested in understanding the various 3D technologies. A timely guide is provided to the present status of development in 3D display technologies, ready to be commercialized as well as to future technologies.

Having presented the physiology of 3D perception, the book progresses to a detailed discussion of the five 3D technologies: stereoscopic and autostereoscopic displays; integral imaging; holography and volumetric displays, and:

Introduces spatial and temporal multiplex for the two views needed for stereoscopic and autostereoscopic displays;

Outlines dominant components such as retarders for stereoscopic displays, and fixed as well as adjustable lenticular lenses and parallax barriers for autostereoscopic displays;

Examines the high speed required for 240 Hz frames provided by parallel addressing and the recently proposed interleaved image processing;

Explains integral imaging, a true 3D system, based on the known lenticulars which is explored up to the level of a 3D video projector using real and virtual images;

Renders holographic 3D easier to understand by using phasors known from electrical engineering and optics leading up to digital computer generated holograms;

Shows volumetric displays to be limited by the number of stacked FPDs; and,

Presents algorithms stemming from computer science to assess 3D image quality and to allow for bandwidth saving transmission of 3D TV signals.

Cover design: Cylinder

PPC FINAL AW16mm

3D Displays

SID S e r i e s i n D i s p l a y Te c h n o l o g y

3D

Displays

Lueder

SID

Series Editor: Anthony C. Lowe, The Lambent Consultancy, Braishfield, UK

The Society for Information Display (SID) is an international society which has the aim ofencouraging the development of all aspects of the field of information display. Complementaryto the aims of the society, the Wiley–SID series is intended to explain the latest developments

in information display technology at a professional level. The broad scope of the seriesaddresses all facets of information displays from technical aspects through systems and

prototypes to standards and ergonomics.

RED bOx RuLES ARE FOR PROOF STAgE ONLy. DELETE bEFORE FINAL PRINTINg.

3D Displays

Wiley–SID Series in Display Technology

Series Editor:

Anthony C. Lowe

Display Systems: Design and Applications

Lindsay W. MacDonald and Anthony C. Lowe (Eds.)

Electronic Display Measurement: Concepts, Techniques, and Instrumentation

Peter A. Keller

Reflective Liquid Crystal Displays

Shin-Tson Wu and Deng-Ke Yang

Colour Engineering: Achieving Device Independent Colour

Phil Green and Lindsay MacDonald (Eds.)

Display Interfaces: Fundamentals and Standards

Robert L. Myers

Digital Image Display: Algorithms and Implementation

Gheorghe Berbecel

Flexible Flat Panel Displays

Gregory Crawford (Ed.)

Polarization Engineering for LCD Projection

Michael G. Robinson, Jianmin Chen, and Gary D. Sharp

Fundamentals of Liquid Crystal Devices

Deng-Ke Yang and Shin-Tson Wu

Introduction to Microdisplays

David Armitage, Ian Underwood, and Shin-Tson Wu

Mobile Displays: Technology and Applications

Achintya K. Bhowmik, Zili Li, and Philip Bos (Eds.)

Photoalignment of Liquid Crystalline Materials: Physics and Applications

Vladimir G. Chigrinov, Vladimir M. Kozenkov, and Hoi-Sing Kwok

Projection Displays, Second Edition

Matthew S. Brennesholtz and Edward H. Stupp

Introduction to Flat Panel Displays

Jiun-Haw Lee, David N. Liu, and Shin-Tson Wu

LCD Backlights

Shunsuke Kobayashi, Shigeo Mikoshiba, and Sungkyoo Lim (Eds.)

Liquid Crystal Displays: Addressing Schemes and Electro-Optical Effects, Second Edition

Ernst Lueder

Transflective Liquid Crystal Displays

Zhibing Ge and Shin-Tson Wu

Liquid Crystal Displays: Fundamental Physics and Technology

Robert H. Chen

3D Displays

Ernst Lueder

3D Displays

Ernst LuederUniversity of Stuttgart, Germany & Independent Consultant, USA

This edition first published 2012

� 2012, John Wiley & Sons, Ltd

Registered office

John Wiley & Sons Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, United Kingdom

For details of our global editorial offices, for customer services and for information about how to apply for

permission to reuse the copyright material in this book please see our website at www.wiley.com.

The right of the author to be identified as the author of this work has been asserted in accordance with the

Copyright, Designs and Patents Act 1988.

All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted,

in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, except as permitted

by the UK Copyright, Designs and Patents Act 1988, without the prior permission of the publisher.

Wiley also publishes its books in a variety of electronic formats. Some content that appears in print may not be

available in electronic books.

Designations used by companies to distinguish their products are often claimed as trademarks. All brand

names and product names used in this book are trade names, service marks, trademarks or registered trademarks

of their respective owners. The publisher is not associated with any product or vendor mentioned in this book.

This publication is designed to provide accurate and authoritative information in regard to the subject matter

covered. It is sold on the understanding that the publisher is not engaged in rendering professional services.

If professional advice or other expert assistance is required, the services of a competent professional

should be sought.

Library of Congress Cataloguing-in-Publication Data

Lueder, Ernst, 1932-

3D displays / Ernst Lueder.

p. cm.

Includes bibliographical references and index.

ISBN 978-1-119-99151-9 (cloth)

1. Three-dimensional display systems. I. Title. II. Title: Three D

displays.

TK7882.I6L84 2012

621.39087–dc232011032490

A catalogue record for this book is available from the British Library.

Print ISBN: 978-1-119-99151-9

ePDF ISBN: 978-1-119-96275-5

oBook ISBN: 978-1-119-96276-2

ePub ISBN: 978-1-119-96304-2

Mobi ISBN: 978-1-119-96305-9

Set in 10/12pt Times by Thomson Digital, Noida, India

http://www.wiley.com

To Helen

whose skills in computers and language were very helpful

Contents

Preface xi

Series Preface xiii

Introduction xv

1 The Physiology of 3D Perception 11.1 Binocular Viewing or Human Stereopsis 1

1.2 The Mismatch of Accommodation and Disparity and the Depths of Focus and of Field 3

1.3 Distance Scaling of Disparity 6

1.4 Interocular Crosstalk 7

1.5 Psychological Effects for Depth Perception 10

1.6 High-Level Cognitive Factor 10

Acknowledgments 11

References 11

2 Stereoscopic Displays 132.1 Stereoscopic Displays with Area Multiplexing 13

2.1.1 Retarders for the generation of polarizations 13

2.1.2 Wire grid polarizers for processing of the second view 20

2.1.3 Stereoscopic display with two LCDs 22

2.2 Combined Area and Time Division Multiplex for 3D Displays 26

2.3 Stereoscopic Time Sequential Displays 31

2.3.1 Time sequential viewing with an active retarder 31

2.3.2 Fast time sequential 3D displays by the use of OCB LCDs 33

2.3.3 Time sequential 3D displays with black insertions 33

2.4 Special Solutions for Stereoscopic Displays 41

2.5 Stereoscopic Projectors 48

2.6 Interleaved, Simultaneous, and Progressive Addressing of AMOLEDs and AMLCDs 60

2.7 Photo-Induced Alignment for Retarders and Beam Splitters 68

Acknowledgments 68

References 69

3 Autostereoscopic Displays 733.1 Spatially Multiplexed Multiview Autostereoscopic Displays with Lenticular Lenses 73

3.2 Spatially Multiplexed Multiview Autostereoscopic Displays with Switchable

Lenticular Lenses 85

3.3 Autostereoscopic Displays with Fixed and Switchable Parallax Barriers 95

3.4 Time Sequential Autostereoscopic Displays and Directional Backlights 104

3.4.1 Time sequential displays with special mirrors or 3D films 105

3.4.2 Time sequential displays with directionally switched backlights 109

3.5 Depth-Fused 3D Displays 115

3.6 Single and Multiview 3D Displays with a Light Guide 125

3.7 Test of 3D Displays and Medical Applications 129

Acknowledgments 129

References 130

4 Assessment of Quality of 3D Displays 1334.1 Introduction and Overview 133

4.2 Retrieving Quality Data from Given Images 135

4.3 Algorithms Based on Objective Measures Providing Disparity or Depth Maps 136

4.3.1 The algorithm based on the sum of absolute differences 136

4.3.2 Smoothness and edge detection in images 140

4.4 An Algorithm Based on Subjective Measures 146

4.5 The Kanade–Lucas–Toman (KLT) Feature Tracking Algorithm 153

4.6 Special Approaches for 2D to 3D Conversion 158

4.6.1 Conversion of 2D to 3D images based on motion parallax 159

4.6.2 Conversion from 2D to 3D based on depth cues in still pictures 161

4.6.3 Conversion from 2D to 3D based on gray shade and luminance setting 162

4.7 Reconstruction of 3D Images from Disparity Maps Pertaining to Monoscopic

2D or 3D Originals 165

4.7.1 Preprocessing of the depth map 165

4.7.2 Warping of the image creating the left and the right eye views 167

4.7.3 Disocclusions and hole-filling 172

4.7.4 Special systems for depth image-based rendering (DIBR) 176

Acknowledgments 182

References 183

5 Integral Imaging 1855.1 The Basis of Integral Imaging 186

5.2 Enhancement of Depth, Viewing Angle, and Resolution of 3D Integral Images 188

5.2.1 Enhancement of depth 189

5.2.2 Enlargement of viewing angle 193

5.2.3 Enhancing resolution 195

5.3 Integral Videography 196

5.4 Convertible 2D/3D Integral Imaging 207

Acknowledgments 214

References 214

6 Holography for 3D Displays 2176.1 Introduction and Overview 217

6.2 Recording a Hologram and Reconstruction of the Original 3D Image 218

6.3 A Holographic Screen 227

viii CONTENTS

6.4 Digital Holography Based on the Fourier Transform 229

6.5 A Holographic Laser Projector 232

Acknowledgments 235

References 235

7 Volumetric 3D Displays 2377.1 The Nature of Volumetric Displays 237

7.2 Accessing and Activating Voxels in Static Volumetric Displays 238

7.3 Swept Volume or Mechanical 3D Displays 245

Acknowledgments 252

References 252

8 A Shot at the Assessment of 3D Technologies 253

Index 257

CONTENTS ix

Preface

Flat panel display technology andmanufacture have now reached the level ofmaturity required to introduce

3D displays to the marketplace. The book covers five approaches to realize 3D perception, namely

stereoscopic and autostereoscopic displays, integral imaging, holography and volumetric displays.

I owe thanks to Dr. Tony Lowe who with his thorough understanding of scientific trends very much

supported the book on 3D technologies. I very much profited from Dan Schott’s excellent knowledge

about flat panel display technologies and I am very grateful for that. Based on his profound evaluation of

new display technologies, Dr. Christof Zeile drew my attention to various new publications. I very much

appreciate his support.

I would also like to express my appreciation of the excellent work performed by the typesetters.

The competent contribution to the index by Neil Manley is gratefully acknowledged.

As in earlier books, I am greatly indebted to Heidi Schuehle for diligently and observantly typing the

manuscript and to Rene Troeger for the professional and accomplished drawing of the figures.

Ernst Lueder

Scottsdale, USA, October 2011

Series Preface

Professor Lueder wrote his first book “Liquid Crystal Displays” for the Wiley-SID Series in Display

Technology in the year 2000. That bookwent on to become the best seller in the entire series and is now in

its second edition. I am therefore delighted to be writing a foreword to Ernst Lueder’s newest work, this

time on the topical subject of 3D Displays.

Most sighted human beings have a perception of what 3D means. We are familiar with what we see

around us, that we perceive some objects to be nearer than others, that distant objects traversing our field of

view appear to move more slowly than and are obscured by those nearer to us, and so on. A smaller but

growing fraction of the population is familiarwith 3Dmovies and television.However, amajoritywill have

only a vague understanding of how our brains operate on visual stimuli to create our familiar three-

dimensional view of the world. When it comes to creating 3D images on displays, further levels of

complexity are required not only to avoid eye strain by displaying inconsistent or misleading visual cues,

but to process prodigiously large quantities of data at sufficient speeds to enable real-time 3D visualisation.

This book sets out to present its subject in amannerwhich places it on a soundmathematical basis.After an

overview of the physiology of 3D perception, there follow detailed descriptions of stereoscopic and

autostereoscopic displays which are, after all, the most developed of 3D display technologies. Much

attention is given to the synthesis of 3D from 2D content, a most important topic, given the quantity of 2D

content already available. Quality issues are addressed next, with particular emphasis onmethods to improve

thevisual quality of 3D imagery and to reduce the bandwidth required to transmit it, with special emphasis on

a method known as depth image-based rendering. The book then describes three types of displays (integral

imaging, holography and volumetric displays) which, although less developed than stereoscopic and

autostereoscopic displays, are able to present real three-dimensional images in which the view changes -

with nearer objects obscuring more distant ones - as the viewer changes position. This is in contrast to

providing a mere illusion of three-dimensionality, as is the case with many stereoscopic images.

The book concludes with a chapter aptly named “AShot at the Assessment of 3DTechnologies” This is

not somuch a guess at what is coming next, but rather a logical in futuro extension of the technologies and

methods already described and, to my reading, a credible one.

This is a complete book, full of the necessary equations, with many illustrations and repletewith references.

The subject matter, whilst complex, is very clearly presented and will provide readers with a sound technical

basis from which to develop their skills further into the exciting field of three-dimensional display science.

Anthony Lowe

Braishfield, UK, 2011

Introduction

The design and manufacture of displays are now mature enough to introduce three-dimensional (3D)

displays into themarketplace. This happened first with displays for mobile devices in the form of near-to-

the-eye displays, but home TV will follow suit.

This book covers five approaches to realize 3D perception, namely, stereoscopic and autostereoscopic

displays, integral imaging, holography, and volumetric displays.

The intention guiding the book is to promote a well-founded understanding of the electro-optic effects

of 3D systems and of the addressing circuits. Equations are as a rule not simply stated but are derived, or, if

not fully done so, at least hints for the derivation are given.An example of this concept is the explanation of

the basics of holography by phasors, which will be outlined, but which are also known from electrical

engineering or from the Jones vector. This renders complex facts associated with holograms easier

to understand.

Emphasis is placed on stereoscopic and autostereoscopic displays as they are closest to being

commercialized. The basic components of stereoscopic displays are patterned retarders and to a lesser

degree wire grid polarizers. Autostereoscopic displays rely on beam splitters, lenticular lenses, parallax

barriers, light guides and various types of 3D films. All of these elements are explained in detail.

The glasses required for stereoscopic displays distinguish between the left and the right eyeviews either

by shutters or by circular polarization. Linearly polarized glasses have the disadvantage of being sensitive

to tilting of the head.

Special attention is given to 3D systems working in a spatial or temporal multiplex, as well as in a

combination of the two, and to novel fast addressing schemes. In order to suppress crosstalk and blur, a

240 Hz frame rate is preferred. The increased speed of addressing is handled by parallel processing and by

the recently published interleaved addressing, which also parallels the images. Special care is taken to

outline how the autostereoscopic approach is able to provide side views, the perspectives, of the object.

This paves the way for an understanding of integral images (IIs) with a pickup stage for information

similar to the lenticular lenses of the autostereoscopic displays. Very naturally this leads to the ingenious

design of an II projector working with real and virtual images where the viewer can walk around the

displayed object, thus enjoying a first solution for a true 3D display.

The chapter on holography leads the reader on to digital computer-generated holography, which is not

yet a real-time process.

Volumetric displays consist of a stack of LCDs, each of which is devoted to a particular depth, where

also the limitations of the fusion of the images become noticeable.

Notably, Chapter 4 is devoted to familiarizing designers of flat panel displays with the work done by

computer scientists on the assessment and improvement of 3D image quality. Algorithms are introduced

for evaluating the properties of 3D displays based on objective and subjective criteria and on tracking the

motion of selected special features. Special attention is drawn to establishing disparity maps and

preparing a 3D image ready for transmission with a bandwidth-saving “depth image - based rendering”

(DIBR). Head tracking for 3D reception by a group of single viewers is not included.

xvi INTRODUCTION

1

The Physiology of 3DPerception

1.1 Binocular Viewing or Human Stereopsis

As one eye is capable only of perceiving a planar image, 3D viewing is commonly achieved by the

cooperation of both eyes in providing each eyewith a view of the object. The images that the eyes receive

from the same object are different according to the different locations of the eyes. This binocular viewing

provides the perception of depth, the third dimension, as further explained by the horopter circle in

Figure 1.1. This circle serves as a reference from which the depth is determined [1,2]. If the eyes are

focusing, for which the synonyms fixating, accommodating, or converging are also used, on point M on

the horopter circle, the ciliarymuscles of the eyes rotate the eyeballs into such a position that the light from

Mpasses the pupils parallel to the axes of the lenses in the eyes. The axes intersect atM. Then the light hits

the retina in Figure 1.1 at the foveasml for the left eye andmr for the right eye. The foveas are in the center

of the retina and exhibit the highest density of light receptors. The rotation of the eyes is called the

vergence. Obviously the axes of the eyes are no longer parallel, which will provide the depth information

required by the brain [1,3]. In this situation light from point P hits the retinas at the points pl for the left eye

and pr for the right eye. The anglesa at the periphery of the circle are, as is known fromgeometry, the same

for all points P on the circle above the distance b between the pupils. As a consequence, also all the angles gfor points on the horopter circle are equal [4]. The angle g at the retina, measured as a rule in arcmin, is

called the disparity or the parallax. As all the pointsM and P on the horopter circle have the same disparity

g in both eyes, the difference d in the disparities of all points on this circle is zero. The further P is away

from M, but still on the horopter circle, the larger is the disparity [2,3]. Obviously the larger disparity is

associated with a smaller depth. The disparity information is transferred to the brain, which translates it

into a perceived depth. How the brain fuses the two disparities into a 3D image is not yet fully understood.

As all points on the horopter circle exhibit a zero difference in disparities, the circle serves as a reference

for the depth. The fusion of the disparities and the depth perception as described works only in Panum’s

fusional area in Figure 1.1 [3]. In this area, reliable depth perception decreases monotonically with

3D Displays, First Edition. Ernst Lueder.� 2012 John Wiley & Sons, Ltd. Published 2012 by John Wiley & Sons, Ltd.

increasing magnitude of the disparity. This relationship is called the patent stereopsis. For a point Q in

Figure 1.1 [3] not on the horopter circle but closer to the eyes and still in Panum’s area, the disparities on

the retina are given by the points ql for the left eye and qr for the right eye with the disparities g1 and g2.These points lie across the fovea on the other side of the retina and exhibit a so-called crossed disparity,

while the points farther away than the horopter have an uncrossed disparity. Their image points

corresponding to qr and ql for crossed disparities lie on the opposite side of the fovea.

For point Q the disparities g1 and g2 are no longer equal. The value g1� g2 6¼ 0 together with the

disparities themselves provide information to the brain on how much the depth of Q is different from

the depth on the horopter. However, how the brain copes with this difference of disparities is again not

fully known.

When moving an object from the horopter closer to the eye, the patent stereopsis is finally lost at a

distance of around 2mor less from the eyes. Fusion of the imagesmay no longer work and double images,

called diplopia, appear [3]. Due to overlarge disparities, the eyes perceive the object they are trying to

accommodate and its background separately. The brain unsuccessfully tries to suppress the background

information. On the other hand, the further away from the horopter the object is, the smaller is the

disparity, because the axes of the lenses become closer to being parallel. Finally, at distances beyond about

10m the differences between the small disparities can no longer be resolved and the depth information is

lost. This coincides with our inability to estimate the difference in depth of objects that are too far away.

The average distance b of the pupils in Figure 1.1 of adults in the USA is 6.5 cm, and for 90% of these

adults it lies between 6 and 7 cm [5]. The total range of disparity is about 80 arcmin for the perception of

spatial frequencies from 2 to 20 cycles per degree and about 8 arcdegrees for low spatial frequencies

around0.1 cycles per degree [3]. Thismeans that for low spatial frequencies larger disparities are available

than for larger spatial frequencies. As a consequence, the sensitivity of disparities for low spatial

frequencies is larger than for larger spatial frequencies. The same facts apply also for lower and larger

temporal frequencies of the luminance in an image.

The smallest still recognizable disparity, the stereoacuity Dmin, is 20 arcsec in the spatial frequency

range of about 2–20 cycles per degree,while themaximumperceivable disparityDmax is 40 arcmin for low

spatial frequencies [3]. As the values forDmin andDmax apply to both the crossed and uncrossed disparities

standing for different ranges of depths, the values can be added to a total of 80 arcmin for high and

8 arcdegrees for low spatial frequencies, as already given above [6,7]. Again this is also true for temporal

Figure 1.1 Horopter circle.

2 THE PHYSIOLOGY OF 3D PERCEPTION

frequencies in dynamic imageswith a larger sensitivity of disparities for lower temporal frequencies and a

lower sensitivity for larger temporal frequencies of luminance.

There are two visual pathways from the retina to the brain. The parvocellular-dominated dorsal–cor-

tical path connects the central retina to the ventral–cortical areas in the visual cortexwhere spatial patterns

and color are analyzed. The magno-cellular-dominated dorsal–cortical path leads from the central and

peripheral retina to dorsal–cortical areas in the visual cortex, where optical flow information for heading

control and biological motion are investigated. Further information on these paths can be found in [8–10].

The stereoanomalies are associated with defects in these paths of information where there are neurons

sensitive to only crossed or uncrossed disparities. The perception of depth is thought to involve responses

from both types of neurons. In stereoanomalous individuals, one type of these neurons fails to be sensitive

to their information. Then the other type of neurons dominates the response to all disparity information. In

the case where neurons are only sensitive to uncrossed disparities belonging to objects located further

away than the horopter circle, the information from crossed disparities stemming from objects closer to

the eye than the horopter is suppressed in favor of objects far away. The individual perceives the close-up

information as far away informationwith a far away depth.When the neurons are only sensitive to crossed

disparities, the individual perceives the far away information with a depth close to the eye [11,12].

Individuals who are stereoblind, as a rule resulting from a disease called strabismus, are assumed to be

entirely lacking in disparity-sensitive neurons.

Under degraded stimulus conditions such as brief stimulus exposure, stereoanomalies are found in 30%

of the population [13]. In addition, 6–8%of the population are stereoblind. The relatively large percentage

of people incapable of perceiving a 3D image would merit more attention.

Another physiological disturbance is binocular rivalry. In this case an individual views a stereo

display with a very large disparity or with interocular misalignment or distortion such that no fusion of

the two eyes’ image takes place [7,14]. One eye inhibits the visual activities of the other eye. One view

may be visible, as the other eye’s view is suppressed, which reverses over time. This is a problem

which may be experienced with headworn displays, where two images from different sources may be

misaligned or distorted [15].

Two physiological stimuli of depth can be detected by one eye alone. These are disparity and motion

parallax. Under this parallax the shift of a moving object toward a still background is understood. The eye

together with the brain extracts from this parallax a 3D perception with an associated depth.

Similar tomotion parallax is Pulfrich’s phenomenon [16].One eye is coveredwith a filter which darkens

the image. The processingof the dark image is delayed in relation to theprocessingof the bright image. This

leads to disparity errorswhen theviewermoves relative to an object. However, it can also be used to provide

a depth cue, as the delay renders the two eyes’ images differently as usually caused by depth.

1.2 The Mismatch of Accommodation and Disparityand the Depths of Focus and of Field

Nowwe are ready to consider a phenomenon explicablewith known stereoptic facts. Aswe shall see later,

in stereoscopic and autostereoscopic displays the two required views of an object are presented next to

each other on the screen of a display. The distance to the eyes of the viewer is constant for all scenes

displayed. That is the cause of a problem, as the eyes accommodate to the two images with a vergence

associated with the disparity. The disparity stimulates a depth perception in the brain. On the other hand,

the accommodation of points on the screen also conveys depth information, which is the constant distance

to the screen. The two depth details are contradictory, and are called themismatch of accommodation and

vergence or disparity. Thismay cause discomfort for viewers, manifested by eyestrain, blurred vision, or a

slight headache [7]. Fortunately the problems stemming from this mismatch are experienced mainly for

short viewing distances of around 0.5m. A quick and obvious explanation is the already mentioned fact

that for larger distances the disparities become smaller and are crowded together on the retina, so the

THE MISMATCH OF ACCOMMODATION AND DISPARITY 3

resolution of depth associated with disparity is diminished. Therefore the depth information based on

disparity no longer changes much with increasing distances and is more easily matched with the depth

information based on accommodation. In practice itwas found that a viewing distance of 2mormore from

a TV screen no longer leads to annoying discomfort [7].

A more thorough explanation is derived from the depth of focus and the depth of field, which is also

important for the design of a 3D system formoving viewers [17].We assume that the eyes have focused on

an object at point C in Figure 1.2, providing a sharp image. The depth of focus describes the range of

distance from a point P nearer to the eye thanC to a point D further away thanC inwhich an object can still

be detected by applying a given criterion for detection. If the distance of point P is p and that of D is d then

the depth of focus T in diopters is

T ¼ 1

p� 1

dð1:1Þ

where p and d are expressed in m. The depth of field is

F ¼ d � p ð1:2Þalso in m.

Diopters are defined by 1/f, where f is the focal length of a lens in m; in our case the lens is the eyewith

that f where the eyes experience a sharp image.

Possible criteria for the detectability of features in a display are:

(a) the deterioration of visual acuity or of resolving power;

(b) the discrimination of least perceptible blurring of the image;

(c) the loss of visibility or detectability of target details through loss of contrast; and

(d) the perceptual tolerance to out-of-focus blur which results in a stimulus for a change in

accommodation.

The first three criteria depend on the perception of out-of-tolerance blur, while the last one depends on

physiological tolerance. Point P is called the proximal blurring point, while D is the distal blurring point.

Below P and beyond D the image is no longer accepted.

The results reported now are based on criterion (a) and the out-of-focus blur in criterion (d) [17].

A checkerboard test pattern is used and test persons provide the percentage of correct answers in detecting

the correct pattern. The test pattern had a size of 1.25 arcmin corresponding to a Snellen notation of 20/25.

The diameter of the pupils was 4.6mm. The test result is shown in Figure 1.3. The abscissa represents the

displacement of the test pattern from the fixation point C measured in diopters. Hence the abscissa

indicates in diopters the degree to which the test pattern is out of focus. The ordinate represents the

percentage of the correct visual resolution perceived for the test pattern. This percentage exhibits a

Gaussian probability density.

Figure 1.2 Depth of focus and depth of field.


The midpoint of the depth of focus is always slightly nearer to the eye than the focus point C.

For a 50% correct visual resolution, the depth of focus has awidth of 0.66 diopters, whereas for 99% the

width shrinks to 0.38 diopters. This shrinking is about 0.06 diopters for an increase in the visual

resolution of 10% of the proximal blurring. The depth of focus at the 99% level is an important one for the

out-of-focus blur at which the visual resolution begins to deteriorate.

The diagram in Figure 1.3 depends upon the location of the fixation point C. This is evident from

Table 1.1withmeasured distances for the fixation point C inm, the distances p of the proximal and d of the

distal blur also in m, as well as the resulting depth of focus T in diopters. Only if T were constant for all

points C would the diagram be independent of the location of C. The fixation point C for the diagram in

Figure 1.3 is about 1m from the eye. The depth of field, d� p, in m increases with increasing distance to

the fixation point C; it can even become infinite.

Further results in [17] relate to the influence of luminance, pupil diameter, and size of object in arcmin

on the depth of focus. The larger the luminance, the smaller the diameter of the pupil. At 0.03 cd/m2 the

diameter is 6mm, at 30 cd/m2 it is 3mm, and at 300 cd/m2 only 2mm.A linear decrease in the diameter of

the pupil is associated with a logarithmic increase in luminance. For a 1mm decrease of this diameter the

depth of focus increases by 0.12 diopters.

For an increase in the object by 0.25 arcmin the depth of focus increases by 0.35 diopters. At a size of

2 arcmin the depth of focus reaches 2 diopters.

Figure1.3 Percentage of correct resolutionperceived versus displacement of the test pattern from the fixation pointC

in Figure 1.2.

Table 1.1 Dependence of proximal and distal blur as well as depth of focus T on location of C

Distance of fixation

point C in m

Distance of proximal

blur p in m

Distance of distal

blur d in m

Depth of focus

T in diopters

1 0.75 1.5 1333� 0.666 ¼ 0.667

2 1 5 1� 0.2 ¼ 0.8

3 1.5 1 0.666

THE MISMATCH OF ACCOMMODATION AND DISPARITY 5

The results in Figure 1.3 are very important for those 3D displays where the viewer only has a sharp

picture at a given distance from the screen. Figure 1.3 reveals howmuch the viewer has tomove backward

and forward while still perceiving an acceptable image.

Newer values for the depth of field depending on the distance of the fixation point C are given in

Table 1.2 [18]. Obviously the depth of field increases strongly with increasing distance of the fixation

point. So fixation or accommodation on a given point is no longer so important for larger distances. As a

consequence for themismatch of accommodation and disparity, accommodation plays aminor rolewhich

also alleviates discomfort. This is no longer true for a fixation point at 0.5m or closer, meaning that

discomfort certainly is a problem for near-to-the-eye displays. For regular 3D displays a viewing distance

of at least 2m should sufficiently minimize discomfort, as already stated above.

In view of this result, discomfort when viewing 3D movies from larger distances should not occur as a

rule. This, however, is not the case, because there is a different effect identified as the cause of discomfort,

as discussed in Section 1.6.

Stereoscopic and autostereoscopic displays provide only an illusion of 3D perception. This is among

other effects due to the difficulty stemming from themismatch of accommodation and disparity, resulting

in a conflict of depth perception. Contrary to this, integral imaging, holography, and volumetric displays,

which will be treated later, do not exhibit this mismatch. There, the viewer, when moving, has the

impression of walking around the 3D object, thus experiencing true 3D. On the other hand the viewer

would always see the same image in the case of stereoscopic solutions.

1.3 Distance Scaling of Disparity

In stereopsis there are two definitions of perceived distance or depth. The egocentric view refers to the

conventional distance D between an observer and an object and is usually measured in m. On the other

hand, relative depth is based on the depth interval between a viewer and the reference point on the horopter

circle and ismeasured in radians of the disparity g on the retina in Figure 1.1. The disparity information g isconnected to D by a strongly nonlinear relation stemming from the geometry shown in Figure 1.1. This

relation has to be differently approximated or recalibrated or, in other words, scaled for different regions

of distance D [19,20].

For obtaining a veridical or true value, egocentric distance information D together with the relative

depth g are needed by the brain. It is assumed that the brain combines binocular disparity gwith egocentricdistance cues for the process of disparity scaling.

For a large distance D in real-world scenery, the magnitude of the disparity g varies, as we have seenintuitively from the geometry in Figure 1.1, approximately with the inverse of D2. It was found that g isalso proportional to the interpupillary distance b. This leads to the equation

g ¼ b d0

D2ð1:3Þ

in which d0, with the dimensions of cm arcmin, is an experimentally determined proportionality factor,

called the depth interval and sometimes also the predicted depth [21]; d0 is different for each D and is

approximated by a constant in an interval around D.

Table 1.2 Newer values for those in Table 1.1

Distance of fixation point C Distance of low end

of depth of field

Distance of high end

of depth of field

0.5 0.4 0.67

1 0.67 z

2 1 1


In stereoscopic displays the disparity g depends approximately on the separation S between the two

images required for 3D perception and inversely on the viewing distance D. Thus

g ¼ S=D: ð1:4Þ

For Equation 1.4, a symmetrical convergence and targets on the retina close to the midsaggital plane

symmetrically dividing the body are required.

For a fixed value of S a stereoscopic display will provide the depth interval d0 as

d0 ¼ DS

b� S: ð1:5Þ

In the denominator the þ sign applies for crossed disparities and the� sign for uncrossed disparities.

In [22] it was found that this determination of d0 is very robust toward unequal luminances in the two

views. If the luminance exceeds 0.63 cd/m2 a 60% difference in the interocular luminance does not harm

the perception of the depth interval d0. However, for greater interocular luminance differences the

perceived depth may be far away from the correct value. Another luminance-related effect is the

discomfort created by interocular luminance differences of more than 25% [22]. Finally, interocular

differences in contrast of up to 83% did not affect depth perception, while the threshold for discomfort in

this case was between 25% and 50%.

1.4 Interocular Crosstalk

Information leaking from the view in one eye into that of the other eye is known as crosstalk, which as a

rule severely damages the quality of the perceived image but can also affect the fusion of the two images.

At no crosstalk the fusion is limited by 27 arcmin for crossed disparity and by 24 arcmin for uncrossed

disparity. For a 200ms stimulus, crosstalk has only a small effect on fusion,which is no longer true for a 2 s

stimulus [23]. In this case, 2–7% crosstalk can already hamper fusion and can cause discomfort [24].

Autostereoscopic displays may apply spatial multiplexing of the two views, for which an array of

lenticular lenses or parallax barriers is used. Lenticular lenses exhibit chromatic aberrations, while

barriers produce diffraction by which image content can leak into the wrong eye. The remedy is to limit

aberration and diffraction at least for a given position of the viewer.

For stereoscopic and autostereoscopic displays with temporal multiplexing, crosstalk occurs due to the

persistence of a display, inwhich the image content of one eye’s view is still visible in the next framewhen

that eye is exposed to a new view. This is shown in Figure 1.4. Temporal multiplexing can also induce

flicker seen in the visual periphery. This disrupts vision in large field-of-view immersive displays. The

cause is that these displays stimulate the magno-cellular-dominated dorsal–cortical area, which draws

connections from the peripheral retina, and above all have a transient response and high temporal acuity,

perceived as flicker. A remedy is a high frame rate enabling the visual system to integrate the intermittent

information in the periphery [6].

A further, very strong source of crosstalk is blurring of the edges of a moving image. Blur occurs in all

displays where the luminance of the image is held constant during the entire frame time. This occurs in

liquid crystal displays (LCDs) and in organic light-emitting diode displays (OLED displays). A relatively

brief description of this important phenomenon is given here, while a more detailed one can be found on

pages 298–300 of [25].

Blur is explained in Figure 1.5a, where a black stripe at rest on an LCD screen can be seen, while

Figure 1.5b shows the stripe moving to the right. The edges of the stripe in Figure 1.5a are perfectly sharp

but are blurred by themovement in Figure 1.5b. Themain cause is that an image on anLCD is held constant

during the frame time Tf, which for a frame frequency of f ¼ 60Hz is given by T ¼ 1/f ¼ 16.66ms.

INTEROCULAR CROSSTALK 7

This holding property does not occur in plasma display panels (PDPs) and in cathode ray tubes (CRTs) and

hence they are free of blur.

Now we consider what happens to the pixels in the column representing the left edge in Figure 1.5a.

Assuming black stands for fully white and white for fully black, then the luminance in the column has to

decay from fully white to fully black during the movement. This decay is gradual, as shown in Figure 1.6.

The reason is the delayed rotation of the liquid crystal molecules in response to an electric field applied in

the first frame in Figure 1.6. The display still provides full luminance at the beginning of the

frame time. One frame time later at time Tf the luminance is held at the value it had decayed to at

timeTf, as again indicated one frame time later in Figure 1.6. This stepwise decaying luminance continues

in Figure 1.6 until fully black is reached. The stepwise decay leads to the blurred left edge in Figure 1.5b

andwith the same explanation also for the right edge in Figure 1.5b. The duration of the decay is called the

blurred edge width (BEW). This duration can also be measured in the number of pixels that the first

Figure 1.4 Crosstalk due to persistence of luminance in an LCD.

Figure 1.5 (a) A stationary image and (b) the blurred edge of a moving image on an LCD.

Figure 1.6 Decay of luminance of a display and stepwise approximation representing the holding property of an

LCD.


column has to travel to the right until a decay to black is reached. This number is n pixels per frame time

which providesBEW � n: ð1:6Þ

The stripe in Figure 1.5a becomeswider by the blur on each side. A severe consequence for 3D displays

is that the gap between the two views required for 3D is filled with blur, which may even extend into the

two original images. Thus blur represents crosstalk in both eyes.

Diminishing thewidth n of blur is mandatory for shrinking and even avoiding crosstalk in 3D displays.

In pursuing this goal we have to understand how blur depends on the frame frequency f and the frame time

Tf. This understanding is provided by the physiological law for the perception of a moving picture by the

human eye. It states among other things that the eyeballs track themovement of an object perfectly; this is

called the smooth pursuit along the track of the movement.

We apply this rule to Figure 1.7a with the coordinates t, with the frame times Tf, 2Tf, etc., and the

locationm, where themovement n per time Tf to the right of a black bar (white areas) is indicated. The eye

tracking of the movement is carried out in the direction of the arrows along the slanted line in Figure 1.7a.

We assume that after time 3Tf, the maximum luminance Lm is reached.

This generates the luminance V(x) over the location x in the diagram in Figure 1.7a, starting with a

luminanceof zeroatx ¼ 0andLmatx ¼ n. Thediagramrepresents the traceof the luminanceon the retina.

In order to determine the influence of the frame time, we plot in Figure 1.7b the same diagram but with

half the frame time Tf/2 and the same speed n of movement, resulting in an advance to the right by n/2

within Tf/2. The pertinent construction of V(x) reveals that the maximum luminance is reached after just

Figure 1.7 Speed of luminous response (a) for 60Hz and (b) for 120Hz frame frequencies.

INTEROCULAR CROSSTALK 9

half themovement at x ¼ n/2. As n represents thewidth of the blur, this blur is halved and hence crosstalkshrinks to half the width by doubling the frame frequency.

This result has enormous significance for reducing crosstalk by blur in 3D displays. As a rule for 3D

displays, a frame frequency of 240Hz is used for reducing crosstalk by a factor of four in comparison to a

60Hz frame. In this case crosstalk is virtually invisible, but the addressing circuits have to work at four

times the speed.

1.5 Psychological Effects for Depth Perception

The physiological depth perception discussed so far is complemented by a few monocular psychological

effects and experiences which are also encountered in 2D displays. They are learned while growing up.

With these effects, every 2D picture can induce a 3D impression.

The size of an image on the retina, the real size of which is known, indicates the depth by its actual

smaller perceived size. The hiding of an object by closer objects can also create a 3D impression. This

perception of so far hidden objects is called dynamic disclosure, while the occlusion of objects is called

dynamic occlusion. Both effects convey the sensation of depth. A 3D perspective is also created by two

straight lines which intersect at a vanishing point. This was discovered by medieval painters and is now

used extensively. Objects becoming bluish and misty with increasing distance also induce the perception

of depth. Further, one instinctively assumes that the illumination comes from above, so the length and

direction of the shadow also help to perceive depth. Finally, motion parallax, described in the next section,

is another strong trigger for depth. Further depth cues to be detailed later are the luminance, contrast, and

sharpness of an object.

1.6 High-Level Cognitive Factor

Immersive stereo displays such as 3Dmovies and 3D TV create real-world scenes by presenting a variety

of cues to depth and distance. These cues include binocular disparity, focusing on depth by accommoda-

tion, motion parallax, linear perspective, and texture perspective. For ease of viewing, all these cognitive

factors are supposed to provide the samemagnitude of depth, otherwise the viewer experiences high-level

cue conflict – high level because reasoning is involved, as we shall see below. Cue conflict induces

discomfort, as viewers may encounter in watching 3D movies [7].

This can be illustrated by a stereo scene of an American football game [26]. Binocular disparity

information may provide a depth perception of a few inches, while linear and texture perceptive

information could convey depths of several tens of yards consistent with the football field. A viewer

exposed to this conflict over a longer time will complain of discomfort. Psychologists explain this by

assuming that human reasoning is based on two modes of operation: activity in an analytical and in an

intuitive system [26]. The analytical system permits conscious rule-based reasoning, while the intuitive

system is based on situational pattern recognition. The latter approach uses information derived from

immersive stereo displays consisting of the perception of simultaneous redundant traits such as the

psychological features mentioned in Section 1.5. That way, immersive stereo displays stimulate

the intuitive system. However, there are exceptions, such as the perception of motion parallax, which

provides a stereo impression in the intuitive system that, as a rule, does not exist in immersive

stereo displays.

For a deeper understanding of this statementwe have to look at themotion parallax shown in Figure 1.8.

If a viewer is moving to the right with eyes fixed on the stationary point F, then the stationary objects

behind F are perceived as moving in the same direction to the right as the viewer, while those in front of F

are perceived as moving to the left. This motion parallax is part of the intuitive experience of the viewer

and provides the locomotive viewer with depth information relative to F.


Now we consider single view immersive stereoscopic and autostereoscopic displays in which the 3D

impression can only be perceived from a single given position of the viewer. This implies that there is no

full motion parallax, because the entire scene is placed on the plane of the display screen. The eyes focus

on point F on that plane and all objects are perceived to be further away than F, that is, in an areawhere the

objects seem tomove in the same direction as theviewer.Objects closer than Fand theirmovement against

the viewer’s movement do not exist [27].

The fact that there is no full motion parallax in single view stereoscopic and autostereoscopic displays

contributes to confusion in the viewer’s brain, causing discomfort. This discomfort disappears in

multiview stereoscopic and autostereoscopic displays where the regular motion parallax can be detected

from a few specific viewing positions.

Obviously the discomfort originating from immersive stereoscopic and autostereoscopic displays is not

caused by the mismatch of accommodation and disparity which, as we know, plays virtually no role in

viewing from larger distances associated with immersive displays.

Acknowledgments

The author gratefully acknowledges permission to reproduce a figure granted by the institution named

below. The source of the figure is also listed below together with its corresponding number in this book.

Optical Society of America (OSA)

Journal of the Optical Society of America, vol. 49, no. 2, March 1959, p. 276, figure 2 Reproduced

as Figure 1.3

References

1. Wheatstone, C. (1838) Contributions to the physiology of vision. Philos. Trans. R. Soc. A, 128, 371.

2. Okoshi, T. (1976) Three Dimensional Imaging Techniques, Academic Press, New York.

3. Patterson, R. (2009) Human factors of stereoscopic displays. SID 09, p. 805

4. Bader, G. (1999) Elektrooptische Signalverarbeitung zur Darstellung autostereoskopischer Bewegtbilder und zur

Strukturerkennung. PhD thesis. University of Stuttgart.

5. Ferwerda, J.G. (1990) The World of 3D: 3D Book Productions, Borger.

6. Patterson, R. (2007) Human factors of 3D displays. J. SID, 151(11),861.

7. Patterson, R. (2009) Human factors of stereodisplays: an update. J. SID, 17(112),987.

8. Livingstone, M.G. and Hubel, O.H. (1988) Segregation of form, color movement and depth: anatomy, physiology

and perception. Science, 240, 740.

9. Schiller, P.H. et al. (1990) Role of the color opponent and broad band channels in vision. Visual Neurosci., 5, 321.

Figure 1.8 Motion parallax.

REFERENCES 11

3d s y la p is d - download.e-bookshelf.de · introduction to microdisplays david armitage, ian...

Documents