milton
DESCRIPTION
PresentationTRANSCRIPT
![Page 1: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/1.jpg)
Do’s and Don’ts of using Videoconferencing for Remote Teaching:
A Human Factors Approach
Milton Chen, PhD
Human Computer Interaction Lab
Stanford University
Presented at the 21st NORDUnet Network Conference 8/27/2003
![Page 2: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/2.jpg)
Executive Summary
1. Don’t use video if the task doesn’t need it
2. Don’t use “non-fluent” video
3. Don’t make call setup difficult
4. Don’t use voice activated switching
5. Don’t sacrifice audio
6. Do show self view
7. Do show audience
![Page 3: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/3.jpg)
Distance education at Stanford
6000 students, $20 M in tuition each yearCan hear but not see the remote students
a 1969 classroom
a 2003 operator console
a 2003 lecture viewer
![Page 4: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/4.jpg)
Consequence of not seeing the students
Little interaction with remote students– Local students asked 3 questions per session– Remote students asked 1 question in 6 month
* based on classroom observation of 4 CS classes
![Page 5: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/5.jpg)
F2F interaction is crucial for discussions
Importance of f2f interaction
0%
50%
100%
students TAs faculty
extremely
very
moderately
somewhat
not
* 120 students, 15 TAs, and 41 faculty[Report to the School of Engineering Dean’s Office ’01]
![Page 6: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/6.jpg)
Distance learning
The consequence
student of the future teacher of the future
The visionany where, any time
The realityany where, any time except live
![Page 7: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/7.jpg)
The Stanford Video Auditorium
desktop interface
15’ x 5’ video wall
![Page 8: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/8.jpg)
The Stanford Video Auditorium
![Page 9: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/9.jpg)
AccomplishmentsIntel President Paul Otellini demonstrated vsee
during his keynote at IDF
Candidate system for International Space Station• With Bob Bradford, MSFC
Featured Internet2 project to break video wall record• Attempt to see all 200 members of Internet2 simultaneously
![Page 10: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/10.jpg)
Videoconferencing Solutions
Max video links
Max video resolution
Bandwidth at 352x288 15fps
Microsoft NetMeeting,
Yahoo Super Webcam
1 352x288 ~200 Kbps
Polycom, Tandberg, … 4 352x288 ~200 Kbps
vsee ~16 720x480* ~100 Kbps
* At 30 fps on a 3 GHz Pentium 4
![Page 11: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/11.jpg)
Instructor’s view Student’s view
Independent Students
StanfordIceland
![Page 12: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/12.jpg)
The great expectation
19271st demo by AT&T
1964PicturePhone
1991/92Mbone VICCUseeMe
1996NetMeeting
expe
ctat
ion
![Page 13: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/13.jpg)
first mobile phone, 1924 first handheld phone, 1973
1st Revolution: Possible 2nd Revolution: Ubiquitous
first videophone, 1927
![Page 14: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/14.jpg)
Tyranny of real classroomsvsee
A history of failuresHarmful effect of video
“We express ourselves into existence.” - Iris Murdoch
![Page 15: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/15.jpg)
Harmful effect of video
Time and resource sink
Make user look bad– Gaze less potent => are you ignoring me?
– Gesture less potent => am I not interesting?
– Slow response => user is slow?
– Lack of lip sync => user is not believable?
– Lack of eye contact => user is not motivated? [Reeves and Nass ’96]
![Page 16: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/16.jpg)
Eye contact stirs us to action
[Sharbat Gula, photographed by McCurry ‘83]
![Page 17: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/17.jpg)
Eye contact fires up our brain
[Kampe et al. ’01 Nature]
![Page 18: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/18.jpg)
Methodology
Observers watch videos of looker
Large display with camera at the center
![Page 19: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/19.jpg)
Eye contact?
![Page 20: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/20.jpg)
Sensitivity is asymmetric
* 16 observers judged recorded videos of 1 looker
![Page 21: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/21.jpg)
An anatomical explanation
looking at you looking sideways
looking up
looking down eye closing
Illustrations from The Artist’s Guide to Facial Expression[Faigin ’90]
![Page 22: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/22.jpg)
Tyranny of real classroomsvsee
A history of failuresHarmful effect of video
Eye contact findingLip sync finding
“We shape our tools, and there after our tools shape us”
- Marshal McLuhan
![Page 23: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/23.jpg)
Why read lips
Improves comprehension – Background noise [Sumby and Pollack ’54]– Hearing loss [Binnie, Montgomery, Jackson ’86]
[Yarbus ’67]
![Page 24: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/24.jpg)
Audio ahead of the video
Videoconferencing– 1 msec to encode 30-msec audio with TrueSpeech– Up to 250 msec to encode a 720x480 frame with
high-quality MPEG-4
Detectable skew130 msec [Dixon and Spitz ’80]
80 msec [Steinmetz ’96]
![Page 25: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/25.jpg)
Conventional lip synchronization
encodenetworkdecode
A
a v
time
Unsynchronized
encodenetworkdecodesync
a, v
Audio delay lineA
delayskew
![Page 26: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/26.jpg)
Attribute delay and skew to remote person
=> person is slow?
=> person is not believable?
[Reeves and Nass ’96]
encodenetworkdecode
A
a v
time
Unsynchronized
encodenetworkdecodesync
a, v
Audio delay lineA
delayskew
![Page 27: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/27.jpg)
A new lip sync method
encodenetworkdecodesync
synchronized and low perceived latency
a v a v
encodenetworkdecode
A
a v
time
Unsynchronized
encodenetworkdecodesync
a, v
Audio delay lineA
Round trip delay
![Page 28: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/28.jpg)
Methodology
Recorded 3 speakers– 44.1KHz x 16 bps uncompressed audio– 320x240x30fps uncompressed video– Sentences consist of easy to lipread words
Speaker 1female native
speaker
Speaker 2male native
speaker
Speaker 3male non-native
speaker
![Page 29: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/29.jpg)
Perception of variable AV skew
* 16 subjects judged recorded videos of 1 speaker
0
25
50
75
100
200,unsync 200,slow 200, fast sync
initial skew (msec) , stretch period
lip s
ynch
roni
zatio
n (%
)
![Page 30: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/30.jpg)
Tyranny of real classroomsvsee
A history of failuresHarmful effect of video
Eye contact findingLip sync findingDo’s and Don'ts
“The heart is stirred more slowly by the ear than by the eye.”
– Horace
![Page 31: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/31.jpg)
1. Don’t use video if you don’t need it
![Page 32: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/32.jpg)
Benefit of video medium
Facilitate communication process– Stimulate interactivity when group is medium size
– Support tasks that require complex collaboration• Negative feedback
• Negotiation
Build relationship– Establish identity
– Build trust
– Form friendship
![Page 33: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/33.jpg)
2. Don’t use “non-fluent” video
![Page 34: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/34.jpg)
A language fluency model for videoconferencing
Are you fluent in videoconferencing ?
Factors that make gaining fluency difficult– Disruption
• Audio quality < 8KHz x 8 bits per sample
• Video quality < 320 x 240 x 10 fps
– Loss of control• Voice-activated switching
• Room-to-room
– Expressiveness• No lip sync
• No eye contact
![Page 35: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/35.jpg)
3. Don’t make call setup difficult
![Page 36: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/36.jpg)
Videoconferencing today?
Ericsson’s mobile phone, 1901
![Page 37: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/37.jpg)
4. Don’t use voice activated switching
![Page 38: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/38.jpg)
3 sins
Activate peripheral vision
Out of sight out of mind
Artificial social hierarchy
![Page 39: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/39.jpg)
5. Don’t sacrifice audio
![Page 40: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/40.jpg)
Less tolerant of audio artifacts
Push button to talk
Half-duplex
Latency
Network loss
![Page 41: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/41.jpg)
6. Do show self view
![Page 42: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/42.jpg)
Mental model discrepancy
We think face-to-face
We see through a tube
![Page 43: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/43.jpg)
7. Do show audience
![Page 44: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/44.jpg)
Convey vs. feedback
Where do you actually look?
The typically class layout
![Page 45: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/45.jpg)
Do’s and Don’ts
1. Don’t use video if the task doesn’t need it
2. Don’t use “non-fluent” video
3. Don’t make call setup difficult
4. Don’t use voice activated switching
5. Don’t sacrifice audio
6. Do show self view
7. Do show audience
![Page 46: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/46.jpg)
A plane that does not fly is not a plane
First flight, Wrights 1903
A videophone that limits communication is not a videophone
What is a videophone
![Page 47: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/47.jpg)
SummaryTyranny of real classrooms
– Case study: not seeing => no participation
VSee
A history of failures– Poor video can be worse than no video
• Findings on eye contact and lip sync
Do’s and Don’ts
![Page 48: Milton](https://reader036.vdocuments.mx/reader036/viewer/2022062707/5585d978d8b42aa6518b48d5/html5/thumbnails/48.jpg)
Acknowledgement– Prof. Ebba Hvannberg
– Prof. Pat Hanrahan and Terry Winograd– Prof. Cliff Nass, Tom Moran, Anoop Gupta
I would love to hear from you!– http://vsee.stanford.edu
Collaborate on the Internet2 demo?