eye contact between pedestrians and driversb. eye contact eye contact has long been seen as a...

6
Eye Contact Between Pedestrians and Drivers Dina AlAdawy * Michael Glazer * Jack Terwilliger * Henri Schmidt * Josh Domeyer Bruce Mehler * Bryan Reimer * Lex Fridman * Center for Transportation and Logistics, Massachusetts Institute of Technology Toyota Collaborative Safety Research Center Abstract—When asked, a majority of people believe that, as pedestrians, they make eye contact with the driver of an ap- proaching vehicle when making their crossing decisions. This work presents evidence that this widely held belief is false. We do so by showing that, in majority of cases where conflict is possible, pedestrians begin crossing long before they are able to see the driver through the windshield. In other words, we are able to circumvent the very difficult question of whether pedestrians choose to make eye contact with drivers, by showing that whether they think they do or not, they can’t. Specifically, we show that over 90% of people in representative lighting conditions cannot determine the gaze of the driver at 15m and see the driver at all at 30m. This means that, for example, that given the common city speed limit of 25mph, more than 99% of pedestrians would have begun crossing before being able to see either the driver or the driver’s gaze. In other words, from the perspective of the pedestrian, in most situations involving an approaching vehicle, the crossing decision is made by the pedestrian solely based on the kinematics of the vehicle without needing to determine that eye contact was made by explicitly detecting the eyes of the driver. I. I NTRODUCTION When asked, most people will say that they make eye contact with the driver of an approaching vehicle before crossing the road (see Survey section). This work provides evidence that this belief is largely false, and consequently, that people misattribute the source of their crossing decisions. We focus on the subclass of pedestrian-vehicle interactions where collision is possible as a consequence of non-verbal miscommunication; specifically, situations where a vehicle is decelerating for a pedestrian and there is no crosswalk or signal. While in most road contexts, the law clearly defines who has the right of way, the reality of social-interaction in these contexts is that laws are often bent and broken. Just like many drivers partake in regularly exceeding the speed limit, many pedestrians partake in regularly crossing the street when and where they are not legally allowed to. In this world, where formal rules of the road are not strictly followed, companies and researchers are beginning to test autonomous vehicles on public roads. The important question becomes: What are the cues that pedestrians and vehicles use in ultimately resolving these situations so that we can design autonomous vehicle control algorithms that interact successfully with pedestrians. For the pedestrian, despite the aforementioned common misconception, the crossing decision appears to be primarily based on vehicle kinematics [1]. In fact, one explanation for the findings in this work is that pedestrians form models of driver intent by observing vehicle kinematics, and because of forming such a model believe that they in-fact “see” the driver. We conduct a survey and two experiments that provide evidence that they do not see the driver. Our work has several actionable takeaways for the design of safe human-centered autonomous vehicle control algorithms [3]. First, the general inability to see the presence of a driver in an approaching vehicle means that an autonomous vehicle that has no driver can still, in large part, participate in the intricate pedestrian-vehicle interaction without the pedestrian being directly aware of the driver’s absence. Second, this work implies that vehicle kinematics is in fact the primary form of non-verbal cuing from vehicle to pedestrian. Acceleration and deceleration of the vehicle is something that an autonomous vehicle can control precisely and directly learn from real-world interactions. II. RELATED WORK Eye contact is considered to be one of the main non-verbal cues exchanged in driver-pedestrian crossing interactions. For pedestrians, making eye contact with the driver may facilitate the assessment of drivers’ awareness of their presence or serve as a signal to communicate their intent to cross [9]. With the introduction of full autonomous cars and the elimination of driver-provided cues, a lot of effort is being directed towards the understanding of the role of non-verbal cues such as driver presence and eye contact. A. Driver Presence Testing the impact of driver absence on pedestrians’ crossing decisions is so far limited to Wizard-of-Oz studies due to the limited possibility of operating driverless vehicles in urban environments. In this study [10], most pedestrians managed to make crossing decisions in front of a seemingly driverless car based on vehicle cues alone. 80% of the interviewed pedestrians noticed the missing driver. It’s however unclear § Corresponding author: [email protected] arXiv:1904.04188v1 [cs.HC] 8 Apr 2019

Upload: others

Post on 11-Jun-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Eye Contact Between Pedestrians and DriversB. Eye Contact Eye contact has long been seen as a central component of non-verbal communication in the context of pedestrian-vehicle interaction

Eye Contact Between Pedestrians and DriversDina AlAdawy∗ Michael Glazer∗ Jack Terwilliger∗ Henri Schmidt∗

Josh Domeyer† Bruce Mehler∗ Bryan Reimer∗ Lex Fridman∗§

∗Center for Transportation and Logistics,Massachusetts Institute of Technology

†Toyota Collaborative Safety Research Center

Abstract—When asked, a majority of people believe that, aspedestrians, they make eye contact with the driver of an ap-proaching vehicle when making their crossing decisions. Thiswork presents evidence that this widely held belief is false. Wedo so by showing that, in majority of cases where conflict ispossible, pedestrians begin crossing long before they are able tosee the driver through the windshield. In other words, we are ableto circumvent the very difficult question of whether pedestrianschoose to make eye contact with drivers, by showing that whetherthey think they do or not, they can’t. Specifically, we show thatover 90% of people in representative lighting conditions cannotdetermine the gaze of the driver at 15m and see the driver at allat 30m. This means that, for example, that given the commoncity speed limit of 25mph, more than 99% of pedestrians wouldhave begun crossing before being able to see either the driveror the driver’s gaze. In other words, from the perspective of thepedestrian, in most situations involving an approaching vehicle,the crossing decision is made by the pedestrian solely based onthe kinematics of the vehicle without needing to determine thateye contact was made by explicitly detecting the eyes of the driver.

I. INTRODUCTION

When asked, most people will say that they make eye contactwith the driver of an approaching vehicle before crossingthe road (see Survey section). This work provides evidencethat this belief is largely false, and consequently, that peoplemisattribute the source of their crossing decisions. We focus onthe subclass of pedestrian-vehicle interactions where collisionis possible as a consequence of non-verbal miscommunication;specifically, situations where a vehicle is decelerating for apedestrian and there is no crosswalk or signal. While in mostroad contexts, the law clearly defines who has the right of way,the reality of social-interaction in these contexts is that lawsare often bent and broken. Just like many drivers partake inregularly exceeding the speed limit, many pedestrians partakein regularly crossing the street when and where they are notlegally allowed to.

In this world, where formal rules of the road are not strictlyfollowed, companies and researchers are beginning to testautonomous vehicles on public roads. The important questionbecomes: What are the cues that pedestrians and vehiclesuse in ultimately resolving these situations so that we candesign autonomous vehicle control algorithms that interactsuccessfully with pedestrians. For the pedestrian, despite the

aforementioned common misconception, the crossing decisionappears to be primarily based on vehicle kinematics [1]. Infact, one explanation for the findings in this work is thatpedestrians form models of driver intent by observing vehiclekinematics, and because of forming such a model believe thatthey in-fact “see” the driver. We conduct a survey and twoexperiments that provide evidence that they do not see thedriver.

Our work has several actionable takeaways for the design ofsafe human-centered autonomous vehicle control algorithms[3]. First, the general inability to see the presence of a driverin an approaching vehicle means that an autonomous vehiclethat has no driver can still, in large part, participate in theintricate pedestrian-vehicle interaction without the pedestrianbeing directly aware of the driver’s absence. Second, this workimplies that vehicle kinematics is in fact the primary form ofnon-verbal cuing from vehicle to pedestrian. Acceleration anddeceleration of the vehicle is something that an autonomousvehicle can control precisely and directly learn from real-worldinteractions.

II. RELATED WORK

Eye contact is considered to be one of the main non-verbalcues exchanged in driver-pedestrian crossing interactions. Forpedestrians, making eye contact with the driver may facilitatethe assessment of drivers’ awareness of their presence or serveas a signal to communicate their intent to cross [9]. With theintroduction of full autonomous cars and the elimination ofdriver-provided cues, a lot of effort is being directed towardsthe understanding of the role of non-verbal cues such as driverpresence and eye contact.

A. Driver Presence

Testing the impact of driver absence on pedestrians’ crossingdecisions is so far limited to Wizard-of-Oz studies due to thelimited possibility of operating driverless vehicles in urbanenvironments. In this study [10], most pedestrians managedto make crossing decisions in front of a seemingly driverlesscar based on vehicle cues alone. 80% of the interviewedpedestrians noticed the missing driver. It’s however unclear

§ Corresponding author: [email protected]

arX

iv:1

904.

0418

8v1

[cs

.HC

] 8

Apr

201

9

Page 2: Eye Contact Between Pedestrians and DriversB. Eye Contact Eye contact has long been seen as a central component of non-verbal communication in the context of pedestrian-vehicle interaction

Fig. 1: Sample stimuli images of approaching vehicle in 9 different lighting conditions.

when they noticed it and how it affected their crossing decisionas the interviews were conducted after the interaction. [7]installed a dummy wheel in a right-hand steered vehicle,where the real steering wheel was hidden from pedestrians.In contrast to the study in [10], pedestrians highlighted thenecessity of driver-centric cues, as many indicated they wouldnot cross in front of a driverless vehicle due to lack ofconfirmation that they were seen. Conversely, most participantswere willing to cross when the driver engaged in eye contact.

B. Eye Contact

Eye contact has long been seen as a central component ofnon-verbal communication in the context of pedestrian-vehicleinteraction. For example, the U.S. Department of Transporta-tion recommends pedestrians to seek eye-contact with driversto confirm, that they are seen. However, lost in the term “eyecontact” is the distinction between (1) looking to communicatean intent and (2) looking to see. The two goals are disjoint,and the degree to which each is involved in pedestrian-vehiclecommunication is important to understand and what our workaims to provide insights on.

[9], for instance, show that gaze is a powerful tool pedestriansuse to communicate their crossing intent to drivers. They alsosuggest that pedestrians often establish eye contact with driversto make sure they are seen. [11] argue that drivers usuallyyield if there are pedestrians present at the curb who clearlycommunicate their intention of crossing often by seeking eye

contact. [5] also report that significantly more drivers stoppedat a crosswalk when a pedestrian looked them in the eyes incomparison to when they looked above their head.

In contrast to the common idea about the relevance of eyecontact in pedestrian-vehicle interactions, [2] argue that cars’body language - encoded in the kinematics of an approachingvehicle - is much more significant to traffic negotiationsbetween pedestrians and drivers. They suggest, that pedestriansonly seek eye-contact as a confirmation of driver awareness,if the car “behaves” different from what they expected.

To the best of our knowledge, what is not shown in any ofthe above studies is the distinction between a pedestrian’sbehavior of “seeking eye contact” and actually being able tosee the driver or the driver’s eyes. We believe that the latteris not feasible in most scenarios of an approaching car, butinstead, the act of seeking eye contact in itself is used associal signaling [4]. This distinction is one our work seeks tohighlight as we believe it is of critical importance to the designof autonomous vehicles that safely and effectively interact withpedestrians.

III. METHODS

We conducted 2 experiments to investigate humans’ abilityto see inside cars. Both experiments were programmed usingjsPsych, a JavaScript library for running behavioral experi-ments in a web browser and run on Amazon’s crowdsourcing

Page 3: Eye Contact Between Pedestrians and DriversB. Eye Contact Eye contact has long been seen as a central component of non-verbal communication in the context of pedestrian-vehicle interaction

Experiment 1:Driver vs. No DriverCan you see if there is a driver?

Experiment 2:Looking forward vs. Looking to pedestrianCan you see where the driver’s eyes are looking?

Fig. 2: An in-cab view of the driver’s body and head position used for the two experiments.

platform Mechanical Turk.

In the first experiment (DRIVER) we examined whetherpeople are able to perceive driver’s presence in the car underdifferent lighting conditions and at different distances. Thesecond experiment (EYES) aimed at identifying the influenceof lighting conditions and vehicle’s distance on pedestrians’perception of driver eye-gaze.

A. Stimuli

The stimuli in both experiments were 4K photos (3840×2160)of a Lincoln MKZ car, that were taken using a Sony FDR-AX53. We used a tripod at adult eye level (about 165cm) andtook the images from the left or the right side of the vehicleto reflect pedestrians’ viewing angle on oncoming cars in acrossing interaction.

The stimuli dataset includes images of the vehicle in 9 lightingconditions: 1) sunshine (left-view), 2) sunshine (right-view),3) sun at zenith, 4) sunset (left-view), 5) sunset (right-view),6) shadow, 7) glare, 8) tree shadow and 9) night (see Fig. 1).For each lighting condition, images are taken at 6 distances:5m, 10m, 15m, 20m, 25m and 30m. And for each distance andlighting condition, images are taken featuring 3 driver states:looking forward, looking to pedestrian, absent (see Fig. 2).

B. Experiments

Both experiments followed a mixed factorial design with twoindependent variables: Distance as the within-subjects factorand lighting as the between-groups factor. Participants wereassigned to a random lighting condition and performed arepeated measurement task for all distances in that particularlighting condition.

1) Task: At the beginning of an experiment, we collected dataon gender and age and asked 2 survey questions:

• Q1: When crossing the street as a pedestrian, when thereis a crosswalk and there are no traffic lights, do you tryto make eye contact with the driver?

• Q2: Do you believe pedestrians are able to see througha car’s windshield and make eye contact with the driver?

Then we showed a sample image of the stimuli dataset ata 5m distance and instructed participants to only focus onthat specific car and ignore any other parked cars in thescene. The stimuli of the respective experiment were thenpresented to the participant in a random order accompaniedby a question for each stimulus. Stimuli for one group wereall in one lighting condition and 1 out of 6 distances. For eachdistance, 2 instances were shown of the driver states of interestto the particular experiment. After the 12 trials, we asked theparticipant Q2 again.

2) Experiment 1: DRIVER: In this experiment we included 6distances and 2 stimuli for each distance: one where the driveris looking forward and one where the driver is absent. For eachimage, participants were asked to answer the question:

Can you see, if there is a driver in the car?

• [n] No, I can’t see if there is a driver.

• [0] Yes, I can see. There is no driver.

• [1] Yes, I can see. There is a driver.

3) Experiment 2: EYE: In this experiment we included 6distances and 2 stimuli for each distance: one where the driveris looking forward and one where the driver is looking to thecamera. For each image, participants were asked to answer thequestion:

Can you see where the driver’s eyes are looking?

• [n] No, I can’t see where the driver’s eyes are looking.

• [f] Yes, I can see. They are looking forward.

• [c] Yes, I can see. They are looking to the camera.

C. Data collection

For each experiment subjects were recruited voluntarilythrough Mechanical Turk. Each experiment involved 180participants (DRIVER: 79 females and 101 males, EYES: 83females and 97 males). Their ages ranged from 19 to 74 (M= 39, SD = 12) for the DRIVER experiment and from 21 to66 (M = 39, SD = 11) for the EYES experiment. Participantsfor each experiment were randomly assigned to one out of 9lighting condition treatments. Hence, we had 20 participantsfor each lighting condition in each experiment. Subjects were

Page 4: Eye Contact Between Pedestrians and DriversB. Eye Contact Eye contact has long been seen as a central component of non-verbal communication in the context of pedestrian-vehicle interaction

rewarded 1.50 USD for participating in the EYES experimentand 0.75 USD for the DRIVER experiment. We only acceptedworkers who had a minimum of 1000 accepted HITs and aminimum of 98% acceptance rate. Furthermore, we measuredthe response times to the individual stimuli as well as the dura-tion of the experiment and excluded data from 3 participants:1 in the DRIVER experiment and 2 in the EYE experimentbased on their very short response times (Mean response time< 400ms).

IV. RESULTS

1) Survey: Prior to the experiment, when asked whetherthe subject usually seeks eye contact with the driver whencrossing the street (Q1), 122 (34%) said they don’t seek eyecontact while 235 (66%) indicated they do. This confirmsour assumption about the relevance of eye contact as a non-verbal communication mechanism in traffic. To the questionabout the feasibility of such communication through the car’swindshield (Q2), 282 (79%) subjects said they believe theycan see through cars’ windshields and engage in makingeye contact with the driver whereas only 75 (21%) believedthey are not able to do that. Interestingly, when asked thesame question (Q2) again after the experiment, we noticed adecrease in the total number of people saying they believethey are able to see through windshields from 282 (79%) to190 (53%). Particularly, one third of the participants (104)change their response from “Yes, I am able to see throughwindshields” to “No, I am not able to see through windshields”after the experiment.

Number of responses Number of “I can’t see”DRIVER 2148 1530 (71%)

EYES 2136 1876 (87%)

TABLE I: Experiment results

2) Experiment: Table I shows that 71% of the responses tostimuli were I can’t see, meaning that in all those instancessubjects were unable to see whether there was a driver or not.In the EYES experiment, subjects were not able to detect thedriver’s eye-gaze in 87% of the instances. Decisive responses,where subjects made a prediction, had an accuracy of 0.8 forthe DRIVER experiment and 0.6 for the EYES experiment(see Fig. 3).

In Fig. 4 and Fig. 5 we group the responses by the distancesand lighting conditions. For all lighting conditions and all dis-tances except at 5m distance, the highest number of responsesto the stimuli is I can’t see. The percentage of such uncertainresponses is around 60% at 10m and increases with moredistance to reach 90% of all responses at a 25m distance in theDRIVER experiment. For the perception of driver’s eye-gaze,responses indicate that participants can’t see whether a driveris looking at them at a 5m distance (60% I can’t see). This risesup to 91% of responses at 15m distance. These results showthat seeing through cars’ windshields is very limited and canonly occur at very short distances to the vehicle. When applied

to the crossing context, this implies that crossing pedestriansdon’t mainly rely on driver-provided cues (eye contact) asthose usually occur when the decision to cross has alreadybeen made. For example, assuming a speed of 20mph, thetime to collision at 30m is 3.3s which is below pedestrians’threshold for crossing. Hence, a pedestrian must have madetheir decision to cross before the car reaches 30m distanceat that speed. Pedestrians are therefore not necessarily basingtheir decision of crossing on the perception of driver-relatedcues.

Based on this data, we conclude that pedestrians are oftenunable to see through the windshield and engage in makingeye contact with the driver. According to the answers to Q1and Q2, however, a common contradicting belief exists aboutour ability to make eye contact. Assuming pedestrians looktowards the windshield when communicating with the driver,perhaps there is something functionally similar to an eye-gaze that provides them with acknowledgment and thereforesimulates engaging in eye contact with the driver.

The discrepancy between the reports of pedestrians and theseresults may be that people create post-hoc explanations oftheir decisions [8]. The perceived coupling of the kinematicbehavior with pedestrians’ gaze behavior may lead them tobelieve that their gaze resulted in the vehicle stopping. Overall,the results speak to the larger challenge of relying on reportsfrom people in the design of automated vehicles for tasks thatinvolve tacit knowledge or high degrees of automaticity [8].Even subtle influences such as question wording have beenshown to change how people report their experiences [6].These results in the larger context emphasize the importanceof data when making decisions about the design of safety-relevant systems.

V. CONCLUSION

In this paper we examine the role of eye contact and driverpresence in vehicle-pedestrian interactions. We challenge thecommon notion that decision making in crossing interactions islargely influenced by direct human-to-human communicationbetween pedestrians and drivers. We show that over 90% ofpeople cannot determine the gaze of the driver at 15m andsee the driver at all at 30m. This means that, for example,given the common city speed limit of 25mph, more than 99%of pedestrians would have begun crossing an unsignalizedcrosswalk before being able to see either the driver or thedriver’s gaze [1]. In other words, from the perspective ofthe pedestrian, in most situations involving an approachingvehicle, the crossing decision is made by the pedestrian solelybased on the kinematics of the vehicle without needing todetermine that eye contact was made by explicitly detectingthe eyes of the driver.

Page 5: Eye Contact Between Pedestrians and DriversB. Eye Contact Eye contact has long been seen as a central component of non-verbal communication in the context of pedestrian-vehicle interaction

No Driver DriverParticipant

No

Driv

erD

riverGro

und

Trut

h 178(66.67%)

89(33.33%)

40(11.40%)

311(88.60%)

50

100

150

200

250

300

(a) Experiment 1: DRIVER

Not Looking LookingParticipant

Not

Loo

king

Look

ing

Gro

und

Trut

h 88(73.33%)

32(26.67%)

75(53.57%)

65(46.43%)

40

50

60

70

80

(b) Experiment 2: EYES

Fig. 3: Predictions of driver’s presence and eye-gaze in the subset of cases where subject claimed to be able to see well enoughto answer the question accurately, and then doing so erroneously a large percent of the time, especially for the eye-gazedetection experiment.

5 10 15 20 25 30Distance (m)

0

20

40

60

80

Per

cent

of R

espo

nses

Can you see if there is a driver?

"I can see. There is a driver.""I can see. There is no driver.""I can't see."

(a) DRIVER: grouped by distance

1 2 3 4 5 6 7 8 9Lighting Condition

0

20

40

60

80

Per

cent

of R

espo

nses

Can you see if there is a driver?

"I can see. There is a driver.""I can see. There is no driver.""I can't see."

(b) DRIVER: grouped by lighting

Fig. 4: Effect of distance and lighting on perception of driver’s presence.

5 10 15 20 25 30Distance (m)

0

20

40

60

80

100

Per

cent

of R

espo

nses

Can you see where the driver's eyes are looking?

"I can see. Camera.""I can see. Forward.""I can't see."

(a) EYES: grouped by distance

1 2 3 4 5 6 7 8 9Lighting Condition

0

20

40

60

80

100

Per

cent

of R

espo

nses

Can you see where the driver's eyes are looking?"I can see. Camera.""I can see. Forward.""I can't see."

(b) EYES: grouped by lighting

Fig. 5: Effect of distance and lighting on perception of driver’s eye-gaze.

Page 6: Eye Contact Between Pedestrians and DriversB. Eye Contact Eye contact has long been seen as a central component of non-verbal communication in the context of pedestrian-vehicle interaction

ACKNOWLEDGMENT

This work was in part supported by US DOT’s Region I NewEngland University Transportation Center at MIT (NEUTC)and Toyota’s Collaborative Safety Research Center. The viewsand conclusions being expressed are those of the authors andmay not necessarily reflect the views of sponsoring organiza-tions.

REFERENCES

[1] M. A. Brewer, K. Fitzpatrick, J. A. Whitacre, andD. Lord. Exploration of pedestrian gap-acceptance be-havior at selected locations. Transportation researchrecord, 1982(1):132–140, 2006.

[2] D. Dey and J. Terken. Pedestrian interaction withvehicles: roles of explicit and implicit communication.In Proceedings of the 9th International Conference onAutomotive User Interfaces and Interactive VehicularApplications, pages 109–113. ACM, 2017.

[3] L. Fridman. Human-centered autonomous vehicle sys-tems: Principles of effective shared autonomy. CoRR,abs/1810.01835, 2018. URL https://arxiv.org/abs/1810.01835.

[4] M. S. Gobel, H. S. Kim, and D. C. Richardson. The dualfunction of social gaze. Cognition, 136:359–364, 2015.

[5] N. Gueguen, S. Meineri, and C. Eyssartier. A pedestriansstare and drivers stopping behavior: A field experiment atthe pedestrian crossing. Safety science, 75:87–89, 2015.

[6] E. F. Loftus and G. Zanni. Eyewitness testimony: Theinfluence of the wording of a question. Bulletin ofthe Psychonomic Society, 5(1):86–88, jan 1975. doi:10.3758/BF03336715.

[7] V. M. Lundgren, A. Habibovic, J. Andersson,T. Lagstrom, M. Nilsson, A. Sirkka, J. Fagerlonn,R. Fredriksson, C. Edgren, S. Krupenia, et al. Willthere be new communication needs when introducingautomated vehicles to the urban context? In Advancesin Human Aspects of Transportation, pages 485–497.Springer, 2017.

[8] R. E. Nisbett and T. D. Wilson. Telling more than we canknow: Verbal reports on mental processes. PsychologicalReview, 84(3):231–259, 1977. doi: 10.1037/0033-295X.84.3.231.

[9] A. Rasouli, I. Kotseruba, and J. K. Tsotsos. Agreeingto cross: How drivers and pedestrians communicate. InIntelligent Vehicles Symposium (IV), 2017 IEEE, pages264–269. IEEE, 2017.

[10] D. Rothenbucher, J. Li, D. Sirkin, B. Mok, and W. Ju.Ghost driver: A field study investigating the interactionbetween pedestrians and driverless vehicles. In Robotand Human Interactive Communication (RO-MAN), 201625th IEEE International Symposium on, pages 795–802.IEEE, 2016.

[11] M. Sucha, D. Dostal, and R. Risser. Pedestrian-drivercommunication and decision strategies at marked cross-ings. Accident Analysis & Prevention, 102:41–50, 2017.