crowdsourcing soft data for improved urban situation...

7
Crowdsourcing Soft Data for Improved Urban Situation Assessment Barry Park, Anders Johannson Systems Centre, Department of Civil Engineering University of Bristol Bristol, UK {cebsp, a.johansson}@bristol.ac.uk David Nicholson Advanced Technology Centre BAE Systems Bristol, UK [email protected] Abstract—Conventional “hard” sensing in urban spaces is challenged by the complexity of the environment, creating gaps in situation assessment and possible confusion due to data association errors. Crowdsourced “soft” reports from human observers may remedy this problem but require techniques for fusing hard and soft data. This paper describes an experimental crowdsourcing system to evaluate the potential improvement in situation assessment resulting from the fusion of hard and soft data. The paper then applies a new combination of Bayesian inference algorithms, Particle Filtering and Softmax learning, to a canonical test problem: tracking a single moving object moving along a road network. The fusion of soft reports with intermittent hard data is shown to yield a marked improvement in situation assessment performance. Key to achieving such gains in practice will be appropriate incentives to reward trustworthy reporters along with methods to reduce sensitivity to any remaining untrustworthy reports Index Terms—crowdsourcing; tracking; fusion; uncertainty; trust I. I NTRODUCTION Reliable and accurate situation assessment (SA) underpins decision-making in a wide range of applications. Typically SA is derived from “hard” physical sensors, such as radar, and contextual sources of information such as geographical databases [1]. Consider an urban scene in which a decision-maker requires an accurate and complete picture of the whereabouts of one or more moving objects of interest. This could pose a difficult computational challenge for an information fusion process because the hard sensors offer only sparse coverage of the objects due to limited fields of view and obscuration. Crowdsourcing is an emerging approach for mitigating difficult computational problems by utilizing widely available human resource [2]. In the case of urban SA, human “sensors” could be tasked to provide “soft” observations about an object that extend and complement hard data from physical sensors. They extend the hard data by plugging gaps in coverage and complement the physical attributes of objects measured by hard sensors by providing soft relational measurements [3]. This paper describes a key aspect of our work in the devel- opment of a crowdsourcing platform for investigating urban SA for applications such as defence, policing, emergency response, and future smart cities. It focuses on the problem of how to fuse hard and soft observations within a rigorous and robust mathematical framework for SA. The goal was to quantify under what circumstances and to what extent the soft reports are able to improve SA performance. The background to this work is: (a) a growth in the theory and application of crowdsourcing and our ongoing effort to harness this for improving urban SA, and (b) the need to integrate existing mathematical approaches for SA, particularly hard/soft fusion, in a common framework. This background is expanded in Section II, and demon- strates examples of applying the crowdsourcing paradigm to urban SA. It is followed in Section III by a description of a simulation-based example designed to evaluate SA perfor- mance with and without soft data. Section IV describes a par- ticularly versatile Bayesian inference framework for hard/soft data fusion. A number of simulation-based experiments were performed to evaluate this framework and the results are presented and discussed in Section V, followed by conclusions in Section VI. II. BACKGROUND A. Crowdsourcing Some problems remain immensely challenging to solve in sufficient time with the required level of accuracy due to their computational demands or the limitations on data collected by conventional means. Crowdsourcing is now being increasingly utilized as an effective means of overcoming these problems [4]. The idea behind crowdsourcing is to outsource a problem to a human task force and request data (e.g. observations or labels) that facilitates in its solution. The data is typically re- turned in the form of a soft report or preference. Incentives are awarded to stimulate timely and truthful reporting. However, the crowdsourced data is still likely to be of variable quality and must be carefully filtered and aggregated to reduce bias and uncertainty. In the case of urban SA, human observers could be tasked to report on vehicle positions relative to certain landmarks. These tasks might be undertaken in a live situation, making use of smartphone devices to input data. Alternatively, they could be performed offline by tasking participants to report on video playback of a scene from their laptop or desktop computers at home.

Upload: others

Post on 27-Jun-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Crowdsourcing Soft Data for Improved Urban Situation ...fusion.isif.org/proceedings/fusion2013/html/pdf... · Crowdsourcing Soft Data for Improved Urban Situation Assessment Barry

Crowdsourcing Soft Data for Improved UrbanSituation Assessment

Barry Park, Anders JohannsonSystems Centre, Department of Civil Engineering

University of BristolBristol, UK

{cebsp, a.johansson}@bristol.ac.uk

David NicholsonAdvanced Technology Centre

BAE SystemsBristol, UK

[email protected]

Abstract—Conventional “hard” sensing in urban spaces ischallenged by the complexity of the environment, creating gapsin situation assessment and possible confusion due to dataassociation errors. Crowdsourced “soft” reports from humanobservers may remedy this problem but require techniques forfusing hard and soft data. This paper describes an experimentalcrowdsourcing system to evaluate the potential improvement insituation assessment resulting from the fusion of hard and softdata. The paper then applies a new combination of Bayesianinference algorithms, Particle Filtering and Softmax learning, toa canonical test problem: tracking a single moving object movingalong a road network. The fusion of soft reports with intermittenthard data is shown to yield a marked improvement in situationassessment performance. Key to achieving such gains in practicewill be appropriate incentives to reward trustworthy reportersalong with methods to reduce sensitivity to any remaininguntrustworthy reports

Index Terms—crowdsourcing; tracking; fusion; uncertainty;trust

I. INTRODUCTION

Reliable and accurate situation assessment (SA) underpinsdecision-making in a wide range of applications. TypicallySA is derived from “hard” physical sensors, such as radar,and contextual sources of information such as geographicaldatabases [1].

Consider an urban scene in which a decision-maker requiresan accurate and complete picture of the whereabouts of oneor more moving objects of interest. This could pose a difficultcomputational challenge for an information fusion processbecause the hard sensors offer only sparse coverage of theobjects due to limited fields of view and obscuration.

Crowdsourcing is an emerging approach for mitigatingdifficult computational problems by utilizing widely availablehuman resource [2]. In the case of urban SA, human “sensors”could be tasked to provide “soft” observations about an objectthat extend and complement hard data from physical sensors.They extend the hard data by plugging gaps in coverage andcomplement the physical attributes of objects measured byhard sensors by providing soft relational measurements [3].

This paper describes a key aspect of our work in the devel-opment of a crowdsourcing platform for investigating urbanSA for applications such as defence, policing, emergencyresponse, and future smart cities. It focuses on the problemof how to fuse hard and soft observations within a rigorous

and robust mathematical framework for SA. The goal was toquantify under what circumstances and to what extent the softreports are able to improve SA performance. The backgroundto this work is: (a) a growth in the theory and applicationof crowdsourcing and our ongoing effort to harness this forimproving urban SA, and (b) the need to integrate existingmathematical approaches for SA, particularly hard/soft fusion,in a common framework.

This background is expanded in Section II, and demon-strates examples of applying the crowdsourcing paradigm tourban SA. It is followed in Section III by a description ofa simulation-based example designed to evaluate SA perfor-mance with and without soft data. Section IV describes a par-ticularly versatile Bayesian inference framework for hard/softdata fusion. A number of simulation-based experiments wereperformed to evaluate this framework and the results arepresented and discussed in Section V, followed by conclusionsin Section VI.

II. BACKGROUND

A. Crowdsourcing

Some problems remain immensely challenging to solve insufficient time with the required level of accuracy due to theircomputational demands or the limitations on data collected byconventional means. Crowdsourcing is now being increasinglyutilized as an effective means of overcoming these problems[4].

The idea behind crowdsourcing is to outsource a problemto a human task force and request data (e.g. observations orlabels) that facilitates in its solution. The data is typically re-turned in the form of a soft report or preference. Incentives areawarded to stimulate timely and truthful reporting. However,the crowdsourced data is still likely to be of variable qualityand must be carefully filtered and aggregated to reduce biasand uncertainty.

In the case of urban SA, human observers could be taskedto report on vehicle positions relative to certain landmarks.These tasks might be undertaken in a live situation, makinguse of smartphone devices to input data. Alternatively, theycould be performed offline by tasking participants to reporton video playback of a scene from their laptop or desktopcomputers at home.

Page 2: Crowdsourcing Soft Data for Improved Urban Situation ...fusion.isif.org/proceedings/fusion2013/html/pdf... · Crowdsourcing Soft Data for Improved Urban Situation Assessment Barry

This paper is concerned with the data fusion aspects of acrowdsourcing workflow for generating urban SA. Two of themain considerations in this regard are the heterogeneity of thedata both in terms of its type (hard and soft) and its level oftrust and uncertainty. Only the type heterogeneity issue willbe considered here.

B. Crowdsourcing and Urban SA

There are many examples of crowdsourcing approachesbeing used to generate large amounts of both hard and softdata from human reporters, for applications closely related toSA.

Hard data collected by crowdsourced reporters has beenaggregated to improve estimates in radiation levels [5], whileDARPA have carried out experiments that have shown crowd-sourcing can be used to locate objects across the mainland US[6], and people globally [7], in a matter of hours.

SA has been improved in disaster relief through soft crowd-sourced data. Ushaidi, initially developed to map reports ofviolence in Kenya in 2008, provides a platform for users tosubmit soft text-based data that can used for crisis mapping[8]. A significant bottleneck is the need for volunteers to labeland aggregate incoming data due to its subjective nature.

This paper provides a first proof of concept for the fusionof soft, categorical, crowdsourced data with conventional harddata sources applied to urban SA.

C. Hard/Soft Data Fusion

When data is generated by physical sensors and by crowd-sourcing, its combination will likely require a hard/soft datafusion framework. Previous work, although not directly framedin the context of crowdsourcing, has focused on three maintheoretical frameworks summarized below: Belief Theory,Random Set Theory, and Probability Theory

1) Belief Theory (BT): Dempster-Shafer BT provides richmeasures of uncertainty that can capture the kinds of ambi-guity and vagueness inherent in soft observations. Illustrativeexamples of its application to fuse hard and soft data for testingvarious propositions relating to object position and identitywere presented in [9] and [10] respectively. However, thereremain issues around whether BT can deal in a consistentway with the significant conflicts that are likely to arise incrowdsourced human observations.

2) Random Set Theory (RST): RST is a general mathemati-cal framework for representing and computing hard/soft fusionproblems. It has previously been used to fuse soft observationwith and without hard data to both localize [11] and trackan object [12]. RST requires parametric models to transformobservations into random finite sets. This is not as flexible asnonparametric Bayesian models that can accommodate highlynonlinear (e.g. range-only) observations given an adequatesupply of training data.

3) Bayesian Probability Theory (BPT): BPT is an ax-iomatic framework for data fusion and is the basis for estab-lished techniques such as particle filters for object tracking andBayesian Networks for SA. More recently BPT-based machine

Fig. 1. Crowdsourcing system diagram

learning has been applied to hard/soft data fusion test cases inthe military [13] and bioinformatics [14] application domains.BPT can be used for flexible nonparametric modeling of softdata and combined with decision theory to allocate humaneffort in crowdsourcing [15]. In this paper a BPT approach isused.

III. URBAN SA EXAMPLE

A. System Description

A software system is currently under development forinvestigating the impact of soft data in urban SA. The aim ofthis system is to combine the noise and uncertainty presentin real human responses with the methodical, flexible andrepeatable experimentation available through simulation. Sucha simulation approach allows the ground truth of the envi-ronment to be compared to the picture compilation possiblethrough fusing human reports with simulated hard sensors.

The simulation system is designed to run offline with ahuman task force of workers recruited from a web servicesuch as Amazon’s Mechanical Turk [16]. Accessing existingservices such as Mechanical Turk enables the recruitment of alarge number of people, in an on-demand manner, while alsogiving the requester an element of control over the type of datagathered through their choice of questions and user interface.

A top-level system diagram is displayed in Fig. 1. Undersome payment scheme, based on performance validation, aworker is tasked to report on one of multiple simulated urbanscenes running on an external web server. Each simulation is ashort duration time slice from a longer contiguous simulation.The aim is to ensure: tasks are quick to complete; goodcoverage over the entire simulation time; overlapping reportson the same scene so the benefits of fusion can be exploited.

Fig. 2 displays a screenshot of the simulation window andthe task input menu that will be presented to a worker whoaccepts a task. Focusing on a specific object, the workers are

Page 3: Crowdsourcing Soft Data for Improved Urban Situation ...fusion.isif.org/proceedings/fusion2013/html/pdf... · Crowdsourcing Soft Data for Improved Urban Situation Assessment Barry

Fig. 2. Urban SA simulation window and task input menu.

tasked to provide input about its type, what it did, and wherethis happened. The worker can click on the simulation windowto locate an event and choose from a set of menu optionsthat describe the event, e.g. the size of an object { small,medium, big } and its mobility {accelerated, decelerated},{turned left, turned right}. These soft observation preferencesare then submitted to a report database for further analysis,such as hard and soft data fusion processing to derive SA.

It is envisaged that SA derived as a result of this systemcould be used to inform ‘pattern of life’ analysis for urbantraffic flow or forensic studies of particular traffic incidents inapplications such as smart cities and disaster management.

B. Data Fusion Test Case

The data fusion component of the system has been evaluatedon a test case involving a single mobile object and simulatedobservation data (hard and soft). The aim of the test casewas to prototype a novel combination of hard/soft data fusionalgorithms and to motivate a future crowdsourcing experimentby demonstrating proof of concept.

In the test case a constant speed object moves along anetwork of roads. The roads are treated as lines with zerowidth. For tracking purposes, there was a known upper boundon the objects velocity of twice its constant value. Observationdata was generated in the form of multiple simulated hard andsoft reports. For evaluation purposes, four experimental set-upswere considered:

1) Good hard sensor coverage: Three hard sensors weredispersed along the roads. They were placed so that atleast one sensor was able to detect the object at any time,i.e. the object was always within the maximum range ofone sensor.

2) Poor hard sensor coverage: Only a single hard sensorwas available. It detects the object at the start of thesimulation but then the object moves out of range of thesensor.

3) Good hard sensor coverage with periodic soft reports:As for the first setup with the addition of soft reports.

4) Poor hard sensor coverage with periodic soft reports: Asfor the second setup with the addition of soft reports.

The simulated hard sensor reports are real-valued bearing-only observations of the object of interest. The observationsare perturbed by zero-mean Gaussian random noise. The hard

sensor is assumed to have a maximum range Rh, with theprobability of detection PD given by

PD = 1− r/Rh (1)

where r is the absolute distance between object and sensor.For all the experiments, Rh was set to 200m. No falsedetections (clutter) were generated for the purpose of thisevaluation.

The simulated soft sensor reports are discrete multi-categorical values for the range and bearing of the object withrespect to a reference landmark. Each sensor provides oneof 17 mutually exclusive and exhaustive soft location labelsfrom the 16 range-bearing categories {‘near to’, ‘far from’}× {‘north’, ‘north-east’,. . . ,‘north-west’} and the additionalrange-only category {‘next to’}. This work uses a modelof noisy human responses without bias or deceit, and thuspresents an initial baseline for fusion of crowdsourced datafor situation assessment.

The observation data, along with (assumed known) geo-graphic information about the road layout and the referencelandmarks are then passed to the data fusion algorithmsdescribed in the next section.

IV. FUSION APPROACH

The BPT framework from Section II forms the basis of thedata fusion approach. The desired output is an estimate for theposterior probability density function p(Xk|H1:k, S1:k) where:k is a discrete time index; Xk is the continuous random statevector of the object at time k; and H1:k and S1:k are therespective sequences of hard and soft data up to time k.

A. Prediction

In the urban environments of interest here, objects areusually constrained to move along road networks. A Bayesianestimator must be able to deal with the multi-modality of theprobability distributions created by road intersections as wellas the dynamic motion uncertainties arising from different roadcharacteristics.

Here the multi-modality issue was handled with a ParticleFilter (PF) [17]. The PF prediction was implemented using aconstant velocity motion model with additive Gaussian noise.A relatively high variance in the noise element was used.This was an intentionally sub-optimal solution designed tochallenge the fusion process when there were gaps betweenobservations, even for the simplest test case of a constantvelocity object on a straight road.

B. Hard Fusion

The hard fusion process is a conventional PF update fora noisy bearing-only sensor observation, followed by theapplication of road constraints.

Depending on the implementation details of the soft fusionalgorithm (see below) it may be necessary to compact theparticle representation of state into a Gaussian mixture modelrepresentation. This is commonly done by resampling theparticles and applying a variational Bayesian procedure to

Page 4: Crowdsourcing Soft Data for Improved Urban Situation ...fusion.isif.org/proceedings/fusion2013/html/pdf... · Crowdsourcing Soft Data for Improved Urban Situation Assessment Barry

0 10 20 30

0.65

0.7

0.75

0.8

0.85

0.9

0.95

TrainingPdataPsizeP(perPrangePcategory)

Cla

ssifi

catio

nPP

roba

bilit

y

σ =P20

(a)

0 10 20 300.6

0.7

0.8

0.9

1

Trainingbdatabsizeb(perbrangebcategory)

Cla

ssifi

catio

nbP

roba

bilit

yb

σ =b0σ =b10σ =b20

σ =b30σ =b40σ =b50

(b)

−200 0 200−300

−200

−100

0

100

200

300

x position (m)

y po

sitio

n (m

)

(c)

Fig. 3. Training softmax function: a) Mean training curve and variance over 50 simulation runs for range noise level σ = 20, b) Mean training curves forsoftmax learning with varying levels of noise in the measured range, c) Probability contours for the softmax likelihood function and training data for the ‘nextto’ (red circles), ‘near to’ (blue circles) and ‘far from’ (green circles) observation range observation categories

infer the best fit model (including the number of mixturecomponents) [18].

C. Soft Fusion

The soft fusion update at time k is given by Bayes’ Rule:

p(Xk|H1:k, S1:k) ∝ p(Sk|Xk)p(Xk|H1:k, S1:k−1) (2)

A soft report Sk = j is an observed category for Xk, wherej ∈ {1, . . . ,m} and m = 17 for the test case since thereare 2× 8 soft range-bearing categories and 1 soft range-onlycategory.

The first key issue is how to specify the likelihood modelp(Sk|Xk). This is a function that maps the continuous statespace variable Xk = x to discrete probabilities on thesoft observation space. Following [17], a natural choice for‘continuous-to-discrete’ mapping is the softmax function:

p(Sk|Xk) =exp(wT

j x+ bj)∑mh=1 exp(w

Th x+ bh)

(3)

where wj , wh ∈ Rn and bj , bh ∈ R1 are vector weights andscalar biases for classes j, h ∈ {1, . . . ,m}, respectively.

The softmax parameters are learned from labeled trainingdata using convex optimization procedures based on maximumlikelihood (or MAP) estimation.

Training can be an expensive process, both in term of datavolume and processing time, so constraints on the softmaxmodel were enforced to ease the problem. These amountedto an assumption that given a state estimate, its bearingobservation (relative to a landmark) is perfectly known, i.e.the soft reporters are able to locate the object in the correctoctant of a circle around the landmark without error. Trainingtherefore reduces to estimating the mapping between the stateand the soft range observations.

The soft sensor model used in these experiments totrain and test the softmax function had the following

range categories:‘next to’<100m, 100m<‘near to’<200m, and200m<‘far from’<300m. A soft sensor with no noise willassign the ‘correct’ categorical label every time.

Noise was introduced to this model by perturbing the rangemeasured by the soft sensor, between object and landmark,by zero mean Gaussian noise, with variance σ2, leading topossible mis-assignment in the soft range category. In laterexperiments, this data will be created by real crowdsourcedhuman observers.

To quantify the training data requirement, mean trainingcurves for varying soft range noise levels were produced.Based on 50 simulation runs, these curves show the probabilityof correctly predicting unseen observation labels. Fig. 3ashows an example of the mean and variance in the classifi-cation probability across the 50 simulations when σ = 20. Itcan be seen that on average, with 10 training points per rangecategory, the softmax model correctly predicts the correctrange-bearing category approximately 90% of the time.

Fig. 3b shows the impact on the mean classification perfor-mance with increasing noise.

An example of a learned softmax function is shown inFig. 3c This displays the probability contours of a learnedsoftmax likelihood function along with the 10 samples oftraining data in each range category that were used to trainit. This model was trained using σ = 0. Notice the ‘fuzzy’boundaries between the categories. The degree of ‘fuzziness’is controlled by the softmax weights. Also of note are thesquare edges between the different range categories. Thismeans that even with infinite training data, and a noise free softsensor model, the softmax function will not always correctlycategorise the soft range. This is a limitation of the currentapproach, however, with a small amount of injected noise inthe range data, or limited amounts of training data, the impactof this is minimal.

The second main issue is how to evaluate (2) given asoftmax likelihood function (3) and a prior p(Xk|·). The

Page 5: Crowdsourcing Soft Data for Improved Urban Situation ...fusion.isif.org/proceedings/fusion2013/html/pdf... · Crowdsourcing Soft Data for Improved Urban Situation Assessment Barry

400 600 800 1000

200

300

400

500

600

700

800

900

xapositiona(m)

yapo

siti

ona(

m)

ObjectaPositionSensorsLandmarks

ParticlesFilteraMean

(a)

400 600 800 1000

200

400

600

800

x position (m)y

posi

tion

(m

)

Sensors

Landmarks

ParticlesFilter mean

(b)

400 600 800 1000 12000

200

400

600

800

x position (m)

y po

sitio

n (m

)

Sensor 3

Object PositionSensor

Landmark

Particles

Sensor 2

Sensor 1

(c)

Fig. 4. Simulation screenshots: a) Particle filter estimate of an object showing increase in variance due to poor hard sensor coverage b) Situation assessmentafter fusion of a soft report stating object is “near to” and “south east” of upper landmark. Probability contours of the soft sensor likelihood function areshown. c) Test scenario used for performance analysis

simplest solution is likelihood weighted importance sampling(LWIS) [17], for which the new particle weights following asoft update at time k can be determined, up to a normalizationconstant:

w̃ik ∝ wi

k × p(Sk|xik) (4)

This approach, while computationally convenient, is sus-ceptible to inconsistent estimates in situations that drive manyparticle weights to near zero values. A more involved butrobust approach, Variational Bayesian Importance Sampling(VBIS), has been developed in [19].

VBIS computes a reliable Gaussian mixture approximationto (2) via variational Bayesian importance sampling (VBIS).This treats all probability density functions over Xk as fullGaussian mixture models, compressing weighted particlesafter the hard and soft sensor update. This provides a consistentand compact representation of state and avoids potential issueswith sample degeneracy.

Both LWIS and VBIS were applied in this investigation andfound to perform very similarly. The results in the next sectionare therefore based on the simpler LWIS method.

V. RESULTS AND DISCUSSION

This section contains a selection of results for the testcase outlined in Section III. First, an illustration of poorsituation assessment is shown in Fig. 4a. This has arisenbecause the object has turned right at a road intersection,moving out of range of two hard sensors. The particle-basedstate estimate has propagated particles consistent with thepossible multiple motion hypotheses given the known roadlayout. The consequence is the mean object position estimateis significantly displaced from the true object position.

A soft report is now received stating that the object is “nearto” and “south east” of the upper landmark, as show in Fig.4b. Probability contours are displayed for the learned softmaxlikelihood function. The soft report is fused with the current

particle-based estimate and the impact is clear: an improvedmean object position estimate and a reduction in the spreadof particles (variance).

Detailed statistical performance analysis results are nowpresented for different combinations of hard and soft sensorupdates. The results were obtained for the simple test scenarioshown in Fig. 4c. A single object moves along a road next towhich are situated 3 hard sensors. At any time step at leastone sensor has the object within its range. A single landmarkis used as the anchor point for periodic soft reports.

First, it is instructive to look at the estimate of one ofthe object’s position variables for a representative run of thesimulator. This is shown in Fig. 5a for a situation in whichthere is poor hard sensor coverage with no soft reports. In thisscenario, the target is within range of a hard sensor until timestep k = 100, where the object moves beyond its range. Thereis a significant drift in estimated position relative to the trueobject position in this case.

The estimates for a situation in which there is again poorhard sensor coverage but periodic soft reports every 50 timesteps are shown in Fig. 5b. The soft reports provide sufficientinformation to offset much of the drift that was evident in theprevious case.

The statistical performance analysis was based on 20 MonteCarlo runs of the simulator, after which the root-mean-squareposition estimate of the object was calculated at each time step.First, the results for poor hard sensor coverage, with periodicsoft reports were generated. This is shown in Fig. 5c, wherethe soft range noise was set to σ ∈ {0, 20, 40}. Each softmaxfunction was trained on 10 data points per range category,leading to a correct classification probability of approximately95% , 90% and 80% respectively. The main impact of thenoise can be seen when the object approaches the boundarybetween two soft range categories, which in this case occurredbetween the time steps 150− 200, and 250− 300. The noisehas little impact when the object is located near the centre

Page 6: Crowdsourcing Soft Data for Improved Urban Situation ...fusion.isif.org/proceedings/fusion2013/html/pdf... · Crowdsourcing Soft Data for Improved Urban Situation Assessment Barry

100 200 300 400 500

0

100

200

300

400

500

600

SimulationOTime

yOpo

siti

onO(

m)

FilterOPositionOEstimate± 2OStandardODeviations

TrueOObjectOPosition

(a)

100 200 300 400 500

0

100

200

300

400

500

600

SimulationjTime

yjpo

siti

onj(

m)

Filterjpositionjestimate± 2jStandardjDeviations

TruejObjectjPosition

(b)

0 100 200 300 400 5000

20

40

60

80

100

Simulation Time

RM

S e

rror

(m

)

σ = 0σ = 20σ = 40

(c)

100 200 300 400 500

0

50

100

150

200

250

SimulationfTime

RM

Sfe

rror

f(m

)

Goodfsensorfcoverage

Goodfsensorfcoveragef-fPeriodicfsoftfreports

Poorfsensorfcoverage

Poorfsensorfcoveragef-fPeriodicfsoftfreports

(d)

Fig. 5. Simulation results for simple single road test case: a) Position estimate for a situation with poor hard sensor coverage and no soft reports b) Positionestimate for a situation with poor hard sensor coverage and periodic soft reports c) RMS error performance showing impact of noise in the soft sensor rangeon object tracking d) RMS error performance showing the benefit of soft reports in situations of poor hard sensor coverage

of a soft range-bearing region, which is shown for time stepsk > 300.

Using the softmax model with σ= 0, Fig. 5d shows theimpact of soft reports for various hard and soft sensorcombinations. Bounds on performance are provided by thepoor and good hard sensor coverage cases. In the case ofgood sensor coverage, the soft reports have little impact onperformance. However, soft sensor reports clearly improvesituation assessment performance when hard sensor coverageis poor.

VI. CONCLUSIONS

Crowdsourcing applications raise interesting new challengesfor data fusion researchers. The crowdsourced data fromhuman reporters is likely to be of a soft type, of variablequality, and may not always be trustworthy.

This paper has focused on the soft aspect of crowdsourceddata and shown how it can be fused with hard data to provide

SA performance benefit when the coverage provided by hardsensors is poor.

Poor hard sensor coverage will typically be the case inurban environments, where sensing opportunities on objectsare limited leaving gaps that human reporters may be able tofill to provide completer SA with a sufficient level of accuracyto reliably inform decision makers. This paper has consideredmulti-categorical soft reports in which an object’s range and/orbearing position were specified with vague descriptors such as‘near to’ and ‘north east’ relative to a reference landmark.

Bayesian Probability Theory was applied to formulate thehard and soft data fusion problem for a mobile object on a roadnetwork. Prediction and hard data fusion were implementedwith a Particle Filter. The likelihood of a soft report wasmodeled with a softmax function that was learned fromlabelled training data. Soft fusion was implemented via alikelihood weighted importance sampling (LWIS) approach.

Several aspects of this approach merit a more detailedanalysis, including:

Page 7: Crowdsourcing Soft Data for Improved Urban Situation ...fusion.isif.org/proceedings/fusion2013/html/pdf... · Crowdsourcing Soft Data for Improved Urban Situation Assessment Barry

• When is it valid to apply geometric constraint assump-tions to simplify training of the softmax function?

• What is the performance trade-off associated with learn-ing an “average” likelihood model for a group of softreporters rather than separate models for individual re-porters?

• When does particle degeneracy become an issue? Can thisbe predicted so the most appropriate soft fusion algorithm(LWIS or VBIS) can be applied?

• What are the implications on performance of more com-plex object situations, e.g. multiple objects with dataassociation ambiguity?

This paper has only provided a proof-of concept that fusingsoft reports from human sensors can improve urban SA. Itsgreatest limitation in this regard was the use of synthetic softdata. The next step is therefore to conduct a trial with real softdata generated by human observers recruited from AmazonMechanical Turk. This expands the scope of the data fusionproblem considerably.

The first additional challenge relates to the expected vari-ability and quality of crowdsourced data. This aspect has beennoticed for the Galaxy Zoo Supernovae application, a partof the highly successful Zooniverse family of citizen scienceprojects [20]. Elsewhere this has been handled by a Bayesianclassifier combination method with a dynamic component totrack changes over time in individuals reporting confidence[21].

Related to the challenge of variable quality human reportsis their degree of trustworthiness. This can be partly mitigatedby aligning the rewards that are paid to human reporters withtheir performance. There are also computational trust modelsfor representing trust and combining reports on the basis oftrustworthiness [22].

This paper has employed a Bayesian machine learningframework for hard and soft data fusion, but there may becircumstances when other frameworks for combining uncertaindata are more advantageous. For example, when there areinsufficient labelled data to accurately train a machine learningalgorithm. The comparison of related fusion methods for thetype of test problem explored in this paper is recommended asa possible test case for the International Society of InformationFusions Evaluation of Techniques of Uncertainty Representa-tion Working Group

In conclusion, crowdsourcing is a promising technology forimproving urban SA and data fusion will be a core componentof any solution. However, the real key to success will rely uponintegrating data fusion with methodology and perspectivesfrom other fields, in particular computer science and socialsystems science.

ACKNOWLEDGMENT

This work was jointly supported by the EPSRC fundedIndustrial Doctorate Centre in Systems (Grant EP/G037353/1)and BAE Systems. Thanks are due to Nisar Ahmed (Cornell)for providing software and advice on the VBIS soft fusion

algorithm, and to Steve Reece (Oxford) and Gopal Ramchurn(Southampton) for useful comments.

REFERENCES

[1] S. Das, High-Level Data Fusion. Artech House Publishers, Sep. 2008.[2] J. Howe, “The Rise of Crowdsourcing,” Wired, vol. 14, no. 6, Jun. 2006.[3] D. L. Hall and J. M. Jordan, Human-Centered Information Fusion:

Artech House Electronic Warfare Library, 1st ed. Norwood, MA, USA:Artech House, Inc., 2010.

[4] D. C. Brabham, “Crowdsourcing as a Model for Problem Solving,”Convergence: The International Journal of Research into New MediaTechnologies, vol. 14, no. 1, pp. 75–90, Feb. 2008.

[5] M. Kamel Boulos, B. Resch, D. Crowley, J. Breslin, G. Sohn, R. Burtner,W. Pike, E. Jezierski, and K.-Y. Chuang, “Crowdsourcing, citizensensing and sensor web technologies for public and environmentalhealth surveillance and crisis management: trends, ogc standards andapplication examples,” International Journal of Health Geographics,vol. 10, no. 1, p. 67, 2011.

[6] G. Pickard, W. Pan, I. Rahwan, M. Cebrian, R. Crane, A. Madan, andA. Pentland, “Time-critical social mobilization,” Science, vol. 334, no.6055, pp. 509–512, 2011.

[7] I. Rahwan, S. Dsouza, A. Rutherford, V. Naroditskiy, J. McInerney,M. Venanzi, N. Jennings, and M. Cebrian, “Global manhunt pushes thelimits of social mobilization,” Computer, vol. 46, no. 4, pp. 68–75, 2013.

[8] R. Munro, “Crowdsourcing and the crisis-affected community,”Information Retrieval, vol. 16, no. 2, pp. 210–266, 2013. [Online].Available: http://dx.doi.org/10.1007/s10791-012-9203-2

[9] S. Acharya and M. Kam, “Evidence combination for hard and soft sensordata fusion,” in Information Fusion (FUSION), 2011 Proceedings of the14th International Conference on, 2011, pp. 1–8.

[10] T. L. Wickramarathne, K. Premaratne, M. N. Murthi, M. Scheutz,S. Kubler, and M. Pravia, “Belief theoretic methods for soft and harddata fusion,” in Acoustics, Speech and Signal Processing (ICASSP), 2011IEEE International Conference on, 2011, pp. 2388–2391.

[11] A. N. Bishop and B. Ristic, “Fusion of natural language propositions:Bayesian random set framework,” in Information Fusion (FUSION),2011 Proceedings of the 14th International Conference on, 2011, pp.1–8.

[12] B. Khaleghi and F. Karray, “Human-centered multi-target tracking: ARandom Set theoretic approach,” in Information Reuse and Integration(IRI), 2012 IEEE 13th International Conference on, 2012, pp. 193–200.

[13] S. Reece, S. Roberts, D. Nicholson, and C. Lloyd, “Determining intentusing hard/soft data and Gaussian process classifiers,” in InformationFusion (FUSION), 2011 Proceedings of the 14th International Confer-ence on, 2011, pp. 1–8.

[14] G. R. G. Lanckriet, T. De Bie, N. Cristianini, M. I. Jordan, and W. S.Noble, “A statistical framework for genomic data fusion,” Bioinformat-ics, vol. 20, no. 16, pp. 2626–2635, 2004.

[15] E. Kamar, S. Hacker, and E. Horvitz, “Combining human and machineintelligence in large-scale crowdsourcing,” in Proceedings of the 11thInternational Conference on Autonomous Agents and Multiagent Systems- Volume 1, ser. AAMAS ’12. Richland, SC: International Foundationfor Autonomous Agents and Multiagent Systems, 2012, pp. 467–474.

[16] Amazon’s mechanical turk. [Online]. Available:https://www.mturk.com/mturk/welcome

[17] B. Ristic, S. Arulampalam, and N. Gordon, Beyond the Kalman Filter:Particle Filters for Tracking Applications. Artech House, 2004.

[18] C. M. Bishop, Pattern Recognition and Machine Learning (InformationScience and Statistics). Secaucus, NJ, USA: Springer-Verlag New York,Inc., 2006.

[19] N. R. Ahmed, E. M. Sample, and M. Campbell, “Bayesian Multicat-egorical Soft Data Fusion for Human #x2013;Robot Collaboration,”Robotics, IEEE Transactions on, vol. 29, no. 1, pp. 189–206, 2013.

[20] Zooniverse. [Online]. Available: https://www.zooniverse.org/[21] E. Simpson, S. Roberts, I. Psorakis, and A. Smith, “Dynamic Bayesian

Combination of Multiple Imperfect Classifiers,” Jun. 2012.[22] S. Reece, A. Rogers, S. Roberts, and N. R. Jennings, “A Multi-

Dimensional Trust Model for Heterogeneous Contract Observations,”in Twenty-Second AAAI Conference on Artificial Intelligence, 2007, pp.128–135, event Dates: July 2007.