self-mapping radio maps for location fingerprinting

15
1 23 Wireless Networks The Journal of Mobile Communication, Computation and Information ISSN 1022-0038 Wireless Netw DOI 10.1007/s11276-014-0864-1 Self-mapping radio maps for location fingerprinting Gareth Ayres & Jason Jones

Upload: swansea

Post on 27-Mar-2023

0 views

Category:

Documents


0 download

TRANSCRIPT

1 23

Wireless NetworksThe Journal of Mobile Communication,Computation and Information ISSN 1022-0038 Wireless NetwDOI 10.1007/s11276-014-0864-1

Self-mapping radio maps for locationfingerprinting

Gareth Ayres & Jason Jones

1 23

Your article is protected by copyright and all

rights are held exclusively by Springer Science

+Business Media New York. This e-offprint is

for personal use only and shall not be self-

archived in electronic repositories. If you wish

to self-archive your article, please use the

accepted manuscript version for posting on

your own website. You may further deposit

the accepted manuscript version in any

repository, provided it is only made publicly

available 12 months after official publication

or later and provided acknowledgement is

given to the original source of publication

and a link is inserted to the published article

on Springer's website. The link must be

accompanied by the following text: "The final

publication is available at link.springer.com”.

Self-mapping radio maps for location fingerprinting

Gareth Ayres • Jason Jones

� Springer Science+Business Media New York 2014

Abstract Localisation of a mobile device in Wi-Fi based

indoor environments has received much attention and is an

important area of research, considering the uptake of

devices with built-in Wi-Fi and often blanket coverage in

enterprise. Most localisation techniques in this area require

an off-line calibration phase where a radio map is manually

built using the laborious efforts of administrators. This

radio map is then subject to becoming inaccurate with time

and requires refreshing as the equipment is upgraded and

replaced. We propose a system that can automatically build

and calibrate a radio map for use with location finger-

printing techniques to provide indoor and outdoor locali-

sation with Wi-Fi. This system first uses a self-mapping

technique that makes use of user roaming patterns to build

a weighted undirected graph of Access Point locations. We

then combine this initial map with a map built by users of

smartphones with built-in Wi-Fi and GPS to generate

anchor nodes. We show that the unique combination of

these two techniques provides a zero-configuration cali-

bration map for use with location fingerprinting techniques,

which not only saves time and effort in the calibration

phase, but provides a constantly fine-tuning and self-heal-

ing map.

Keywords Wireless � Localization � Visualization �Graph theory � Force-directed layout

1 Introduction

The recent explosion in popularity of mobile devices such

as the Apple iPhone and Google Android phones has

accelerated the use and deployment of Location Based

Services (LBS). LBS’s require different levels of granu-

larity of location data with some applications working

sufficiently well with low-grained location data such as a

town/city while others require much more fine-grained data

such as room/building.

The problem of localisation while outdoors has been

largely addressed with the use of three technologies, Global

Positioning System (GPS), mobile phone network locali-

sation and Wi-Fi localisation. GPS localisation has proven

very popular with consumers, with GPS chips being

embedded in many personal devices such as traffic navi-

gators, smartphones and sports watches. The GPS system

works through the use of a Global Navigation Satellite

System to provide timing data to devices from multiple

satellites. A device can then use this data to trilaterate its

position on the earth. The GPS system works well in open

spaces, but runs into problems in city centers where the

signal suffers from reflection.

Mobile phone network localisation has become a pop-

ular method to quickly locate a phone on a GSM network,

all be it with significantly less accuracy than GPS. The U.S

FCC 911 (E911) mandate has pushed the development of

these technologies to assist authorities tracking users in

times of an emergency [1].

Wi-Fi data can also usually be obtained from smart-

phones, and as a result this has begun to emerge as a

solution to outdoor localisation. It has become common

place for competing mobile OS providers to build their

own a priori map of Wi-Fi access points that is used to help

assist in the location of smartphones. The smartphone’s

G. Ayres (&) � J. Jones

College of Engineering, Swansea University, Swansea SA28PP,

UK

e-mail: [email protected]

J. Jones

e-mail: [email protected]

123

Wireless Netw

DOI 10.1007/s11276-014-0864-1

Author's personal copy

then contribute new information back to the provider to

improve and add to the map.

Indoor localisation using Wi-Fi on the other hand is an

ongoing problem to which an adequate solution has yet to

be adopted by any mainstream smartphone developers.

There have been many proposed solutions, such as location

fingerprinting, which will be covered in Sect. 2.

In this paper we propose a novel system that augments a

self-mapping and calibrating Wi-Fi radio map derived from

mobile device roaming behavior, with a smartphone vari-

able accuracy radio map, to produce a zero-configuration

self-mapping radio map for use in indoor and outdoor

localisation.

The remainder of this paper is broken down as such:

Sect. 2 covers related work in indoor localisation, location

fingerprinting and zero-configuration radio maps. Section 3

provides an overview of the LocPriS system developed to

implement the concepts and produce results. Section 4

introduces the self-mapping using undirected weighted

graphs technique. Section 5 introduce the smartphone

variable accuracy radio map android application and

technique. Section 6 then brings Sects. 4 and 5 together to

present the Augmented Radio Map. Section 7 provides a

conclusion of the work.

2 Related work

Zero-configuration indoor Wi-Fi localisation is an active

area of research that has its roots in indoor localisation for

mobile sensor networks as well as mobile robotics. Both

these areas of research have a need to build a self-map of a

wireless environment and to locate themselves within it.

These areas of research combined with indoor localisation

in WLAN environments have produced much work. This

section will summarise the work in these areas related to

this paper.

2.1 Indoor localisation

Much work on localisation of mobile devices in an indoor

environment has been undertaken in order to provide

context-aware and location based services. The first areas

of research involved specialist hardware devices that made

use of TOA (time of arrival), TDOA (time difference) and

AOA (angle of arrival). These specialist setups made use of

infrared (Active Badges) [2], ultrasonic (Active Bat) [3]

and beacon based (Cricket) [4] high frequency wireless

communications to determine location. While these sys-

tems worked well, the requirement for specialist hardware

was a hurdle to mainstream adoption [5].

With significant research in localisation techniques and

other issues relevant to LBS’s focusing on indoor systems

with specialist hardware throughout the 1990s until early

2000 the next wave of research was initiated by Microsoft

with a paper they published called RADAR [6], which

details a number of methods of performing localisation

through the use of a emerging technology, wireless LAN’s

(Wi-Fi), without the need of specialist hardware. This

paper is discussed more in Sect. 2.2.

The direction of research began to shift during the early

2000s from using indoor location tags to existing Wi-Fi

infrastructure to provide LBS’s. The increase in popularity

of GPS and smartphones helped drive this change of

direction to focus on Wi-Fi localisation techniques.

2.2 Location fingerprinting

In 2001 Microsoft Research published a paper that detailed

a passive system called RADAR which made use of the

802.11 based Wi-Fi infrastructure for localisation, with

only additional software needed and no additional spe-

cialist hardware. This paper sparked a drive to improve

localisation of indoor devices using the native wireless

equipment used by 802.11b networks at the time.

In the RADAR system, the RF received signal strength

indicator values (RSS) is used as a measure of distance

between an Access Point (AP) and mobile node (MN). An

empirical/deterministic algorithm is described (Nearest

Neighbour in Signal Space NNSS) which is used to store

the RSS of each AP for a specific location in a Vector in a

off line phase. This off-line phase is used to create a radio

map. Each point that RSS values are added to the vector is

called a calibration point (CP). This information is used to

compute the 2D position by triangulation, with both an

empirical method and a signal propagation modelling

method used.

RSS based positioning has developed to generally use

three different techniques: Association/Cell ID based

methods, triangulation (pattern matching and probabilistic

filtering) and fingerprinting. Cell ID based is the simplest,

but yields a very low accuracy equal to that of the total

range of RSS from an AP. Triangulation through lateration

is more accurate with three or more AP’s, but is compu-

tationally demanding and requires knowledge of the exact

position of AP’s, and does not work well when considering

radio propagation/multi-path problems encountered by

non-line of sight (NLOS). However, for indoor positioning,

non-line-of-sight propagation and attenuations caused by

walls, other structures, and even people causes significant

fluctuations to RSS, which makes the simple path loss

models too inaccurate in many real life situations. Location

Fingerprinting has become an established area of further

research as it copes well with the problem of NLOS in Wi-

Fi by modelling the environment and making use of the

propagation characteristics by design.

Wireless Netw

123

Author's personal copy

Typically two phases are required for location finger-

printing; an offline training phase to build a radio map of

calibration points (CPs) for all locations and an online

location determination phase where a radio map is refer-

enced by a device to determine its location.

The radio map, or a priori, is created during a calibration

phase that includes reading RSS values at predetermined

locations for a fixed period of time. RADAR suggested

manual or simulation calibration phases. Manual requires

walking around and taking readings at each CP while

simulation requires a detailed model of a building, both

taking considerable time and effort to achieve. It is also

usual to include building floor plans as images, and require

manually linking location to position on a floor plan

(labelling). It has been suggested that the linking of actual

location to radio map could be achieved through prompting

a user to label locations during the online phase instead of

the calibration phase [7]. This phase of calibrating the

system is also labour intensive and subject to errors when

AP’s move or fail.

The number of CP’s in a radio map, and the time spent

at each [8] has also been the subject of much work, in an

attempt to reduce the burden of the offline calibration

phase. It has been shown that more CP’s result in more

accurate location estimates but a larger search time and

computational cost. While less CP’s result in less accurate

location estimates but a smaller search time and are more

efficient [9].

It has also been reported that different devices report

differing RSS values at the same location as a result of the

different hardware and design properties of Wi-Fi chipsets

[10]. This, along with the fact that the orientation and

direction of devices can also return different readings,

results in a significant possibility of error in a radio map

[11].

Location fingerprinting techniques fall into two cate-

gories, probabilistic or deterministic. In deterministic

techniques typically the estimated location of a MN is

considered static in time and makes use of the Nearest

Neighbor techniques, with the Kth NN and Weighted NN

techniques proven to be successful in early works [6].

Probabilistic Algorithms have proven to be the most

popular technique; they consist of RSS data summarised

into Probability Density Functions (PDF). At each CP, a

radio map has a PDF for each AP approximated using a

Kernel Function or Histograms [12]. Using a random var-

iable to represent location estimation results in a smaller

radio map in memory and less computation time in the

localisation phase [13]. This combined with filtering using

Kalman or Bayesian filtering results in improved location

estimation using continuous random variables. This

approach makes use of previous location data, not just

static calibration points [12].

2.3 Reduced/zero-configuration radio maps

Accurate location estimation using measurements of signal

strength is still a difficult task due to the noisy character-

istics of signal propagation. Physical characteristics such as

reflection, refraction, diffraction and absorption means

signal propagation suffers from multi-path fading effects in

an indoor environment and RSS values frequently change.

With 802.11 techniques such as channel hopping, automatic

equipment and transmission power control and changes in

network layout an up to date radio map for use with location

fingerprinting is a research challenge of its own.

Frequently changing fingerprints result in the time/

labour intensive radio maps becoming inaccurate or

expired quickly, with a new costly offline recalibration

phase necessary. This challenge has been addressed by

projects such as RedPin [14] which make use of RSS fin-

gerprints and manual interaction from clients to continually

update a radio map with actual location on floor plans.

Reducing the size of radio maps and the number of

required CPs has been suggested as a way to speed up the

off line phase. The use of probabilistic estimation tech-

niques has resulted in smaller radio map requirements for

fingerprinting, but still requires linking to a floor plan or

map [15].

The link between a floor plan and a CP on a radio map is

sometimes called labelled data. There has been some work

to reduce the effort of labelling data when building radio

maps by allowing one floor to be manually labelled and

assume the other floors follow a similar layout [16].

Anchor nodes have been proposed as a solution to

provide zero-configuration, along with access points that

monitor each other’s RSS values [17]. The anchor nodes

however need to be manually entered and positioned before

a system is active and AP’s require custom layer 2 soft-

ware. If such an anchor node were to move or be replaced

with different hardware, it would have significant impact

on the reliability of such a system.

3 System overview

LocPriS is a modular extensible framework [18] that we

are developing to provide tools for the development, ana-

lysis, comparison and visualisation of LBS’s that preserve

privacy and security. The framework will also assist in the

development and testing of Location Based Services

through an exposed API (Fig. 1).

3.1 Visualising roaming with LocPriS

LocPriS can build a model of wireless access points (AP),

mobile nodes (MN) and the movements of the MN’s

Wireless Netw

123

Author's personal copy

between AP’s. This model can then be analysed empiri-

cally or visualised in 2d/3d as in Fig. 2. The visualization is

produced using custom software written in Java, PHP and

Javascript. The D3.js framework is used for all 2d visu-

alisations, and the jME framework used for 3d.

3.1.1 Automatic labelling and grouping

Although trivial, the hostname of each AP can sometimes

be used to automatically label AP’s and group them. This

technique can be useful in enterprise environments where

multiple location/buildings/floors have similar hostnames.

These self-assigned labels or groups can then be used to

help eliminate or speed up the manual calibration phase.

Groups of similar hostnames can be presumed to rep-

resent similar geographic locations. Fulton-AP-F1-2 for

example could be inferred to represent a cluster of AP’s in

the Fulton building. These groups can then be labelled and

visualised instead of individual AP’s in order to present a

visualisation that is simpler to interpret and conceptualise.

This is shown in Fig. 2, where automatic groups for each

building are shown as red circles on a campus map which

are generated in LocPriS using manually entered locations.

The green lines in Fig. 2 represent the roaming between

groupings by a MN as it moves around the campus.

3.1.2 Association/disassociation data

As MN’s roam, they associate to different AP’s. These

associations and disassociations create events (SNMP

traps) in the wireless system which can be sent to LocPriS

and then stored in a database. These events contain addi-

tional information such as time, AP name and MN iden-

tifier. The LocPriS system can pseudo-anonymise this data

to enhance user privacy for use with location fingerprint-

ing. While it is the aim to discover the location of AP’s

using self-mapping, it is possible to set the latitude/longi-

tude of an AP manually in LocPriS. Figure 3 shows the

location of all AP’s at the Student Village mapped using

their lat/long location which was manually entered in order

to evaluate the effectiveness of self-mapping in later

sections.

3.2 Android application: variable accuracy location

data

Using smartphones it is now possible to generate much

location data with variable accuracy from different sources.

These sources can include GPS, the mobile phone network

LocPriSData Collection Module

(Real-Time data collection or Wireless Traces )Mobility Simulation Module

(Internal and External Simulation )

Visualisation Module(2D & 3D Visualisation)

Statistical Output Module(Output to CSV File)

Location Based Service Module / API

Location Privacy Module(Implementation of Privacy Techniques )

Fig. 1 The modular design of the LocPriS Framework

Fig. 2 Automatic groupings based on labels and fixed locations

(Color figure online) Fig. 3 Map of all AP’s in Student Village

Wireless Netw

123

Author's personal copy

and Wi-Fi estimation. The specific use of Android and a

custom LocPriS application that has been developed is

discussed in further detail in Sect. 5.

4 Self-mapping using undirected weighted graphs

The LocPriS system receives real time association and

disassociation data about all connected MN’s from wireless

AP’s via the wireless controllers. This data is then for-

warded to the LocPriS server software and stored in two

mySQL database tables.

This data is generated by devices roaming between

AP’s. Each roam results in an association and disassocia-

tion message being sent to the LocPriS system using SNMP

packets. Figure 4 shows an example of a user in a typical

scenario walking past access points, between buildings,

with a device roaming for connectivity as the user moves.

This information can be used to build an undirected

weighted graph G with vertices V and edges E. Let an

access point be considered a vertex V, and the con-

nectivity/relationship between each access point as a

device roams considered the edge E. While the graph of

one user’s movements is directed, when multiple devices

over a larger time period are considered, the same edge

between vertices will be covered in both directions. So the

direction becomes irrelevant.

The edge weight W can then be defined as the time taken

to associate between access points. Given Fig. 4, access

points A1 and A2 are nodes, and the edge between the two

nodes signifies the relationship that node A2 was next

roamed to from node A1. The edge weight is W = 8 as there

was an 8 s time period between associations.

As many weights, from many devices will be fed into the

system for two given nodes, each with a potentially dif-

ferent weight, it is important to use the most effective

maximum weight. This is required to rule out some sce-

narios where invalid data would be generated, such as when

a device remains still or a device is switched off and then

back on. In order to calculate the most effective weights to

use, we recorded the GPS locations of the 204 access points

at the university’s student village, which is a private area of

land roughly quarter of a mile squared in area containing

130 houses and 204 AP’s. This can be seen in Fig. 3, with

each access point positioned on the map with a red dot.

We used 1 month of data from March 2013 and plotted

each nodes GPS location into a Euclidian plane and cal-

culated the distance between neighboring AP’s. We then

paired these data with the minimum and average time

recorded to associate between neighbor AP’s. The corre-

lation coefficient between the time taken (t) in seconds to

roam between two AP’s, and the displacement (d) between

those two AP’s in meters was then calculated to discover

the best maximum weight to use for future graphs.

Correlðt; dÞ ¼Pðt � �tÞðd � �dÞ

ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiPðt � �tÞ2

qPðd � �dÞ2

where �t ¼ average time; �d ¼ average displacement

This was undertaken for all the access points using all

the data from March 2013, and used maximum weights

ranging between 25 and 3,000 s. The weight is then said to

be the time taken in seconds to roam between two AP’s.

This gave us the plot which can be seen in Fig. 5, and

shows that the minimum time seen to roam between two

AP’s gives a better result than the average time. It also

shows the highest correlation coefficient is for a maximum

weight of 250 s.

Using the information discovered in Fig. 5, the best

maximum weight Wmax = 250 s is recommended, and used

throughout the rest of this paper. While the weight is

Fig. 4 One device roaming

0

0.1

0.2

0.3

0.4

0.5

0.6

0 500 1000 1500 2000 2500 3000

CORR

ELAT

ION

CO

EFFI

CIEN

T

SECONDS

Correla�on Min Correla�on Avg

Fig. 5 Correlation coefficient of weights

Wireless Netw

123

Author's personal copy

derived from time, the unit of the weight once the graph is

built is meaningless, as the graph is dimensionless. Once

the graph is anchored in Sect. 6, the weights are used to

scale the graph and keep edges relative.

For the full 12 months of 2013 there were 50,539,780

associations recorded by LocPriS running on the Swansea

University wireless system, for all campuses. During this

time, there were 870 distinct AP’s identified in the system

and 21,382 distinct devices seen.

The sample data seen in Table 1 shows the information

stored in the LocPriS database in the association table. The

edge weight between V(AP1) and V(AP2) is W = AP2(Wi)

- AP1(Wi) = 1,304,467,963 - 1,304,467,955 = 8 s.

Figure 4 represents one devices movement, for a trivial

number of AP’s. The LocPriS system builds d subgraphs

Gd = (V,E), where d = {1…number of devices}, based on

the connectivity and mobility of a device d and then

compares the weights, adding only the smallest weight seen

between two edges to graph G, where W \ 250. So edge

G(E) will contain {W0…Wd} weights, and Ed(W) will

contain the smallest value of {W0…Wd}.

The AP Name is irrelevant to the Graph, but used only

as a label to aid interpretation of visualisations. In our

dataset the AP name is the hostname of the AP. Fortunately

the hostnames at Swansea are descriptive of the location to

differing degrees, but usually contain at least the building

name and floor. This makes grouping and automatically

labelling AP’s based on AP Name a future possibility. The

remainder of this paper will focus on data taken from the

academic year 2013–2014.

4.1 Roaming statistics and device characterisation

Devices roaming between access points produce the data

used to build the graphs for the radio map. Some inter-

esting statistics and trends have been observed when ana-

lysing the roaming statistics from the LocPriS system

which may affect the effectiveness of the data and tech-

niques used. We have analysed 12 months roaming data

from 2013 to 2014 from the Student Village.

Table 2 shows the top 17 device vendors in descending

order of association count, including a row for ‘All Ven-

dors’ which shows the total for all devices seen in 2013-14

at the Student Village. This table shows which device types

are popular on the wireless network at Swansea, and how

much of the data used to build the graphs came from which

device types. It also provides an insight into which of the

popular devices on campus have a high or low number of

Association/Device. The number of AP’s (Vertices) seen

Table 1 Sample associations data

AP name Device ID Timestamp

AP1 7899 1304467955

AP2 7899 1304467963

Table 2 Top device vendors by

association countVendor Association

count

Assoc

%

Edges Vertices Devices Assoc/

Device

All vendors 13,422,957 100.0 23,462 204 18,674 718.80

Apple 6,263,451 50.0 20,102 204 8,415 744.32

Samsung 1,606,796 12.8 15,784 204 1,985 809.47

Hon hai precision ind 937,605 7.5 3,276 204 1,284 730.22

Nintendo co., ltd. 813,108 6.5 3,366 191 71 11,452.23

HTC corporation 477,820 3.8 10,236 204 666 717.45

Liteon technology

corporation

427,762 3.4 2,345 204 781 547.71

Murata manufactuaring 401,243 3.2 10,840 204 614 653.49

Nokia corporation 376,124 3.0 8,628 204 355 1,059.50

Sony mobile

communications

365,596 2.9 8,952 204 756 483.59

Intel corporate 295,498 2.4 2,460 204 751 393.47

LG electronics 207,446 1.7 7,834 204 175 1,185.41

Research in motion 148,693 1.2 14,448 204 381 390.27

Asustek computer inc. 61,926 0.5 3,450 203 120 516.05

Huawei technologies 54,258 0.4 3,866 194 73 743.26

Motorola mobility 50,952 0.4 3,088 199 49 1,039.84

Gemtek technology 35,702 0.3 200 149 81 440.77

Wistron neweb corp 13,638 0.1 238 125 52 262.27

Total for listed vendors 12,537,618 100.0

Wireless Netw

123

Author's personal copy

by each manufacturer (from a total of 204) and the number

of distinct associations between AP’s (Edges) are also

shown.

We have also calculated the correlation coefficient

between displacement and time for each device vendor type,

and broken down the two most popular vendors Apple and

Samsung for further analysis. It can be seen in Table 3 that

Apple devices account for 50 % of all wireless devices in the

Student Village, with Samsung making up nearly 13 %.

Students at Swansea can use any make device they wish on

the wireless network, and all are self-bought by students.

By using the wireless registration data it is possible to

brake the Apple vendor down into the known models. The

same applies for Samsung, but we have had to combine all

tablets/phones under Android as this was all that was

possible with the registration data. The totals for ‘All

Apple’ and ‘All Samsung’ is larger than the sum of the sub-

types as not all devices seen on the system were registered,

so these were not included in the granular analysis.

It can be seen from the data in Table 3 that the ‘Apple-

OSX’ and ‘Samsung-Laptop’ have significantly poorer

correlation coefficients than the mobile devices such as the

‘iPhone’. This is presumably a result of the difference in

use of mobile devices such as phones and laptops. It is also

likely a result of the behaviors of such devices, where

smartphones stay connected to wireless in a user’s pocket

as they roam, whereas a laptop would typically be shut

down or suspended when carried.

The devices with a correlation coefficient of less than

0.5 are given in italics, and greater than 0.5 in roman.

Using this information to remove the devices which pro-

vide a low or poor correlation may help improve the overall

accuracy of the map.

Other factors which may also play part in the roaming

decisions of device are badly written wireless drivers

which roam between near access points too aggressively, or

possible infrastructure problems in some locations.

4.2 Visualising roaming using LocPriS

Using the roaming data from LocPriS along with the

grouping of the AP data per building using the automatic

labelling of AP names, it is possible to quickly visualise the

mobility of devices to get an idea of roaming patterns.

Figure 6 shows a visualisation of associations of all devices

during a 1 h period, overlaid onto a Swansea Universiy

Student Village map. The association between two AP’s is

represented by a line. Each AP was manually positioned

using manually collected GPS locations and is represented

by a red circle.

However, removing the dependency of having location

for buildings and access points manually entered and

recorded is the goal of a self-mapping radio map.Table 3 Device type correlation coefficient

Vendor Devices Associations % Correl Coeff

All Vendors 18,674 100.0 0.61

All Apple 8,415 50.0 0.60

Apple-iPhone 4,298 N/A 0.61

Apple-iPad 753 N/A 0.62

Apple-OSX 135 N/A 0.29

All Samsung 1,985 12.8 0.64

Samsung-Android 1,321 N/A 0.65

Samsung-Laptop 90 N/A 0.03

HTC Corporation 666 3.8 0.63

Hon Hai Precision 1,284 7.5 0.45

Murata manufacturing 614 3.2 0.62

Nintendo 71 6.5 0.43

Nokia Corporation 355 3.0 0.63

Liteon Technology 781 3.4 0.26

Intel 751 2.4 0.32

Sony 756 2.9 0.64

RIM 381 0.6 0.51

LG Electronics 175 1.7 0.62

Wistron Neweb 52 0.1 0.27

Asustek Computer inc 120 0.5 0.57

Motorola Mobility 49 0.4 0.61

Huawei Technologies 73 0.4 0.54

Gemtek Technology 81 0.3 0.02 Fig. 6 One hour associations with automatic grouping and manual

entered AP locations (Color figure online)

Wireless Netw

123

Author's personal copy

4.3 Graph drawing of the undirected weighted graph

All the associations and disassociations recorded by Loc-

PriS at the Student Village for March 2013 were used to

build an undirected weighted graph, with a maximum

weight of 250 s. The LocPriS system calculates the nodes,

edges and weights then outputs the result as a Graph G.

Using force-directed layout techniques, and the visual-

ization package D3.js, it is possible to visualise Graph G.

The graph in Fig. 7 represents the Self-Mapping Radio

Map (SMRM) generated from all the associations collected

for March 2013 on the Swansea University student village

wireless infrastructure.

This graph is an undirected weighted graph that is using

a maximum weight of 250 s to determine the minimum

weight of each neighbour relationship between AP’s.

The orientation of the SMRM is irrelevant, as without

anchor nodes there is no way to determine correct orien-

tation or if a flip/twist has occurred.

In Fig. 7, groupings and clustering of AP’s where there

is a relationship between edge weight of neighbouring AP’s

and the time taken to roam between them is becoming

evident.

The graph was visualized using the custom visualization

module in LocPriS, and can be edited and manipulated in

real time using a web browser.

The SMRM now requires a meaningful orientation and

reference points to link it to the real-world locations of

AP’s for use with further location fingerprinting techniques

or for immediate localisation. Section 5 will describe how

these reference points can be collected and defined in a new

map, with Sect. 6 then showing how the two maps can be

combined to produce a self-mapping radio map. The ori-

entation and positioning could also be achieved manually,

by an administrator dragging some anchor nodes into

position on a real map, but this would not achieve the zero-

configuration or continuously updating and correcting

functionality that the map in the next section helps achieve.

5 Smartphone variable accuracy based radio maps

The rise in smartphone’s with built-in location technolo-

gies such as GPS, mobile network and Wi-Fi localisation

has resulted in a boom of location based services. Fortu-

nately, smartphone’s with the Android OS come with a

SDK [19] that is freely available for application developers

to make use of the smartphone technology to gather loca-

tion information.

A LocPriS application has been developed for Android

which collects location data and Wi-Fi fingerprints and

sends them to the LocPriS system. This data is then used to

calculate anchor nodes (AN) that then form the Variable

Accuracy Radio Map (VARM).

5.1 LocPriS Android application

Developing the LocPriS application in Android has the

advantage that it can be easily distributed to one of the

many Android devices available to users, through the

market place. This application could then be used by many

users or administrators to help build a detailed radio map as

they roam day to day.

A HTC Hero running Android 2.1 was used for the

development and experimentation of the application as

seen in this paper, but the application should work on any

Android device.

The left screen shot in Fig. 8 shows the summary page

of the Android application. From this page it can quickly

be seen that there are a number of items of information

available including: the total movements detected, the total

AP’s seen by the application and the number of broadcast

SSIDs seen.

The right screen shows the detailed view from the

LocPriS application. The detailed view is used to get more

information and turn features on and off. The Refresh

button can be used to start a Wi-Fi scan, and use the last

known good location data once. The Send button will

access the data structures of the application and send the

data to the LocPriS server. The Auto On button indicates

the automatic location sensing is off, and can be turned on

by clicking the Auto On button. This allows a user to run

the application silently, and roam around while constantly

sending data to LocPriS server.

The data text visible is a debug/detailed summary out-

put. Information available to the application from the SDK

includes: The location with an estimated location accuracyFig. 7 Self-mapping radio map (SMRM) for March 2013

Wireless Netw

123

Author's personal copy

in meters, the RSS value of the AP seen, the latitude and

longitude of the current location along with a timestamp

and the location provider type.

The location data comes from either the built in GPS on

the device, or the estimated location based on the mobile

network localisation technique provided by Android. This

uses the mobile telephony towers to calculate location, and

is generally a less accurate than GPS except for some dense

city centers where GPS is poor and there is a high con-

centration of mobile network towers.

5.2 LocPriS application calibration points and anchor

nodes

The LocPriS application was used around Swansea Uni-

versity Student Village to collect a sample of data. It is

envisaged that the application would only need to be used

by one site administrator, and even only once in order to

collect sufficient anchor nodes. The application could be

built as a service that is running silently on android devices

without user intervention.

A Calibration Point (CP) can be considered the esti-

mated location of a MN (running the LocPriS application)

when a RSS value reading, paired with GPS/Mobile loca-

tion data was taken (Location Fingerprint). A user running

the LocPriS Android Application walked on an anticlock-

wise path around the Student Village. Figure 9 shows the

path taken by the users, along with the CP’s (red pins in left

image) and the AP roaming history obtained by the

LocPriS system (right image). As the exact locations of

APs is already known in Fig. 9, this can be considered the

ground truth for AP locations.

Multiple CP’s with the same AP visible are then used to

compute the Anchor Node location, which is the estimated

location of the AP. This is achieved by using the location

of the CP with the lowest location accuracy error as the

AN. If multiple CP’s with similar location accuracy error

as found then the one with the lowest RSS value is used.

The location of the AN then become the location provided

by the CP, as derived from the android application.

It is possible to use the location of multiple CP’s and

techniques such as those described in RADAR [6] to per-

form trilateration to give a AN with a potentially better

location accuracy, but this is not explored in this paper.

The use of GPS or mobile network localisation to obtain

AN locations results in AN’s that are placed outdoors, or

on the periphery of buildings. Also limiting the maximum

location accuracy error to 5 m (typically GPS) will help

remove the creation of unsuitable AN’s.

The AN seen in Table 4 has had its location determined

by an android device with a GPS lock with an accuracy of

\5 m. This AN can then be used along with other ANs to

orientate and position other radio maps, as discussed in

Sect. 6.

Combining the anchor nodes along with the variable

accuracy location data then provides a Variable Accuracy

Radio Map (VARM) derived from android devices

reporting data back to the LocPriS server.

Fig. 8 Screen shots of LocPriS

application

Wireless Netw

123

Author's personal copy

5.3 Visualising variable accuracy location data

The Android application reports location data with latitude

and longitude values, accompanied by an accuracy value.

This is the degree of accuracy in meters that the android

location service reports the location data is correct to within.

The data collected from the LocPriS application con-

tains this accuracy value, and it is important when calcu-

lating anchor node locations.

The markers in Fig. 10 visualise the location accuracy

estimated by the android application when sending location

data coupled with Wi-Fi data to the LocPriS server.

Again blue represents mobile network location, and red,

GPS estimated locations. The blue locations in this instance

are all large as mobile network localisation is not very

accurate at the Student Village. In a dense city environment

where traditionally GPS is not so accurate but there is

excellent mobile network coverage, the mobile network

location data may be the better option. The red location

estimates are generally much smaller at the Student

Village, with some instances where location using GPS

was given a poor accuracy value.

6 Augmented radio map

With the two independently derived radio maps built, it is

possible to combine them to produce a radio map that

Fig. 9 (Left) Calibration points

via GPS. (Right) Ground truth

from MN roaming (Color figure

online)

Table 4 An anchor node (Node-id 41) with a fixed location

\node id = ’’39’’ label = ’’ful-wap-fg-niche’’/[\node id = ’’41’’ label = ’’uh-aironet-f1-lift’’[\attvalues[\attvalue for = ’’latitude’’ value = ’’51.6100090742111’’/[\attvalue for = ’’longitude’’ value = ’’-3.98159444332123’’/[\/attvalues[\/node[\node id = ’’42’’ label = ’’pre-wap-gnd-1’’/[

Fig. 10 Variable accuracy radio map (VARM) (Color figure online)

Wireless Netw

123

Author's personal copy

contains enough information on the location of access

points to perform indoor and outdoor localisation, or to

perform further Location Fingerprinting to improve accu-

racy. We call this an Augmented Radio Map (ARM). The

ARM is equivalent to the a priori map used traditionally in

location fingerprinting techniques.

The SMRM contains information useful to indoor and

outdoor localisation, but it lacks orientation and a link to

the real world locations. The smartphone VARM contains

orientation through the use of anchor nodes and helps

improve accuracy through GPS on buildings periphery

AP’s by linking anchor nodes to the real world locations.

By infusing the anchor node information into the

SMRM, it is possible to correctly orientate and position an

augmented radio map that contains information useful to

indoor and outdoor localisations and more importantly, as a

zero-configuration self-mapping radio map for use with

Location Fingerprint techniques.

The known GPS locations of the 204 AP’s at the student

village were used to anchor all nodes in a SMRM map to

their known locations, as seen in Fig. 3. Then a random

node was unanchored one at a time, and the average error

between the nodes location in the Augmented Radio Map

(ARM) and its known location was calculated and plotted

in Fig. 11. This shows the number of anchored nodes

required to maintain a specific level of error (distance in

meters) for the ARM map. While it may be assumed only a

small number of nodes are required to correctly orientate a

map, rarely will the three most ideally placed nodes be

selected as anchors. We also found that although the map

can be considered one graph, subgraphs that naturally

form, can be flipped/twisted even if the graph is correctly

orientated. The twisting and flipping of these subgraphs can

only be resolved with either anchor nodes in the subgraph

or more edges between the subgraph and supergraph.

Figure 11 shows a steady increase in error as nodes are

unanchored. We believe this is a result of the subgraph

flipping/twisting problem being corrected for each subgraph,

as well as the correction to AP locations as a result of

mapping 3D AP locations into a 2D plane. We have observed

that mapping buildings with multiple floors of APs directly

above each other into 2D results in a spread of vertices near

each other, instead of a tower of vertices atop each other. This

is a result of converting the 3D world into a 2D graph where

the edge weights are interpreted as distance between nodes in

the x/y dimension stead of the z-dimension.

The augmented radio map is a map of the AP’s in an

enterprise wireless deployment, both indoors and outdoors,

and will correct itself and improve its accuracy as more

calibration information is feed into the two source maps.

The accuracy of localisation is entirely dependent on the

quality of the two source maps, and any further finger-

printing techniques used. The more anchor nodes generated

by android application users will increase the accuracy of

the anchor nodes placed on the edge of buildings which are

used to orientate and position the self-mapping radio map.

Figure 12 shows the ARM overlaid onto a diagram of

Swansea University student village. The SMRM is posi-

tioned and orientated correctly using some of the anchor

nodes derived from the calibrations points of a user who

had walked around the student village with the application

running and collecting data. The blue dots in Fig. 12 show

the estimated location of AP’s, with the AP’s used as

anchor nodes coloured red.

While there are some obvious errors with the ARM, the

majority of AP’s are positioned atop/near the correct

building. Continuously adding more data to the source

maps should improve the ARM accuracy. The greater the

density of access points inside a building, combined with

sufficient roaming devices producing roaming data for

decreased weights, will also increase the accuracy of the

self-mapping radio map.

-10

10

30

50

70

90

110

1 14 27 40 53 66 79 92 105

118

131

144

157

170

183

196

DIST

ANCE

(MET

ERS)

UNANCHORED NODES

Error (distance ) / Anchors

Fig. 11 UnAnchored nodes/error (distance)

Fig. 12 Augmented radio map at the Student Village (Color figure

online)

Wireless Netw

123

Author's personal copy

The augmented radio map exists in a real time computed

state in the memory of the LocPriS system. While the computed

SMRM graph data is stored in LocPriS in constantly updating

database tables, as is the VARM data. The LocPriS system then

provides access to the augmented radio map, and rebuilds it as

necessary and as the source radio maps change. LocPriS can

then be references by any fingerprinting systems or even used

directly for location estimation of wireless devices.

7 Conclusion

In this paper we introduce a method to generate a Self-

Mapping Radio Map to be used with location fingerprinting

and indoor localisation that makes use of the LocPriS

system and requires zero-configuration.

We introduce the novel approach of using the roaming

data generated by mobile devices moving between access

points as weights for a self-mapping radio map (SMRM) of

access point locations using undirected weighted graphs.

The best maximum weight in the graph to use was calcu-

lated as 250 s, and the minimum time taken to roam

between AP’s was show to be better than the average.

We then show the use of a smartphone application to

collect location and wireless fingerprints at calibration

points in order to calculate anchor nodes. These anchor

nodes form a variable accuracy radio map (VARM) that

can then be combined with the self-mapping radio map

(SMRM) to orientate and position the SMRM into a aug-

mented radio map (ARM) for use as an a priori with further

location fingerprinting techniques.

Through the use of 1 month’s wireless roaming data

from the Swansea University Student Village, we show how

our technique can produce a self-mapping radio map. Such

a system can not only be used to locate access points and

hence devices in indoor and outdoor locations, but it is self-

correcting and improving, and requires zero configuration.

Further areas of interest which we have not covered in

detail in this paper but which could benefit from deeper

investigation are what influences the best maximum weight

values used, if different scenarios or deployment environ-

ments influence this, and if different graph layout tech-

niques would help improve visualisation and accuracy. The

implications of mapping real world three dimensional data

to a 2D graph/map is also an interesting area. In our sce-

nario at the Student village most buildings contained only

one or two AP’s and floors, but in a scenario where

buildings contain multiple floors of AP’s this will introduce

a new area to the graph layout which warrants further work.

Device roaming behavior between access points proved to

be an interesting topic with some unexpected results. This

value being critical to the graph composition means that the

further testing of what influences devices roaming times

between AP’s would be interesting. We believe some unex-

pected roaming statistics could be the results of some user not

walking on campus but carrying a device on a bike or in a

vehicle. Also some Wi-Fi devices do not roam as expected,

possibly as a result of bad roaming algorithms or hardware.

References

1. Woody, C., Acker, L. et al. (2001). The impact of the FCC’s

position on wireless E911.

2. Want, R., et al. (1992). The active badge location system. ACM

Transaction on Information Systems, 10, 91–102.

3. Harter, A., et al. (1999). The anatomy of a context-aware appli-

cation (MobiCom 1999) (pp. 59–68). New York, USA: ACM Press.

4. Priyantha, N. B., Chakraborty, A., & Balakrishnan, H. (2000).

The Cricket location-support system. In Proceedings of the 6th

annual international conference on Mobile computing and net-

working (MobiCom ’00) (pp. 32–43). New York, USA: ACM

Press. doi:10.1145/345910.345917.

5. Hightower, J., & Borriello, G. (2001). A survey and taxonomy of

location systems for ubiquitous computing. IEEE Computer 8(34),

57–66.

6. Bahl, P., & Padmanabhan, V. N. (2000). RADAR: An in-building RF-

based user location and tracking system. INFOCOM 2000. Nineteenth

Annual Joint Conference of the IEEE Computer and Communications

Societies. Proceedings. IEEE (Vol. 2, pp. 775–784).

7. Bhasker, E. S., Brown, S. W., & Griswold, W. G. (2004).

Employing user feedback for fast, accurate, low-maintenance

geolocationing. In In Proc. PerCom (pp. 111–120).

8. John Krumm, J. C. P. (2003). Minimizing calibration effort for an

indoor 802.11 device location measurement system. Microsoft

Research, MSR-TR-2003-82.

9. Perttula, A., et al. (2009). WLAN positioning on mobile phone. In

Proceedings of 13th IAIN World Congress, Stockholm, Sweden.

October 27–30, 2009.

10. Bolliger, P., et al. (2009). Improving location fingerprinting

through motion detection and asynchronous interval labeling. In

Proceedings of the 4th International Symposium on Location and

Context Awareness (pp. 37–51). Tokyo: Springer.

11. Taheri, A., Singh, A., & Emmanuel, A. (2004). Location fin-

gerprinting on infrastructure 802.11 wireless local area networks

(WLANs) using Locus. In Local Computer Networks, 2004. 29th

Annual IEEE International Conference on (pp. 676–683).

12. Castro, P., et al. (2001). A probabilistic room location service for

wireless networked environments. In Proceedings of the 3rd Interna-

tional Conference on Ubiquitous Computing. Atlanta, GA: Springer.

13. Youssef, M. A., Agrawala, A., & Udaya Shankar, A. (2003).

WLAN location determination via clustering and probability

distributions. In Pervasive Computing and Communications,

2003 (PerCom 2003). Proceedings of the First IEEE Interna-

tional Conference on (pp. 143–150).

14. Bolliger, P. (2008). Redpin—adaptive, zero-configuration indoor

localization through user collaboration. In Proceedings of the

First ACM International Workshop on Mobile Entity Localization

and Tracking in GPS-Less Environments (pp. 55–60). San

Francisco, CA: ACM.

15. Chai, X., & Yang, Q. (2005). Reducing the calibration effort for

location estimation using unlabeled samples. In Proceedings of

the Third IEEE International Conference on Pervasive Comput-

ing and Communications (pp. 95–104). IEEE Computer Society.

16. Hua-Yan, W., et al. (2010). Indoor localization in multi-floor

environments with reduced effort. In Pervasive Computing and

Wireless Netw

123

Author's personal copy

Communications (PerCom), 2010 IEEE International Conference

on (pp. 244–252).

17. Lim, H., et al. (2006). Zero-configuration, robust indoor locali-

zation: theory and experimentation. In INFOCOM 2006. 25th

IEEE International Conference on Computer Communications.

Proceedings (pp. 1–12).

18. Ayres, G., & Mehmood, R. (2010). LocPriS: A security and

privacy preserving location based services development frame-

work. In Proceedings of the 14th International Conference on

Knowledge-Based and Intelligent Information and Engineering

Systems: Part IV (pp. 566–575). Cardiff: Springer.

19. Google. Android SDK. http://developer.android.com/sdk/index.

html

Gareth Ayres is a Ph.D. student

in the School of Engineering at

Swansea University and also

works as a Wireless Network

Officer for Information Services

and Systems (ISS) at Swansea.

He previously completed an

undergraduate degree at Swan-

sea before working in ISS, and

then completed a master’s

degree in communication sys-

tems before starting his Ph.D.

He now works and researches in

the area of wireless networking

and privacy and enjoys

combining the practical side of his work with ISS with the theoretical

aspect of his Ph.D.

Jason Jones received a B.Sc. in

Computer Science from Swan-

sea University in 1994. He

subsequently moved across to

the College of Engineering and

received his Ph.D. in High Per-

formance Visualisation in 1999.

After that he stayed in Swansea

as a post-doctoral researcher

working on a number of large

European projects involving the

major Aerospace companies in

Europe including BAE Systems,

Airbus and EADS. Dr. Jones

then took up the post of Centre

Manager for a new Virtual Reality Centre in the University for

3 years before finally moving back to the College of Engineering as

their High Performance Computing Research Manager.

Wireless Netw

123

Author's personal copy