car-to-car communication, cognitive radio and related
TRANSCRIPT
1 © 2008 Nokia Vers1.0.ppt / 20. June 2008 / Edmund Coersmeier
Car-to-car Communication, Cognitive Radio and related Parallel Processing
15. July 2008
Edmund Coersmeier
2 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Overview
• Car-to-car communication
• Cognitive Radio
• Software Radio
• Channel estimation processing
• Multicore processing
• Conclusion
3 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Car-to-car communication
4 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Basic idea
• Automatic car-to-car communication
SpeedDriving Direction
PositionSpeed
Driving DirectionPosition
5 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Related Technologies• Starting with established wireless technologies• Include step-by-step latest wireless systems
6 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Unicast versus Geocast
• Unicast• communication between two parties
unknown structure recipientsender
unknown structure adressed regionsender
• Multicast• several parties receive the same
information• Geocast for car-to-car communication
Sender Unknownstructure
Recipient
AddressedRegion
Unknownstructure
Sender
7 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Geographic Clustering
• Clusters mapped on landscape
• GPS to link parties to the corresponding clusters
• One car acts as cellular uplink node
• Other cars communication via WLAN
UMTS
WLAN
ClusterCluster
Cellular
WLAN
8 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Dynamic Gateway
• No GPS positioning
• Cellular uplink node is established, when no other uplink is available
• Advantage• Communication
range improved
UMTS
WLAN
individuelleKommunikations-reichweite
Cellular
WLANCommunication
Range
9 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Required cellular connections
• Geographic clustering• Required UMTS connections • Highway scenario• Max = 43 connections
120 50 100 150 200 300 400 600 800 0
5
10
15
20
25
30
35
40
45
Cars
Con
nect
ions
maximum max. 98% averagetheoretical
120 50 100 150 200 300 400 600 8000
5
10
15
20
25
Cars
Con
nect
ions
maximum max. 98%averagetheoretical
• Dynamic gateway• Required UMTS connections • Highway scenario• Max = 22 connections
• Both setups converge with increasing number of cars
10 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Conclusion on Car-to-Car communication
• WLAN and latest cellular technologies can initiate car-to-car communication for consumer applications
• Dynamic Gateway approach more efficient than Geocast approach
• Both approaches are stable when number of cars increase
11 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Cognitive Radio
12 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Definition
• Cognitive Radio approach intends to efficiently utilize the limited wireless spectrum
• Cognitive Radio scans the spectrum and decides which spectrum is the best applicable for the corresponding wireless technology
• Cognitive Radio approach is based on three cognitive tasks• Radio-scene analysis (receiver)• Channel-state estimation and predictive modeling (receiver)• Transmit-power control and spectrum management (transmitter)
13 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Cognitive Cycle [source: Simon Haykin]
• Transmitter and receiver interact
• -> Real-time feedback channel
14 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Motivation
15 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Requirement when combining Cognitive and Software Radio• Cognitive Radio for spectrum efficiency
• analyzing user application• definition of wireless requirements• spectrum scanning• definition of radio characteristics
• Software Radio • adjusts transmitter and receiver algorithms• transforms algorithms to an applicable architecture • maps the architecture on available processor platform• balances between different, parallel operating radios
• To achieve efficient receiver implementations Software Radio requires • strong flexibility in terms of
• algorithm complexity• power consumption
• support from Cognitive Radio
16 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Cognitive Radio Enhancement
17 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Channel-State Estimation• Channel-State Estimate to judge about channel capacity
• Semi-blind training • Supervised training mode via short training sequence • Tracking via data feedback
• Rate feedback to transmitter to setup• data rate• transmit-power control
18 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Transmit Power Control
• Power initialization
• Inner Loop• Allocation of a number of channels
• Outer Loop investigates achieved data rate• exceeding• matching• undershooting
• Outer Loop adjusts the transmit power of each transmitter• All transmitters run from data-rate perspective with optimal transmit power• What is about the receiver complexity?
19 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Cognitive Radio Enhancement
• Each receiver includes an option to ask for low receiver complexity• Transmit-power increase• High quality channel selection
• Transmit-power increase• Other transmitters reduce power• Other receivers increase complexity
• High quality channel selection• Find a better fitting free channel• Exchange already allocated channels
20 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Receiver Algorithms with different Complexities
• Different receiver complexities based on channel-state estimation• Receiver complexities can change at any time
21 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Multicore Processing for Software Radio Architecture
22 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Motivation• Number of wireless technologies rises quickly
GSM GPRSBT
time
Inte
grat
ion
Com
plex
ity
AM/FM
3GCDMA
WLANDVB-HGPS
WIMAX
EVDO
3.9G
3.5GUWB
NextG1GBit
DRM
DMB-T
• Phone form factors limit the number of dedicated chip sets• Different mobile device use cases require different technology combinations • Implementation costs are high if dedicated digital baseband chips are used
23 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
How to reduce number of dedicated digital baseband chips?
• Starting position
GSM BT GPS WLAN DVB-H
Radio controllerPHY BB
Upper Radio Layers
• Solution
• Idea – replace digital dedicated baseband chip sets by multi-processor platform
GSM BT GPS WLAN DVB-H
Radio controllerPHY BB
Upper Radio Layers
Multi-Processor-Baseband-Engine
24 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Channel Estimation Processing
25 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Processor Load as Measure for Algorithm Complexity
• OFDM PHY layer algorithms running on a single DSP (here Digital Radio Mondiale)
• Decoding and channel estimation require most processor load
• Future software radio implementations require more scalability of complex algorithms than today available
• Research question: how to scale radio algorithms from processor load perspective?
26 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Channel Estimation via Wiener filter
• The channel transfer function Ĥ can be interpolated from pilot carriers using the Wiener filter equation
• Performance can be improved if filter coefficients are computed online (no pre-computation)
• In this case matrix inversion is the most time consuming task
27 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Processing for Channel Estimation
Filtering in frequency direction
Filtering in time direction
Read in new symbol
Channel equalization
Time synchronization tracking
Symbols
Proc
essi
ng T
ime
[ms]
28 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Processing in Frequency Direction
• Coeff. update requires most processor load.
• Only 15% for interpolation.
Interpolation infrequency direction 15%
Wiener coefficientcalculation 49%
Calculationfrequency-
correlation 29%Store coefficientsin matrix 7%
29 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Channel Estimation Optimization
30 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Dynamic channel observation
00,5
11,5
22,5
33,5
44,5
5
0 10 20 30 40 50 60 70 80
OFDM symbols
Time
[ms]
taumax taumaxoff Coefficient update required
Changing Channel Properties
• τmax = max. excess delay.
• τmaxoff = impulse offset.
31 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Reducing Processor Load
Static and dynamic processing time
10,5
24,2
2
15,8
0
5
10
15
20
25
30
Coefficient calculation only Complete channel estimation
Time
[ms]
Every OFDM symbolDynamic update
32 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Conclusion – Channel Estimation
• Channel estimation based on run-time coefficient calculation is complex
• Channel properties need to be monitored
• Dynamic coefficient update leads to significant SW radio power reduction
33 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Mapping traditional Algorithms on Multi-Processor Platforms
34 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Matrix Inversion• Toeplitz matrix inversion is a key mathematic operation for online Wiener filter
coefficient calculation to enable high performance channel estimation
• Evaluation is done in ARM Realview EB Rev. B MPCore platform• CT11MPCore • 4 x ARM11 CPU with 200MHz core frequency• L1 cache with 32kB memory for data, 32kB memory for instructions• unified L2 cache 1MB running at core frequency 200MHz (shared memory)• L220 cache controller• external bus frequency 20MHz• no floating point co-processors used
• Matrix inversions are based on C++ code with 64-Bit floating point arithmetic• Integer modeling / word-length limits will decrease relative speed-up because
• times between synchronization points will decrease• relative increase of synchronization overhead
35 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Single-CPU versus Multi-Cores• ARM processors scale execution time to the expected operation count• Cholesky, LDT, QR are separated into 4 cores, ST includes only 2 cores• Only inversion of large matrices (> 100x100) is improved a little bit from speed perspective
36 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Speed up through Multi-Threading• Speed up from -360% up to only +10.7% whereas 400% theoretically possible• Algorithms can slow down when matrix calculation is faster than synchronization• Most complex algorithms QR and Cholesky most significantly speed up
37 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Analyzing Matrix Inversion Results
• The longer math calculation versus thread synchronization time the better speed up
• For large matrices, calculation starts to be longer than synchronization time• But speed up is far from factor 2 or 4 which could be theoretically possible• Data interdependency inside the algorithms are high
To utilize multi-processor platform processing power
a significant change of data dependencies is required
38 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Introducing scalability to software radio algorithms
39 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Receiver scaling through multi-processor platforms
• Pure hardware-optimized design can be replaced by multi-processor platform• Several radios and their algorithms run in parallel• The same radio functions need to optimally fit on different processor types
A change of mathematics fitting to different processor types is required
40 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Example of parallel radio algorithms – channel decoding
• Viterbi• high signal processing performance• optimal for hardware implementation• sub-optimal for software radio approach• difficult to parallelize
• Recurrent Neural Networks• do not outperform Viterbi signal processing performance• similar mathematics as adaptive filters• easy to parallelize several networks
22)(
22)(
)(~)()(min)(min nynnnenn
�� wXww
2
2
22
22 minminmin aGrcre
acc
T����
Least-Squares - Adaptive Filter Recurrent Neural Network
41 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
N-parallel channel decoders
• Run several networks in parallel
• The more networks, the higher the channel decoding performance
• Each network • operates independently of all others• works on the same data set
• Research topic – optimize complexity of each channel decoder network
Scalability introduces flexibility for the multi-processor platform processor load
42 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Simulation results for recurrent neural networks
• Number of Recurrent Neural Networks can be adjusted to channel quality• Optimal design approach for multi-processor platforms
Direct Decision
Viterbi RNN 6
RNN 4
RNN 2
43 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Acknowledgment
The presented work was partly carried out within the German funded BMBF-project
3GET - 3G Evolving TechnologiesNo. 01 – BU - 356
The authors have the responsibility for the content of this presentation
44 © 2008 Nokia Vers1.0.ppt / 20. june 2008/ Edmund Coersmeier
Conclusion – Multicore Processing
• Parallel processor platform should be able to replace optimized hardware
• To utilize multi-processor platform processing power a significant change in algorithm data dependencies is required
• Software Radio needs to handle several radios in parallel
• A change of mathematics fitting to different processor types is required
• Radio functions need to be scalable to balance• required system performance • multi-processor platform load