signal processing b14 option – 4 lectures stephen roberts · 2017. 1. 4. · s − js ω c = e j...

SIGNAL PROCESSING

B14 Option – 4 lectures

Stephen Roberts

Recommended texts

• Lynn. An introduction to the analysis and processing of signals. Macmillan.

• Oppenhein & Shafer. Digital signal processing. Prentice Hall

• Orfanidis. Introduction to Signal Processing. Prentice Hall.

• Proakis & Manolakis. Digital Signal Processing: Principles, Algorithms andApplications.

• Matlab Signal processing toolbox manual.

1

Lecture 1 - Introduction

1.1 Introduction

Signal processing is the treatment of signals (information-bearing waveforms ordata) so as to extract the wanted information, removing unwanted signals andnoise. There are compelling applications as diverse as the analysis of seismicwaveforms or the recording of moving rotor blade pressures and heat transferrates.In these lectures we will consider just the very basics, mainly using filtering as

a case example for signal processing.

2

1.1.1 Analogue vs. Digital Signal Processing

Most of the signal processing techniques mentioned above could be used toprocess the original analogue (continuous-time) signals or their digital version(the signals are sampled in order to convert them to sequences of numbers). Forexample, the earliest type of “voice coder” developed was the channel vocoderwhich consists mainly of a bank of band-pass filters. The trend is, therefore,towards digital signal processing systems; even the well-established radio receiverhas come under threat. The other great advantage of digital signal processinglies in the ease with which non-linear processing may be performed. Almost allrecent developments in modern signal processing are in the digital domain.This lecture course will start by covering the basics of the design of simple

(analogue) filters as this a pre-requisite to understanding much of the underlyingtheory of digital signal processing and filtering.

3

1.2 Summary/Revision of basic definitions

1.2.1 Linear Systems

A linear system may be defined as one which obeys the Principle of Superposi-tion. If x1(t) and x2(t) are inputs to a linear system which gives rise to outputsy1(t) and y2(t) respectively, then the combined input ax1(t) + bx2(t) will giverise to an output ay1(t) + by2(t), where a and b are arbitrary constants.

Notes

• If we represent an input signal by some support in a frequency domain, Fin(i.e. the set of frequencies present in the input) then no new frequencysupport will be required to model the output, i.e.

Fout ⊆ Fin

• Linear systems can be broken down into simpler sub-systems which can bere-arranged in any order, i.e.

x −→ g1 −→ g2 ≡ x −→ g2 −→ g1 ≡ x −→ g1,2

4

1.2.2 Time Invariance

A time-invariant system is one whose properties do not vary with time (i.e. theinput signals are treated the same way regardless of their time of arrival); forexample, with discrete systems, if an input sequence x(n) produces an output se-quence y(n), then the input sequence x(n−no) will produce the output sequencey(n − no) for all no.

1.2.3 Linear Time-Invariant (LTI) Systems

Most of the lecture course will focus on the design and analysis of systems whichare both linear and time-invariant. The basic linear time-invariant operation is“filtering” in its widest sense.

1.2.4 Causality

In a causal (or realisable) system, the present output signal depends only uponpresent and previous values of the input. (Although all practical engineeringsystems are necessarily causal, there are several important systems which arenon-causal (non-realisable), e.g. the ideal digital differentiator.)

5

1.2.5 Stability

A stable system (over a finite interval T ) is one which produces a bounded outputin response to a bounded input (over T ).

1.3 Linear Processes

Some of the common signal processing functions are amplification (or attenua-tion), mixing (the addition of two or more signal waveforms) or un-mixing1 andfiltering. Each of these can be represented by a linear time-invariant “block” withan input-output characteristic which can be defined by:

• The impulse response g(t) in the time domain.

• The transfer function in a frequency domain. We will see that the choice offrequency basis may be subtly different from time to time.

As we will see, there is (for the systems we examine in this course) an invertablemapping between the time and frequency domain representations.

1This linear unmixing turns out to be one of the most interesting current topics in signal processing.

6

1.4 Time-Domain Analysis – convolution

Convolution allows the evaluation of the output signal from a LTI system, givenits impulse response and input signal.The input signal can be considered as being composed of a succession of

impulse functions, each of which generates a weighted version of the impulseresponse at the output, as shown in 1.1. The output at time t, y(t), is obtainedsimply by adding the effect of each separate impulse function – this gives rise tothe convolution integral :

y(t) =∑

τ

{x(t − τ)dτ}g(τ) dτ→0−→∫ ∞

0x(t − τ)g(τ)dτ

τ is a dummy variable which represents time measured “back into the past” fromthe instant t at which the output y(t) is to be calculated.

7

0 20 40 60 80 1000

0.02

0.04

0.06

0.08

0.1components

0 20 40 60 80 1000

0.05

0.1

0.15

0.2

0.25total

0 20 40 60 80 1000

0.05

0.1

0.15

0.2

0.25

0 2 4 60

0.1

0.2

0.3

0.4

Figure 1.1: Convolution as a summation over shifted impulse responses.

8

1.4.1 Notes

• Convolution is commutative. Thus:

y(t) is also

∫ ∞

0x(τ)g(t − τ)dτ

• For discrete systems convolution is a summation operation:

y [n] =∞∑

k=0

x [k ]g[n − k ] =∞∑

k=0

x [n − k ]g[k ]

• Relationship between convolution and correlation The general form ofthe convolution integral

f (t) =

∫ ∞

−∞x(τ)g(t − τ)dτ

is very similar2 to that of the cross-correlation function relating 2 variablesx(t) and y(t)

Rxy(τ) =

∫ ∞

−∞x(t) · y(t − τ)dt

Convolution is hence an integral over lags at a fixed time whereas correlationis the integral over time for a fixed lag.

2Note that the lower limit of the integral can be −∞ or 0. Why?

9

• Step response The step function is the time integral of an impulse. As inte-gration (and differentiation) are linear operations, so the order of applicationin a LTI system does not matter:

δ(t) −→∫

dt −→ g(t) −→ step response

δ(t) −→ g(t) −→∫

dt −→ step response

10

1.5 Frequency-Domain Analysis

LTI systems, by definition, may be represented (in the continuous case) by lin-ear differential equations (in the discrete case by linear difference equations).Consider the application of the linear differential operator, D, to the functionf (t) = est :

Df (t) = sf (t)An equation of this form means that f (t) is the eigenfunction of D. Just like theeigen analysis you know from matrix theory, this means that f (t) and any linearoperation on f (t) may be represented using a set of functions of exponentialform, and that this function may be chosen to be orthogonal. This naturallygives rise to the use of the Laplace and Fourier representations.

11

• The Laplace transform:

X(s) −→ Transfer function G(s) −→ Y (s)

where,

X(s) =

∫ ∞

0x(t)e−stdt Laplace transform of x(t)

Y (s) = G(s)X(s)

where G(s) can be expressed as a pole-zero representation of the form:

G(s) =A(s − z1) . . . (s − zm)

(s − p1)(s − p2) . . . (s − pn)

(NB: The inverse transformation, ie. obtaining y(t) from Y (s), is not astraightforward mathematical operation.)

12

• The Fourier transform:

X(jω) −→ Frequency response G(jω)Y (jω)

where,

X(jω) =

∫ ∞

−∞x(t)e−jωtdt Fourier transform of x(t)

andY (jω) = G(jω)X(jω)

The output time function can be obtained by taking the inverse Fouriertransform:

y(t) =1

2π

∫ ∞

−∞Y (jω)ejωtdω

13

1.5.1 Relationship between time & frequency domains

TheoremIf g(t) is the impulse response of an LTI system, G(jω), the Fourier transformof g(t), is the frequency response of the system.

ProofConsider an input x(t) = A cosωt to an LTI system. Let g(t) be the impulseresponse, with a Fourier transform G(jω).

Using convolution, the output y(t) is given by:

y(t) =

∫ ∞

0A cosω(t − τ)g(τ)dτ

=A

2

∫ ∞

0ejω(t−τ)g(τ)dτ +

A

2

∫ ∞

0e−jω(t−τ)g(τ)dτ

=A

2ejωt

∫ ∞

−∞g(τ)e−jωτdτ +

A

2e−jωt

∫ ∞

−∞g(τ)ejωτdτ

(lower limit of integration can be changed from 0 to −∞ since g(τ) = 0 for

14

t < 0)

=A

2{ejωtG(jω) + e−jωtG(−jω)}

Let G(jω) = Cejφ ie. C = |G(jω)|, φ = arg{G(jω)}

Then y(t) =AC

2{ej(ωt+φ) + e−j(ωt+φ)} = CA cos(ωt + φ)

i.e. an input sinusoid has its amplitude scaled by |G(jω)| and its phase changedby arg{G(jω)}, where G(jω) is the Fourier transform of the impulse responseg(t).

15

TheoremConvolution in the time domain is equivalent to multiplication in the frequencydomain i.e.

y(t) = g(t) ∗ x(t) ≡ F−1{Y (jω) = G(jω)X(jω)}

andy(t) = g(t) ∗ x(t) ≡ L−1{Y (s) = G(s)X(s)}

ProofConsider the general integral (Laplace) transform of a shifted function:

L{f (t − τ)} =∫

tf (t − τ)e−stdt

= e−sτL{f (t)}

Now consider the Laplace transform of the convolution integral

L{f (t) ∗ g(t)} =∫

t

∫

τf (t − τ)g(τ)dτe−stdt

=

∫

τg(τ)e−sτdτL{f (t)}

= L{g(t)}L{f (t)}

By allowing s → jω we prove the result for the Fourier transform as well.

16

1.6 Filters

Low-pass : to extract short-term average or to eliminate high-frequency fluctu-ations (eg. noise filtering, demodulation, etc.)

High-pass : to follow small-amplitude high-frequency perturbations in presenceof much larger slowly-varying component (e.g. recording the electrocardio-gram in the presence of a strong breathing signal)

Band-pass : to select a required modulated carrier frequency out of many (e.g.radio)

Band-stop : to eliminate single-frequency (e.g. mains) interference (also knownas notch filtering)

17

low-pass high-passband-pass

notch

ω

G( )ω| |

Figure 1.2: Standard filters.

18

1.7 Design of Analogue Filters

We will start with an analysis of analogue low-pass filters, since a low-pass filtercan be mathematically transformed into any other standard type.Design of a filter may start from consideration of

• The desired frequency response.

• The desired phase response.

The majority of the time we will consider the first case. Consider some desiredresponse, in the general form of the (squared) magnitude of the transfer function,i.e. |G(s)|2. This response is given as

|G(s)|2 = G(s)G∗(s)

where ∗ denotes complex conjugation. If G(s) represents a stable filter (its polesare on the LHS of the s-plane) then G∗(s) is unstable (as its poles will be on theRHS).The design procedure consists then of

• Considering some desired response |G(s)|2 as a polynomial in even powersof s .

19

• Designing the filter with the stable part of G(s), G∗(s).

This means that, for any given filter response in the positive frequency domain,a mirror image exists in the negative frequency domain.

1.7.1 Ideal low-pass filter

ωω

| G( ) |ω

c

1

0Figure 1.3: The ideal low-pass filter. Note the requirement of response in the negative frequency domain.

Any frequency-selective filter may be described either by its frequency response

20

(more common) or by its impulse response. The narrower the band of frequenciestransmitted by a filter, the more extended in time is its impulse response wave-form. Indeed, if the support in the frequency domain is decreased by a factor ofa (i.e. made narrower) then the required support in the time domain is increasedby a factor of a (you should be able to prove this).Consider an ideal low-pass filter with a “brick wall” amplitude cut-off and no

phase shift, as shown in Fig. 1.3.Calculate the impulse response as the inverse Fourier transform of the fre-

quency response:

g(t) =1

2π

∫ ∞

−∞G(jω)ejωtdω =

1

2π

∫ ωc

−ωc1 · ejωtdω =

1

2πjt(ejωc t − e−jωc t)

hence,

g(t) =ωcπ

(sinωct

ωct

)

Figure 1.4 shows the impulse response for the filter (this is also referred to asthe filter kernel).The output starts infinitely long before the impulse occurs – i.e. the filter is

not realisable in real time.

21

−5 −4 −3 −2 −1 0 1 2 3 4 5−0.5

0

0.5

1

1.5

2

t

g(t

)

Figure 1.4: Impulse response (filter kernel) for the ILPF. The zero crossings occur at integer multiples ofπ/ωc .

A delay of time T such that

g(t) =ωcπ

sinωc(t − T )ωc(t − T )

would ensure that most of the response occurred after the input (for large T ).The use of such a delay, however, introduces a phase lag proportional to frequency,since arg{G(jω} = ωT . Even then, the filter is still not exactly realisable; insteadthe design of analogue filters involves the choice of the most suitable approxima-tion to the ideal frequency response.

22

1.8 Practical Low-Pass Filters

Assume that the low-pass filter transfer function G(s) is a rational function in s .The type of filter to be considered in the next few pages is all-pole design, whichmeans that G(s) will be of the form:

G(s) =1

(ansn + an−1sn−1 + . . . a1s + a0)

orG(jω) =1

(an(jω)n + an−1(jω)n−1 + . . . a1jω + a0)

The magnitude-squared response is |G(jω)|2 = G(jω) ·G(−jω). The denom-inator of |G(jω)|2 is hence a polynomial in even powers of ω. Hence the taskof approximating the ideal magnitude-squared characteristic is that of choosinga suitable denominator polynomial in ω2, i.e. selecting the function H in thefollowing expression:

|G(jω)|2 =1

1 +H{( ωωc )2}

where ωc = nominal cut-off frequency and H = rational function of (ωωc)2.

23

The choice of H is determined by functions such that 1+H{(ω/ωc)2} is closeto unity for ω < ωc and rises rapidly after that.

1.9 Butterworth Filters

H{(ω

ωc)2} = {(

ω

ωc)2}n = (

ω

ωc)2n

i.e.

|G(jω)|2 =1

1 + ( ωωc )2n

where n is the order of the filter. Figure 1.5 shows the response on linear (a)and log (b) scales for various orders n.

24

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 20

0.2

0.4

0.6

0.8

1(a)

n=1

n=2

n=6

ω

|G(ω

)|

10−2

10−1

100

10−1

100

(b)

log ω

log |G

(ω)|

Figure 1.5: Butterworth filter response on (a) linear and (b) log scales. On a log-log scale the response, forω > ωc falls off at approx -20db/decade.

25

1.9.1 Butterworth filter – notes

1. |G| = 1√2for ω = ωc (i.e. magnitude response is 3dB down at cut-off

frequency)

2. For large n:

• in the region ω < ωc , |G(jω)| = 1• in the region ω > ωc , the steepness of |G| is a direct function of n.

3. Response is known as maximally flat, because

dnG

dωn

∣∣∣∣ω=0

= 0 for n = 1, 2, . . . 2N − 1

Proof

Express |G(jω)| in a binomial expansion:

|G(jω)| = {1 + (ω

ωc)2n}−

12 = 1−

1

2(ω

ωc)2n +

3

8(ω

ωc)4n −

5

16(ω

ωc)6n + . . .

It is then easy to show that the first 2n − 1 derivatives are all zero at theorigin.

26

1.9.2 Transfer function of Butterworth low-pass filter

|G(jω)| =√

G(jω)G(−jω)Since G(jω) is derived from G(s) using the substitution s → jω, the reverse

operation can also be done, i.e. ω → −js

√

G(s)G(−s) =1

√

1 + (−j sωc )2n

or G(s)G(−s) =1

1 + (−jsωc )2n

Thus the poles of

1

1 + (−jsωc )2n

belong either to G(s) or G(−s). The poles are given by:(−jsωc

)2n

= −1 = ej(2k+1)π, k = 0, 1, 2, . . . , 2n − 1

27

Thus−jsωc= ej(2k+1)

π2n

Since j = ejπ2 , then we have the final result:

s = ωc exp j [π

2+ (2k + 1)

π

2n]

i.e. the poles have the same modulus ωc and equi-spaced arguments. Forexample, for a fifth-order Butterworth low-pass filter (LPF), n = 5:

π

2n= 18o →

π

2+ (2k + 1)

π

2n= 90o + (18o, 54o, 90o, 126o, etc . . .)

i.e. the poles are at:

108o, 144o, 180o, 216o, 252o,︸︷︷︸

in L.H. s-plane therefore stable

288o, 324o, 360o, 396o, 432o︸︷︷︸

in R.H.S. s-plane therefore unstable

We want to design a stable filter. Since each unstable pole is (−1)× a stablepole, we can let the stable ones be in G(s), and the unstable ones in G(−s).

Therefore the poles of G(s) are ωcej108o,ωcej144

o,ωcej180

o,ωcej216

o,ωcej252

o

as shown in Figure 1.6.

28

−1.5 −1 −0.5 0 0.5

−0.5

0

0.5

Figure 1.6: Stable poles of 5–th order Butterworth filter.

29

G(s) =1

1− spk=

1

(1 + ( sωc )){1 + 2 cos 72o sωc+ ( sωc )

2}{1 + 2 cos 36o sωc + (sωc)2}

G(s) =1

1 + 3.2361 sωc + 5.2361(sωc)2 + 5.2361( sωc )

3 + 3.2361( sωc )4 + ( sωc )

5

Note that the coefficients are “palindromic” (read the same in reverse order) –this is true for all Butterworth filters. Poles are always on same radii, at πn angularspacing, with “half-angles” at each end. If n is odd, one pole is real.

n a1 a2 a3 a4 a5 a6 a7 a81 1.00002 1.4141 1.00003 2.0000 2.0000 1.00004 2.6131 3.4142 2.6131 1.00005 3.2361 5.2361 5.2361 3.2361 1.00006 3.8637 7.4641 9.1416 7.4641 3.8637 1.00007 4.4940 10.0978 14.5918 14.5918 10.0978 4.4940 1.00008 5.1258 13.1371 21.8462 25.6884 21.8462 13.1371 5.1258 1.0000

Butterworth LPF coefficients for a ≤ 8

30

1.9.3 Design of Butterworth LPFs

The only design variable for Butterworth LPFs is the order of the filter n. If afilter is to have an attenuation A at a frequency ωa:

|G|2ωa =1

A2=

1

1 + (ωaωc )2n

i.e. n =log(A2 − 1)2 log ωaωc

or since usually A≫ 1, n ≈logA

log ωaωc

Butterworth design – example

Design a Butterworth LPF with at least 40 dB attenuation at a frequency of1kHz and 3dB attenuation at fc = 500Hz.Answer

40dB → A = 100; ωa = 2000π and ωc = 1000π rads/sec

31

Therefore n ≈log10 100

log10 2=

2

0.301= 6.64

Hence n = 7 meets the specification

Check: Substitute s = j2 into the transfer function from the above table forn = 7

|G(j2)| =1

|87.38− 93.54j |which gives A = 128.

32

signal processing b14 option – 4 lectures stephen roberts · 2017. 1. 4. · s − js ω c = e j...

Documents