acoustic impulse response measurement using speech and music signals john usher barcelona media –...

Acoustic impulse response measurement Acoustic impulse response measurement using speech and music signalsusing speech and music signals

John UsherJohn Usher

Barcelona Media – Innovation Centre | Av. Diagonal, 177, planta 9, 08018 Barcelona

John Usher -- In-situ RIR measurement using music and speech

Using adaptive filters to estimate acoustic IRs

In-situ acquisition of electro-acoustic IR, with audience.Continuous:

Fast enough for changing environment conditions.Use speech and music signal radiated from loudspeaker.AF for IR is nothing new! Used for:

Acoustic echo and feedback cancellation. Upmixing (2 → 5.1, 2 → 3D). ANC. Room EQ (using noise).

Audio source(voice or music) LS

AF update

Adaptive Filter (AF = h)

Mic.error

Adaptive Filter is updated to model the acoustic IR so that the error signal level (power) is minimized.

Basic principle:

TD and FD smoothing

Homo. Deco.

Audio source(voice or music)

Filter inversion

RIR estimationh

EQ filter

Application for room EQ (filtered-x)

Localizing objects in a room

Emit speech warning from loudspeaker in room.Extract RIR using adaptive filter.Detect reflection onset timing, e.g. using running kurtosis.

Application for live sound: De-noising & spatial re-mixing

Audio source(voice or music) LS

AF update(NLMS)

Adaptive Filter (AF = h)

Mic.error

Room signal

Audience signal (applause etc.)

Clean audio signal (from the desk)

Filter update algorithm (NLMS):

x(n) LS

Update

Mic.e(n)

Small-room experiment set-up:

Audio source(voice or music)

Blah blah blah...

A. Source is loudspeaker reproducing noise, speech or music.Multichannel noise from loudspeakers.

B. Source is live spoken voice.Predict IR between two lav. mics.

Lav. 1 Lav. 2

Noise signal(white noise or babble)

Results

Error Criterion:1)Start with reference RIR (measured using swept-sine technique).

2)Allow Adaptive Filter to converge for 10 seconds to get AF spectra.

Calculate misalignment: mean of difference between the ref. and AF spectra (80 Hz-- 12 kHz):

Rate of Convergence

Comparison of filter spectra using noise, speech and music:(High SNR)

Robustness to SNR (25, 12, 3 dB SNR):

Masker = noise.

Robustness to SNR:Masker = babble

Comparison with DCFFT:

Dual Channel FFT method:

Following AES reviewer recommendation, compared with commercial DCFFT system (“SMAART”).

Comparison of NLMS vs DCFFT:

Effectiveness of AF RIR acquisition method with long RIRs.

6 RIRs:

Obtained from Dirac fed into Altiverb.

(NB: No background noise simulated.)

Football stadium, Caen Cathedral, church, EMT plate, Filmorch. Stage Berlin, Castle.

RT60: 9.6-1.1 secs.

1.2, 2.3, 3.5, 6.0, 7.8, 9.6.

What happens if we just model the early part of the IR?

… Not much: most of the spectral detail is in the early part.

For longer IRs, the adaptive filter should be longer.

Rate of Convergence for different RTs. 340 ms window, 32 x overlap.

RIR acquisition for small and large rooms :

Adaptive filter updated using NLMS and overlapped window.

Tested with RT60 = 0.5 -10 secs.

Using music, speech and noise as excitation signals.

Less accurate using live voice and two mics.

Convergence in <3 sec. (<2 dB mean error).

Little change in performance with SNRs down to 0 dB.

Conclusions:

Music vs speech:

Music: AF matches RIR 60 Hz—12 kHz.

Speech: AF matches RIR 100 Hz– 8 kHz.

No considerable improvement for filter sizes >340 ms. I.e. we only need to model first 1/8th of RIR to have a good approximation

of the spectrum.

Adaptive whitening algorithm (LPC residuals) can speed up convergence for highly coloured signals, but only in low SNRS.

Conclusions:

· In-situ continuous room EQ using filtered-x approach.

· Object localization using speech message.

(e.g. using running kurtosis).

· Re-mixing live music:

ambient sound separation using filter output and error signal (e.g. get clean signal + room ambiance + audience applause).

Applications:

Cheers!

John Usher

acoustic impulse response measurement using speech and music signals john usher barcelona media –...

situ rir measurement

speechreference rir

extract rir

reference measurement

babblejohn usher

music blah blah blah

deskjohn usher

xjohn usher

Documents

menÚs coffee-break - silken hotel...opciones para ampliar...

usher in-clean-india

clustering algorithms for chains€¦ · clustering...

eucharistic ministry...

using usher at microstrategy faq · pdf fileusing usher at...

the usher … · the usher . created date: 3/20/2020...

usher overview.2014.02 hi

case 1:11-cv-08018-jsr document 40 filed 04/15/13 page 1 of...

madrid aqualogy infraestructuras...torre agbar - av....

mounia lalmas msc phdmounia/cv-january2012.pdfmounia lalmas...

cronología bíblica - james usher

usher syndrom

introducing david usher

debussy, usher

interfacom, s.a.u. (+34) 932 662 030 - c/perú, 104, 08018

rpp diagonal ruang, diagonal bidang, dan bidang diagonal

usher greeter brochure

usher brochure final

treatments of the future for usher ... - usher-syndrome.org

28 october 29 october 30 october - cifmers · visit 3:...