speech material

4
n i t y a @ e e . i i t b . a c . i n E E D e p t . , I I T B o m b a y Compressi on Ratio Single-band Compression Multiband Compressi on Sliding-band Compression 5 10 20 30 Comparison of dynamic range compression using single- band, multiband, and sliding-band compression schemes Speech material “you will mark ut please” concatenated with scaling factors of 0.1, 0.8, 0.1, 0.4, 0.1 Processing Single-band, multiband, and sliding-band dynamic range compression using Win. len. = 25.6 ms, FFT len. = 512 Results from offline processing

Upload: muniya

Post on 15-Feb-2016

33 views

Category:

Documents


0 download

DESCRIPTION

Results from offline processing. Comparison of dynamic range compression using single-band, multiband, and sliding-band compression schemes. Speech material “you will mark ut please” concatenated with scaling factors of 0.1, 0.8, 0.1, 0.4, 0.1 Processing - PowerPoint PPT Presentation

TRANSCRIPT

Slide 1

Compression RatioSingle-band CompressionMultiband CompressionSliding-band Compression5102030Comparison of dynamic range compression using single-band, multiband, and sliding-band compression schemesSpeech materialyou will mark ut please concatenated with scaling factors of 0.1, 0.8, 0.1, 0.4, 0.1ProcessingSingle-band, multiband, and sliding-band dynamic range compression using Win. len. = 25.6 ms, FFT len. = 512

Results from offline [email protected] Dept., IIT Bombay

Time (s)

Distortions during spectral transitions: Example of swept sinusoidal input. Sliding band compression outputMultiband compression (18 auditory critical bands) outputSingle-band compression outputInput: constant amplitude, 125 250 Hz linearly swept frequency, 200 ms sweep durationCR = 30, Ta = 6.4 ms, Tr = 192 ms. [email protected] Dept., IIT BombayExample: "you will mark ut please" concatenated with scaling factors for variation in the input level. CR = 2, Ta = 6.4 ms, Tr = 6.4 & 192 ms.

Input waveform Scaling factor Unprocessed waveform Processed Tr = 6.4 ms, low Pmc Processed Tr = 192 ms, low Pmc Processed Tr = 6.4 ms, high Pmc Processed Tr = 192 ms, high Pmc

Time (s)

Processing of different speech materials with varying levels: No audible roughness or distortion during informal [email protected] Dept., IIT BombayResults from real-time processingInformal listening: real-time output perceptually similar to the offline outputPESQ for real-time w.r.t. offline : 3.5Signal delay = 36 msUse of processing capacity: 41% (lowest proc. clock for satisfactory operation = 50 MHz, max. clock = 120 MHz)

Unprocessed waveform Offline processed waveformReal-time processed waveform

Example: "you will mark ut please" concatenated with scaling factors for variation in the input level. CR = 2, Ta = 6.4 ms, Tr = 192 ms, low Pmc.Time (s)[email protected] Dept., IIT Bombay