real-time recognition of whale calls using soundid neil j boucher, soundid. australia michihiro...

21
Real-time Recognition of Whale Calls using SoundID Neil J Boucher, SoundID. Australia Michihiro Jinnai, Nagoya Women’s University. Japan

Upload: rebecca-beswick

Post on 01-Apr-2015

221 views

Category:

Documents


4 download

TRANSCRIPT

  • Slide 1

Slide 2 Real-time Recognition of Whale Calls using SoundID Neil J Boucher, SoundID. Australia Michihiro Jinnai, Nagoya Womens University. Japan Slide 3 SoundID www.soundid.net Founded 2002 Engineers specialising in sound recognition Aim to produce human expert and better quality recognition Designed to run in real-time Designed to handle terabytes of data with ease Have our own patented methods Who Are We? Slide 4 SoundID Mk I in glorious 1-d! Slide 5 1-d Recognition Results monitoring a Dawn Chorus over 70 minutes Slide 6 The 1-d method works well for most bird calls as just shown. Frogs and bats also easy. Whales are harder because the 1/f noise and other background noises smear the 1-d image. Worse still, most of the whale noises are in-band. Birds are Easy, Whales are Hard Slide 7 2-d Depiction of Same Call Slide 8 Two Methods Compared Slide 9 The Bottom of the Coke Bottle or FFT-like View of the Same Call at Frame-Width 513 Points Slide 10 Euclidean Distance is the simple straight line (first order) distance between two points. Geometric Distance (measured in angular degrees) is the angle between two vectors that we use to describe the difference in shape. It is a fourth order measure, vaguely related to the skew of a standard distribution. Geometric Distance Slide 11 As a measure of similarity we can use the GD between two n-dimensional shapes to measure their similarity. Two identical shapes have a GD = zero Two totally dissimilar shapes have a GD of 90 degrees. The spectra of two sounds that sound similar to the human ear can have a GD in the range 0 to about 6 degrees. The Measure of Geometric Distance Slide 12 Once we have a library of reference sounds we can compare that library with the detected target sound. Each sound in the library is compared in turn with the target and the one with the lowest GD is declared the closest match. Then, if the lowest GD is small enough to be declared a match (typically GD