audiofiles harika basana ([email protected] ), elizabeth chan ([email protected] ), nikolai...

AUDIOFILES Harika Basana ([email protected] ), Elizabeth Chan ([email protected] ), Nikolai Sinkov([email protected] ), Frank Zhang ([email protected] ) 6100 Main Street, Rice University, Houston, Texas 77005 GOAL To explore the MP3 technology and To explore the MP3 technology and to implement various audio data compression to implement various audio data compression algorithms. algorithms. Analyze This Audio compression is to compress an audio file into a smaller-sized file. People cannot differentiate between these two files by just hearing. Due to its smaller size, the new file can be easily transferred via the Internet. People try to find better audio compression algorithms that retain satisfying audio quality. Algorithms Average Energy Algorithm Zeroes out selected high and low frequencies of the audio file. Procedure Perform the Discrete Cosine Transform (DCT). Calculate the signal’s energy. Find the mean and the standard deviation of from the energy spectrum. Keep all frequencies with energies within 1 standard deviation (std) from the mean. Zero out frequencies with energies outside this range. Similarly, keep frequencies with energies within 2 and 3 stds from the mean. Perform the Inverse DCT and get the output. Results Amount of compression is insignificant. Algorithm would probably work better if the signal is very short, has monotonous tones, and has little noise. Ding.wav before compression Ding.wav with frequencies within 1 std from the mean Ding.wav with frequencies within 2 std from the mean Ding.wav with frequencies within 3 std from the mean Psycho Acoustic Algorithm Linear, tangent or arctangent quantization of the signal. Procedure Perform the Discrete Cosine Transform (DCT) Quantize the signal in one of the following ways : Diagram of the quantization “buckets” for the three methods Give certain frequency bands more bits (1000 – 5100 Hz and 12500 - 15200Hz). Throw away frequencies below 20Hz and above 20,000Hz. Perform the Inverse DCT. Results Compression is very significant. Quality is good for the amount of compression. Arctangent quantization yields the best quality. Original signal sampled at 44100Hz The x-axis DT sample and the y-axis is the amplitude After linear quantization After arctangent quantization After tangent quantization Masking Algorithm The presence of a signal at a particular f raise the perceptual threshold of signals the masking frequency. Procedure Go through every sample and remove the follow samples if they are below a certain thresho Results No significant improvement. Need a better implementing to get good results. Conclusion We didn’t create MP3 files. Used the underlying concepts. Produced much smaller files. Psycho Acoustic Algorithm is the best, in - amount of compression - sound quality of the output. Improvements Implement windowing Implement temporal masking Bibliography: http://www.sospubs.co.uk/sos/may00/articles/m http://www.besar.dcs.gla.ac.uk/labs/audiolab/ tutorials/mp3/mp3how.php and more…

Upload: jasper-johnson

Post on 19-Jan-2016

219 views

Category:

Documents

0 download

Report

Download

Tags:

Embed Size (px):

TRANSCRIPT

Page 1: AUDIOFILES Harika Basana (ilsai@rice.edu ), Elizabeth Chan (lizychan@rice.edu ), Nikolai Sinkov(nik05@rice.edu ), Frank Zhang (frankzsj@rice.edu ) 6100

GOALTo explore the MP3 technology and To explore the MP3 technology and to implement various audio data compressionto implement various audio data compressionalgorithms.algorithms.

Analyze This Audio compression is to compress an audio file into a smaller-sized file.

People cannot differentiate between these two files by just hearing.

Due to its smaller size, the new file can be easily transferred via the Internet.

People try to find better audio compression algorithms that retain satisfying audio quality.

Algorithms

Average Energy Algorithm

Zeroes out selected high and low frequencies of the audio file.

Procedure Perform the Discrete Cosine Transform (DCT).

Calculate the signal’s energy.

Find the mean and the standard deviation of from the energy spectrum.

Keep all frequencies with energies within 1 standard deviation (std) from the mean.

Zero out frequencies with energies outside this range. Similarly, keep frequencies with energies within 2 and 3 stds from the mean.

Perform the Inverse DCT and get the output.

Results Amount of compression is insignificant.

Algorithm would probably work better if the signal is very short, has monotonous tones, and has little noise.

Ding.wav before compressionDing.wav before compression Ding.wav with frequencies within 1 std from the mean

Ding.wav with frequencies within 1 std from the mean

Ding.wav with frequencies within 2 std from the mean

Ding.wav with frequencies within 3 std from the mean

Psycho Acoustic Algorithm

Linear, tangent or arctangent quantization of the signal.

Procedure Perform the Discrete Cosine Transform (DCT)

Quantize the signal in one of the following ways :

Diagram of the quantization “buckets” for the three methods

Give certain frequency bands more bits (1000 – 5100 Hz and 12500 - 15200Hz).

Throw away frequencies below 20Hz and above 20,000Hz. Perform the Inverse DCT.

Results Compression is very significant.

Quality is good for the amount of compression.

Arctangent quantization yields the best quality.

Original signal sampled at 44100Hz

The x-axis DT sample and the y-axis is the amplitude

Original signal sampled at 44100Hz

The x-axis DT sample and the y-axis is the amplitude After linear quantization After linear quantization

After arctangent quantization After arctangent quantization After tangent quantization After tangent quantization

Masking Algorithm The presence of a signal at a particular frequency can raise the perceptual threshold of signals close to the the masking frequency.

Procedure Go through every sample and remove the following samples if they are below a certain threshold.

Results No significant improvement. Need a better way of implementing to get good results.

Conclusion We didn’t create MP3 files.

Used the underlying concepts.

Produced much smaller files.

Psycho Acoustic Algorithm is the best, in terms of - amount of compression - sound quality of the output.

Improvements Implement windowing

Implement temporal masking

Bibliography:http://www.sospubs.co.uk/sos/may00/articles/mp3.htmhttp://www.besar.dcs.gla.ac.uk/labs/audiolab/real_site/tutorials/mp3/mp3how.php and more…

Constructional Profiles as the Basis of Semantic Analysis Suzanne Kemmer Rice University [email protected]

BIOE 301 Lecture Twenty: Clinical Trials Mike Cordray [email protected]

El hombre prehistórico Jose J Salazar [email protected]

Preparation and Characterization of Uranium Oxides … · Preparation and Characterization of Uranium Oxides in Support of the K Basin Sludge Treatment Project SI Sinkov CH Delegard

The Cognitive is Phenomenal Too Lecture 6 Critique of Tye & Wright Charles Siewert Rice University [email protected]

POWER MAP OF 400KV / 220KV / 132KV / 66KV LINES & …getco.co.in/getco_new/pages/files/maps/Mehsana_Zone.pdf · kali dungari mitha basana bhandu kherva udalpur mandalilanghaj linch

Diane Butler Rice University [email protected] SAA/CSA Joint Annual Meeting, August 12, 2009

1 How to Prepare and Present a Technical Poster Tracy Volz [email protected] Cain Project Rice University

Sonochemical Digestion of High-Fired Plutonium Dioxide Samples€¦ · Sonochemical Digestion of High-Fired Plutonium Dioxide Samples S. I. Sinkov G. J. Lumetta October 2006 Prepared

Brought to you by Rice University Office of Information ... · 10/19/2017 · Barry Ribbeck {Security} - brr at Rice.edu Dylan Jacobs {Network} –dtj1 at Rice.edu Paul Engle {IAM}

Data Display How to Effectively Communicate Your Findings Mary Purugganan, Ph.D. [email protected] cainproj/ Leadership & Professional

Linking Alan L. Cox [email protected] Some slides adapted from CMU 15.213 slides