Mel scale
File:Mel-scale from 200 to 1500, in intervals of 50.ogg Mel Scale is a perceptual scale of pitches judged by listeners to be equal in distance from one another. The point of reference between this scale and actual frequency measurement is the pitch of a 1 kHz tone, 40 dB above the listener's threshold of hearing, which is arbitrarily assigned a pitch of 1000 mels. The scale is named after the word 'melody' to indicate that the scale is based on pitch comparisons. The concept of the Mel Scale is crucial in the fields of music, acoustics, and particularly in speech recognition and audio processing technologies.
Overview[edit | edit source]
The Mel Scale was developed as a way to represent pitches perceived by the human ear. Human hearing does not perceive pitches linearly; instead, changes in pitch at lower frequencies are detected more easily than changes at higher frequencies. This non-linear perception is quantified by the Mel Scale, which more closely aligns with the human auditory system's response than the standard linear frequency scale measured in Hertz (Hz).
History[edit | edit source]
The concept of the Mel Scale was introduced in the 1930s by Stevens, Volkmann, and Newman. The researchers were interested in understanding the relationship between the physical properties of sound and the perceived pitch. They conducted experiments asking listeners to judge the pitch of sounds in order to develop a scale that would more accurately reflect the human perception of pitch.
Calculation[edit | edit source]
The frequency in mels, \(M\), can be calculated from frequency in Hertz, \(f\), using various formulas. A commonly used formula is: \[ M = 2595 \log_{10}\left(1 + \frac{f}{700}\right) \] This formula approximates the human ear's response more closely than linear scales, especially for frequencies below 1000 Hz.
Applications[edit | edit source]
The Mel Scale is widely used in audio and speech processing. One of its most significant applications is in the development of Mel Frequency Cepstral Coefficients (MFCCs), which are features derived from the Mel Scale that are used in automatic speech recognition (ASR) and speaker identification systems. The scale's ability to mimic the human ear's response makes MFCCs particularly effective for these applications.
See Also[edit | edit source]
Mel scale Resources | |
---|---|
|
Search WikiMD
Ad.Tired of being Overweight? Try W8MD's physician weight loss program.
Semaglutide (Ozempic / Wegovy and Tirzepatide (Mounjaro / Zepbound) available.
Advertise on WikiMD
WikiMD's Wellness Encyclopedia |
Let Food Be Thy Medicine Medicine Thy Food - Hippocrates |
Translate this page: - East Asian
中文,
日本,
한국어,
South Asian
हिन्दी,
தமிழ்,
తెలుగు,
Urdu,
ಕನ್ನಡ,
Southeast Asian
Indonesian,
Vietnamese,
Thai,
မြန်မာဘာသာ,
বাংলা
European
español,
Deutsch,
français,
Greek,
português do Brasil,
polski,
română,
русский,
Nederlands,
norsk,
svenska,
suomi,
Italian
Middle Eastern & African
عربى,
Turkish,
Persian,
Hebrew,
Afrikaans,
isiZulu,
Kiswahili,
Other
Bulgarian,
Hungarian,
Czech,
Swedish,
മലയാളം,
मराठी,
ਪੰਜਾਬੀ,
ગુજરાતી,
Portuguese,
Ukrainian
WikiMD is not a substitute for professional medical advice. See full disclaimer.
Credits:Most images are courtesy of Wikimedia commons, and templates Wikipedia, licensed under CC BY SA or similar.
Contributors: Prab R. Tumpati, MD