In musical terms, what properties are varied by the human voice to produce different words / syllables? Announcing the arrival of Valued Associate #679: Cesar Manara Planned maintenance scheduled April 23, 2019 at 00:00UTC (8:00pm US/Eastern)Alternative Numerical Representation of PitchWhat is the difference between male head voice and falsetto?What is the technology where two vocals overlap simultaneously in songs?Transposing the human voiceHow Do Time Signatures Affect a Song?Singing On Pitch?Why do Talkbox and Autotune effects sound similar?What differentiates great music from good?Do classical pieces sound different today than the originals due to temperament?Psychological barriers in vocals
Would it be possible to dictate a bech32 address as a list of English words?
How do I find out the mythology and history of my Fortress?
Do I really need to have a message in a novel to appeal to readers?
Why is there Net Work Done on a Pressure/Volume Cycle?
Crossing US/Canada Border for less than 24 hours
Would it be easier to apply for a UK visa if there is a host family to sponsor for you in going there?
One-one communication
Belief In God or Knowledge Of God. Which is better?
Sum letters are not two different
Co-worker has annoying ringtone
Strange behavior of Object.defineProperty() in JavaScript
How to pronounce 伝統色
Why are vacuum tubes still used in amateur radios?
How to compare two different files line by line in unix?
How do living politicians protect their readily obtainable signatures from misuse?
How could we fake a moon landing now?
What is "gratricide"?
How does the math work when buying airline miles?
Putting class ranking in CV, but against dept guidelines
How fail-safe is nr as stop bytes?
What do you call the main part of a joke?
Converted a Scalar function to a TVF function for parallel execution-Still running in Serial mode
Put R under double integral
What is the home of the drow in Flanaess?
In musical terms, what properties are varied by the human voice to produce different words / syllables?
Announcing the arrival of Valued Associate #679: Cesar Manara
Planned maintenance scheduled April 23, 2019 at 00:00UTC (8:00pm US/Eastern)Alternative Numerical Representation of PitchWhat is the difference between male head voice and falsetto?What is the technology where two vocals overlap simultaneously in songs?Transposing the human voiceHow Do Time Signatures Affect a Song?Singing On Pitch?Why do Talkbox and Autotune effects sound similar?What differentiates great music from good?Do classical pieces sound different today than the originals due to temperament?Psychological barriers in vocals
Why, for example, does the word "hello" sound completely different to the word "goodbye", or the letter "a" from the letter "b"?
I know it can't be pitch, because all of these words and syllables can be spoken at the same pitch and still sound distinct, and changing the lyrics of a song does not change the pitch.
What musical property is it then that makes words sound different from each other?
theory voice
New contributor
add a comment |
Why, for example, does the word "hello" sound completely different to the word "goodbye", or the letter "a" from the letter "b"?
I know it can't be pitch, because all of these words and syllables can be spoken at the same pitch and still sound distinct, and changing the lyrics of a song does not change the pitch.
What musical property is it then that makes words sound different from each other?
theory voice
New contributor
1
en.wikipedia.org/wiki/Timbre
– Your Uncle Bob
1 hour ago
add a comment |
Why, for example, does the word "hello" sound completely different to the word "goodbye", or the letter "a" from the letter "b"?
I know it can't be pitch, because all of these words and syllables can be spoken at the same pitch and still sound distinct, and changing the lyrics of a song does not change the pitch.
What musical property is it then that makes words sound different from each other?
theory voice
New contributor
Why, for example, does the word "hello" sound completely different to the word "goodbye", or the letter "a" from the letter "b"?
I know it can't be pitch, because all of these words and syllables can be spoken at the same pitch and still sound distinct, and changing the lyrics of a song does not change the pitch.
What musical property is it then that makes words sound different from each other?
theory voice
theory voice
New contributor
New contributor
New contributor
asked 1 hour ago
JShorthouseJShorthouse
1311
1311
New contributor
New contributor
1
en.wikipedia.org/wiki/Timbre
– Your Uncle Bob
1 hour ago
add a comment |
1
en.wikipedia.org/wiki/Timbre
– Your Uncle Bob
1 hour ago
1
1
en.wikipedia.org/wiki/Timbre
– Your Uncle Bob
1 hour ago
en.wikipedia.org/wiki/Timbre
– Your Uncle Bob
1 hour ago
add a comment |
3 Answers
3
active
oldest
votes
I think you might be best served by linguistics, specifically phonetics.
Pitch is sort of an element, but specific pitch isn't the concern. Instead, some vocal sounds are "voiced" meaning the vocal chords vibrate (producing pitches.) For example, the f
in 'fan is not voiced, but when voiced it becomes v
like 'van.'
How vowel and consonant sounds are produced is understood in linguistics as a matter of vocal anatomy of the tongue, palette, etc. and described with terms like fricative, labial, etc. There is a complex mapping of the inside of the mouth in linguistics.
You could describe the actions of the voice with acoustics with terms like amplitutde, wave form, etc. But, linguistics actually has a whole branch devoted to the study of vocal sounds.
By the way, in voice training these topics are called diction.
1
Phoenix + phonetics = phoenetics! :-) ...I corrected my typo, thanks!
– Michael Curtis
1 hour ago
Found another of my own typos: 'best server' instead of 'best served' ...can you tell I work in computer support?
– Michael Curtis
1 hour ago
add a comment |
As Michael Curtis has pointed out, from the linguistic side, the study of phonetics is all about what speech sounds humans make and how they make them. Phonetics doesn't really approach things from a musical perspective, so I thought I might try to make some correlations between phonetics and musical acoustics.
Phonetics divides speech sounds (phonemes) into two broad categories: vowels and consonants. The lines can be a bit blurry there, but vowel sounds always involve the vocal cords and usually made with the mouth more or less open, while consonants involve specific motions of the teeth, lips, and tongue and may or may not use the vocal cords.
For vowels, we always use our vocal cords, which means vowels always have some pitch. The pitches used during speech generally do not have a typical musical relationship, but sometimes might be "accidentally" musical. For instance, when a child taunts on the playground something like, "Johnny is a chick-en!", they often us a sing-song tone that is a melodic minor third. But that's incidental.
The way we make different vowel sounds is by changing the shape of our mouths, and this changes the timbre of the sound made by our vocal cords. Another musical way to look at it is that we are filtering (like with EQ or a synth filter) the pitch that is created by our vocal cords.
That entirely covers the musical aspects of vowel sounds. We could talk about loudness and duration (the two other main dimensions of music), but neither of those change the vowel sound we make or hear.
Consonants are more complicated. Let's divide them into the phonetic categories of voiced (using the vocal cords) and unvoiced (not using the vocal cords).
Unvoiced consonants (like /t/, /p/, /f/, /k/, /s/), from a musical standpoint, are closest to percussion sounds. These are the kinds of sounds we make when we beat box.
Percussion sounds and unvoiced consonants are both musically unpitched, and instead distinguished solely by timbre. The two main timbral elements of these sounds are the envelope and formant. The formant is like a filter setting, just like for vowel sounds, but since there is no pitch to filter, what is being filtered instead is noise or unpitched tones. Unpitched tones are groups of frequencies that do not have a harmonic relationship to each other, so we don't hear them as a note. Think of two different cymbals, a "high" one and a "low" one as being examples of noise with two different formants.
For unvoiced consonants, there are two subcategories we can talk about, plosives, fricatives. Plosives (/t/, /p/, /k/) have a very short loudness envelope that reaches maximum volume very quickly and then dies away just as quickly. This is most similar to a drum sound. The different sounds of plosives come from their different formants. In this case, it's mainly how much and what kinds of noise is being made along with the plosive sound. A /p/ sound has essentially no noise, like a kick drum, while /t/ and /k/ have two different kinds of noise that are more like a hi hat and snare drum, respectively. Another thing that makes the /t/ sound different from the /k/ is the position of the mouth is different, which causes different filtering just like we see in the vowel sounds.
Fricatives (/f/, /s/, /sh/, /th/) are all bursts of noise that generally last longer than plosives (they have a slower loudness envelope), and they each have their own formant, or filter setting, that changes the character of the noise. Note that /f/ is a fairly even noise sound, while /s/ has more of a sense of some frequencies being louder than others, /sh/ is a more uneven noise sound, and /th/ is a muted noise sound without as much of the upper frequencies.
For the voiced consonants, most of them are essentially the same as the unvoiced ones outlined above, except they also involve the vocal cords, so there is again a pitch of some kind when voiced consonants are spoken. These consonants include /d/ (voiced /t/), /b/ (voiced /p/), /z/ (voiced /s/) and so on. I believe every unvoiced consonant has a voiced version in English (I believe this is also true in Japanese).
There are a few voiced consonants that do not have unvoiced versions and also straddle the line between consonants and vowels. The two closest to being vowels are the /y/ and /w/ sounds. These are basically vowels where the formant or filter is changed while we say them. This is done by changing the positing of the tongue or lips while the vocal cords create a pitch.
Two others, /m/ and /n/, are basically made similar to humming, and the main way we tell the difference is by how the consonant changes to a vowel to determine whether it was an /m/ or /n/. During the transition to vowel, the difference between /m/ and /n/ is similar to the difference between /w/ and /y/.
Finally, /l/ and /r/ are essentially vowels that have rather extreme formants or filters applied do them. They also sound different when they are approached and left (what you might call their formant/filter envelopes).
If you're really paying attention, you've noticed I have not discussed every English phoneme. I have touched on all the musical aspects of phonemes in all languages. Here's more of a breakdown aspects of phonemes:
- Different sound sources, including the vocal cords to make pitches and parts of the mouth that can make noises
- Different mouth positions to filter the sound sources in different ways to create different formants
- Different loudness envelopes, or how the loudness changes with time
- Different formant envelopes, or how the filtering changes with time
Those are the primary elements that distinguish different phonemes summarized with musical, rather than phonetic, terms.
add a comment |
It's a combination of a few things. Every consonant sounds different due to the way we physically make the sound with our mouths ("ess" versus "eff"), but every vowel is different purely based on timbre. This timbre shift is caused by the shape of the mouth, I believe.
add a comment |
Your Answer
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "240"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
noCode: true, onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
JShorthouse is a new contributor. Be nice, and check out our Code of Conduct.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmusic.stackexchange.com%2fquestions%2f82996%2fin-musical-terms-what-properties-are-varied-by-the-human-voice-to-produce-diffe%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
3 Answers
3
active
oldest
votes
3 Answers
3
active
oldest
votes
active
oldest
votes
active
oldest
votes
I think you might be best served by linguistics, specifically phonetics.
Pitch is sort of an element, but specific pitch isn't the concern. Instead, some vocal sounds are "voiced" meaning the vocal chords vibrate (producing pitches.) For example, the f
in 'fan is not voiced, but when voiced it becomes v
like 'van.'
How vowel and consonant sounds are produced is understood in linguistics as a matter of vocal anatomy of the tongue, palette, etc. and described with terms like fricative, labial, etc. There is a complex mapping of the inside of the mouth in linguistics.
You could describe the actions of the voice with acoustics with terms like amplitutde, wave form, etc. But, linguistics actually has a whole branch devoted to the study of vocal sounds.
By the way, in voice training these topics are called diction.
1
Phoenix + phonetics = phoenetics! :-) ...I corrected my typo, thanks!
– Michael Curtis
1 hour ago
Found another of my own typos: 'best server' instead of 'best served' ...can you tell I work in computer support?
– Michael Curtis
1 hour ago
add a comment |
I think you might be best served by linguistics, specifically phonetics.
Pitch is sort of an element, but specific pitch isn't the concern. Instead, some vocal sounds are "voiced" meaning the vocal chords vibrate (producing pitches.) For example, the f
in 'fan is not voiced, but when voiced it becomes v
like 'van.'
How vowel and consonant sounds are produced is understood in linguistics as a matter of vocal anatomy of the tongue, palette, etc. and described with terms like fricative, labial, etc. There is a complex mapping of the inside of the mouth in linguistics.
You could describe the actions of the voice with acoustics with terms like amplitutde, wave form, etc. But, linguistics actually has a whole branch devoted to the study of vocal sounds.
By the way, in voice training these topics are called diction.
1
Phoenix + phonetics = phoenetics! :-) ...I corrected my typo, thanks!
– Michael Curtis
1 hour ago
Found another of my own typos: 'best server' instead of 'best served' ...can you tell I work in computer support?
– Michael Curtis
1 hour ago
add a comment |
I think you might be best served by linguistics, specifically phonetics.
Pitch is sort of an element, but specific pitch isn't the concern. Instead, some vocal sounds are "voiced" meaning the vocal chords vibrate (producing pitches.) For example, the f
in 'fan is not voiced, but when voiced it becomes v
like 'van.'
How vowel and consonant sounds are produced is understood in linguistics as a matter of vocal anatomy of the tongue, palette, etc. and described with terms like fricative, labial, etc. There is a complex mapping of the inside of the mouth in linguistics.
You could describe the actions of the voice with acoustics with terms like amplitutde, wave form, etc. But, linguistics actually has a whole branch devoted to the study of vocal sounds.
By the way, in voice training these topics are called diction.
I think you might be best served by linguistics, specifically phonetics.
Pitch is sort of an element, but specific pitch isn't the concern. Instead, some vocal sounds are "voiced" meaning the vocal chords vibrate (producing pitches.) For example, the f
in 'fan is not voiced, but when voiced it becomes v
like 'van.'
How vowel and consonant sounds are produced is understood in linguistics as a matter of vocal anatomy of the tongue, palette, etc. and described with terms like fricative, labial, etc. There is a complex mapping of the inside of the mouth in linguistics.
You could describe the actions of the voice with acoustics with terms like amplitutde, wave form, etc. But, linguistics actually has a whole branch devoted to the study of vocal sounds.
By the way, in voice training these topics are called diction.
edited 1 hour ago
answered 1 hour ago
Michael CurtisMichael Curtis
12.3k744
12.3k744
1
Phoenix + phonetics = phoenetics! :-) ...I corrected my typo, thanks!
– Michael Curtis
1 hour ago
Found another of my own typos: 'best server' instead of 'best served' ...can you tell I work in computer support?
– Michael Curtis
1 hour ago
add a comment |
1
Phoenix + phonetics = phoenetics! :-) ...I corrected my typo, thanks!
– Michael Curtis
1 hour ago
Found another of my own typos: 'best server' instead of 'best served' ...can you tell I work in computer support?
– Michael Curtis
1 hour ago
1
1
Phoenix + phonetics = phoenetics! :-) ...I corrected my typo, thanks!
– Michael Curtis
1 hour ago
Phoenix + phonetics = phoenetics! :-) ...I corrected my typo, thanks!
– Michael Curtis
1 hour ago
Found another of my own typos: 'best server' instead of 'best served' ...can you tell I work in computer support?
– Michael Curtis
1 hour ago
Found another of my own typos: 'best server' instead of 'best served' ...can you tell I work in computer support?
– Michael Curtis
1 hour ago
add a comment |
As Michael Curtis has pointed out, from the linguistic side, the study of phonetics is all about what speech sounds humans make and how they make them. Phonetics doesn't really approach things from a musical perspective, so I thought I might try to make some correlations between phonetics and musical acoustics.
Phonetics divides speech sounds (phonemes) into two broad categories: vowels and consonants. The lines can be a bit blurry there, but vowel sounds always involve the vocal cords and usually made with the mouth more or less open, while consonants involve specific motions of the teeth, lips, and tongue and may or may not use the vocal cords.
For vowels, we always use our vocal cords, which means vowels always have some pitch. The pitches used during speech generally do not have a typical musical relationship, but sometimes might be "accidentally" musical. For instance, when a child taunts on the playground something like, "Johnny is a chick-en!", they often us a sing-song tone that is a melodic minor third. But that's incidental.
The way we make different vowel sounds is by changing the shape of our mouths, and this changes the timbre of the sound made by our vocal cords. Another musical way to look at it is that we are filtering (like with EQ or a synth filter) the pitch that is created by our vocal cords.
That entirely covers the musical aspects of vowel sounds. We could talk about loudness and duration (the two other main dimensions of music), but neither of those change the vowel sound we make or hear.
Consonants are more complicated. Let's divide them into the phonetic categories of voiced (using the vocal cords) and unvoiced (not using the vocal cords).
Unvoiced consonants (like /t/, /p/, /f/, /k/, /s/), from a musical standpoint, are closest to percussion sounds. These are the kinds of sounds we make when we beat box.
Percussion sounds and unvoiced consonants are both musically unpitched, and instead distinguished solely by timbre. The two main timbral elements of these sounds are the envelope and formant. The formant is like a filter setting, just like for vowel sounds, but since there is no pitch to filter, what is being filtered instead is noise or unpitched tones. Unpitched tones are groups of frequencies that do not have a harmonic relationship to each other, so we don't hear them as a note. Think of two different cymbals, a "high" one and a "low" one as being examples of noise with two different formants.
For unvoiced consonants, there are two subcategories we can talk about, plosives, fricatives. Plosives (/t/, /p/, /k/) have a very short loudness envelope that reaches maximum volume very quickly and then dies away just as quickly. This is most similar to a drum sound. The different sounds of plosives come from their different formants. In this case, it's mainly how much and what kinds of noise is being made along with the plosive sound. A /p/ sound has essentially no noise, like a kick drum, while /t/ and /k/ have two different kinds of noise that are more like a hi hat and snare drum, respectively. Another thing that makes the /t/ sound different from the /k/ is the position of the mouth is different, which causes different filtering just like we see in the vowel sounds.
Fricatives (/f/, /s/, /sh/, /th/) are all bursts of noise that generally last longer than plosives (they have a slower loudness envelope), and they each have their own formant, or filter setting, that changes the character of the noise. Note that /f/ is a fairly even noise sound, while /s/ has more of a sense of some frequencies being louder than others, /sh/ is a more uneven noise sound, and /th/ is a muted noise sound without as much of the upper frequencies.
For the voiced consonants, most of them are essentially the same as the unvoiced ones outlined above, except they also involve the vocal cords, so there is again a pitch of some kind when voiced consonants are spoken. These consonants include /d/ (voiced /t/), /b/ (voiced /p/), /z/ (voiced /s/) and so on. I believe every unvoiced consonant has a voiced version in English (I believe this is also true in Japanese).
There are a few voiced consonants that do not have unvoiced versions and also straddle the line between consonants and vowels. The two closest to being vowels are the /y/ and /w/ sounds. These are basically vowels where the formant or filter is changed while we say them. This is done by changing the positing of the tongue or lips while the vocal cords create a pitch.
Two others, /m/ and /n/, are basically made similar to humming, and the main way we tell the difference is by how the consonant changes to a vowel to determine whether it was an /m/ or /n/. During the transition to vowel, the difference between /m/ and /n/ is similar to the difference between /w/ and /y/.
Finally, /l/ and /r/ are essentially vowels that have rather extreme formants or filters applied do them. They also sound different when they are approached and left (what you might call their formant/filter envelopes).
If you're really paying attention, you've noticed I have not discussed every English phoneme. I have touched on all the musical aspects of phonemes in all languages. Here's more of a breakdown aspects of phonemes:
- Different sound sources, including the vocal cords to make pitches and parts of the mouth that can make noises
- Different mouth positions to filter the sound sources in different ways to create different formants
- Different loudness envelopes, or how the loudness changes with time
- Different formant envelopes, or how the filtering changes with time
Those are the primary elements that distinguish different phonemes summarized with musical, rather than phonetic, terms.
add a comment |
As Michael Curtis has pointed out, from the linguistic side, the study of phonetics is all about what speech sounds humans make and how they make them. Phonetics doesn't really approach things from a musical perspective, so I thought I might try to make some correlations between phonetics and musical acoustics.
Phonetics divides speech sounds (phonemes) into two broad categories: vowels and consonants. The lines can be a bit blurry there, but vowel sounds always involve the vocal cords and usually made with the mouth more or less open, while consonants involve specific motions of the teeth, lips, and tongue and may or may not use the vocal cords.
For vowels, we always use our vocal cords, which means vowels always have some pitch. The pitches used during speech generally do not have a typical musical relationship, but sometimes might be "accidentally" musical. For instance, when a child taunts on the playground something like, "Johnny is a chick-en!", they often us a sing-song tone that is a melodic minor third. But that's incidental.
The way we make different vowel sounds is by changing the shape of our mouths, and this changes the timbre of the sound made by our vocal cords. Another musical way to look at it is that we are filtering (like with EQ or a synth filter) the pitch that is created by our vocal cords.
That entirely covers the musical aspects of vowel sounds. We could talk about loudness and duration (the two other main dimensions of music), but neither of those change the vowel sound we make or hear.
Consonants are more complicated. Let's divide them into the phonetic categories of voiced (using the vocal cords) and unvoiced (not using the vocal cords).
Unvoiced consonants (like /t/, /p/, /f/, /k/, /s/), from a musical standpoint, are closest to percussion sounds. These are the kinds of sounds we make when we beat box.
Percussion sounds and unvoiced consonants are both musically unpitched, and instead distinguished solely by timbre. The two main timbral elements of these sounds are the envelope and formant. The formant is like a filter setting, just like for vowel sounds, but since there is no pitch to filter, what is being filtered instead is noise or unpitched tones. Unpitched tones are groups of frequencies that do not have a harmonic relationship to each other, so we don't hear them as a note. Think of two different cymbals, a "high" one and a "low" one as being examples of noise with two different formants.
For unvoiced consonants, there are two subcategories we can talk about, plosives, fricatives. Plosives (/t/, /p/, /k/) have a very short loudness envelope that reaches maximum volume very quickly and then dies away just as quickly. This is most similar to a drum sound. The different sounds of plosives come from their different formants. In this case, it's mainly how much and what kinds of noise is being made along with the plosive sound. A /p/ sound has essentially no noise, like a kick drum, while /t/ and /k/ have two different kinds of noise that are more like a hi hat and snare drum, respectively. Another thing that makes the /t/ sound different from the /k/ is the position of the mouth is different, which causes different filtering just like we see in the vowel sounds.
Fricatives (/f/, /s/, /sh/, /th/) are all bursts of noise that generally last longer than plosives (they have a slower loudness envelope), and they each have their own formant, or filter setting, that changes the character of the noise. Note that /f/ is a fairly even noise sound, while /s/ has more of a sense of some frequencies being louder than others, /sh/ is a more uneven noise sound, and /th/ is a muted noise sound without as much of the upper frequencies.
For the voiced consonants, most of them are essentially the same as the unvoiced ones outlined above, except they also involve the vocal cords, so there is again a pitch of some kind when voiced consonants are spoken. These consonants include /d/ (voiced /t/), /b/ (voiced /p/), /z/ (voiced /s/) and so on. I believe every unvoiced consonant has a voiced version in English (I believe this is also true in Japanese).
There are a few voiced consonants that do not have unvoiced versions and also straddle the line between consonants and vowels. The two closest to being vowels are the /y/ and /w/ sounds. These are basically vowels where the formant or filter is changed while we say them. This is done by changing the positing of the tongue or lips while the vocal cords create a pitch.
Two others, /m/ and /n/, are basically made similar to humming, and the main way we tell the difference is by how the consonant changes to a vowel to determine whether it was an /m/ or /n/. During the transition to vowel, the difference between /m/ and /n/ is similar to the difference between /w/ and /y/.
Finally, /l/ and /r/ are essentially vowels that have rather extreme formants or filters applied do them. They also sound different when they are approached and left (what you might call their formant/filter envelopes).
If you're really paying attention, you've noticed I have not discussed every English phoneme. I have touched on all the musical aspects of phonemes in all languages. Here's more of a breakdown aspects of phonemes:
- Different sound sources, including the vocal cords to make pitches and parts of the mouth that can make noises
- Different mouth positions to filter the sound sources in different ways to create different formants
- Different loudness envelopes, or how the loudness changes with time
- Different formant envelopes, or how the filtering changes with time
Those are the primary elements that distinguish different phonemes summarized with musical, rather than phonetic, terms.
add a comment |
As Michael Curtis has pointed out, from the linguistic side, the study of phonetics is all about what speech sounds humans make and how they make them. Phonetics doesn't really approach things from a musical perspective, so I thought I might try to make some correlations between phonetics and musical acoustics.
Phonetics divides speech sounds (phonemes) into two broad categories: vowels and consonants. The lines can be a bit blurry there, but vowel sounds always involve the vocal cords and usually made with the mouth more or less open, while consonants involve specific motions of the teeth, lips, and tongue and may or may not use the vocal cords.
For vowels, we always use our vocal cords, which means vowels always have some pitch. The pitches used during speech generally do not have a typical musical relationship, but sometimes might be "accidentally" musical. For instance, when a child taunts on the playground something like, "Johnny is a chick-en!", they often us a sing-song tone that is a melodic minor third. But that's incidental.
The way we make different vowel sounds is by changing the shape of our mouths, and this changes the timbre of the sound made by our vocal cords. Another musical way to look at it is that we are filtering (like with EQ or a synth filter) the pitch that is created by our vocal cords.
That entirely covers the musical aspects of vowel sounds. We could talk about loudness and duration (the two other main dimensions of music), but neither of those change the vowel sound we make or hear.
Consonants are more complicated. Let's divide them into the phonetic categories of voiced (using the vocal cords) and unvoiced (not using the vocal cords).
Unvoiced consonants (like /t/, /p/, /f/, /k/, /s/), from a musical standpoint, are closest to percussion sounds. These are the kinds of sounds we make when we beat box.
Percussion sounds and unvoiced consonants are both musically unpitched, and instead distinguished solely by timbre. The two main timbral elements of these sounds are the envelope and formant. The formant is like a filter setting, just like for vowel sounds, but since there is no pitch to filter, what is being filtered instead is noise or unpitched tones. Unpitched tones are groups of frequencies that do not have a harmonic relationship to each other, so we don't hear them as a note. Think of two different cymbals, a "high" one and a "low" one as being examples of noise with two different formants.
For unvoiced consonants, there are two subcategories we can talk about, plosives, fricatives. Plosives (/t/, /p/, /k/) have a very short loudness envelope that reaches maximum volume very quickly and then dies away just as quickly. This is most similar to a drum sound. The different sounds of plosives come from their different formants. In this case, it's mainly how much and what kinds of noise is being made along with the plosive sound. A /p/ sound has essentially no noise, like a kick drum, while /t/ and /k/ have two different kinds of noise that are more like a hi hat and snare drum, respectively. Another thing that makes the /t/ sound different from the /k/ is the position of the mouth is different, which causes different filtering just like we see in the vowel sounds.
Fricatives (/f/, /s/, /sh/, /th/) are all bursts of noise that generally last longer than plosives (they have a slower loudness envelope), and they each have their own formant, or filter setting, that changes the character of the noise. Note that /f/ is a fairly even noise sound, while /s/ has more of a sense of some frequencies being louder than others, /sh/ is a more uneven noise sound, and /th/ is a muted noise sound without as much of the upper frequencies.
For the voiced consonants, most of them are essentially the same as the unvoiced ones outlined above, except they also involve the vocal cords, so there is again a pitch of some kind when voiced consonants are spoken. These consonants include /d/ (voiced /t/), /b/ (voiced /p/), /z/ (voiced /s/) and so on. I believe every unvoiced consonant has a voiced version in English (I believe this is also true in Japanese).
There are a few voiced consonants that do not have unvoiced versions and also straddle the line between consonants and vowels. The two closest to being vowels are the /y/ and /w/ sounds. These are basically vowels where the formant or filter is changed while we say them. This is done by changing the positing of the tongue or lips while the vocal cords create a pitch.
Two others, /m/ and /n/, are basically made similar to humming, and the main way we tell the difference is by how the consonant changes to a vowel to determine whether it was an /m/ or /n/. During the transition to vowel, the difference between /m/ and /n/ is similar to the difference between /w/ and /y/.
Finally, /l/ and /r/ are essentially vowels that have rather extreme formants or filters applied do them. They also sound different when they are approached and left (what you might call their formant/filter envelopes).
If you're really paying attention, you've noticed I have not discussed every English phoneme. I have touched on all the musical aspects of phonemes in all languages. Here's more of a breakdown aspects of phonemes:
- Different sound sources, including the vocal cords to make pitches and parts of the mouth that can make noises
- Different mouth positions to filter the sound sources in different ways to create different formants
- Different loudness envelopes, or how the loudness changes with time
- Different formant envelopes, or how the filtering changes with time
Those are the primary elements that distinguish different phonemes summarized with musical, rather than phonetic, terms.
As Michael Curtis has pointed out, from the linguistic side, the study of phonetics is all about what speech sounds humans make and how they make them. Phonetics doesn't really approach things from a musical perspective, so I thought I might try to make some correlations between phonetics and musical acoustics.
Phonetics divides speech sounds (phonemes) into two broad categories: vowels and consonants. The lines can be a bit blurry there, but vowel sounds always involve the vocal cords and usually made with the mouth more or less open, while consonants involve specific motions of the teeth, lips, and tongue and may or may not use the vocal cords.
For vowels, we always use our vocal cords, which means vowels always have some pitch. The pitches used during speech generally do not have a typical musical relationship, but sometimes might be "accidentally" musical. For instance, when a child taunts on the playground something like, "Johnny is a chick-en!", they often us a sing-song tone that is a melodic minor third. But that's incidental.
The way we make different vowel sounds is by changing the shape of our mouths, and this changes the timbre of the sound made by our vocal cords. Another musical way to look at it is that we are filtering (like with EQ or a synth filter) the pitch that is created by our vocal cords.
That entirely covers the musical aspects of vowel sounds. We could talk about loudness and duration (the two other main dimensions of music), but neither of those change the vowel sound we make or hear.
Consonants are more complicated. Let's divide them into the phonetic categories of voiced (using the vocal cords) and unvoiced (not using the vocal cords).
Unvoiced consonants (like /t/, /p/, /f/, /k/, /s/), from a musical standpoint, are closest to percussion sounds. These are the kinds of sounds we make when we beat box.
Percussion sounds and unvoiced consonants are both musically unpitched, and instead distinguished solely by timbre. The two main timbral elements of these sounds are the envelope and formant. The formant is like a filter setting, just like for vowel sounds, but since there is no pitch to filter, what is being filtered instead is noise or unpitched tones. Unpitched tones are groups of frequencies that do not have a harmonic relationship to each other, so we don't hear them as a note. Think of two different cymbals, a "high" one and a "low" one as being examples of noise with two different formants.
For unvoiced consonants, there are two subcategories we can talk about, plosives, fricatives. Plosives (/t/, /p/, /k/) have a very short loudness envelope that reaches maximum volume very quickly and then dies away just as quickly. This is most similar to a drum sound. The different sounds of plosives come from their different formants. In this case, it's mainly how much and what kinds of noise is being made along with the plosive sound. A /p/ sound has essentially no noise, like a kick drum, while /t/ and /k/ have two different kinds of noise that are more like a hi hat and snare drum, respectively. Another thing that makes the /t/ sound different from the /k/ is the position of the mouth is different, which causes different filtering just like we see in the vowel sounds.
Fricatives (/f/, /s/, /sh/, /th/) are all bursts of noise that generally last longer than plosives (they have a slower loudness envelope), and they each have their own formant, or filter setting, that changes the character of the noise. Note that /f/ is a fairly even noise sound, while /s/ has more of a sense of some frequencies being louder than others, /sh/ is a more uneven noise sound, and /th/ is a muted noise sound without as much of the upper frequencies.
For the voiced consonants, most of them are essentially the same as the unvoiced ones outlined above, except they also involve the vocal cords, so there is again a pitch of some kind when voiced consonants are spoken. These consonants include /d/ (voiced /t/), /b/ (voiced /p/), /z/ (voiced /s/) and so on. I believe every unvoiced consonant has a voiced version in English (I believe this is also true in Japanese).
There are a few voiced consonants that do not have unvoiced versions and also straddle the line between consonants and vowels. The two closest to being vowels are the /y/ and /w/ sounds. These are basically vowels where the formant or filter is changed while we say them. This is done by changing the positing of the tongue or lips while the vocal cords create a pitch.
Two others, /m/ and /n/, are basically made similar to humming, and the main way we tell the difference is by how the consonant changes to a vowel to determine whether it was an /m/ or /n/. During the transition to vowel, the difference between /m/ and /n/ is similar to the difference between /w/ and /y/.
Finally, /l/ and /r/ are essentially vowels that have rather extreme formants or filters applied do them. They also sound different when they are approached and left (what you might call their formant/filter envelopes).
If you're really paying attention, you've noticed I have not discussed every English phoneme. I have touched on all the musical aspects of phonemes in all languages. Here's more of a breakdown aspects of phonemes:
- Different sound sources, including the vocal cords to make pitches and parts of the mouth that can make noises
- Different mouth positions to filter the sound sources in different ways to create different formants
- Different loudness envelopes, or how the loudness changes with time
- Different formant envelopes, or how the filtering changes with time
Those are the primary elements that distinguish different phonemes summarized with musical, rather than phonetic, terms.
edited 3 mins ago
answered 12 mins ago
Todd WilcoxTodd Wilcox
37.4k370125
37.4k370125
add a comment |
add a comment |
It's a combination of a few things. Every consonant sounds different due to the way we physically make the sound with our mouths ("ess" versus "eff"), but every vowel is different purely based on timbre. This timbre shift is caused by the shape of the mouth, I believe.
add a comment |
It's a combination of a few things. Every consonant sounds different due to the way we physically make the sound with our mouths ("ess" versus "eff"), but every vowel is different purely based on timbre. This timbre shift is caused by the shape of the mouth, I believe.
add a comment |
It's a combination of a few things. Every consonant sounds different due to the way we physically make the sound with our mouths ("ess" versus "eff"), but every vowel is different purely based on timbre. This timbre shift is caused by the shape of the mouth, I believe.
It's a combination of a few things. Every consonant sounds different due to the way we physically make the sound with our mouths ("ess" versus "eff"), but every vowel is different purely based on timbre. This timbre shift is caused by the shape of the mouth, I believe.
answered 1 hour ago
user45266user45266
4,4641835
4,4641835
add a comment |
add a comment |
JShorthouse is a new contributor. Be nice, and check out our Code of Conduct.
JShorthouse is a new contributor. Be nice, and check out our Code of Conduct.
JShorthouse is a new contributor. Be nice, and check out our Code of Conduct.
JShorthouse is a new contributor. Be nice, and check out our Code of Conduct.
Thanks for contributing an answer to Music: Practice & Theory Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmusic.stackexchange.com%2fquestions%2f82996%2fin-musical-terms-what-properties-are-varied-by-the-human-voice-to-produce-diffe%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
1
en.wikipedia.org/wiki/Timbre
– Your Uncle Bob
1 hour ago