In musical terms, what properties are varied by the human voice to produce different words / syllables? Announcing the arrival of Valued Associate #679: Cesar Manara Planned maintenance scheduled April 23, 2019 at 00:00UTC (8:00pm US/Eastern)Alternative Numerical Representation of PitchWhat is the difference between male head voice and falsetto?What is the technology where two vocals overlap simultaneously in songs?Transposing the human voiceHow Do Time Signatures Affect a Song?Singing On Pitch?Why do Talkbox and Autotune effects sound similar?What differentiates great music from good?Do classical pieces sound different today than the originals due to temperament?Psychological barriers in vocals

Would it be possible to dictate a bech32 address as a list of English words?

How do I find out the mythology and history of my Fortress?

Do I really need to have a message in a novel to appeal to readers?

Why is there Net Work Done on a Pressure/Volume Cycle?

Crossing US/Canada Border for less than 24 hours

Would it be easier to apply for a UK visa if there is a host family to sponsor for you in going there?

One-one communication

Belief In God or Knowledge Of God. Which is better?

Sum letters are not two different

Co-worker has annoying ringtone

Strange behavior of Object.defineProperty() in JavaScript

How to pronounce 伝統色

Why are vacuum tubes still used in amateur radios?

How to compare two different files line by line in unix?

How do living politicians protect their readily obtainable signatures from misuse?

How could we fake a moon landing now?

What is "gratricide"?

How does the math work when buying airline miles?

Putting class ranking in CV, but against dept guidelines

How fail-safe is nr as stop bytes?

What do you call the main part of a joke?

Converted a Scalar function to a TVF function for parallel execution-Still running in Serial mode

Put R under double integral

What is the home of the drow in Flanaess?



In musical terms, what properties are varied by the human voice to produce different words / syllables?



Announcing the arrival of Valued Associate #679: Cesar Manara
Planned maintenance scheduled April 23, 2019 at 00:00UTC (8:00pm US/Eastern)Alternative Numerical Representation of PitchWhat is the difference between male head voice and falsetto?What is the technology where two vocals overlap simultaneously in songs?Transposing the human voiceHow Do Time Signatures Affect a Song?Singing On Pitch?Why do Talkbox and Autotune effects sound similar?What differentiates great music from good?Do classical pieces sound different today than the originals due to temperament?Psychological barriers in vocals










6















Why, for example, does the word "hello" sound completely different to the word "goodbye", or the letter "a" from the letter "b"?



I know it can't be pitch, because all of these words and syllables can be spoken at the same pitch and still sound distinct, and changing the lyrics of a song does not change the pitch.



What musical property is it then that makes words sound different from each other?










share|improve this question







New contributor




JShorthouse is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.















  • 1





    en.wikipedia.org/wiki/Timbre

    – Your Uncle Bob
    1 hour ago















6















Why, for example, does the word "hello" sound completely different to the word "goodbye", or the letter "a" from the letter "b"?



I know it can't be pitch, because all of these words and syllables can be spoken at the same pitch and still sound distinct, and changing the lyrics of a song does not change the pitch.



What musical property is it then that makes words sound different from each other?










share|improve this question







New contributor




JShorthouse is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.















  • 1





    en.wikipedia.org/wiki/Timbre

    – Your Uncle Bob
    1 hour ago













6












6








6


1






Why, for example, does the word "hello" sound completely different to the word "goodbye", or the letter "a" from the letter "b"?



I know it can't be pitch, because all of these words and syllables can be spoken at the same pitch and still sound distinct, and changing the lyrics of a song does not change the pitch.



What musical property is it then that makes words sound different from each other?










share|improve this question







New contributor




JShorthouse is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.












Why, for example, does the word "hello" sound completely different to the word "goodbye", or the letter "a" from the letter "b"?



I know it can't be pitch, because all of these words and syllables can be spoken at the same pitch and still sound distinct, and changing the lyrics of a song does not change the pitch.



What musical property is it then that makes words sound different from each other?







theory voice






share|improve this question







New contributor




JShorthouse is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.











share|improve this question







New contributor




JShorthouse is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









share|improve this question




share|improve this question






New contributor




JShorthouse is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









asked 1 hour ago









JShorthouseJShorthouse

1311




1311




New contributor




JShorthouse is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.





New contributor





JShorthouse is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






JShorthouse is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.







  • 1





    en.wikipedia.org/wiki/Timbre

    – Your Uncle Bob
    1 hour ago












  • 1





    en.wikipedia.org/wiki/Timbre

    – Your Uncle Bob
    1 hour ago







1




1





en.wikipedia.org/wiki/Timbre

– Your Uncle Bob
1 hour ago





en.wikipedia.org/wiki/Timbre

– Your Uncle Bob
1 hour ago










3 Answers
3






active

oldest

votes


















4














I think you might be best served by linguistics, specifically phonetics.



Pitch is sort of an element, but specific pitch isn't the concern. Instead, some vocal sounds are "voiced" meaning the vocal chords vibrate (producing pitches.) For example, the f in 'fan is not voiced, but when voiced it becomes v like 'van.'



How vowel and consonant sounds are produced is understood in linguistics as a matter of vocal anatomy of the tongue, palette, etc. and described with terms like fricative, labial, etc. There is a complex mapping of the inside of the mouth in linguistics.



You could describe the actions of the voice with acoustics with terms like amplitutde, wave form, etc. But, linguistics actually has a whole branch devoted to the study of vocal sounds.



By the way, in voice training these topics are called diction.






share|improve this answer




















  • 1





    Phoenix + phonetics = phoenetics! :-) ...I corrected my typo, thanks!

    – Michael Curtis
    1 hour ago











  • Found another of my own typos: 'best server' instead of 'best served' ...can you tell I work in computer support?

    – Michael Curtis
    1 hour ago


















2














As Michael Curtis has pointed out, from the linguistic side, the study of phonetics is all about what speech sounds humans make and how they make them. Phonetics doesn't really approach things from a musical perspective, so I thought I might try to make some correlations between phonetics and musical acoustics.



Phonetics divides speech sounds (phonemes) into two broad categories: vowels and consonants. The lines can be a bit blurry there, but vowel sounds always involve the vocal cords and usually made with the mouth more or less open, while consonants involve specific motions of the teeth, lips, and tongue and may or may not use the vocal cords.



For vowels, we always use our vocal cords, which means vowels always have some pitch. The pitches used during speech generally do not have a typical musical relationship, but sometimes might be "accidentally" musical. For instance, when a child taunts on the playground something like, "Johnny is a chick-en!", they often us a sing-song tone that is a melodic minor third. But that's incidental.



The way we make different vowel sounds is by changing the shape of our mouths, and this changes the timbre of the sound made by our vocal cords. Another musical way to look at it is that we are filtering (like with EQ or a synth filter) the pitch that is created by our vocal cords.



That entirely covers the musical aspects of vowel sounds. We could talk about loudness and duration (the two other main dimensions of music), but neither of those change the vowel sound we make or hear.



Consonants are more complicated. Let's divide them into the phonetic categories of voiced (using the vocal cords) and unvoiced (not using the vocal cords).



Unvoiced consonants (like /t/, /p/, /f/, /k/, /s/), from a musical standpoint, are closest to percussion sounds. These are the kinds of sounds we make when we beat box.
Percussion sounds and unvoiced consonants are both musically unpitched, and instead distinguished solely by timbre. The two main timbral elements of these sounds are the envelope and formant. The formant is like a filter setting, just like for vowel sounds, but since there is no pitch to filter, what is being filtered instead is noise or unpitched tones. Unpitched tones are groups of frequencies that do not have a harmonic relationship to each other, so we don't hear them as a note. Think of two different cymbals, a "high" one and a "low" one as being examples of noise with two different formants.



For unvoiced consonants, there are two subcategories we can talk about, plosives, fricatives. Plosives (/t/, /p/, /k/) have a very short loudness envelope that reaches maximum volume very quickly and then dies away just as quickly. This is most similar to a drum sound. The different sounds of plosives come from their different formants. In this case, it's mainly how much and what kinds of noise is being made along with the plosive sound. A /p/ sound has essentially no noise, like a kick drum, while /t/ and /k/ have two different kinds of noise that are more like a hi hat and snare drum, respectively. Another thing that makes the /t/ sound different from the /k/ is the position of the mouth is different, which causes different filtering just like we see in the vowel sounds.



Fricatives (/f/, /s/, /sh/, /th/) are all bursts of noise that generally last longer than plosives (they have a slower loudness envelope), and they each have their own formant, or filter setting, that changes the character of the noise. Note that /f/ is a fairly even noise sound, while /s/ has more of a sense of some frequencies being louder than others, /sh/ is a more uneven noise sound, and /th/ is a muted noise sound without as much of the upper frequencies.



For the voiced consonants, most of them are essentially the same as the unvoiced ones outlined above, except they also involve the vocal cords, so there is again a pitch of some kind when voiced consonants are spoken. These consonants include /d/ (voiced /t/), /b/ (voiced /p/), /z/ (voiced /s/) and so on. I believe every unvoiced consonant has a voiced version in English (I believe this is also true in Japanese).



There are a few voiced consonants that do not have unvoiced versions and also straddle the line between consonants and vowels. The two closest to being vowels are the /y/ and /w/ sounds. These are basically vowels where the formant or filter is changed while we say them. This is done by changing the positing of the tongue or lips while the vocal cords create a pitch.



Two others, /m/ and /n/, are basically made similar to humming, and the main way we tell the difference is by how the consonant changes to a vowel to determine whether it was an /m/ or /n/. During the transition to vowel, the difference between /m/ and /n/ is similar to the difference between /w/ and /y/.



Finally, /l/ and /r/ are essentially vowels that have rather extreme formants or filters applied do them. They also sound different when they are approached and left (what you might call their formant/filter envelopes).



If you're really paying attention, you've noticed I have not discussed every English phoneme. I have touched on all the musical aspects of phonemes in all languages. Here's more of a breakdown aspects of phonemes:



  • Different sound sources, including the vocal cords to make pitches and parts of the mouth that can make noises

  • Different mouth positions to filter the sound sources in different ways to create different formants

  • Different loudness envelopes, or how the loudness changes with time

  • Different formant envelopes, or how the filtering changes with time

Those are the primary elements that distinguish different phonemes summarized with musical, rather than phonetic, terms.






share|improve this answer
































    0














    It's a combination of a few things. Every consonant sounds different due to the way we physically make the sound with our mouths ("ess" versus "eff"), but every vowel is different purely based on timbre. This timbre shift is caused by the shape of the mouth, I believe.






    share|improve this answer























      Your Answer








      StackExchange.ready(function()
      var channelOptions =
      tags: "".split(" "),
      id: "240"
      ;
      initTagRenderer("".split(" "), "".split(" "), channelOptions);

      StackExchange.using("externalEditor", function()
      // Have to fire editor after snippets, if snippets enabled
      if (StackExchange.settings.snippets.snippetsEnabled)
      StackExchange.using("snippets", function()
      createEditor();
      );

      else
      createEditor();

      );

      function createEditor()
      StackExchange.prepareEditor(
      heartbeatType: 'answer',
      autoActivateHeartbeat: false,
      convertImagesToLinks: false,
      noModals: true,
      showLowRepImageUploadWarning: true,
      reputationToPostImages: null,
      bindNavPrevention: true,
      postfix: "",
      imageUploader:
      brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
      contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
      allowUrls: true
      ,
      noCode: true, onDemand: true,
      discardSelector: ".discard-answer"
      ,immediatelyShowMarkdownHelp:true
      );



      );






      JShorthouse is a new contributor. Be nice, and check out our Code of Conduct.









      draft saved

      draft discarded


















      StackExchange.ready(
      function ()
      StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmusic.stackexchange.com%2fquestions%2f82996%2fin-musical-terms-what-properties-are-varied-by-the-human-voice-to-produce-diffe%23new-answer', 'question_page');

      );

      Post as a guest















      Required, but never shown

























      3 Answers
      3






      active

      oldest

      votes








      3 Answers
      3






      active

      oldest

      votes









      active

      oldest

      votes






      active

      oldest

      votes









      4














      I think you might be best served by linguistics, specifically phonetics.



      Pitch is sort of an element, but specific pitch isn't the concern. Instead, some vocal sounds are "voiced" meaning the vocal chords vibrate (producing pitches.) For example, the f in 'fan is not voiced, but when voiced it becomes v like 'van.'



      How vowel and consonant sounds are produced is understood in linguistics as a matter of vocal anatomy of the tongue, palette, etc. and described with terms like fricative, labial, etc. There is a complex mapping of the inside of the mouth in linguistics.



      You could describe the actions of the voice with acoustics with terms like amplitutde, wave form, etc. But, linguistics actually has a whole branch devoted to the study of vocal sounds.



      By the way, in voice training these topics are called diction.






      share|improve this answer




















      • 1





        Phoenix + phonetics = phoenetics! :-) ...I corrected my typo, thanks!

        – Michael Curtis
        1 hour ago











      • Found another of my own typos: 'best server' instead of 'best served' ...can you tell I work in computer support?

        – Michael Curtis
        1 hour ago















      4














      I think you might be best served by linguistics, specifically phonetics.



      Pitch is sort of an element, but specific pitch isn't the concern. Instead, some vocal sounds are "voiced" meaning the vocal chords vibrate (producing pitches.) For example, the f in 'fan is not voiced, but when voiced it becomes v like 'van.'



      How vowel and consonant sounds are produced is understood in linguistics as a matter of vocal anatomy of the tongue, palette, etc. and described with terms like fricative, labial, etc. There is a complex mapping of the inside of the mouth in linguistics.



      You could describe the actions of the voice with acoustics with terms like amplitutde, wave form, etc. But, linguistics actually has a whole branch devoted to the study of vocal sounds.



      By the way, in voice training these topics are called diction.






      share|improve this answer




















      • 1





        Phoenix + phonetics = phoenetics! :-) ...I corrected my typo, thanks!

        – Michael Curtis
        1 hour ago











      • Found another of my own typos: 'best server' instead of 'best served' ...can you tell I work in computer support?

        – Michael Curtis
        1 hour ago













      4












      4








      4







      I think you might be best served by linguistics, specifically phonetics.



      Pitch is sort of an element, but specific pitch isn't the concern. Instead, some vocal sounds are "voiced" meaning the vocal chords vibrate (producing pitches.) For example, the f in 'fan is not voiced, but when voiced it becomes v like 'van.'



      How vowel and consonant sounds are produced is understood in linguistics as a matter of vocal anatomy of the tongue, palette, etc. and described with terms like fricative, labial, etc. There is a complex mapping of the inside of the mouth in linguistics.



      You could describe the actions of the voice with acoustics with terms like amplitutde, wave form, etc. But, linguistics actually has a whole branch devoted to the study of vocal sounds.



      By the way, in voice training these topics are called diction.






      share|improve this answer















      I think you might be best served by linguistics, specifically phonetics.



      Pitch is sort of an element, but specific pitch isn't the concern. Instead, some vocal sounds are "voiced" meaning the vocal chords vibrate (producing pitches.) For example, the f in 'fan is not voiced, but when voiced it becomes v like 'van.'



      How vowel and consonant sounds are produced is understood in linguistics as a matter of vocal anatomy of the tongue, palette, etc. and described with terms like fricative, labial, etc. There is a complex mapping of the inside of the mouth in linguistics.



      You could describe the actions of the voice with acoustics with terms like amplitutde, wave form, etc. But, linguistics actually has a whole branch devoted to the study of vocal sounds.



      By the way, in voice training these topics are called diction.







      share|improve this answer














      share|improve this answer



      share|improve this answer








      edited 1 hour ago

























      answered 1 hour ago









      Michael CurtisMichael Curtis

      12.3k744




      12.3k744







      • 1





        Phoenix + phonetics = phoenetics! :-) ...I corrected my typo, thanks!

        – Michael Curtis
        1 hour ago











      • Found another of my own typos: 'best server' instead of 'best served' ...can you tell I work in computer support?

        – Michael Curtis
        1 hour ago












      • 1





        Phoenix + phonetics = phoenetics! :-) ...I corrected my typo, thanks!

        – Michael Curtis
        1 hour ago











      • Found another of my own typos: 'best server' instead of 'best served' ...can you tell I work in computer support?

        – Michael Curtis
        1 hour ago







      1




      1





      Phoenix + phonetics = phoenetics! :-) ...I corrected my typo, thanks!

      – Michael Curtis
      1 hour ago





      Phoenix + phonetics = phoenetics! :-) ...I corrected my typo, thanks!

      – Michael Curtis
      1 hour ago













      Found another of my own typos: 'best server' instead of 'best served' ...can you tell I work in computer support?

      – Michael Curtis
      1 hour ago





      Found another of my own typos: 'best server' instead of 'best served' ...can you tell I work in computer support?

      – Michael Curtis
      1 hour ago











      2














      As Michael Curtis has pointed out, from the linguistic side, the study of phonetics is all about what speech sounds humans make and how they make them. Phonetics doesn't really approach things from a musical perspective, so I thought I might try to make some correlations between phonetics and musical acoustics.



      Phonetics divides speech sounds (phonemes) into two broad categories: vowels and consonants. The lines can be a bit blurry there, but vowel sounds always involve the vocal cords and usually made with the mouth more or less open, while consonants involve specific motions of the teeth, lips, and tongue and may or may not use the vocal cords.



      For vowels, we always use our vocal cords, which means vowels always have some pitch. The pitches used during speech generally do not have a typical musical relationship, but sometimes might be "accidentally" musical. For instance, when a child taunts on the playground something like, "Johnny is a chick-en!", they often us a sing-song tone that is a melodic minor third. But that's incidental.



      The way we make different vowel sounds is by changing the shape of our mouths, and this changes the timbre of the sound made by our vocal cords. Another musical way to look at it is that we are filtering (like with EQ or a synth filter) the pitch that is created by our vocal cords.



      That entirely covers the musical aspects of vowel sounds. We could talk about loudness and duration (the two other main dimensions of music), but neither of those change the vowel sound we make or hear.



      Consonants are more complicated. Let's divide them into the phonetic categories of voiced (using the vocal cords) and unvoiced (not using the vocal cords).



      Unvoiced consonants (like /t/, /p/, /f/, /k/, /s/), from a musical standpoint, are closest to percussion sounds. These are the kinds of sounds we make when we beat box.
      Percussion sounds and unvoiced consonants are both musically unpitched, and instead distinguished solely by timbre. The two main timbral elements of these sounds are the envelope and formant. The formant is like a filter setting, just like for vowel sounds, but since there is no pitch to filter, what is being filtered instead is noise or unpitched tones. Unpitched tones are groups of frequencies that do not have a harmonic relationship to each other, so we don't hear them as a note. Think of two different cymbals, a "high" one and a "low" one as being examples of noise with two different formants.



      For unvoiced consonants, there are two subcategories we can talk about, plosives, fricatives. Plosives (/t/, /p/, /k/) have a very short loudness envelope that reaches maximum volume very quickly and then dies away just as quickly. This is most similar to a drum sound. The different sounds of plosives come from their different formants. In this case, it's mainly how much and what kinds of noise is being made along with the plosive sound. A /p/ sound has essentially no noise, like a kick drum, while /t/ and /k/ have two different kinds of noise that are more like a hi hat and snare drum, respectively. Another thing that makes the /t/ sound different from the /k/ is the position of the mouth is different, which causes different filtering just like we see in the vowel sounds.



      Fricatives (/f/, /s/, /sh/, /th/) are all bursts of noise that generally last longer than plosives (they have a slower loudness envelope), and they each have their own formant, or filter setting, that changes the character of the noise. Note that /f/ is a fairly even noise sound, while /s/ has more of a sense of some frequencies being louder than others, /sh/ is a more uneven noise sound, and /th/ is a muted noise sound without as much of the upper frequencies.



      For the voiced consonants, most of them are essentially the same as the unvoiced ones outlined above, except they also involve the vocal cords, so there is again a pitch of some kind when voiced consonants are spoken. These consonants include /d/ (voiced /t/), /b/ (voiced /p/), /z/ (voiced /s/) and so on. I believe every unvoiced consonant has a voiced version in English (I believe this is also true in Japanese).



      There are a few voiced consonants that do not have unvoiced versions and also straddle the line between consonants and vowels. The two closest to being vowels are the /y/ and /w/ sounds. These are basically vowels where the formant or filter is changed while we say them. This is done by changing the positing of the tongue or lips while the vocal cords create a pitch.



      Two others, /m/ and /n/, are basically made similar to humming, and the main way we tell the difference is by how the consonant changes to a vowel to determine whether it was an /m/ or /n/. During the transition to vowel, the difference between /m/ and /n/ is similar to the difference between /w/ and /y/.



      Finally, /l/ and /r/ are essentially vowels that have rather extreme formants or filters applied do them. They also sound different when they are approached and left (what you might call their formant/filter envelopes).



      If you're really paying attention, you've noticed I have not discussed every English phoneme. I have touched on all the musical aspects of phonemes in all languages. Here's more of a breakdown aspects of phonemes:



      • Different sound sources, including the vocal cords to make pitches and parts of the mouth that can make noises

      • Different mouth positions to filter the sound sources in different ways to create different formants

      • Different loudness envelopes, or how the loudness changes with time

      • Different formant envelopes, or how the filtering changes with time

      Those are the primary elements that distinguish different phonemes summarized with musical, rather than phonetic, terms.






      share|improve this answer





























        2














        As Michael Curtis has pointed out, from the linguistic side, the study of phonetics is all about what speech sounds humans make and how they make them. Phonetics doesn't really approach things from a musical perspective, so I thought I might try to make some correlations between phonetics and musical acoustics.



        Phonetics divides speech sounds (phonemes) into two broad categories: vowels and consonants. The lines can be a bit blurry there, but vowel sounds always involve the vocal cords and usually made with the mouth more or less open, while consonants involve specific motions of the teeth, lips, and tongue and may or may not use the vocal cords.



        For vowels, we always use our vocal cords, which means vowels always have some pitch. The pitches used during speech generally do not have a typical musical relationship, but sometimes might be "accidentally" musical. For instance, when a child taunts on the playground something like, "Johnny is a chick-en!", they often us a sing-song tone that is a melodic minor third. But that's incidental.



        The way we make different vowel sounds is by changing the shape of our mouths, and this changes the timbre of the sound made by our vocal cords. Another musical way to look at it is that we are filtering (like with EQ or a synth filter) the pitch that is created by our vocal cords.



        That entirely covers the musical aspects of vowel sounds. We could talk about loudness and duration (the two other main dimensions of music), but neither of those change the vowel sound we make or hear.



        Consonants are more complicated. Let's divide them into the phonetic categories of voiced (using the vocal cords) and unvoiced (not using the vocal cords).



        Unvoiced consonants (like /t/, /p/, /f/, /k/, /s/), from a musical standpoint, are closest to percussion sounds. These are the kinds of sounds we make when we beat box.
        Percussion sounds and unvoiced consonants are both musically unpitched, and instead distinguished solely by timbre. The two main timbral elements of these sounds are the envelope and formant. The formant is like a filter setting, just like for vowel sounds, but since there is no pitch to filter, what is being filtered instead is noise or unpitched tones. Unpitched tones are groups of frequencies that do not have a harmonic relationship to each other, so we don't hear them as a note. Think of two different cymbals, a "high" one and a "low" one as being examples of noise with two different formants.



        For unvoiced consonants, there are two subcategories we can talk about, plosives, fricatives. Plosives (/t/, /p/, /k/) have a very short loudness envelope that reaches maximum volume very quickly and then dies away just as quickly. This is most similar to a drum sound. The different sounds of plosives come from their different formants. In this case, it's mainly how much and what kinds of noise is being made along with the plosive sound. A /p/ sound has essentially no noise, like a kick drum, while /t/ and /k/ have two different kinds of noise that are more like a hi hat and snare drum, respectively. Another thing that makes the /t/ sound different from the /k/ is the position of the mouth is different, which causes different filtering just like we see in the vowel sounds.



        Fricatives (/f/, /s/, /sh/, /th/) are all bursts of noise that generally last longer than plosives (they have a slower loudness envelope), and they each have their own formant, or filter setting, that changes the character of the noise. Note that /f/ is a fairly even noise sound, while /s/ has more of a sense of some frequencies being louder than others, /sh/ is a more uneven noise sound, and /th/ is a muted noise sound without as much of the upper frequencies.



        For the voiced consonants, most of them are essentially the same as the unvoiced ones outlined above, except they also involve the vocal cords, so there is again a pitch of some kind when voiced consonants are spoken. These consonants include /d/ (voiced /t/), /b/ (voiced /p/), /z/ (voiced /s/) and so on. I believe every unvoiced consonant has a voiced version in English (I believe this is also true in Japanese).



        There are a few voiced consonants that do not have unvoiced versions and also straddle the line between consonants and vowels. The two closest to being vowels are the /y/ and /w/ sounds. These are basically vowels where the formant or filter is changed while we say them. This is done by changing the positing of the tongue or lips while the vocal cords create a pitch.



        Two others, /m/ and /n/, are basically made similar to humming, and the main way we tell the difference is by how the consonant changes to a vowel to determine whether it was an /m/ or /n/. During the transition to vowel, the difference between /m/ and /n/ is similar to the difference between /w/ and /y/.



        Finally, /l/ and /r/ are essentially vowels that have rather extreme formants or filters applied do them. They also sound different when they are approached and left (what you might call their formant/filter envelopes).



        If you're really paying attention, you've noticed I have not discussed every English phoneme. I have touched on all the musical aspects of phonemes in all languages. Here's more of a breakdown aspects of phonemes:



        • Different sound sources, including the vocal cords to make pitches and parts of the mouth that can make noises

        • Different mouth positions to filter the sound sources in different ways to create different formants

        • Different loudness envelopes, or how the loudness changes with time

        • Different formant envelopes, or how the filtering changes with time

        Those are the primary elements that distinguish different phonemes summarized with musical, rather than phonetic, terms.






        share|improve this answer



























          2












          2








          2







          As Michael Curtis has pointed out, from the linguistic side, the study of phonetics is all about what speech sounds humans make and how they make them. Phonetics doesn't really approach things from a musical perspective, so I thought I might try to make some correlations between phonetics and musical acoustics.



          Phonetics divides speech sounds (phonemes) into two broad categories: vowels and consonants. The lines can be a bit blurry there, but vowel sounds always involve the vocal cords and usually made with the mouth more or less open, while consonants involve specific motions of the teeth, lips, and tongue and may or may not use the vocal cords.



          For vowels, we always use our vocal cords, which means vowels always have some pitch. The pitches used during speech generally do not have a typical musical relationship, but sometimes might be "accidentally" musical. For instance, when a child taunts on the playground something like, "Johnny is a chick-en!", they often us a sing-song tone that is a melodic minor third. But that's incidental.



          The way we make different vowel sounds is by changing the shape of our mouths, and this changes the timbre of the sound made by our vocal cords. Another musical way to look at it is that we are filtering (like with EQ or a synth filter) the pitch that is created by our vocal cords.



          That entirely covers the musical aspects of vowel sounds. We could talk about loudness and duration (the two other main dimensions of music), but neither of those change the vowel sound we make or hear.



          Consonants are more complicated. Let's divide them into the phonetic categories of voiced (using the vocal cords) and unvoiced (not using the vocal cords).



          Unvoiced consonants (like /t/, /p/, /f/, /k/, /s/), from a musical standpoint, are closest to percussion sounds. These are the kinds of sounds we make when we beat box.
          Percussion sounds and unvoiced consonants are both musically unpitched, and instead distinguished solely by timbre. The two main timbral elements of these sounds are the envelope and formant. The formant is like a filter setting, just like for vowel sounds, but since there is no pitch to filter, what is being filtered instead is noise or unpitched tones. Unpitched tones are groups of frequencies that do not have a harmonic relationship to each other, so we don't hear them as a note. Think of two different cymbals, a "high" one and a "low" one as being examples of noise with two different formants.



          For unvoiced consonants, there are two subcategories we can talk about, plosives, fricatives. Plosives (/t/, /p/, /k/) have a very short loudness envelope that reaches maximum volume very quickly and then dies away just as quickly. This is most similar to a drum sound. The different sounds of plosives come from their different formants. In this case, it's mainly how much and what kinds of noise is being made along with the plosive sound. A /p/ sound has essentially no noise, like a kick drum, while /t/ and /k/ have two different kinds of noise that are more like a hi hat and snare drum, respectively. Another thing that makes the /t/ sound different from the /k/ is the position of the mouth is different, which causes different filtering just like we see in the vowel sounds.



          Fricatives (/f/, /s/, /sh/, /th/) are all bursts of noise that generally last longer than plosives (they have a slower loudness envelope), and they each have their own formant, or filter setting, that changes the character of the noise. Note that /f/ is a fairly even noise sound, while /s/ has more of a sense of some frequencies being louder than others, /sh/ is a more uneven noise sound, and /th/ is a muted noise sound without as much of the upper frequencies.



          For the voiced consonants, most of them are essentially the same as the unvoiced ones outlined above, except they also involve the vocal cords, so there is again a pitch of some kind when voiced consonants are spoken. These consonants include /d/ (voiced /t/), /b/ (voiced /p/), /z/ (voiced /s/) and so on. I believe every unvoiced consonant has a voiced version in English (I believe this is also true in Japanese).



          There are a few voiced consonants that do not have unvoiced versions and also straddle the line between consonants and vowels. The two closest to being vowels are the /y/ and /w/ sounds. These are basically vowels where the formant or filter is changed while we say them. This is done by changing the positing of the tongue or lips while the vocal cords create a pitch.



          Two others, /m/ and /n/, are basically made similar to humming, and the main way we tell the difference is by how the consonant changes to a vowel to determine whether it was an /m/ or /n/. During the transition to vowel, the difference between /m/ and /n/ is similar to the difference between /w/ and /y/.



          Finally, /l/ and /r/ are essentially vowels that have rather extreme formants or filters applied do them. They also sound different when they are approached and left (what you might call their formant/filter envelopes).



          If you're really paying attention, you've noticed I have not discussed every English phoneme. I have touched on all the musical aspects of phonemes in all languages. Here's more of a breakdown aspects of phonemes:



          • Different sound sources, including the vocal cords to make pitches and parts of the mouth that can make noises

          • Different mouth positions to filter the sound sources in different ways to create different formants

          • Different loudness envelopes, or how the loudness changes with time

          • Different formant envelopes, or how the filtering changes with time

          Those are the primary elements that distinguish different phonemes summarized with musical, rather than phonetic, terms.






          share|improve this answer















          As Michael Curtis has pointed out, from the linguistic side, the study of phonetics is all about what speech sounds humans make and how they make them. Phonetics doesn't really approach things from a musical perspective, so I thought I might try to make some correlations between phonetics and musical acoustics.



          Phonetics divides speech sounds (phonemes) into two broad categories: vowels and consonants. The lines can be a bit blurry there, but vowel sounds always involve the vocal cords and usually made with the mouth more or less open, while consonants involve specific motions of the teeth, lips, and tongue and may or may not use the vocal cords.



          For vowels, we always use our vocal cords, which means vowels always have some pitch. The pitches used during speech generally do not have a typical musical relationship, but sometimes might be "accidentally" musical. For instance, when a child taunts on the playground something like, "Johnny is a chick-en!", they often us a sing-song tone that is a melodic minor third. But that's incidental.



          The way we make different vowel sounds is by changing the shape of our mouths, and this changes the timbre of the sound made by our vocal cords. Another musical way to look at it is that we are filtering (like with EQ or a synth filter) the pitch that is created by our vocal cords.



          That entirely covers the musical aspects of vowel sounds. We could talk about loudness and duration (the two other main dimensions of music), but neither of those change the vowel sound we make or hear.



          Consonants are more complicated. Let's divide them into the phonetic categories of voiced (using the vocal cords) and unvoiced (not using the vocal cords).



          Unvoiced consonants (like /t/, /p/, /f/, /k/, /s/), from a musical standpoint, are closest to percussion sounds. These are the kinds of sounds we make when we beat box.
          Percussion sounds and unvoiced consonants are both musically unpitched, and instead distinguished solely by timbre. The two main timbral elements of these sounds are the envelope and formant. The formant is like a filter setting, just like for vowel sounds, but since there is no pitch to filter, what is being filtered instead is noise or unpitched tones. Unpitched tones are groups of frequencies that do not have a harmonic relationship to each other, so we don't hear them as a note. Think of two different cymbals, a "high" one and a "low" one as being examples of noise with two different formants.



          For unvoiced consonants, there are two subcategories we can talk about, plosives, fricatives. Plosives (/t/, /p/, /k/) have a very short loudness envelope that reaches maximum volume very quickly and then dies away just as quickly. This is most similar to a drum sound. The different sounds of plosives come from their different formants. In this case, it's mainly how much and what kinds of noise is being made along with the plosive sound. A /p/ sound has essentially no noise, like a kick drum, while /t/ and /k/ have two different kinds of noise that are more like a hi hat and snare drum, respectively. Another thing that makes the /t/ sound different from the /k/ is the position of the mouth is different, which causes different filtering just like we see in the vowel sounds.



          Fricatives (/f/, /s/, /sh/, /th/) are all bursts of noise that generally last longer than plosives (they have a slower loudness envelope), and they each have their own formant, or filter setting, that changes the character of the noise. Note that /f/ is a fairly even noise sound, while /s/ has more of a sense of some frequencies being louder than others, /sh/ is a more uneven noise sound, and /th/ is a muted noise sound without as much of the upper frequencies.



          For the voiced consonants, most of them are essentially the same as the unvoiced ones outlined above, except they also involve the vocal cords, so there is again a pitch of some kind when voiced consonants are spoken. These consonants include /d/ (voiced /t/), /b/ (voiced /p/), /z/ (voiced /s/) and so on. I believe every unvoiced consonant has a voiced version in English (I believe this is also true in Japanese).



          There are a few voiced consonants that do not have unvoiced versions and also straddle the line between consonants and vowels. The two closest to being vowels are the /y/ and /w/ sounds. These are basically vowels where the formant or filter is changed while we say them. This is done by changing the positing of the tongue or lips while the vocal cords create a pitch.



          Two others, /m/ and /n/, are basically made similar to humming, and the main way we tell the difference is by how the consonant changes to a vowel to determine whether it was an /m/ or /n/. During the transition to vowel, the difference between /m/ and /n/ is similar to the difference between /w/ and /y/.



          Finally, /l/ and /r/ are essentially vowels that have rather extreme formants or filters applied do them. They also sound different when they are approached and left (what you might call their formant/filter envelopes).



          If you're really paying attention, you've noticed I have not discussed every English phoneme. I have touched on all the musical aspects of phonemes in all languages. Here's more of a breakdown aspects of phonemes:



          • Different sound sources, including the vocal cords to make pitches and parts of the mouth that can make noises

          • Different mouth positions to filter the sound sources in different ways to create different formants

          • Different loudness envelopes, or how the loudness changes with time

          • Different formant envelopes, or how the filtering changes with time

          Those are the primary elements that distinguish different phonemes summarized with musical, rather than phonetic, terms.







          share|improve this answer














          share|improve this answer



          share|improve this answer








          edited 3 mins ago

























          answered 12 mins ago









          Todd WilcoxTodd Wilcox

          37.4k370125




          37.4k370125





















              0














              It's a combination of a few things. Every consonant sounds different due to the way we physically make the sound with our mouths ("ess" versus "eff"), but every vowel is different purely based on timbre. This timbre shift is caused by the shape of the mouth, I believe.






              share|improve this answer



























                0














                It's a combination of a few things. Every consonant sounds different due to the way we physically make the sound with our mouths ("ess" versus "eff"), but every vowel is different purely based on timbre. This timbre shift is caused by the shape of the mouth, I believe.






                share|improve this answer

























                  0












                  0








                  0







                  It's a combination of a few things. Every consonant sounds different due to the way we physically make the sound with our mouths ("ess" versus "eff"), but every vowel is different purely based on timbre. This timbre shift is caused by the shape of the mouth, I believe.






                  share|improve this answer













                  It's a combination of a few things. Every consonant sounds different due to the way we physically make the sound with our mouths ("ess" versus "eff"), but every vowel is different purely based on timbre. This timbre shift is caused by the shape of the mouth, I believe.







                  share|improve this answer












                  share|improve this answer



                  share|improve this answer










                  answered 1 hour ago









                  user45266user45266

                  4,4641835




                  4,4641835




















                      JShorthouse is a new contributor. Be nice, and check out our Code of Conduct.









                      draft saved

                      draft discarded


















                      JShorthouse is a new contributor. Be nice, and check out our Code of Conduct.












                      JShorthouse is a new contributor. Be nice, and check out our Code of Conduct.











                      JShorthouse is a new contributor. Be nice, and check out our Code of Conduct.














                      Thanks for contributing an answer to Music: Practice & Theory Stack Exchange!


                      • Please be sure to answer the question. Provide details and share your research!

                      But avoid


                      • Asking for help, clarification, or responding to other answers.

                      • Making statements based on opinion; back them up with references or personal experience.

                      To learn more, see our tips on writing great answers.




                      draft saved


                      draft discarded














                      StackExchange.ready(
                      function ()
                      StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmusic.stackexchange.com%2fquestions%2f82996%2fin-musical-terms-what-properties-are-varied-by-the-human-voice-to-produce-diffe%23new-answer', 'question_page');

                      );

                      Post as a guest















                      Required, but never shown





















































                      Required, but never shown














                      Required, but never shown












                      Required, but never shown







                      Required, but never shown

































                      Required, but never shown














                      Required, but never shown












                      Required, but never shown







                      Required, but never shown







                      Popular posts from this blog

                      Category:Fedor von Bock Media in category "Fedor von Bock"Navigation menuUpload mediaISNI: 0000 0000 5511 3417VIAF ID: 24712551GND ID: 119294796Library of Congress authority ID: n96068363BnF ID: 12534305fSUDOC authorities ID: 034604189Open Library ID: OL338253ANKCR AUT ID: jn19990000869National Library of Israel ID: 000514068National Thesaurus for Author Names ID: 341574317ReasonatorScholiaStatistics

                      Reverse int within the 32-bit signed integer range: [−2^31, 2^31 − 1]Combining two 32-bit integers into one 64-bit integerDetermine if an int is within rangeLossy packing 32 bit integer to 16 bitComputing the square root of a 64-bit integerKeeping integer addition within boundsSafe multiplication of two 64-bit signed integersLeetcode 10: Regular Expression MatchingSigned integer-to-ascii x86_64 assembler macroReverse the digits of an Integer“Add two numbers given in reverse order from a linked list”

                      Kiel Indholdsfortegnelse Historie | Transport og færgeforbindelser | Sejlsport og anden sport | Kultur | Kendte personer fra Kiel | Noter | Litteratur | Eksterne henvisninger | Navigationsmenuwww.kiel.de54°19′31″N 10°8′26″Ø / 54.32528°N 10.14056°Ø / 54.32528; 10.14056Oberbürgermeister Dr. Ulf Kämpferwww.statistik-nord.deDen danske Stats StatistikKiels hjemmesiderrrWorldCat312794080n790547494030481-4