| Home | E-Submission | Sitemap | Editorial Office |  
top_img
Audiology and Speech Research > Volume 21(1); 2025 > Article
Suprent, Samayan, and Mani: The Evolution of Male Voice Acoustics: A Lifespan Perspective

Abstract

Purpose

The acoustic parameters of voice, including fundamental frequency (F0), Jitter, Shimmer (dB), and noise-to-harmonic ratio (NHR), exhibit significant changes across the male lifespan. These variations are influenced by anatomical, physiological, and neuromuscular changes in the vocal folds and laryngeal structures. Understanding these changes is essential for developing age-specific diagnostic and therapeutic approaches in voice science. This study aims to develop the acoustic voice parameters in males from the age 6 to 70 years.

Methods

A total of 270 healthy male participants were included in this study, with 30 individuals in each of the nine age groups. The vowel /a/ was used as the stimulus for vocal recording. The Computerized Speech Lab (CSL 4500B; PENTAX, Montvale, NJ, USA) software module Multi-dimensional voice profile was utilized to analyze the vocal responses.

Results

The results revealed significant differences in all the vocal parameters across the age groups. These differences reflect the age-related changes in the acoustic characteristics of the male voice. the statistical significance of differences was present in the following vocal parameters F0, Jitter, Shimmer (dB), and NHR across different age groups of male participants. The p-value of 0.00 (p < 0.05) for all the parameters indicates that the differences between the age groups are statistically significant.

Conclusion

The review highlights the dynamic nature of male vocal acoustics across the lifespan. These findings underscore the importance of age-specific norms for acoustic parameters to improve the accuracy of voice assessments and interventions.

INTRODUCTION

Males' vocal folds, laryngeal structures, and respiratory system undergo physiological and anatomical changes throughout their lives, which causes a major change in the acoustic parameters of their voice. The noise-to-harmonic ratio (NHR), Shimmer (dB), Jitter, and fundamental frequency (F0) are the most often used acoustic metrics for evaluating vocal function. Men have a lower fundamental frequency than women, while prepubescent boys have lower and more closely spaced vocal tract resonances than girls. Each acoustic measure has a distinct clinical use, however when diagnosing voice issues, the qualities of the voice that are related to noise are given more weight. Throughout life, vocal structures undergo a variety of modifications that affect the voice’s varied characteristics, including intensity, frequency, and quality. From birth to senescence, these structural alterations take place. According to Brockmann et al.(2011), factors including gender, vowel, intensity, and F0 should be taken into account in order to achieve appropriate voice assessments.
In order to diagnose and treat voice diseases and to further our understanding of physiological voice aging, it is essential to comprehend how acoustic voice parameters change throughout the course of a person’s life. Numerous facets of these factors are examined in the literature, with an emphasis on the aging, stable adult, and developmental stages. The main conclusions of NHR, Jitter, Shimmer (dB), and F0 are summarized here. There are few research studies focussed on the acoustic characteristics of voice in children, adolescents and the older age groups. According to Titze(1994), F0 drastically changes throughout puberty, particularly in boys, since testosterone causes the larynx to develop quickly. F0 drops by around one octave as a result of this expansion. According to Harries et al.(1997), male F0 stabilizes at a frequency range of 85~125 Hz by the end of puberty, which is consistent with the growth in the thickness and length of the vocal folds. Because of consistent laryngeal architecture and neuromuscular regulation, F0 stays comparatively constant throughout maturity (Boone et al., 2005). Due to vocal fold atrophy and decreased flexibility, F0 marginally rises in older guys. According to Sataloff(2017), older guys had an average increase of 10~20 Hz in comparison to their midlife values.
Linville(2004) found that older males had more Jitter, which they attributed to glottal insufficiency, decreased neuromuscular accuracy, and age-related vocal fold bowing. Children’s elevated Shimmer (dB) is a reflection of their underdeveloped glottal closure and vocal fold vibration. Adults often have Shimmer (dB) levels between 0.2 dB and 0.5 dB, which are stable and indicative of effective phonation (Stathopoulos et al., 2011).
Due to alterations in subglottal pressure regulation and decreased glottal efficiency, elderly males exhibit a marked rise in Shimmer (dB) (Ramig & Ringel, 1983). Children’s inadequate glottal closure and less effective phonation led to higher NHR readings (Titze, 1994). This frequently occurs at times of fast development or hormonal shifts. According to Maryn et al.(2009), adults have the lowest NHR values (around 0.1~0.2), which suggests effective glottal vibration and less noise components. Higher levels of aperiodic noise and decreased glottal closure are reflected in older guys’ elevated NHR. According to Linville(2004), there is a substantial correlation between changes in NHR and perceptual indices of voice roughness and hoarseness.
Hence the human voice is a vital tool for communication, and its acoustic properties can provide crucial insights about bodily changes and development. Vocal parameters such as F0, Jitter, Shimmer (dB), and NHR are essential for evaluating the voice’s health and quality. However, a significant amount of research on vocal acoustics has either focused on certain age groups or been limited to clinical populations. There aren’t many thorough studies that examine vocal parameters throughout a broad age range, especially for healthy males.

Aim of the study

The focus of this study is to examine specific voice acoustic parameters in males between the ages of 6 to 70 in order to determine how these parameters change as people age.

MATERIALS AND METHODS

The current study proposal was approved by the Institutional Ethical Committee, and the ethical clearance number is 1427(A)/2018.

Participants

The current study included 270 male participants, ages 6 to 70 years. Each of the nine selected groups consisted of thirty individuals. The hearing and health-related factors of each participant were thoroughly evaluated to ensure they were suitable for the study. Before the study began, informed consent was given by all adult participants, and consent papers and informed consent were given by the parents or guardians of minor participants. Through this approach, it was ensured that all participants were adequately informed about the study’s objectives and methodology.

Instruments and procedures

Each participant’s voice samples were analysed using the multi-dimensional speech profile (MDVP) module of the CSL 4500b program (PENTAX, Montvale, NJ, USA). A soundproof chamber was used to record the voices. The participants were instructed to maintain a 10-cm distance from the microphone and to sit upright. Every participant was told to take a slow, deep breath and phonate the vowel for at least five seconds at a comfortable loudness. After that, the signals were processed at a sampling rate of 44,100 Hz using the MDVP module of the CSL 4500b program (PENTAX) for acoustic analysis. The following four essential acoustical metrics were examined: NHR, Jitter, Shimmer (dB), and F0. The vowel /a/ provided the values for these characteristics, which were then calculated and statistically analysed. For each parameter, the mean and standard deviation (SD) were computed. Significant variations in the MDVP parameters within and between the different male age groups were found using an analysis of variance test.

RESULTS

The above Table 1 represents the mean ± SD values of F0, Jitter, Shimmer (dB) and NHR among male participants.
The data show a gradual decrease in F0 as age increases, with a marked drop from 260.31 Hz in the 6~9 years age group to 125.97 Hz in the 61~70 years age group. This finding aligns with previous research, which suggests that F0, or the pitch of the voice, tends to decline with age, particularly after the third decade of life (Titze, 2000). According to research by Ramig and Ringel(1983), aging has an impact on vocal features. Older persons also showed decreased vocal stability and intensity, as seen by higher Jitter and Shimmer (dB).
Jitter, which reflects the variability in the frequency of vocal fold oscillations, exhibits a rise in values between ages 9 to 12 years which is around 2.6, after which it stabilizes from 16 to 50 years. However, there is a decline in Jitter values in the 60~70 years group, dropping to 0.43. This suggests that, over time, the stability of the voice improves, and frequency perturbations become less noticeable. The trend is consistent with findings that Jitter decreases in older adults, possibly due to changes in laryngeal flexibility and vocal fold elasticity (Hillenbrand et al., 1994).
The mean values of evolution of Shimmer (dB), which measures amplitude perturbation, were closer between the ages of 21 and 50, showing a mean value of 0.3; between the ages of 10 and 15 years, the mean values were 0.8; between the ages of 51 and 60, they were 0.6; and between the ages of 61 and 70, they were 0.4. This suggests that, beyond early adolescence, voice amplitude stability tends to stabilize throughout time with little volatility. According to earlier research, when people age and their vocal fold tissues alter in mass and tension, their Shimmer (dB) values tend to decrease (Simberg et al., 2005).
The NHR, which measures the volume of noise in relation to the voice’s harmonic components, fluctuates minimally between age groups, indicating that voice quality is generally constant throughout life. Between the age range of 13 to 15 and 51 to 60, the NHR mean values increased slightly; the mean value is 0.2, while the mean values for the remaining rage groups are nearly identical, at 0.1. This result is in line with studies showing that aging does not substantially change the voice’s harmonic structure, while having an impact on the fundamental frequency and perturbation measures (Brockmann et al., 2011).
Table 2 provides the statistical significance of differences in vocal parameters of F0, Jitter, Shimmer (dB), and NHR across different age groups of male participants. The p-value of 0.00 (p < 0.05) for all the parameters indicates that the differences between the age groups are statistically significant.
The post hoc analysis comparing average F0 across male age groups showed statistically significant differences (p < 0.05). The following age groups showed exceptions: 16~20 and 21~30 (p = 0.469), 16~20 and 41~50 (p = 0.098), 16~20 and 51~60 (p = 0.014), and 61~70 with 16~20 (p = 0.765) and 21~30 (p = 0.671). Furthermore, there was no significant difference (p = 1.000) between 31~40 and 51~60.
The post hoc analysis comparing Jitter across male age groups showed significant differences (p < 0.05). Comparing age groups 9~12 and 12~15 (p = 0.120), 16~20 and 21~30 (p = 0.410), 16~20 and 31~40 (p = 0.149), 31~40 and 21~30 (p = 0.534), and a number of neighbouring pairs, including 31~40 with 51~60 (p = 1.000) and 41~50 with 51~60 (p = 0.571), were among the exceptions.
The majority of pairwise comparisons, especially those across non-adjacent age groups, revealed significant differences (p < 0.05) in the post hoc analysis comparing Shimmer (dB) among male age groups. The following closely related groups, however, did not differ significantly: 9~12 and 6~9 (p = 0.078), 12~15 and 9~12 (p = 0.994), 21~30 and 31~40 (p = 0.508), 41~50 and 31~40 (p = 0.824), 51~60 and 31~40 (p = 1.000), and several more. According to these findings, Shimmer (dB) exhibits similarities between nearby or overlapping age groups but varies dramatically with age, especially when comparing far age ranges for comparison.
With the exception of significant differences between the 12~15 and younger groups (6~9, p = 0.010; 9~12, p = 0.012) and a few other pairs, such as 16~20 vs. 12~15 (p = 0.043) and 21~30 vs. 12~15 (p = 0.018), the post hoc analysis of NHR across male age groups showed primarily non-significant differences (p > 0.05).

DISCUSSIONS

The analysis of voice parameters across different age groups in male participants reveals significant insights into how various aspects of the voice evolve with age.
A steady decline in F0 was observed as age increases, with a substantial drop from 260.31 Hz in the 6~9 years age group to 125.97 Hz in the 60~70 years age group. This aligns with previous studies (Titze, 2000) that show F0 decreases with age, likely due to structural changes in the vocal cords and laryngeal muscles. Jitter values show an increase between ages 10~12 years, stabilizing from 16 to 50 years. In older age groups (61~70 years), Jitter values decrease, indicating improved voice stability. Adolescent voice change in males is a gradual, extended process that occurs over several years (Hollien et al., 1994). These findings corroborate research by Hillenbrand et al.(1994) and Kreiman & Gerratt(2000), which highlight a reduction in Jitter in older adults, likely due to changes in laryngeal flexibility and vocal fold elasticity.
Shimmer (dB) values remain close at most age groups, with a slight fluctuation in the 10~12 years and 13 to 15 years age range (mean value: 0.8) compare to other age groups. This stability in amplitude perturbation is consistent with findings from Simberg et al.(2005), who noted a reduction in Shimmer (dB) with age as vocal fold tissues undergo changes in mass and tension. NHR showed minimal variation across the age groups, indicating that the harmonic structure of the voice remains stable throughout life. This finding aligns with Brockmann et al.(2011) research, which suggests that although age affects pitch and perturbation measures, the harmonic structure of the voice remains relatively unchanged (Boone et al., 2005).
The results conclude that there is a discernible trend toward decreasing pitch (F0) and increasing frequency (Jjitter) and amplitude (Shimmer [dB]) stability as people age. Although there are age-related changes in these characteristics, the voice's harmonic content (NHR) stays constant. The post hoc test results implied that the most of the middle and older age groups have no statistically significant differences between the age groups in F0, Jitter, Shimmer (dB) and NHR. According to these results, age has an impact on the voice’s dynamic and structural characteristics, like pitch and unpredictability, but it has no discernible effect on the voice’s overall harmonic quality.

Notes

Ethical Statement

The current study proposal was approved by the Institutional Ethical Committee, and the ethical clearance number is 1427(A)/2018.

Declaration of Conflicting Interests

Regarding the research, the authors of the paper indicate that they have no conflicts of interest. The study was carried out in an entirely objective manner, and the conclusions are based only on the data and analysis, free from any biases, commercial interests, or other funding sources.

Funding

The current research received no grant from any funding.

Author Contributions

Ambethkar S: conceptualization, methodology, writing – original draft. Kala Samayan: visualization, supervision. Abhirami Mani: proof reading, data analysis.

Acknowledgments

My sincere gratitude to guide for their invaluable advice, unwavering support, and encouragement during this study, we also want to thank the administration of SRM Medical College Hospital and Research Centre for giving me the tools and chances we needed to carry out this research. We want to express my gratitude to everyone who participated in the study. Their collaboration and readiness to help made this study possible.

Table 1.
Mean and standard deviation of acoustic parameters of F0, Jitter, Shimmer (dB), and noise to harmonic ratio (NHR) across male age groups
N Standard error 95% confidence interval for mean
Minimum Maximum
Lower bound Upper bound
F0
 6~9 30 260.31977 ± 32.676103 5.965813 248.11831 272.52122 201.233 335.236
 10~12 30 235.54280 ± 28.705297 5.240846 224.82407 246.26153 189.920 293.030
 13~15 30 185.87087 ± 23.674090 4.322278 177.03082 194.71092 117.600 215.472
 16~20 30 127.58367 ± 19.154784 3.497169 120.43115 134.73618 100.290 180.750
 21~30 30 123.68440 ± 12.249935 2.236522 119.11020 128.25860 101.200 145.616
 31~40 30 140.85187 ± 18.957324 3.461118 133.77309 147.93065 104.740 167.213
 41~50 30 136.51853 ± 9.040475 1.650557 133.14276 139.89430 117.803 155.648
 51~60 30 120.11747 ± 12.728232 2.323847 115.36467 124.87027 101.200 139.970
 61~70 30 125.97523 ± 11.716310 2.139096 121.60029 130.35018 105.560 144.026
 Total 270 161.82940 ± 53.881131 3.279101 155.37343 168.28537 100.290 335.236
Jitter
 6~9 30 1.53743 ± 0.641568 0.117134 1.29787 1.77700 0.764 3.028
 10~12 30 2.60483 ± 1.310681 0.239297 2.11542 3.09425 1.023 5.930
 13~15 30 1.94807 ± 1.221588 0.223030 1.49192 2.40422 0.430 5.860
 16~20 30 0.92667 ± 0.516696 0.094335 0.73373 1.11960 0.350 2.710
 21~30 30 0.79060 ± 0.449340 0.082038 0.62281 0.95839 0.216 1.860
 31~40 30 0.68807 ± 0.407788 0.074452 0.53580 0.84034 0.187 2.145
 41~50 30 0.59473 ± 0.299419 0.054666 0.48293 0.70654 0.145 1.349
 51~60 30 0.68440 ± 0.348372 0.063604 0.55432 0.81448 0.216 1.860
 61~70 30 0.43367 ± 0.245286 0.044783 0.34208 0.52526 0.050 1.290
 Total 270 1.13427 ± 0.985538 0.059978 1.01619 1.25236 0.050 5.930
Shimmer (dB)
 6~9 30 0.70300 ± 0.390973 0.071382 0.55701 0.84899 0.028 1.973
 10~12 30 0.83507 ± 0.344033 0.062811 0.70660 0.96353 0.345 1.833
 13~15 30 0.83563 ± 0.435580 0.079526 0.67299 0.99828 0.290 1.927
 16~20 30 0.51600 ± 0.256106 0.046758 0.42037 0.61163 0.270 1.280
 21~30 30 0.36603 ± 0.219519 0.040078 0.28406 0.44800 0.187 1.280
 31~40 30 0.31647 ± 0.186859 0.034116 0.24669 0.38624 0.141 0.900
 41~50 30 0.33313 ± 0.181105 0.033065 0.26551 0.40076 0.146 0.900
 51~60 30 0.62503 ± 0.544295 0.099374 0.42179 0.82828 0.154 1.842
 61~70 30 0.41613 ± 0.277796 0.050718 0.31240 0.51986 0.177 0.947
 Total 270 0.54961 ± 0.384898 0.023424 0.50349 0.59573 0.028 1.973
NHR
 6~9 30 0.15387 ± 0.121060 0.022103 0.10866 0.19907 0.007 0.537
 10~12 30 0.15550 ± 0.105150 0.019198 0.11624 0.19476 0.018 0.438
 13~15 30 0.21037 ± 0.089920 0.016417 0.17679 0.24394 0.130 0.432
 16~20 30 0.16600 ± 0.029896 0.005458 0.15484 0.17716 0.120 0.270
 21~30 30 0.15850 ± 0.104637 0.019104 0.11943 0.19757 0.013 0.680
 31~40 30 0.14423 ± 0.031396 0.005732 0.13251 0.15596 0.116 0.270
 41~50 30 0.14423 ± 0.031396 0.005732 0.13251 0.15596 0.116 0.270
 51~60 30 0.27020 ± 0.202904 0.037045 0.19443 0.34597 0.013 0.680
 61~70 30 0.17127 ± 0.124494 0.022729 0.12478 0.21775 0.103 0.611
 Total 270 0.17491 ± 0.112857 0.006868 0.16139 0.18843 0.007 0.680
Table 2.
ANOVA test results for the significance of differences between and within age groups on vocal parameters of fundamental frequency (F0), Jitter, Shimmer (dB), and noise to harmonic ratio (NHR) in males
Sum of squares df Mean square F Sig.
F0
 Between groups 673,377.738 8 84,172.217 204.217 0.000
 Within groups 107,576.694 261 412.171
 Total 780,954.431 269
Jitter
 Between groups 129.960 8 16.245 32.288 0.000
 Within groups 131.316 261 0.503
 Total 261.275 269
Shimmer (dB)
 Between group 10.391 8 1.299 11.508 0.000
 Within groups 29.460 261 0.113
 Total 39.851 269
NHR
 Between groups 0.402 8 0.050 4.337 0.000
 Within groups 3.024 261 0.012
 Total 3.426 269

ANOVA: analysis of variance, df: degree of freedom, F: ratio of variances, Sig.: significance

REFERENCES

Boone, D. R., McFarlane, S. C., Von Berg, S. L., & Zraick, R. I. (2005). The Voice and Voice Therapy. (10th ed.), (pp.100-120). Boston, MA: Allyn & Bacon.

Brockmann, M., Drinnan, M. J., Storck, C., & Carding, P. N. (2011). Reliable jitter and shimmer measurements in voice clinics: The relevance of vowel, gender, vocal intensity, and fundamental frequency effects in a typical clinical task. Journal of Voice, 25(1), 44-53.
crossref pmid
Harries, M. L., Walker, J. M., Williams, D. M., Hawkins, S., & Hughes, I. A. (1997). Changes in the male voice at puberty. Archives of Disease in Childhood, 77(5), 445-447.
crossref pmid pmc
Hillenbrand, J., Cleveland, R. A., & Erickson, R. L. (1994). Acoustic correlates of breathy vocal quality. Journal of Speech and Rearing research, 37(4), 769-778.
crossref
Hollien, H., Green, R., & Massey, K. (1994). Longitudinal research on adolescent voice change in males. The Journal of the Acoustical Society of America, 96(5), 2646-2654.
crossref pmid pdf
Kreiman, J. & Gerratt, B. R. (2000). Perception of voice quality: Beyond gender and age. Journal of Voice, 14(1), 67-81.

Linville, S. E. (2004). The aging voice. Asha Leader, 9(19), 12-14.

Maryn, Y., Roy, N., De Bodt, M., Van Cauwenberge, P., & Corthals, P. (2009). Acoustic measurement of overall voice quality: A metaanalysis. The Journal of the Acoustical Society of America, 126(5), 2619-2634.
crossref pmid pdf
Ramig, L. A. & Ringel, R. L. (1983). Effects of physiological aging on selected acoustic characteristics of voice. Journal of Speech, Language, and Hearing Research, 26(1), 22-30.
crossref
Sataloff, R. T. (2017). Professional Voice: The Science and Art of Clinical Care. (4th ed.), (pp.100-150). San Diego, CA: Plural Publishing.

Simberg, S., Alku, P., & Lindholm, P. (1983). "Shimmer in the elderly voice.". Journal of Voice, 19(3), 422-430.

Stathopoulos, E. T., Huber, J. E., & Sussman, J. E. (2011). Changes in acoustic characteristics of the voice across the life span: Measures from individuals 4-93 years of age. Journal of Speech, Language, and Hearing Research, 54(4), 1011-1021.
crossref
Titze, I. R. (1994). Principles of Voice Production. (1st ed.), (pp.209-306). Hoboken, NJ: Prentice Hall.

Titze, I. R. (2000). Principles of Voice Production. (2nd ed.), (pp.123-125). Iowa City, IA: National Center for Voice and Speech.

TOOLS
PDF Links  PDF Links
PubReader  PubReader
ePub Link  ePub Link
XML Download  XML Download
Full text via DOI  Full text via DOI
Download Citation  Download Citation
  Print
Share:      
METRICS
0
Crossref
0
Scopus
305
View
20
Download
Related article
Editorial Office
RN. 8602, Hallym University,
1 Hallymdaehak-gil, Chuncheon-si, Gangwon-do 24252, Korea
TEL: +82-70-8274-4268   FAX: +82-33-256-3420   E-mail: audiologykorea@gmail.com
About |  Browse Articles |  Current Issue |  For Authors
Copyright © Korean Academy of Audiology.                 Developed in M2PI