Summary
Sommaire Text
in French
_________________________________________________________________
What is the efficacy of speech therapy in chronic dysphonia?
Results of a controlled trial
Janet A Wilson, PN Carding
K MacKenzie* C Sellars**, A Millar+,
** Dept of Speech and Language Therapy, and
*Department of Otolaryngology Head & Neck Surgery, Glasgow Royal Infirmary,
Scotland
I J Deary
Professor of Differential Psychology
University of Edinburgh, Scotland
Introduction
The study I should like us to consider was carried out in what might be termed - North Britain, namely the South of Scotland, and the North of England. The study began 5 years ago, in the Royal Infirmary in Glasgow, Scotland, where I then worked. The project was supported by Scottish Office Home and Health Department.
The laryngologist principally involved in recruitment was my close colleague Ken MacKenzie. The therapist responsible for treating patients was Cam Sellars. Data analysis was aided by Dr Paul Carding, who is the Senior Lecturer in Voice Pathology. Paul and I now work together in the Otolaryngology department in the university of Newcastle upon Tyne. We share research projects and also a typical UK Voice clinic, where we jointly assess certain categories of dysphonic patient.
The study was truly interdisciplinary, also having input from Prof Ian Deary, who works also on the East Coast of Britain, but North of the Scottish border, in Edinburgh, once called the Athens of the North. Ian has helped us to study a number of ORL psychosomatic conditions such as globus sensation and temporomandibular joint dysfunction. We feel that this quality of psychological input is vital in the assessment of dysphonic patients in the UK.
Dysphonia (hoarseness) represents the interaction of pathophysiological, emotional and psychological factors. UK otolaryngologists see at least 48,000 new dysphonic patients per annum. The commonest cause of dysphonia is functional dysphonia. Over half of our new referrals will undergo speech and language therapy- at least 25,000 annually, yet there is little Level 1 evidence for its efficacy. In other words there are very few randomised, prospective, controlled trials.
Many voice clinic patients have entered a vicious cycle: psychological factors exacerbate voice pathology, while poor voice quality adversely affects psychological well-being. Conventional speech and language therapy aims to
To date no major study has examined the overall efficacy of voice therapy for dysphonia, either in terms of changes in voice quality, or changes in psychological distress or laryngoscopic findings. Our postal survey of UK speech and language therapists’ voice treatments showed a wide variation in both the experience of the treating therapists and the techniques used. The greatest variation was in the use of direct techniques. there was a sense that therapy selection might have an intuitive rather than a scientific basis - for example, for chronic laryngitis, of a wide range of measures, only vocal hygiene and general lifestyle advice were accepted by >75% of therapists.
Our study objective was to provide the first major randomised comparison of voice therapy with the natural history of dysphonia.
AIMS
SUBJECTS
We recruited consecutive outpatients attending the Department of Otolaryngology, Head and Neck Surgery, Glasgow Royal Infirmary, with a primary complaint of dysphonia (hoarseness). present for a minimum of two months and without any significant pathology (e.g. polyp, papilloma, tumour, vocal cord palsy) or need for surgery.
Inclusion criteria: Age > 16 years old, motivation to resolve the voice problem, willingness to enter into regular voice therapy sessions.
Exclusion criteria: Previously treated dysphonia, neurological disease or upper aerodigestive tract malignancy; marked hearing impairment; acid reflux; multiple medical complaints; performing voice user requiring urgent intervention; puberphonia; and transsexual
We recruited 204 patients (51 men, 153 women).
By study completion 12 - 14 weeks later, approximately 33% of subjects had dropped out or been excluded, leaving 63 patients in the treatment group and 70 patients in the observation group
MEASURES
Four groups of outcome measures were used
I - Voice Quality
Patient self - report The patient’s view of the impairment has a major influence on response to treatment.
Patients rated their own voice quality using the validated Vocal Performance Questionnaire, developed by Paul Carding. This has 5 point scoring on 12 items (1-5, 5 = worst), the maximum ( most dysphonic) VPQ score thus = 60.
Observer rating A digital analogue tape recording of the patient’s reading of the "Rainbow" passage which is a standard phonetically balanced test, was analysed by a speech therapist, blind to treatment group and status. Ten aspects of voice including laryngeal tone, pitch and resonance were rated 0-5 on the Buffalo Voice Profile scale, whose "overall rating" item was chosen as the key observer-rated variable.
Computer derived (objective) acoustic variables were also extracted by the blinded therapist. Two key variables are presented - "jitter", pitch perturbation, and "shimmer", amplitude perturbation - using the Computerised Speech Laboratory (Model 4300B). Kay Elemetrics Corp: New Jersey, USA. The higher the score on these two variables, the more dysphonic the voice.
II - Laryngoscopic features were assessed by an otolaryngologist at videorecorded flexible nasolaryngoscopy (Olympus ENF 3) using on a 4-point (0-3) rating scale of 4 features - nodule formation, laryngitis, glottic escape and hyperfunction of the laryngeal musculature.
III - Psychological Measures Numerous psychological variables are well - known to be abnormal in dysphonics. These findings may be nonspecific attributes of functional disorders in general, yet are also important because they may explain differences between patients and observers. Two key measures of psychological distress are reported.
The Hospital Anxiety and Depression Scale (HADS) is a self-completed questionnaire which assesses recent anxiety and depression, and provides an overall score for each. The Clinical Interview Schedule (CIS-R) was conducted by a trained psychologist. The interviewer rates 14 aspects of non-psychotic psychiatric disturbance including sleep problems, mood disturbance and somatic symptoms.
IV - Quality of Life was assessed by the Short-Form-36 (SF-36), an internationally validated general health status measure with which many of you will be familiar.
The SF-36 is self-administered and takes less than 10 minutes to complete. Its 8 subscales quantify a broad range of health issues. High scores on the SF-36 indicate better health.
Eight subscales of general health (SF-36)
|
Subscale |
Definition |
|
PF |
Limitations on physical activities such as walking, bathing and |
|
RP |
Problems with work or other daily activities as a result of physical health status |
|
BP |
Intensity of bodily pain or limitations due to pain |
|
GH |
Perception of current health and health outlook |
|
VT |
Level of energy |
|
SF |
Extent health interferes with normal social activities |
|
RE |
Problems with daily activities as a result of emotional issues |
|
MH |
Mental Health Screening |
Design
At entry to the study baseline data were recorded and the subjects then all seen by Mr Sellars, the treating therapist. He obtained a number for random allocation of subjects to either a course of voice therapy or a period of observation. Random numbers were supplied by an independent worker in a separate department. The voice therapy was also all delivered by CS according to a protocol derived from our review of the type, form and frequency of UK speech therapy. Mr Sellars was not involved in the collection, storage or analysis of data in any way. At no time did he convey therapy status to any other member of the research team. The therapy group initially comprised 100 patients. Eight subjects were excluded for a range of reasons e.g. cognitive inability to cope with the protocol. A further 8 subjects failed to attend the first therapy appointment. A further 10 subjects failed to attend during the course of therapy. Thus 74 patients completed a course of voice therapy.
After 6 weeks of therapy or observation, all measures except the Clinical Interview Schedule were recorded again in full on the 145 patients (71%) who continued to this stage.
Following another 6-8 weeks, all measurements were repeated again on the 133 (65%) who completed the study. Comparisons were made at the 3 time points of the 70 who completed treatment with the 63 in the no treatment group.
Speech Therapy Intervention:
A routine voice history was taken and a further recording of the "Rainbow" passage made for possible reference in therapy. In line with current practice, 11 out of the 73 who completed treatment received advice regarding good vocal hygiene and optimal voice production and did not proceed to a fuller programme of voice therapy. (This option did however remain open to them until attendance for Visit 2 of the study.)
The remaining 62 subjects underwent individualized programmes of voice therapy for 2-8 treatment sessions. Treatment reflected current effective practice and was characterised a) in units of 5 minutes b) according to 20 treatment types. In the final session, a review recording of the "Rainbow" passage was made.
Statistical Analysis
A mixed model Analysis of Covariance (ANCOVA) was used. Therapy was used as a between subjects factor (voice therapy versus no treatment). Time (only visits 2 and 3) was a repeated measure. The baseline values of the relevant outcome variables were used as covariates. This allowed a direct test of the main effect of therapy. Only data which were complete for all three time points were analysed for each variable.
To establish separately the efficacy of voice therapy at treatment completion and follow-up, post hoc tests were conducted on all variables which showed significant effects of therapy, time, or therapy x time interactions. Standardised residuals were calculated and therapy and observation groups were compared by independent sample t tests conducted on the residuals.
Analysis of laryngoscopic features: Ratings for each laryngoscopic feature for each patient at visit 2 were subtracted from those at visit 1. Ratings for each feature at visit 3 were also subtracted from those at visit 1. Subjects were then assigned a category (0,1,2) according to whether they had improved, deteriorated, or stayed the same. A series of Chi Square intergroup comparisons was performed.
Quality of Life (SF-36) Analysis Subjects’ raw scores on each SF-36 subscale were normalized to a scale ranging from 0 (worst) to 100 (best). Dysphonic patients’ scores were compared with those of healthy controls. (Data taken from Oxford Healthy Life Survey 1991/2, HSRU, Oxford 2). Comparisons were made using unpaired t-tests with two-tailed p values.
RESULTS
The therapy schedules for the treated subjects were collated.
Total duration of each of 20 interventions (in minutes) for the whole cohort of subjects treated
|
Intervention |
Visit 1 |
Visit 2 |
Visit 3 |
Visit 4 |
Visit 6 |
Visit 6 |
|
History |
4035 |
710 |
690 |
530 |
455 |
475 |
|
Normal Voice |
30 |
25 |
15 |
30 |
5 |
5 |
|
Presenting Features |
535 |
120 |
55 |
125 |
35 |
20 |
|
Voice rest |
265 |
65 |
25 |
45 |
0 |
0 |
|
Vocal hygiene |
420 |
170 |
115 |
80 |
35 |
15 |
|
Life Style |
455 |
140 |
135 |
60 |
55 |
35 |
|
Counseling |
80 |
130 |
145 |
80 |
20 |
80 |
|
Posture |
0 |
305 |
20 |
5 |
0 |
0 |
|
Relaxation |
0 |
650 |
535 |
375 |
240 |
55 |
|
Breath Support |
5 |
815 |
625 |
420 |
205 |
10 |
|
Co-ordination |
10 |
135 |
285 |
355 |
255 |
175 |
|
Glottal attack |
0 |
20 |
155 |
555 |
295 |
130 |
|
Normal |
0 |
0 |
0 |
0 |
5 |
0 |
|
Pitch |
0 |
0 |
0 |
30 |
65 |
80 |
|
Projection |
0 |
5 |
0 |
10 |
35 |
35 |
|
Intonation |
0 |
0 |
0 |
0 |
0 |
0 |
|
Rate |
0 |
0 |
5 |
0 |
5 |
0 |
|
Resonance |
0 |
0 |
15 |
0 |
5 |
|
|
Complexity |
0 |
0 |
0 |
35 |
135 |
60 |
|
Generalise |
0 |
0 |
10 |
0 |
0 |
0 |
|
Maintenance |
15 |
10 |
120 |
190 |
340 |
555 |
|
TOTAL [MINS] |
5855 |
3290 |
2950 |
2925 |
2190 |
1730 |
(A further 2 subjects had >6 sessions of therapy.)
The 73 treatment group subjects were 56F, 17M, mean age 51 years (SD 14). The 72 observation subjects were 50F, 22M, mean age 52 years (SD 13).
Overall the attrition rate was as expected for such a large study sample, and since there was no significant difference between the therapy and no treatment groups, it does not affect the results. Analyses were only conducted on complete data sets in each instance.
Means (Standard deviations) for key voice and psychological variables at each visit for treatment and no treatment groups

Notes:
CSL = Computerised Speech Laboratory. Jitter = pitch perturbation, shimmer = amplitude perturbation, the higher the score on each the more dysphonic the voice. Buffalo Voice Profile analysis has an "overall rating" as 1 of 10 items scored 0-5.I - Voice Quality
The groups were also well matched for subjective and objective voice variables. Voice therapy improved self-rated quality of voice, as measured by the Voice Performance Questionnaire (p=.001), the Buffalo Scale (p<.001) and CSL measurement of amplitude perturbation or shimmer (p=.02).
The VPQ shows a main effect of therapy, with no significant effect of time, or therapy x time interaction.
On post hoc testing, voice therapy led to a significant and maintained improvement in the VPQ. The Buffalo overall rating was significant at visit 3 (follow up) but not visit 2 (end of treatment). Thus improvements in expert ratings of voice quality are largely due to the direct effect of therapy, but there are changes between the end of therapy and follow-up, with the no therapy group becoming worse.
In contrast, CSL derived Shimmer is improved at the end of treatment, but not at follow up (visit 3).
II - Laryngoscopic features
Laryngeal features at study entry showed no significant difference between the treated and observed subjects (either in the originally recruited 204 or in the 145 with 6 week repeat laryngoscopy). Grade 2-3 (moderate to severe) scores were infrequent for all 4 features and there was only minimal resolution of the abnormalities between the 2 time points.
III - Psychological Measures The treatment and no-treatment groups varied at baseline only on HADS-Anxiety scores, significantly higher in the observation group, who thus had a greater potential for reduction in anxiety over time. Both the treatment and no treatment groups have elevated baseline scores for anxiety compared with healthy controls (though not necessarily with other ORL patients ) suggesting the presence of significant psychological distress. The HADS-Anxiety showed a significant effect of time (p=.019), but not of therapy. Both groups improved over time.
IV - Quality of Life: The subjects on whom datasets were available comprised 163 subjects, 38 males and 125 females. The baseline SF-36 quality of life scores reflect a severely impaired quality of life in both the treatment and no-treatment groups compared with other groups of patients. Dysphonia patients had significantly poorer health than controls on all eight SF-36 subscales (limitation of physical activity p<0.05; other 7 all p< 0.001). The mixed model ANCOVA showed therapy to have an effect on only one Quality of Life outcome variable - Mental Health (F=5.5, p=.021), significant at the end of treatment (visit 2), but not at follow-up (visit 3).
Dysphonia patients had significantly lower scores (indicating poorer health status) than age matched healthy controls on all eight sub-scales of the SF-36.
SF-36 sub-scale scores for dysphonia patients and healthy controls
and chronic sinusitis patients
|
Dysphonia |
Healthy |
Test t (dysphonia-healthy) |
|
|
n |
163 |
744 |
|
|
Age -y |
51 (14) |
45-54 |
|
|
PF |
65.0(31.8) |
85.5(17.6) |
2.19° |
|
RP |
56.8(42.6) |
85.0(30.2) |
9.94** |
|
BP |
60.5(27.3) |
79.3(22.0) |
9.42** |
|
GH |
59.4(23.6) |
72.1(20.1) |
7.06** |
|
VT |
50.3(21.5) |
61.3(19.7) |
6.34** |
|
SF |
72.4(27.7) |
88.6(19.8) |
8.73** |
|
RE |
65.9(43.0) |
83.3(31.5) |
5.93** |
|
MH |
67.1(19.4) |
75.2(17.0) |
5.36** |
°
p< 0.05; * p<.01; **p<.001In terms of effect size ( SD units), the group difference was large for 5 of the 8 variables, PF, RP, BP, SF and MH, and moderate for the remaining 3 - GH, VT and RE.
Conclusions
This is the largest randomised controlled trial of the efficacy of voice therapy for dysphonia to date and allows us to draw the following conclusions.
1. Voice therapy for dysphonia is an effective treatment in terms of both patient and expert reports of perceptual ratings. Self-reported improvement in voice (VPQ) correlated with observer-rated changes in overall voice quality (Buffalo). A recent study of voice therapy also confirmed benefit of therapy as shown by the Voice Performance Questionnaire. (Carding et al 1999), but in only 45 subjects. Carding’s study did appear to show greater benefit from direct therapy than indirect alone.
2. Most objective acoustic parameters showed poor correlation with self or observer reports of global voice quality. The validity and reliability of these commonly used computer variables need to be quantified.
3. There were minimal changes in laryngoscopic appearance during the course of this study. This reflects that many of the subjects referred for nonsurgical voice therapy have, by definition, relatively normal laryngeal appearances (floor effect). Also, changes in voice may arise as a result of extralaryngeal factors - e.g. improved breath support, or enhanced resonance.
4. Dysphonia is associated with psychological distress. We found voice therapy had surprisingly little impact on psychological distress. A sub-group of patients remains psychologically distressed despite receiving therapy. In some, psychological distress may be a consequence of dysphonia, rather than a cause, but it seems likely that a sub-group of dysphonic patients may benefit from a more psychological treatment of their dysphonia.
5. The SF-36 results show that dysphonia has a severe impact on most aspects of a patient’s general health status. This highlights the importance of effective vocal communication for psychosocial well-being.
There have been few previous studies of the quality of life of patients with dysphonia. The patients reported by Benninger’s group were of almost identical mean age to those in the present study and included a preponderance of subjects with organic disorders - masses, cancer, vocal cord paralysis, yet with somewhat better reported health status. This apparent paradox may reflect international differences in quality of life assessment, or alternatively that patients with functional disorders may show a tendency to over-report quality of life impact. Therapy had a significant effect on one quality of life variable - mental health- and only at the end of therapy, not later review.
Voice therapy - compared with a period of observation - improves voice quality as rated by the dysphonic patient, and by an expert observer, blind to treatment group. These effects of treatment are maintained at follow-up two months later. The significant levels of psychological distress and reduced quality of life experienced by dysphonic patients seemed not to be influenced by standard voice therapy techniques. Thus a sub-group of dysphonic patients may gain additional benefit from an intervention which targets their distress.
REFERENCES
Aronson A, Peterson H, Litin, E. Psychiatric symptomology in functional dysphonia and aphonia. J Speech Hear Disord 1996; 31: 115-27.
Benninger MS, Ahuja AS, Gardner G, Grywalski C. Assessing outcomes for dysphonic patients. J Voice 1998; 12: 540 - 50.
Carding PN, Horsley IA. An evaluation of voice therapy in non-organic dysphonia. Eur J Disord Commun 1992; 27: 137-148
Carding PN, Horsley IA, Docherty GJ. A study of the effectiveness of voice therapy in the treatment of 45 patients with nonorganic dysphonia. J Voice 1999; 13: 72 - 104
Deary IJ, Scott S, Wilson IM, White A, MacKenzie K, Wilson JA. Personality and psychological distress in dysphonia. British Journal of Health Psychology 1997; 2: 333-341
Dunnet CP, MacKenzie K, Robinson K, Sellars GC, Wilson JA. Voice therapy for dysphonia - still more art than science? Eur J Disord Commun 1997; 32: 333-343
Fairbanks G. Voice and Articulation Drillbook (2nd edition) 1960; Harper & Brothers: New York.
Gerritsma EJ. An investigation into some personality characteristics of patients with psychogenic aphonia and dysphonia. Folia Phoniatr Logop 1991; 43: 13-20.
Gliklich RE and Metson R. The health impact of chronic sinusitis in patients seeking otolaryngologic care. Otolaryngology and Head and Neck Surgery 1995;113:104-9
House A, Andrews H. The psychiatric and social characteristics of patients with functional dysphonia. J Psychosom Res 1987; 31: 483-90.
Jenkinson C, Coulter A, Wright L. Short-form SF-36 health survey questionnaire: normative data for adults of working age. Br Med J 1993; 306: 1437-1440
Kazis LE, Anderson JL, Meenan RF. Effect sizes for interpreting changes in health status. Medical care 1989, 27: S178 - S189
Lewis G, Pelosi AJ, Araya R, Dunn G. Measuring Psychiatric Disorder in the Community: A standardised assessment for use by lay interviewers. Psychological Medicine 1992; 22: 465-486
Millar A, Deary IJ, MacKenzie K, Wilson JA. Patients with dysphonia (hoarseness) report primarily psycho-social problems (Abstract) The Psychologist 1997, 5: 133
Scott S, Robinson K, MacKenzie K, Wilson JA. Patient reported problems associated with dysphonia Clin Otolaryngol 22: 37-40, 1997
Scott S, Deary IJ, Wilson JA, MacKenzie K. Functional Dysphonia - a role for psychologists? Psychology, Health and Medicine 1997; 2: 169-180
Scott S, Robinson K, Wilson JA, MacKenzie K. Patient-reported problems associated with dysphonia. Clin Otolaryngol. 1997, 22, 37-40
Simon GE, Revicki DA, Grothaus L, Vonkorff M. SF-36 summary scores: are physical and mental health truly distinct? Medical Care. 1998, 36: 567 - 72
Smith E, Verdolini K, Gray S & al Effect of voice disorders on quality of life. J Med Speech Lang Pathol 4: 223-245
Ware JE, Gandek B, Kosinski M, Aaronson NK, & al The equivalence of SF-36 summary scores estimated using standard and country-specific algorithms in 10 countres; results from the IQOLA Project. J Clin Epidemiol 1998, 51: 1167-70
Ware JE, Kosinski M, Gandek B & al The factor structure of the SF-36 Health Survey in 10 countries: results from the IQOLA project J Clin Epidemiol 1998 51: 1159 - 65
Ware JE Gandek B Overview of the SF - 36 health survey and the international quality of life assessment (IQOLA) project J Clin Epidemiol 1998 51: 903-12
White A, Deary IJ, Wilson JA. Psychiatric disturbance and personality traits in dysphonic patients. Eur J Disord Commun 1997; 32: 307-314
Wilson JA, Deary IJ, Scott S, MacKenzie K. Functional dysphonia (Editorial). BMJ 1995, 311: 1039-1040
Wilson DK. Voice Problems of Children (3rd edition) 1987; Williams & Wilkins: Baltimore, USA
Zigmond AS, Snaith RP. The hospital anxiety and depression scale. Acta Psychiatry Scand 1983; 67: 361-70.