Chapter 10

Formula Masking for Speech Stimuli

Formulae Quick Reference

Spondee Threshold Minimum Masking Level:

Predicted SPONDEE THRESHOLD + 10 dB in case the SPONDEE THRESHOLD is a bit higher than the PTA predicts – IA (60 dB for inserts, 50 for TDH) + largest significant NTE ABG + 10 dB pad.

Insert Earphones ⇒ TE estimated SPONDEE THRESHOLD – 40 dB + Largest significant NTE ABG (500 Hz to 8k Hz)

Supra-aural Phones ⇒ TE estimated SPONDEE THRESHOLD – 30 dB + Largest significant NTE ABG (500 Hz to 8k Hz)

Word Recognition Testing Minimum Masking Level:

Presentation level – IA + 10 dB pad + Largest significant NTE ABG (500 Hz to 8k Hz)

Insert Earphones ⇒ TE signal level – 50 dB + NTE ABG dB + Largest Significant NTE ABG (500 Hz to 8k Hz)

For Supra-aural Phones ⇒ TE signal level – 40 dB + Largest Significant NTE ABG ABG (500 Hz to 8k Hz)

Maximum Masking Level

For both spondee threshold and word recognition testing: Best TE BC threshold in the 500-8k Hz range + IA – 5

For insert earphone ⇒ Best TE 500 to 8k Hz + 55 dB

For Supra-aural Phones ⇒ BC TE 500 to 8k Hz + 45 dB

For safety, assume the best bone is 10 dB lower than AC thresholds for sensorineural loss and 0 dB for conductive losses.

Down 20 Rule:

Estimated spondee threshold or word recognition test level – 20 dB.

Simple, easy to calculate, and works well for asymmetrical sensorineural loss. If the test ear has conductive loss, this rule could lead to some overmasking, but it is usually not consequential – it won’t typically interfere with speech understanding. Non-test ear conductive loss may causethis rule to undermask.

Some crossback is not the end of the world. Crossback to the TE cochlea is not going to interfere with hearing speech that is 10 dB or more greater than the crossback level. Calculate the sensation level of the stimulus at the test ear (and do this across the frequency ranges). Calculate the sensation level of the crossback at the test ear (again, across frequencies). If the stimulus is 10 dB or more greater than the crossback, then your results are not significantly affected by the crossback.

Speech Testing Introduction

Congratulations on making it this far into formula masking. The chapter on “why formula mask” discussed that when conducting speech testing, especially word recognition testing, you need to formula mask. You can’t plateau. So everything you have learned so far will now be applied where you need it most.

Speech masking requires one more cognitive stretch. You’ll have to think about hearing sensitivity all the frequencies, since speech and speech noise are wide-band stimuli.

The formula masking concepts remain the same. You need to ensure that you have enough noise at the NTE cochlea to mask any crossover. Now, we’ll add in that means thinking about cross over at each frequency. You’ll need to increase that noise level if the NTE has significant air-bone gaps. You need to ensure that the masking noise isn’t going to cross back at a level that will interfere with speech understanding. Here’s where you get a little leeway. If there is just a little crossback, but the speech signal is well above that crossback level, then the crossback won’t significantly affect the speech test results. You are still looking at the test ear bone-conduction thresholds, and basing your MMax calculations on them. Of course, usually you do speech testing via air-conduction before you test bone-conduction hearing, so you have to make an educated guess about the bone-conduction thresholds.

There is a simple formula masking rule that is going to work most of the time. You examine your presentation level, and simply put in contralateral noise that is 20 dB less intense. That’s called the “Down 20” rule. The good news is that works for asymmetrical sensorineural losses, which constitutes the bulk of the cases seen in a general audiology practice. The bad news is that it won’t work with significant conductive loss, so you need to read and understand the entire chapter.

So, without further delay, let’s delve right in.

Speech Interaural Attenuation Value: 60 dB for insert earphones, 50 dB for TDH

Given how often masking for speech is conducted, we have surprisingly little data on the appropriate interaural attenuation (IA) value when using insert earphones. A literature search reveals one study – by Sklare and Denenberg (1987). They tested seven (yes, count them, seven!) patients with unilateral profound hearing loss and found that the average insert earphone interaural attenuation value was 75 dB, the minimum was 70. With only seven subjects, the true minimum is likely below 70 dB. They reported that the standard deviation was 7 dB; so the 2 standard deviation limit that should encompass 98% of the population predicts that the minimum IA is 61 dB, which we’ll round to 60 dB.

There are more data available re: TDH-style earphones. The minimum IA value is 48 dB in most literature. We can “afford” to round that up to 50 dB so long as we are using a 10 dB “pad” – a little extra in case of calibration errors or in case we are testing those with extremely low interaural attenuation. Remember this slight “cheat” in considering when to mask – err on the side of caution.

The Acoustics of Speech, the Acoustics of Interaural Attenuation

How loud is speech at each 1/3 octave band? Where is the bulk of the speech energy? Table 10-1 shows data from Cox and Moore (1988) – there is more energy in the average speaker’s voice in the low frequencies (long term average, of speaker with overall level of 70 dB SPL). Conveniently and coincidentally, the insert earphone interaural attenuation values are greater in the low frequencies. Table 10-1 shows the lowest minimum IA values for the published studies reviewed. This is good news. Where there is more speech energy, there is more interaural attenuation, so we can treat the crossover as being equal intensity level across the frequencies.

Table 10-1. Although speech has greater low-frequency energy, insert earphones have greater low-frequency interaural attenuation. The net result is that air-conduction speech will be at a relatively even level across the frequency range when it reaches the non-test ear cochlea.
Frequency (Hz) 250 500 1k 2k 4k 8k
Spectrum Level of Speech Energy (dB SPL) 60 62 55 49 46 45
Interaural Attenuation Minimum (dB) 63 62 56 50 50 60

Check Need for Masking – Is the Crossover Above NTE Bone-Conduction Thresholds in the 500 to 8k Hz Range?

In determining the need for masking for spondee threshold or word recognition testing, consider if any of the speech energy could be heard, by bone-conduction, in the non-test ear. Examine the range from 500 Hz to 8000 Hz. Why not 250 Hz?

When formula masking spondee thresholds, we need to consider which frequencies contribute to the hearing of the words. Generally, hearing only 250 Hz energy would not be enough to permit one to differentiate which spondee was presented. But, hearing at essentially any other frequency could permit speech understanding. (Hearing at 250 Hz will contribute to the spondee understanding, however. It will aid in vowel recognition.) Word understanding of monosyllabic words also won’t occur just if hearing 250 Hz.

If obtaining a speech detection threshold, then even 250 Hz hearing should be considered: Detection requires only audibility.

The patient with an audiogram shown in Figure 10-1 may have a right ear 120 dB HL spondee threshold (if the audiometer output goes that high), or may have no recognition at all – that’s pretty likely given the cochlear distortions that a hearing loss of that magnitude would create.

Figure 10-1

Figure 10-1. You would not expect the patient to recognize spondee words in the right ear when properly masked, but you would attempt testing so that your results are complete. Assume that the audiometer output can test as high as 120 dB HL.


We would calculate the need for masking based upon the assumption that the interaural attenuation is 60 dB. If 120 dB HL speech can be presented, then we need to consider -- would the non-test ear hear the crossover (by bone conduction)? Figure 10-2 illustrates.

Figure 10-2

Figure 10-2. In the worst-case scenario, the interaural attenuation would be as little as 60 dB for speech, so words presented at 120 dB HL could be audible by bone-conduction at the areas of the cochlea encoding ~1350 Hz and lower frequencies. Examine the best bone-conduction threshold – if the cross hearing is possible at any frequency, 500-8k Hz, then masking is warranted. Masking is needed in this example. The crossover can be detected in the NTE at the cochlea, using the 500 Hz and 1k Hz places.

Word Recognition Supra-Threshold Testing Frequently Requires Masking

Pure-tone masking has trained the eye to recognize that large interaural differences signal the need for masking. You’ll need to learn new strategies in order to not miss the need for speech masking. You will need to think about the presentation level – and compare that to the non-test ear’s best bone-conduction thresholds. Because you are testing at a level that is well above threshold, and well above the non-test ear bone-conduction thresholds, you will frequently need to mask.

Word Recognition Cross-Hearing – Detection Alone May Not Permit Open-Set Word Recognition. Consider the Sensation Level

It’s easier (requires less hearing) to repeat spondee words from a list of a handful of choices than it is to correctly recognize what word was presented during word recognition testing. So for word recognition testing, a little bit of crossover might not truly improve the word recognition score. However, audiologists tend to be cautious and if there is any possibility of the non-test ear participating then contralateral masking is used. This means sometimes we will mask when it wasn’t essential. Figure 10-3 offers an example where the minimal cross hearing during word recognition testing does not do much to aid in figuring out which word was heard. Very little speech energy would be heard in the left ear. If this patient scored even modestly well (e.g. perhaps 40% or better), without contralateral masking having been used, then I would doubt that the non-test ear hearing (alone) could explain that score. Consider the degree to which the speech could be suprathreshold in the non-test ear. If it is marginal, as in Figure 10-3, then it is not an aid to word recognition. However, it doesn’t hurt to mask to eliminate any possibility that the non-test ear is contributing to the word recognition.

Figure 10-3

Figure 10-3. There is no need to mask for right ear spondee testing. The expected spondee threshold is about 60 dB HL. Crossover would be at 0 dB HL and inaudible. However, if testing word recognition at 85 dB HL in the right ear, the crossover (25 dB HL) is audible. Masking should be conducted, but a very low word recognition score would be expected if only the cross-hearing is contributing to word understanding. Only the speech energy below ~600 Hz is audible in the non-test (left) ear cochlea.

High-Intensity Speech Testing (e.g. Rollover Testing) Requires Masking (Sometimes Even Without Significant Asymmetry)

Audiologists may test word recognition at high intensity levels in order to determine if word recognition performance is poor or has decreased when intensity was increased – a sign of potential retrocochlear involvement. Since high test stimulus levels are used, crossover is high; participation of the contralateral ear would invalidate test results. Figure 10-4 illustrates.

Figure 10-4

Figure 10-4. Crossover of the right ear 100 dB HL speech signal to the non-test (left) ear could significantly improve the word recognition score. Even though the asymmetry is not large, masking when testing word recognition at high intensities is necessary.

Even with symmetrical hearing, high intensity speech testing requires masking. Even if the right ear in the audiogram in Figure 10-4 had the same hearing as the left ear, if word recognition were assessed at 100 dB HL, there will be the same crossover to the non-test ear

Spondee Threshold Masking

Recognizing the Need for Masking Summarized

Estimate spondee threshold, add 10 dB. Subtract 60 dB (IA for insert earphones). If this is above the possible bone-conduction thresholds 500 – 8k Hz, then masking is needed.

  • To be efficient with spondee threshold formula masking, you want to make sure that you are masking when needed “from the start.” If you obtain the unmasked spondee threshold, then recognize the need for masking, you have to do extra testing. To avoid this possibility, consider the highest spondee threshold you would expect. The two-frequency average plus about 10 dB would be a good estimate. You want to go 10 dB “above” your estimate so that, if the spondee threshold is a bit higher than your estimate, you don’t have to go back and mask.
  • Subtract 60 dB, the interaural attenuation value (for insert earphones, 50 for TDH).
  • Compare this level to the non-test ear bone-conduction thresholds at 500 to 8k Hz (include 250 Hz if obtaining a speech detection threshold). If the probable bone-conduction thresholds are lower – you need to mask.
  • If immittance is normal, use the NTE air-conduction thresholds as your guide for whether masking is needed. To be safest, consider the “what if” – what if the bone-conduction threshold is 10 dB lower? Use that as your estimate, again, so that you don’t obtain the unmasked threshold, then after obtaining bone-conduction thresholds, realize that you did have to mask. If immittance is abnormal and/or symptoms suggest conductive involvement, then most likely you would assume 0 dB HL bone-conduction thresholds, or perhaps even lower if the patient is young.

Formula for Min Mask: Expected Spondee Threshold – 40 dB (for inserts) + Largest Significant NTE Air-Bone Gap

As with formula masking for pure tones, guessing the threshold will help you be efficient during testing. The pure-tone average should help you predict the spondee threshold you will eventually obtain. To reiterate, you won’t want to use exactly the predicted threshold in your minimum masking formula. If threshold comes in even a little higher, that would mean you have undermasked, and would need to increase the masking level. It is better to build the “what if the spondee threshold is a bit higher than estimated” into the formula.

Masking noise presented to the non-test ear is attenuated by any conductive loss in that non-test ear, so if you have any reason to believe there is conductive loss, estimate it (and use the worst case scenario – that it is as large as it could realistically be.) Add that into the minimum masking formula. As for pure-tone testing, if the air-bone gap is the 5 dB or so air-bone gap that is due to test-retest variability, those don’t need to be accounted for. When immittance is abnormal, then you should be very concerned, and the safest course is to assume bone-conduction thresholds are normal and calculate the air-bone gap. (As with pure-tone masking, you might want to switch to a MMax approach.)

Minimum Masking Level = Expected SPONDEE THRESHOLD + 10 in case it’s a bit higher + 10 for pad – 60 for IA + the largest significant NTE air-bone gap at any frequency 500-8k Hz.

This formula simplifies to: Expected Spondee Threshold – 40 dB (for inserts) + Largest Significant NTE Air-Bone Gap

TDH: Since the IA value is 10 dB lower, it would be SPONDEE THRESHOLD – 30 + Largest ABG (500 to 8 kHz)

Formula for Max Mask (both Spondee Threshold and Word Recognition): Best TE BC threshold from 500 to 8k Hz + 55 dB

How much masking is absolutely safe? If none of the crossback is audible, that is ideal. Assume a 60 dB IA at each frequency.

We can safely present up to 55 dB above the best bone-conduction threshold. If we go all the way up to 60 we risk some crossback interference.

So, the formula is MMax = Best TE bone-conduction threshold (500 to 8 kHz) + 55 dB

This formula holds the same for both spondee threshold and for word recognition testing, but as an upcoming section will discuss, a little bit of cross back may not hurt. As long as the speech signal is well above the crossback, the speech is still audible.

When thinking about the potential crossback and the number to use as “Best TE bone threshold”, think about cochlear sensitivity rather than the maximum output of the audiometer. For example, in Figure 10-5 below, the cochlear sensitivity at 500 Hz is probably 80 dB HL, which is higher than the audiometer bone-conduction circuit can produce. (Overmasking is dictated by the noise interfering with the test ear’s hearing, so that is what we need to consider. You won’t have overmasking unless the crossback is 80 dB or higher.) It would be a good idea to consider the possibility of a slight conductive loss, however. If you wish to be “extra safe” you could use 10 dB below the guessed best bone-conduction threshold.

Review – Example

Let’s calculate MMIn and MMax for the case shown in Figure 10-5.

MMin = Expected Spondee Threshold (85 dB HL) + 10 (ST could be 95 dB HL) – IA (60 dB) + 10 dB pad.

The short form is MMin = 85 – 40 = 45 dB EM.

MMax = Best test ear BC threshold that could be found in the 500-8k range (80 dB HL), but for safety, calculate MMax based on a threshold that is 10 dB lower than what I really think the cochlear sensitivity will be. MMax = 70 dB HL + 55 = 125 dB EM.

Figure 10-5

Figure 10-5. Assume that the loss is sensorineural in each ear. The estimated spondee threshold is about 85 dB HL. Masking is needed: Crossover could be 25 dB HL and audible at 500 and 1000 Hz.

To reiterate, the maximum formula has us look at the 80 dB 500 Hz threshold, yielding a maximum of 135 dB EM. But let's use a true worst case scenario. Maybe there is a slight conductive overlay – cochlear sensitivity might be as good as 70 dB HL. 70 plus 55 dB: 125 dB EM – a level at or above the audiometer’s output and a level that would be intolerably loud and grossly inappropriate! (Remember, this is going into the left ear which has normal low-frequency hearing and high-frequency recruitment.)

The “Down 20” Formula

There is a simple formula that works well when the loss in each ear is sensorineural. It simply advocates that you estimated the spondee threshold, and use contralateral noise that is 20 dB lower than this level. That will give you a value between minimum and maximum when you have sensorineural hearing loss. You don’t have to guess the spondee threshold as being higher, but do consider that if the need for masking is borderline, it is better to mask when it’s possibly needed. This will prevent you from having to come back and mask should the spondee threshold come in a bit higher than the pure-tone average.

As will be further discussed below, if there is conductive loss in the non-test ear (which lowers the effectiveness of the contralateral masking), this formula may lead you to undermask. When there is conductive loss in the test ear, your stimulation will be loud, and your contralateral masking also needs to be more intense. More intense masking noise means a greater likelihood of crossback. There is a chance of crossback that can go unrecognized with the simple “Down 20” formula. Generally though, to have overmasking the conductive loss needs to be greater that the loss severity that one typically sees.

There is nothing magical about the 20 dB number. Preceptors may prefer a “Down 25” or “Down 30” formula.

In Figure 10-5, MMin = 45 dB EM; MMax = 125 dB EM. “Down 20” for the expected 85 dB HL spondee threshold indicates use of 65 dB EM. Isn’t that a lot easier! But remember – the Down 20 Formula works well for bilateral sensorineural loss, but not as well for conductive loss, as the sections below will describe further.

Spondee Threshold Testing Examples

Asymmetrical Sensorineural Loss Example Reviewed Again

Let’s examine another case of asymmetrical sensorineural loss. See Figure 10-6 and its legend.

Figure 10-6

Figure 10-6. Assume that this loss is sensorineural bilaterally. MMin for spondee threshold testing = 100 dB HL – 40 = 60 dB EM. MMax = 85 dB HL (the 500 Hz bone-conduction threshold of 95 that you would measure if your audiometer let you test at that high an intensity, minus 10 dB in case cochlear sensitivity is slightly better than anticipated) + 55 = the ludicrously loud 140 dB EM. The Down 20 rule would recommend use of 80 dB EM.

NTE Conductive Loss Spondee Threshold Masking Example

Non-test ear conductive loss reduces the effectiveness of masking noise, raising the minimum masking level. This can cause the “Down 20” formula to undermask. Figure 10-7 illustrates.

Figure 10-7

Figure 10-7 Assume that immittance is consistent with left ear conductive loss, and prior audiometric testing indicated that the right ear loss is sensorineural. The NTE largest air-bone gap (at 500 to 8k Hz) is assumed to be 50 dB. (If you want to make it 60 dB, that’s not wrong.)

The minimum masking formula = Expected spondee threshold (~100) – 40 + 50 ABG = 110 dB EM. While that is loud, remember that not all of the sound is audible – it is attenuated by the non-test ear conductive loss

The maximum masking level is again the stratospheric 85 dB (best reasonably expected cochlear hearing sensitivity anticipated in the right ear, estimated as 10 dB better than the 500 Hz AC threshold) + 55 = 140 dB EM.

The Down 20 level – expected 100 dB spondee threshold minus 20 dB – is 80 dB EM. This illustrates that the Down 20 formula can lead to undermasking with NTE conductive loss. The 80 dB EM is lower than the 110 required to minimally mask the left ear

Test Ear Conductive Loss Spondee Threshold Examples

When the test ear loss is conductive, overmasking becomes a concern, and the “Down 20” formula may lead you to overmask. Refer to Figure 10-8.

Figure 10-8

Figure 10-8. Assume that the patient’s symptoms and immittance results are consistent with the finding of conductive loss in the right ear.

The minimum masking level is ~65 dB expected SPONDEE THRESHOLD – 40 = 25 dB EM.

The maximum masking level could be assumed to be 5 dB (best bone-conduction threshold) or 0 dB (if you want to be even more conservative) + 55: either 60 or 55 dB EM. Any level between 25 and 55/60 dB EM would adequately mask.

The Down 20 rule calls for 45 dB EM. This falls within the range of minimum and maximum. A more extreme test ear conductive loss is shown in Figure 10-9.


Figure 10-9

Figure 10-9. To have a right ear conductive loss this severe would be rare – perhaps a congenital absence of a middle ear cavity and/or atresia (that still magically allows you to use insert earphones).

The minimum masking level is the estimated spondee threshold (~95 dB HL) – 40 = 55.

Assuming that bone-conduction threshold is 5 dB, perhaps 0 dB HL, giving a maximum masking level of 55-60 dB EM. Note the very narrow range between minimum and maximum with this truly maximal conductive loss in the test ear.

The Down 20 formula would call for 75 dB EM, which would overmask with this (very rare) true maximum conductive loss in the test ear.

Summary and Discussion of Masking Dilemmas for Spondee Threshold

The “Down 20 rule” works most times – the common exception is NTE significant conductive loss, where undermasking can occur. TE conductive loss can cause the “Down 20” rule to have problems, but you would need a near true maximum conductive loss before overmasking becomes a concern. The “Down 20” formula is simple (and therefore less prone to error) so it is recommended for the “plain vanilla” cases of asymmetrical sensorineural hearing loss, which is what you see most often in most clinical situations.

To review, if you have NTE conductive loss, then “Down 20” can undermask. Base your calculations on Minimum Masking Levels: go above that.

If the test ear has conductive loss, double check. Calculate MMax and make sure that it is below your “Down 20” level is below MMax. The test ear conductive loss has to be very large before overmasking becomes a concern when obtaining a spondee threshold.

If you have significant bilateral conductive loss, you will need to carefully calculate minimum and maximum. Similar to pure-tone testing with bilateral conductive loss, a masking dilemma may occur when trying to obtain the spondee threshold.

Example Spondee Threshold Masking Dilemma

Figure 10-10

Figure 10-10. Classic masking dilemma audiogram. It appears there is bilateral conductive loss. If you were to attempt to (plateau) mask the pure-tone results, you would have the erroneous finding of two dead ears. Consider what you would do if you did NOT yet have bone-conduction thresholds, but assume from immittance and patient report that the loss is conductive, most likely bilaterally. Let’s examine the left ear. I expect a ~65 dB HL unmasked spondee threshold. Can I mask to find that threshold, if it is truly the left ear threshold? (Perhaps the left ear is dead, and I’m measuring crossover from presenting to the left ear, but stimulating the right cochlea.)

The Minimum Masking formula = estimated spondee threshold (65) – IA (60) + 10 (in case the spondee threshold is a bit higher than I guessed) + NTE biggest ABG from 500-8k Hz (75 dB) = 90 dB HL.

The Maximum Masking Level = best bone (0 dB HL) + 55 dB HL = 55 dB EM. This defines the masking dilemma: I need a minimum of 90 but I will potentially overmask with 55 dB EM. If the loss were less severe, for spondee threshold testing (not for word recognition), you could try to plateau mask. Note that the Down 20 rule, if the loss is bilaterally conductive, would be 65 estimated spondee threshold – 20 = 45 dB EM. The “Down 20” rule undermasks dramatically. The 45 dB EM is not even audible!

Word Recognition Testing

Recognizing the Need for Masking

Although you need to mask for word recognition testing more often than for spondee threshold testing, at least you know the presentation level since you choose it! That eliminates that ambiguity you had with spondee threshold testing (what if it comes in a bit worse than I guessed?) To determine the need for masking, take the level of speech you plan to use for word recognition testing, subtract 60 dB. If there is any reason to believe that the non-test ear bone-conduction thresholds (in the 500 to 8k Hz range) are lower than that, then mask.

Formula for Min Masking: Word Recognition Testing Presentation Level – 50 dB + Largest Significant NTE ABG (500 to 8k Hz)

The minimum masking formula changes from what is used for spondee threshold testing. With spondee testing, you entertained the possibility of the spondee threshold coming in a bit higher than your best guess, and adjusted the masking level to account for that. You don’t do that for word recognition testing, since you know exactly the level you will use when testing, so the formula is presentation level – 60 dB IA + 10 dB pad + largest significant NTE air-bone gap (500 to 8k Hz) which simplifies to presentation level – 50 dB + largest significant NTE air-bone gap (500 to 8k Hz).

Formula for Max Mask: Best TE BC threshold from 500 to 8k Hz + 55 dB

The maximum masking formula is the same as for spondee threshold testing: Find the best estimated bone-conduction threshold (and guess low rather than high to be safe, you don’t want to overmask). Add 55 dB. While this is the “ideal” maximum, a little bit of crossback isn’t going to lower the test ear word recognition score. If you can avoid going above MMax, please do so, but if the crossback is well below the perceived loudness of the speech signal, then it will not lower the word recognition score. We will examine this idea further below.

The “Down 20” Formula

The simple “Down 20” formula remains simple: Subtract 20 from the presentation level. As with spondee threshold testing, this formula risks undermasking with NTE conductive loss, and overmasking with TE conductive loss, but as we will explore further below, the overmasking probably won’t interfere with understanding the monosyllabic words presented to the test ear.

Word Recognition Examples

Asymmetrical Sensorineural Hearing Loss

As with spondee threshold testing, the “Down 20” formula works well for word recognition testing. See Figure 10-11.

Figure 10-11

Figure 10-11. This case assumed sensorineural loss bilaterally. Choosing the level for word recognition is challenging. To ensure audibility, I’d like to test at 120 dB HL, but that might be uncomfortable for the patient. Let’s assume the patient can tolerate that level, at least for a 25-word list, and that the audiometer is capable of producing that output level.

The minimum masking level = 120 – 60 IA + 10 pad = 70 dB EM.

(The formula simplifies to 120 – 50 = 70 dB EM.)

The maximum masking level is theoretically 70 dB HL 500 Hz threshold (in case there is an insignificant 10 dB conductive overlay at 500 Hz) plus 55 dB: 125 dB EM.

Down 20 recommends 100 dB EM. Again, for this bilateral sensorineural loss the Down 20 formula recommendation is in between minimum and maximum. We have the “Goldilocks” level – not too loud, not too quiet, just right. What’s not to love about this simple formula? Conductive loss, see next figures.

NTE Conductive Loss Word Recognition Example

As was true for spondee threshold testing, conductive loss in the NTE requires higher masking levels, and the “Down 20” rule can undermask.

Figure 10-12

Figure 10-12. This is a case of left ear (NTE ear) conductive loss. Again, let’s assume the patient can tolerate 120 dB HL word recognition testing.

Min Mask = 120 - 50 + largest NTE ABG of 50 dB = 120 dB EM.

The right ear was assumed to have sensorineural loss. Max Mask = best realistic bone at 500 Hz and above, and we will again consider that it’s possible that there may be a small conductive overlay. Use 85 as the best reasonably possible 500 Hz bone-conduction threshold, add 55. Yazza. If you were able to produce 140 dB EM in the left ear, that would still not be problematic – it would not overmask.

So, how does the “Down 20” rule fair? Not well. But you know that already – NTE conductive loss attenuates the masking noise and can lead to undermasking. Here, the Down 20 rule says 120 presentation level – 20 = 100 dB HL, but the minimum needed was 120 dB EM. That is insufficient masking noise.

Test Ear Conductive Loss Word Recognition Testing Examples

Figure 10-13

Figure 10-13. The right ear loss is presumed to be conductive based on immittance testing. Since conductive losses don’t have recruitment, I would like to test word recognition at 30-40 dB sensation level. Let’s assume that the patient finds 100 dB HL speech comfortable.

Min Mask = 100 – 50 (-60 IA plus the 10 dB pad) = 50 dB EM

Potentially one may find a 0 or 5 dB BC threshold for the right ear, so Max Mask = 55/60.

Down 20 rule indicates that we would use 80 dB EM and that has a potential to cause problems. The crossback that could be as much as 20 dB, and that might start to interfere with speech understanding. (The signal is at about 35 dB sensation level, so the 20 dB of crossback probably would not cause a reduction in the word understanding score.)


Let’s examine a larger test ear conductive loss next – see Figure 10-14.

Figure 10-14

Figure 10-14. Let’s make the right ear conductive loss extreme and do the calculations using the maximum output of the audiometer, 125 dB HL.

Min Mask = 125-50 = 75 dB EM.

Max Mask will remain at 50/55 dB EM since we are assuming that the loss in the right ear is conductive. We know we have to use that 75 dB EM, so let’s think more about the consequences of this overmasking. If the interaural attenuation is 60 dB (remember, most people will have greater IA values), then the crossed back noise level at the test ear cochlea is 15 dB EM. If the spondee threshold had been 95 dB HL, this means the 125 dB HL word recognition stimuli are presented at 30 dB SL. The signal-to-noise ratio is still favorable (I consider above 10 dB favorable). The patient should be able to understand 30 dB sensation level test words even with 15 dB SL crossback at the same cochlea, so I would feel comfortable using the 75 dB EM minimum masking level.

Here the Down 20 formula really doesn’t work: it overmasks. The recommended 105 dB EM could cross back as 45 dB EM in the test ear cochlea, which could eliminate the hearing of the 30 dB sensation level signal.

Some Crossback Doesn’t Matter: Consider the Signal-to-Noise Ratio at the TE Cochlea

As the examples above have introduced, crossback isn’t always “the end of the world.” This concept applies for spondee threshold testing as well, but word recognition testing involves presenting supra-threshold level stimuli, requiring higher masking levels, and greater crossback levels are seen.

If some of the noise crosses BACK to the test ear cochlea, that can interfere with word recognition, but it does not necessarily do so. Examine Figure 10-15.

Figure 10-15a
Figure 10-15b

Figure 10-15. The right ear has a large conductive overlay. Word recognition testing is conducted at 110 dB HL. The crossover can be heard in the left ear. The minimum masking level = 110-50: 60 dB EM. The maximum masking level = 15+55 = 70. Using anything within that narrow range of 60 to 70 dB EM would be fine, but let’s assume the audiologist was having an off day and used the “Down 20” formula, presenting 90 dB of noise to the left ear. As shown on the bottom of the figure, this could cause some crossback to the cochlea, which is audible at and below 750 Hz. Would it degrade word recognition performance? No, that’s unlikely. The sensation level of the speech signal (red arrows on the top figure) is above the sensation level of the crossback (blue arrows on the bottom of the figure.)

Masking Dilemmas with Word Recognition Testing

It is possible to have a masking dilemma for word recognition testing – the same situation as creates a masking dilemma for pure-tone testing: bilateral maximum or near maximum conductive loss. Refer to Figure 10-16.

Figure 10-16

Figure 10-16. Immittance test results are consistent with bilateral conductive loss: pure-tone testing with masking presents with a masking dilemma. So will spondee threshold and word recognition testing. Assume that you test left ear word recognition at 100 dB HL, which will be about 30 dB sensation level. The minimum masking level = 100-50+largest NTE ABG (about 70 dB) = 120. The crossback level (120-60) = 60. Since the loss is conductive, that means that the crossback is at about 60 dB sensation level. The noise that crosses back to the test ear cochlea will be louder than the speech sensation level (30 dB SL). The person won’t hear the words, let alone be able to repeat them. You’ll have the same problem with finding the spondee threshold. If the left ear spondee comes in at 65 dB HL, our formula recommends to use (65-40+~70) at minimum 95 dB EM, which could crossback at 35 dB EM, which will prevent hearing. (Remember, the cochlea does the speech recognition – the level of speech reaching the cochlea after attenuation by the conductive loss is about 0 dB HL. The crossback of 35 is more than enough to prevent you from measuring a 70 dB spondee threshold.) What would happen? You would find an spondee threshold perhaps of 40 dB HL instead – elevated by the crossback. Then you would recognize that you aren’t using enough masking noise since the spondee threshold was not what was expected. You’d increase the noise – and that would cause more overmasking, elevating the spondee threshold further.

Reporting on Results Influenced by Overmasking

When you have a potential masking dilemma, my recommendation is to calculate your minimum and maximum levels. If your minimum is above the maximum, think about the amount of crossback that could be created when you use that minimum level – and think about it frequency by frequency. Compare the crossback to the sensation level of the stimulus. If your sensation level is higher than the crossback by 10 dB or more, then the crossback likely has little or no effect

If there is more crossback, e.g. crossback is at 30 dB above the bone-conduction thresholds of the test ear, and the word recognition signal is 35 dB above the air-conduction thresholds, then the word recognition score is probably lowered. This is particularly true for those with sensorineural loss – their word recognition scores are lowered in the presence of competing signals. If you cannot lower the noise without undermasking, then I suggest that you make a note in your report, e.g. “Contralateral masking noise crossback potentially lowered the word recognition score.” You could also test unmasked and note “Because of the masking difficulties/dilemma potentially reducing the word recognition score, unmasked testing was conducted, but is influenced by the non-test ear participation. True word recognition performance is likely in the range between the masked and unmasked scores.”

Summary of Speech Masking Concepts (Insert Earphone Use)

Need for Masking

When masking for speech, particularly word recognition testing, you must be vigilant about the need to mask. Remember to determine if the signal level – 60 dB is above the NTE best bone-conduction threshold (in the range of 500 to 8000 Hz). If so, masking is needed.

”Down 20” Formula is Ideal for Sensorineural Hearing Loss, Generally Adequate for Test Ear Conductive Loss, but Undermasks Significant NTE Conductive Loss Cases

Stay away from the simple “stimulus level – 20” rule if the non-test ear is conductive. It works wonderfully for sensorineural losses, and is simple to use. When the test ear has conductive loss, consider whether the crossback might interfere. (Noise level – 60: is that significantly above the TE bone-conduction thresholds? If so, determine the stimulus sensation level. You are still OK if the stimulus is at least 10 dB above the crossback level.) Be very careful with significant NTE conductive loss, there it is better to use the minimum masking formula. The Down 20 rule tends to cause undermasking with NTE conductive components

Minimum Masking Formula: Expected Spondee Threshold – 40 + Largest NTE ABG; Word Recognition Testing Level – 50 + Largest NTE ABG

We want to avoid having to raise the masking noise if the spondee threshold is slightly higher than predicted. Take your best estimate of the spondee threshold and adjust the formula by 10 dB more, in case the spondee threshold is a bit higher than your guess, which gives us the formula Minimum Level = Expected Spondee Threshold – 40 + Largest Significant NTE Air-Bone Gap (500 to 8k Hz).

For word recognition testing, you know precisely what the stimulus intensity is, so the formula is Minimum Level = Expected Spondee Threshold – 50 + Largest Significant NTE Air-Bone Gap (500 to 8k Hz).

MMax is the Same for Word Recognition and Spondee Testing, but You Don’t Care about Minor Overmasking, Especially with Word Recognition Testing

Since the minimum speech IA is 60 dB, the maximum masking level formula is Max Mask = Best Test Ear Bone-Conduction Threshold (500 to 8k Hz) + 55 dB. If you want to be conservative, you can lower your estimate of the bone-conduction threshold by 10 dB (in case there are minor air-bone gaps).

Remember that some crossback and overmasking may not be a problem, especially with sloping losses. If the crossback is only audible in the lowest frequencies, then even for spondee threshold testing, the results may not be invalidated. With word recognition testing, you are presenting at a level that is suprathreshold. If the crossback is well below the level of the speech at the test ear cochlea, then the crossback doesn’t affect the word recognition score much if at all. Ideally, you would examine the sensation level of the noise above the test ear bone-conduction thresholds. Compare to the sensation level of the speech above the test ear air-conduction thresholds. You would want the speech sensation level to be at a minimum 10 dB louder. If undermasking problems prevent you from lowering the speech level when this criterion is not met, document that the word recognition scores may be lowered due to overmasking.

References:

  • Cox, R.M. & Moore, J.N. (1988). Composite speech spectrum for hearing aid gain prescriptions. Journal of Speech and Hearing Research, 31, 102-107.

  • Sklare, D.A., & Denenberg, L.J. (1987). Interaural attenuation for Tubephone™ insert earphones. Ear and Hearing, 8(5), 298-300