Auditory Processing

Sound is encoded as action potential trains by the cochlea and transmitted along the 8th nerve into the cochlear nucleus. Central auditory processing exemplifies several general principals of sensory neuronal conduction and control. Information is processed in parallel, with subdivisions of the auditory pathway adapted for specialized feature extraction and it is hierarchical. The temporal properties of sound mean that the preservation of timing information during auditory processing is of fundamental importance.

Keywords: hearing; sound localization; auditory pathway

Figure 1. (a) Section through the human head showing auditory canal, middle ear and inner ear. (b) The cochlea is a fluid-filled tube coiled into a three-turn spiral. (c) In cross-section it has three compartments, with the hair cells arranged in rows running the length of the cochlea within the organ of Corti. Sound vibrations cause displacement of hair cell stereocilia relative to the overlying tectorial membrane.
Figure 2. Scanning electron micrograph of the organ of Corti with the tectorial membrane removed. Three rows of outer hair cells and the single row of inner hair cells can be seen, which extend for the length of the cochlea. Each hair cell is capped by a cluster of stereocilia; small movements of these ‘hairs’ are responsible for sensing sound vibrations. Inner hair cells sense sound and transmit this information via the type I spiral ganglion afferents; the outer hair cells transmit information via smaller type II afferent fibres and are thought to serve as the cochlear amplifier. Micrograph kindly provided by Dr. David Furness, Keele University, UK.
Figure 3. The encoding of sound by the cochlea. (a) Sensory transduction. This figure shows the sequence of events leading to the transduction of sound into a train of action potentials. Sound enters the auditory canal, moving the tympanic membrane, which vibrates the three ossicular bones in the middle ear. The stapes transmits this amplified signal to the oval window in the cochlea. The resulting travelling wave displaces the basilar membrane and causes bending of the inner hair stereocilia (IHCs). This movement changes the activity of ion channels, causing depolarization or hyperpolarization of the hair cell membrane potential, which in turn modulates calcium influx and transmitter release onto the type I spiral ganglion cell afferents. The resultant action potentials propagate into the cochlear nucleus via the 8th nerve. (b) Tonotopy. An uncoiled cochlea is shown (see Figure 2b) with two travelling waves represented above. A high-frequency (orange) sound resonates the base of the cochlea and generates action potentials in the spiral ganglion cells projecting to that region. A low-frequency sound (green) activates a region of the cochlea close to the apex and this stimulates a different group of afferents. The action potential response is similar in both fibres (below), but the different cochlear positions of the hair cells that respond to a given frequency of sound allows this information to be encoded into a particular set of afferent fibres. This tonotopic relationship is preserved at many levels of auditory processing. (c) Characteristic or best frequency. The tonotopic relationship means that each afferent fibre responds best (with the lowest threshold) to sound of a given frequency. Each afferent responds to sound of a characteristic frequency. (d) The volume of a sound is thought to be encoded by the number of action potentials. This probably includes action potentials both within and around a particular frequency range; that is, as the volume increases, additional afferents are recruited (volley principle).
Figure 4. Ascending auditory pathway. The three levels of central auditory processing are illustrated. The brainstem auditory pathway consists of the cochlear nucleus and superior olivary complex; the midbrain contains the inferior and superior colliculi and the auditory thalamic relay (the medial geniculate nucleus), which projects to the primary auditory cortex (AI) and the surrounding secondary auditory centres. Abbreviations: CN, cochlear nucleus; DCN, dorsal cochlear nucleus; aVCN, anterioventral cochlear nucleus; DAS, dorsal acoustic stria; IAS, intermediate acoustic stria; SOC, superior olivary complex; LSO, lateral superior olive; MNTB, medial nucleus of the trapezoid body; MSO, medial superior olive; PON, periolivary nuclei; LL, lateral lemniscus; DNLL, dorsal nucleus of the lateral lemniscus; INLL, intermediate nucleus of the lateral lemniscus; VNLL, ventral nucleus of the lateral lemniscus; IC, inferior colliculus; CIC, central nucleus of the inferior colliculus; BIC, brachium of the inferior colliculus; DCx, dorsal cortex of the inferior colliculus; LN, lateral nucleus of the inferior colliculus; SC, superior colliculus; MGN, medial geniculate nucleus; dMGN, mMGN, vMGN, dorsal, medial and ventral MGN, respectively; AI, primary auditory cortex; Py, pyramids; Tz, trepezoid body.
Figure 5. The cochlear nucleus. The cochlear nucleus (CN) is located on the lateral edges of the brainstem, below the cerebellum; this diagram shows the nucleus as viewed from the side. The spiral ganglion axons that carry the input from the cochlea enter the CN via the 8th nerve. The axons maintain their tonotopic relationship and bifurcate into anterior-ventral and rostrodorsal pathways. The dorsal (DCN) and ventral (VCN) cochlear nuclei form the first central processing of the auditory input. The DCN is a laminar structure concerned with the spectral properties of a sound, while the VCN forms the major inputs to the binaural brainstem pathways concerned with sound localization. The locations of the principal cells are indicated by the symbols and by the dashed lines.
Figure 6. Cells of the anterioventral cochlear nucleus have characteristic morphologies and firing properties. Firing properties are depicted here as poststimulus time histograms (PSTH). These represent the rate of firing plotted against time from the start of a sound stimulus (as indicated by the filled bar). (a) The primary afferents or spiral ganglion axons give a short burst of action potentials at the start of a sound and then a sustained discharge for the duration of the sound. This is known as a ‘primary’ response pattern. (b) Bushy cells are so called because they have just few dendrites, which tend to form a bush-like appearance. They receive their primary afferent input from the endbulbs of Held and follow the firing pattern of the primary afferents very closely; hence they are often referred to as having a ‘primary-like’ firing pattern. (c) Stellate or multipolar cells have a more conventional neuronal appearance, they respond to a tone with short bursts of action potentials with distinct, precisely timed pauses, which are called ‘chopper’ responses. (d) Octopus cells possess thick sparsely branched dendrites that tend to originate from one pole of the soma. They fire predominantly at the onset of a stimulus.
Figure 7. Descending auditory pathways. Each level of the auditory pathway possesses both ascending and descending pathways. The descending projection to the hair cells originates in the superior olivary complex of the brainstem (from the same regions involved in sound localization – see below) and forms an important part of the sensitivity control of the cochlear sense organ. These efferent axons form the olivocochlear bundle (OCB) and originate from either medial or lateral divisions of the superior olivary complex (SOC). The lateral division makes presynaptic connections with primary efferent terminals on the inner hair cells, while the medial division makes direct contact with the outer hair cells.
Figure 8. The brainstem binaural auditory pathway. This diagram shows a transverse section of the brainstem at the level of the seventh nerve. The 8th nerve enters the aVCN and excites the bushy cells. The globular bushy cells send a large diameter axon that crosses the brainstem in a tract called the trapezoid body. The terminals of these axons form the calyx of Held giant synapses on the cell bodies of principal cells in the medial nucleus of the trapezoid body (MNTB). The MNTB in turn gives an inhibitory projection to both the medial superior olive (MSO) and the lateral superior olive (LSO). The MSO receives a bilateral excitatory input from the spherical bushy cells and is responsible for interaural time difference computation. The LSO receives an excitatory input from the ipsilateral bushy cells, which is integrated with the inhibitory input from the MNTB as the first stage of interaural level difference computation.
Figure 9. The Jeffress model for interaural time difference computation in the medial superior olive (MSO). Three MSO cells are shown. Each receives excitatory synaptic inputs from both cochlea onto opposite dendrites. The path length of the axons varies along the length of the nucleus, the longer path lengths will give a longer latency response and hence act as a delay line. The cell labelled ‘Right’ receives the longest delay-line response from the right ear and the shortest delay-line response from the left ear. At zero time a sound originating from the far right-hand side is heard, it generates an excitatory postsynaptic potential (EPSP) in all the MSO neurons, but with a range of latencies. About 600 s later the same sound reaches the other ear and triggers EPSPs from the left side. This sound crosses the brainstem and activates the same population of cells. The latency of the left and right EPSPs is the sum of their respective conduction time (+c, for the sound to cross the head) and the delay introduced by the delay-line (+dl). Only those MSO cells that receive coincident left and right EPSPs will be able to generate an action potential as shown in the lower inset. Cell 3 will only receive a coincident input when the sound originates from the right; this is signalled by firing an AP. By the same mechanism cell 1 will only receive coincident EPSPs when the sound originated from the far left. By setting up a complete range of conduction delays, the 180° range of azimuth locations can be specified in this model.
Figure 10. Sound localization in the barn owl. The barn owl uses ITDs to discriminate azimuth location and ILD to detect vertical location. The auditory field of ‘view’ is shown for timing isotherms (ITD, upper graph measured in microseconds), and intensity (ILD, lower graph measured in dB). Combining these two maps of auditory space pinpoints the origin of the sound as indicated in the overlaid map (centre).
Figure 11. Low-threshold potassium currents are responsible for enabling the MNTB neuron to faithfully follow the pattern of activity in its calyx of Held. Dendrotoxin-I (DTx) degrades the transmission of action potentials across the calyx of Held/MNTB synapse. Using whole-cell patch clamp recordings from the MNTB neuron in an in vitro brain slice preparation, the synaptic responses can be evoked by electrical stimulation of the axons in the trapezoid body, as shown in the inset. The postsynaptic responses are shown below, with single stimuli on the left (a, c) and a train of stimuli at 100 Hz shown on the right (b, d). Normally, a single presynaptic action potential generates one postsynaptic action potential (superimposed on the EPSP). In the same recording a 100 Hz presynaptic stimulus generates a train of action potentials in the postsynaptic MNTB neuron, which closely follows the stimulus train. Addition of 100 nmol L–1 dendrotoxin-I blocks a low threshold potassium current and allows the giant EPSP to generate a train of action potentials in response to a single stimulus. The response to a 100 Hz train is now totally confused with many more action potentials being generated. This evidence and other voltage-clamp data demonstrate that low-threshold potassium currents play an important role in damping excitability, so that the EPSP can generate one precisely timed action potential rather than multiple action potentials that are not coherent with the timing of the auditory input. Reproduced with permission from the Journal of Neuroscience (Brew and Forsythe, 1995).
close
 References
    Barnes-Davies M and Forsythe ID (1995) Pre- and postsynaptic glutamate receptors at a giant excitatory synapse in the rat auditory brain stem. Journal of Physiology 488: 387–406.
    Beckius GE, Batra R and Oliver DL (1999) Axons from anteroventral cochlear nucleus that terminate in medial superior olive of cat: observations related to delay lines. Journal of Neuroscience 19: 3146–3161.
    Belin P, Zatorre RJ, Lafaille P, Ahad P and Pike B (2000) Voice selective areas in the human auditory cortex. Nature 403: 309–312.
    Borst JGG, Helmchen F and Sakmann B (1995) Pre- and postsynaptic whole-cell recordings in the medial nucleus of the trapezoid body of the rat. Journal of Physiology, London 489: 825–840.
    Brew HM and Forsythe ID (1995) Two voltage-dependent  K+ conductances with complementary functions in postsynaptic integration at a central auditory synapse. Journal of Neuroscience 15: 8011–8022.
    Forsythe ID (1994) Direct patch recording from identified presynaptic terminals mediating glutamatergic EPSCs in the rat CNS, in vitro. Journal of Physiology 479: 381–387.
    Forsythe ID, Tsujimoto T, Barnes-Davies M, Cuttle MF and Takahashi T (1998) Inactivation of presynaptic calcium current contributes to synaptic depression at a fast central synapse. Neuron 20: 797–807.
    Funabiki K, Koyano K and Ohmori H (1998) The role of GABAergic inputs for coincidence detection in the neurons of nucleus laminaris of the chick. Journal of Physiology 508: 851–869.
    Gale JE and Ashmore JF (1997) An intrinsic frequency limit to the cochlear amplifier. Nature 389: 63–66.
    Geiger JRP, Melcher T, Koh DS et al. (1995) Relative abundance of subunit mRNAs determines gating and  Ca2+ permeability of AMPA receptors in principal neurons and interneurons in rat CNS. Neuron 15: 193–204.
    Grigg JJ, Brew HM and Tempel BL (2000) Differential expression of voltage-gated potassium channel genes in auditory nuclei of the mouse brainstem. Hearing Research 140: 77–90.
    Grothe B (2000) The evolution of temporal processing in the medial superior olive, an auditory brainstem structure. Progress in Neurobiology 61: 581–610.
    Helfert RH and Schwartz IR (1986) Morphological evidence for the existence of multiple neuronal classes in the cat lateral superior olivary nucleus. Journal of Comparative Neurology 244: 533–549.
    Isaacson JS and Walmsley B (1996) Amplitude and time course of spontaneous and evoked excitatory postsynaptic currents in bushy cells of the anteroventral cochlear nucleus. Journal of Neurophysiology 76: 1566–1571.
    Jones EM, Gray-Keller M and Fettiplace R (1999) The role of  Ca2+-activated  K+ channel spliced variants in the tonotopic organisation of the turtle cochlea. Journal of Physiology 518: 653–665.
    Joris PX, Smith PH and Yin TCT (1998) Coincidence detection in the auditory system: 50 years after Jeffress. Neuron 21: 1235–1238.
    Konishi M (1993) Listening with two ears. Scientific American 268: 66–73.
    Kotak VC, Korada S, Schwartz IR and Sanes DH (1998) A developmental shift from GABAergic to glycinergic transmission in the central auditory system. Journal of Neuroscience 18: 4646–4655.
    Macica CM and Kaczmarek LK (2001) Casein kinase 2, determines the voltage dependence of the Kv3.1 channel in auditory neurons and transfected cells. Journal of Neuroscience 21: 1160–1168.
    Manis PB and Marx SD (1991) Outward currents in isolated ventral cochlear nucleus neurons. Journal of Neuroscience 11: 2865–2880.
    McAlpine D, Jiang D and Palmer AR (2001) A neural code for low frequency sound localisation in mammals. Nature Neuroscience 4: 396–401.
    Oertel D (1983) Synaptic responses and electrical properties of cells in brain slices of the mouse aVCN. Journal of Neuroscience 3: 2043–2053.
    Olsen JF, Knudsen EI and Esterly S (1989) Neural maps of interaural time and intensity differences in the optic tectum of the barn owl. Journal of Neuroscience 9: 2591–2605.
    Otis TS, Raman IM and Trussell LO (1995) AMPA receptors with high  Ca2+ permeability mediate synaptic transmission in the avian auditory pathway. Journal of Physiology 482: 309–315.
    Palmer AR and King AJ (1982) The representation of auditory space in the mammalian superior colliculus. Nature 299: 248–249.
    Perney TM, Marshall J, Martin KA, Hockfield S and Kaczmarek LK (1992) Expression of the mRNAs for the Kv3.1 potassium channel gene in the adult and developing rat brain. Journal of Neurophysiology 68: 756–766.
    Raman IM, Zhang S and Trussell LO (1994) Pathway-specific variants of AMPA receptors and their contribution to neuronal signalling. Journal of Neuroscience 14: 4998–5010.
    Ramanathan K, Michael TH and Fuchs PA (2000) Beta subunits modulate alternatively spliced, large conductance, calcium-activated potassium channels of avian hair cells. Journal of Neuroscience 20: 1675–1684.
    Rudy B, Kentros C, Weiser M et al. (1992) Region-specific expression of a  K+ channel gene in brain. Proceedings of the National Academy of Sciences 89: 4603–4607.
    Schlaug G, Jancke L, Huang Y and Steinmetz H (1995) In vivo evidence of structural brain asymmetry in musicians. Science 267: 699–701.
    Schnupp JWH, King AJ, Smith AL and Thompson ID (1995) NMDA-receptor antagonists disrupt the formation of the auditory space map in the mammalian superior colliculus. Journal of Neuroscience 15: 1516–1531.
    Smith AJ, Owens S and Forsythe ID (2000) Characterisation of inhibitory and excitatory postsynaptic currents of the rat medial superior olive. Journal of Physiology 529: 681–698.
    Smith PH, Joris PX and Yin TCT (1993) Projections of physiologically characterized spherical bushy cell axons from the cochlear nucleus of the cat: evidence for delay lines to the medial superior olive. Journal of Comparative Neurology 331: 245–260.
    book von Békésy G (1960) Experiments in Hearing. New York: McGraw-Hill.
    Wang H, Kunkel DD, Martin TM, Schwartzkroin PA and Tempel BL (1993) Heteromultimeric  K+ channels in terminal and juxtaparanodal regions of neurons. Nature 365: 75–79.
    Wang L-Y, Gan L, Forsythe ID and Kaczmarek LK (1998) Functional contribution of a Shaw-like high-threshold potassium channel (Kv3.1) to the phase-locking capability of neurones in mouse auditory brainstem. Journal of Physiology 509: 183–194.
    Yin TCT and Chan JCK (1990) Interaural time sensitivity in medial superior olive of cat. Journal of Neurophysiology 64: 465–488.
 Further Reading
    book Altschuler RA, Bobbin RP, Clopton BM and Hoffman DW (eds) (1991) Neurobiology of Hearing: The Central Auditory System. New York: Raven Press.
    book Bregman AS (1994) Auditory Scene Analysis: The Perceptual Organization of Sound. Cambridge, Mass. and London: The MIT Press.
    book Ehret G and Romand R (eds) (1997) The Central Auditory System. New York: Oxford University Press.
    book Geisler CD (1998) From Sound to Synapse: Physiology of the Mammalian Ear. New York: Oxford University Press.
    Kass JH, Hackett TA and Tramo MJ (1999) Auditory processing in the primate cerebral cortex. Current Opinion in Neurobiology 9: 164–170.
    book Lorente de Nó (1981) The Primary Acoustic Nuclei. New York: Raven Press.
    Oertel D (1999) The role of timing in the brain stem auditory nuclei of vertebrates. Annual Review of Physiology 61: 497–519.
    Trussell LO (1999) Synaptic mechanisms for coding timing in auditory neurons. Annual Review of Physiology 61: 477–496.
    book Young E (1998) "Cochlear nucleus". In: Shepard GM (ed.) The Synaptic Organisation of the Brain, pp. 121–158. New York: Oxford University Press.
Contact Editor close
Submit a note to the editor about this article by filling in the form below.

* Required Field

How to Cite close
Forsythe, Ian D(Jul 2001) Auditory Processing. In: eLS. John Wiley & Sons Ltd, Chichester. http://www.els.net [doi: 10.1038/npg.els.0000017]