[The pre-lecture quiz is in preparation - instructions will appear here when it is ready]
This week’s lecture will be on the evolution of speech, the default (but not the only) human modality for language. In the lecture I will focus on the evolution of the neural mechanisms underlying speech and vocal learning, and how learning sets up the self-organisation of contrastive systems of speech sounds: for this reading, we’d like you to do relevant background information on the physical apparatus behind speech (mainly the vocal tract), by reading Fitch Chapter 8.
The bulk of this chapter is on the position of the larynx in humans and other animals – Fitch has done important work here, using comparative data to overturn some widely-accepted but incorrect assumptions about the uniqueness of the human descended larynx.
As described in the reading, most animals, and very young humans, have a larynx that is positioned high in the throat, which allows them to engage the larynx directly with the velum, forming a nice tight seal which prevents food/liquid being inhaled.

In many other animals, and young human infants, the larynx engages with the velum and nasal cavity, making it possible to breathe through the nose (red arrows) while swallowing (blue arrows). Image from http://thebrain.mcgill.ca/flash/capsules/outil_bleu21.html.
The resting position of the human larynx is lower in the throat, and too far down to allow the larynx to engage with the velum in this way – the larynx moves to its lower position early in development (around age 3 months), with a second descent to an even lower position occurring during puberty in males only (which turns out to be informative about possible evolutionary functions for the low position of the larynx in humans, see below). This low larynx position increases (at least hypothetically) the risk of choking – every time you swallow, the stuff you are swallowing has to pass over the top of your windpipe and into the oesophagus, the epiglottis pulls down over the opening to the windpipe to cover the opening (see this nice animation). It has long been thought that the risks of the descended larynx must be outweighed by some advantage – maybe related to language?

One possibility is that the lower position of the larynx also drags the tongue root down into the pharynx, which gives us a two-tube vocal tract: we can manipulate the size of the oral cavity with the tongue tip, and independently manipulate the pharyngeal tube by pushing the tongue backwards and forwards – this is really nicely illustrated in the MRI images on this page, under section 2 “Vowel Articulation”. The idea is that this two-chamber vocal tract gives access to a wider range of formant frequencies, boosting the range of distinctive speech sounds we can produce.
However, as explained in section 8.4 of the Fitch reading, it turns out that the low position of the larynx/tongue root in humans isn’t actually very unusual. Firstly, many other mammals can dynamically reconfigure their vocal tract during vocalisation – they pull their larynx low in the throat while vocalising, giving them (temporarily) a two-tube vocal tract very similar in configuration to ours. You can see this in action in the X-ray movies of a vocalising goat and dog that Fitch has made available on his lab website – in case the originals don’t work for you (they don’t work for me), I have included local copies here: goat, dog. Second, it turns out that many other species (e.g. koalas, big cats, deer) have a permanent low larynx position, giving them a two-tube vocal tract similar to humans - you can see the low resting position of the larynx (big lump in the throat), and also the fact that these animals pull it even lower when vocalising, in Fitch’s video of a vocalising red deer, reproduced here.
There are two consequences of this. Firstly, since many mammals can reconfigure their vocal tract while vocalising, previous attempts to figure out the position of the larynx/tongue root in fossil hominids are rather pointless – even if we were able to infer a high resting position, there would still be the possibility that the vocal tract was reconfigured during vocalisation. Second, since a (permanently or temporarily) descended larynx/tongue root is seen in species that don’t have language or even complex vocalisations (e.g. deer, big cats – they basically just roar), there must be some other pressure that explains the convergent evolution of the descended larynx in humans and these other species.
Fitch suggests that the most likely explanation is size exaggeration - the lower your larynx, the longer your vocal tract, and the bigger you sound. This can be ‘faked’ to a certain extent by pulling your larynx lower, but is intrinsically an honest signal – you can’t pull your larynx outside your body, so bigger individuals have longer vocal tracts and sound bigger. Size exaggeration might be useful in sexual displays, in male-male competition (think of the roaring red deer) or more generally in territory defence (both male and female big cats roar, and both have a permanently descended larynx that emphasises their size for anyone hearing it). In humans, the second descent of the larynx in puberty in males suggests a sexual signalling function; the descent of the larynx in both sexes at age 3 months is presumably driven by different pressures, applying to both sexes. The descended larynx may therefore give us a nice vocal tract for producing contrastive speech sounds, but needn’t (initially) have been selected for this function.
The overall picture on the human vocal tract is therefore that, sadly (?), humans aren’t as special as we thought in physical apparatus for speech production. Fitch briefly reviews the evidence suggesting that we also aren’t special in our auditory apparatus for speech perception – we have a standard mammal ear, and other mammals have been shown to exhibit e.g. categorical perception effects that were at one point assumed to be uniquely human. The conclusion is therefore that any human-unique adaptations for speech must be in the brain, not in the peripheral apparatus.
