Vocal fold vibration is caused by air pressure from the lungs pushing the folds apart, followed by a combination of tissue elasticity and aerodynamic suction pulling them back together. This cycle repeats hundreds of times per second during speech and singing, producing the sound that becomes your voice. The folds don’t vibrate because of repeated nerve signals telling them to open and close. Instead, they sustain their own oscillation through a self-reinforcing interaction between airflow and tissue.
The Core Mechanism: Air Pressure Meets Elastic Tissue
The explanation accepted since the late 1950s is called the myoelastic-aerodynamic theory. It works like this: before you make a sound, muscles in your larynx bring the two vocal folds together, closing the gap between them (called the glottis). With the gap sealed, air pressure builds beneath the folds. Once that pressure is high enough, it forces the folds apart and a burst of air escapes upward.
As air rushes through the narrow opening, it speeds up, and faster-moving air creates a drop in pressure along the inner surfaces of the folds. This low-pressure zone, a consequence of the Bernoulli principle, helps pull the folds back toward each other. At the same time, the folds’ own elastic tissue acts like a stretched rubber band snapping back to its resting position. Together, the suction effect and elastic recoil close the glottis again, pressure rebuilds below, and the whole cycle starts over.
In a healthy larynx at a comfortable speaking pitch, it takes only about 4 centimeters of water pressure beneath the folds to kick off this cycle. That’s roughly the pressure you’d feel blowing gently through a straw. People with vocal fold lesions like polyps or scarring typically need nearly double that pressure, around 7 centimeters of water, to start phonation.
Why the Vibration Sustains Itself
A simple open-close cycle wouldn’t keep going on its own. For vibration to be self-sustaining, the airflow has to feed more energy into the tissue than the tissue loses during each cycle. Research using synthetic vocal fold models has confirmed that this energy balance is positive: across a full vibration cycle, the net transfer of energy goes from the airstream into the vocal fold tissue, not the other way around.
The key to this energy transfer is timing. The air pressure pushing on the folds during the opening phase is higher than the pressure acting against them during the closing phase. Because the folds receive a bigger push when they’re moving outward than the resistance they encounter moving inward, each cycle adds a small surplus of energy that keeps the oscillation going. Measurements show that the middle third of the fold length accounts for over 80% of this energy transfer, with the very center point alone contributing about 45%.
The Mucosal Wave: A Ripple Through Layered Tissue
If the vocal folds moved in and out as rigid blocks, the pressure timing described above wouldn’t work. What makes self-sustained vibration possible is that different parts of the fold surface move at slightly different times, creating a ripple-like motion called the mucosal wave. The bottom edge of each fold begins to open before the top edge, and it begins to close before the top edge too. This vertical phase difference means the fold’s cross-sectional shape is constantly changing throughout the cycle, alternating between a shape that converges toward the top and one that diverges. That shifting geometry is what creates the pressure asymmetry between opening and closing.
This wave-like behavior depends on the vocal folds’ layered structure. Each fold has five distinct tissue layers. The outermost is a thin sheet of epithelium, only 50 to 80 micrometers thick (roughly the width of a human hair) but relatively stiff. Beneath it sits the superficial layer of the lamina propria, a gel-like tissue that is dramatically softer, sometimes a thousand times less stiff than the epithelium above it. Deeper still are intermediate and deep layers of the lamina propria, and at the core sits the thyroarytenoid muscle.
This stiffness mismatch is essential. The soft superficial layer slides freely beneath the stiffer epithelium, allowing the surface to ripple in waves rather than move as a single unit. Experiments on excised human vocal folds have shown that removing the epithelium alone is enough to disrupt the mucosal wave, reduce how fully the folds close, and break the left-right symmetry of vibration. In computational and silicone models, including a distinct epithelium layer over a very soft superficial layer produces more realistic vibration, lower pressure requirements to start phonation, and more prominent mucosal waves compared to simplified models that lump these layers together.
How Muscles Control Pitch and Volume
While airflow drives the vibration itself, the muscles of the larynx control the properties of that vibration, particularly pitch. The fundamental frequency of vocal fold vibration, which you perceive as pitch, results from the interplay of three tissue characteristics: length, mass per unit length, and tension. Thinner, longer, tenser folds vibrate faster and produce higher pitches, following the same physics as a guitar string.
Two muscles do most of the work. The cricothyroid muscle, which connects the thyroid and cricoid cartilages on the outside of the larynx, tilts the cartilages relative to each other in a way that stretches the vocal folds longer and makes them thinner. This increases tension and raises pitch. The thyroarytenoid muscle, which forms the bulk of the fold itself, contracts to shorten and thicken the folds, lowering pitch. It also pulls the vocal process (the point where the fold attaches to the arytenoid cartilage) inward and downward during the act of bringing the folds together.
Volume is controlled primarily by subglottal pressure. Pushing more air from the lungs increases the pressure beneath the folds, which forces them farther apart during each cycle and produces stronger bursts of air. The result is a louder sound. Increasing muscle tension in the folds themselves also affects loudness by changing how firmly and quickly the folds snap shut during each closing phase.
How the Vocal Tract Influences Vibration
The vocal folds don’t vibrate in isolation. The air column sitting above them in the throat, mouth, and nasal passages, collectively the vocal tract, pushes back on the folds and can either help or hinder their vibration. When the vocal tract has what physicists call inertive reactance (meaning the air column resists changes in airflow in a way that’s timed to assist the folds), it effectively gives the folds a small aerodynamic boost during each closing phase. This makes vibration more efficient and reduces the muscular effort needed to sustain phonation.
This is part of the reason certain vocal exercises work. Singing or humming through a narrow straw, for example, creates a constriction that raises the pressure above the folds, pushes them slightly apart, and reduces the impact force when they collide. Rather than squeezing the folds harder together to compensate, a trained vocalist learns to balance the resistance at the glottis with the resistance in the vocal tract. When that balance is achieved, the folds can vibrate with less forceful contact, which is gentler on the tissue.
What Disrupts Normal Vibration
Because the mucosal wave depends on precise tissue properties, anything that changes the stiffness, mass, or symmetry of the vocal folds will alter vibration. Vocal fold scarring is one of the most common clinical problems. Scar tissue replaces the soft, pliable superficial layer with stiff, fibrous material, disrupting the mucosal wave and forcing asymmetric vibration. Scarring can result from surgery, voice overuse, chronic inflammation, radiation therapy, or trauma.
Different types of asymmetry tend to map to different conditions. Left-right asymmetry, where one fold vibrates differently from the other, is associated with cysts, unilateral vocal fold paralysis, polyps, and a condition called Reinke’s edema where fluid accumulates in the superficial layer. These conditions change the effective mass or stiffness of one fold relative to the other, throwing off the synchronized vibration that produces a clear voice. The audible result is typically breathiness, roughness, or a strained vocal quality, all reflecting the mechanical reality that the two folds are no longer vibrating as matched partners.