Acoustics.

A term that can embrace all aspects of the science of sound and hearing, but is here treated in two specific senses, that of room acoustics, considered only with reference to the performance of music, and that of sound-source acoustics, limited to various classes of musical instruments and the voice. For other acoustical matters see Hearing and psychoacoustics and Sound; for the history of the subject see Physics of music.

I. Room acoustics

II. String instruments

III. Keyboard string instruments

IV. Wind instruments

V. Percussion instruments

VI. The voice

RONALD LEWCOCK, RIJN PIRN (with JÜRGEN MEYER) (I), CARLEEN M. HUTCHINS (II, 1–6 (8 with JOHN C. SCHELLENG and BERNARD RICHARDSON), 9), J. WOODHOUSE (II, 7), DANIEL W. MARTIN/R (III), ARTHUR H. BENADE/MURRAY CAMPBELL (IV), THOMAS D. ROSSING (V), JOHAN SUNDBERG (VI)

Acoustics

I. Room acoustics

1. Introduction.

2. Reflection.

3. Resonance, reverberation and absorption.

4. Insulation against noise.

5. Radio and television studios.

6. Introduction to the history of acoustics.

7. Classical times.

8. Medieval times.

9. Renaissance and Baroque periods.

10. 18th and 19th centuries.

11. The science of acoustics.

12. The contemporary performance of early music.

BIBLIOGRAPHY

Acoustics, §I: Room acoustics

1. Introduction.

A room that has good acoustics is one in which it is possible to hear each sound clearly in all parts of the room; or, in other words, a room in which the sound is adequately loud and evenly distributed. In addition, it is normally required that the quality of sound being listened to in the room should match the type of sound being produced by the source. Room acoustics are relied on in some cases to sustain the sound in the room after the original source has stopped producing it, thus masking unevennesses in the ensemble, while in other cases sound too much sustained would mask the clarity of individual instruments or small groups. Acoustical problems are further complicated if opera is to be performed, for here every syllable is expected to be clearly heard and understood, and therefore only moderate sustained sound is desirable, yet the large ensemble demands sustained sound. Although scientific study permits a certain degree of accuracy in acoustical design, great difficulty is still experienced in determining the correct specification of the acoustics that ought to be provided.

Acoustics, §I: Room acoustics

2. Reflection.

Sound travels across a room in the form of vibrations in the air. Inevitably the amount of energy is diminished as the sound waves spread across the room, which means that there is a limit to the distance an average sound will travel without becoming faint. For increased loudness one normally relies on reflections from walls, ceiling and floor to augment the direct sound arriving at the ears. These reflections are also the source of the reverberation of sound in the room (see §3 below).

Sound can conveniently be thought of as spreading out from its source along straight paths and, like light, casting a shadow when it meets an obstruction (fig.1a). But the nature of this shadow depends on the relationship between two quantities, the wavelength and the dimension of the obstruction. The waves ‘bend’ at the edges of the obstruction, so that if the wavelength of a sound is large compared with the width of the obstruction, practically no shadow is formed (fig.1b). This condition is easily achieved with low-pitched sounds and the objects or screens in a normal room.

Sounds can be focussed to a point by concave reflectors, in the same way as a headlight beam is focussed, or spread out by convex reflectors so that their effect is diminished (fig.2a and b). Because the wavelengths of sounds are so much longer than those of light, the sizes of the reflectors needed to perform these tasks are quite large. For middle C an adequate size would be about 2·5 m.

Similarly, sound can be reflected by a plane wall in just the way light is reflected by a plane mirror. There is an ‘image’ formed behind the wall, which acts as the imaginary source for all sound reflected from the wall. As with light, the angle of incidence is equal to the angle of reflection (fig.2c). Very small reflectors do not work effectively for fundamental sounds, and reflectors for even the higher instrumental sounds need to be relatively wide; for the lowest range of notes reflectors more than 6 m wide are necessary.

If there is too much reflection in a room the sound may be loud and reverberant, and the endless reflections produce booming effects. Or concave reflectors may focus sounds so that some areas of the room receive little or none. Examples are the use of a curved wall behind an orchestra, which produces ‘sound foci’ in parts of the audience (fig.3a), and the use of a hard domed surface on the ceiling of a ballroom; in fig.3b the curved ceiling of the section produces the maximum focussing effect and by corollary the maximum area of diminished sound, sometimes known as a ‘dead spot’. A further example is that of the curved rear wall in an auditorium, which may concentrate sound back on the source or on the people in the front rows (fig.3c).

Echo is one of the most serious problems introduced by reflections. Fortunately it occurs only when there is a pronounced audible gap between the direct sound and the first reflection (or between two reflections). In other words, an echo is a discrete reflection that stands out over and above the other reflections. Ability to hear echoes varies with the individual, but a time interval of 0·08 seconds can be perceived by most people as an echo in music (as compared to less than 0·04 seconds in speech), and has therefore to be avoided. The distances travelled by the two sound paths would have to differ by 27 m before this time interval would occur in music (13·5 m in speech) (fig.4c). This means that any reflector behind the source or behind the listener and more than 13.5 m away is potentially likely to produce an echo (fig.4a and b). Reflectors in side walls or ceilings can generally be further away before they produce echoes, as the difference in length between direct and reflected sound paths is less than in the former case.

A complicated echo occurs when two reflective walls, or a reflective floor and ceiling, are exactly parallel and opposite each other. The difference in paths of travel of the sounds necessary to produce the echo is then formed again and again, resulting in a multiple or ‘flutter’ echo (fig.4d). This is particularly disturbing to the person producing the sound, but may also be heard by members of the audience. For this reason it is normal practice to ensure that reflectors are not exactly parallel; the deviations from the true parallel need not be so much that they are seen.

Two of the problems in room acoustics may be solved with the aid of properly designed reflecting surfaces. The first is the transmission of sound from the front to the back of the room so that it may be heard with reasonable loudness (yet without introducing any artificial coloration, as would almost inevitably happen with electronic amplification). The second is the problem of uneven distribution of sound, which is dealt with by ‘diffusion’, with the aim of producing a ‘diffuse sound field’.

(i) Transmission and the design of reflectors.

The transmission of sound from the front to the back of a room is normally aided by specially designed reflecting surfaces. As an example, consider a recital room of small size, with a flat floor, fairly low ceiling and a raised podium for the performer. It is shown in fig.5a before the reflectors are designed, in fig.5b after reflectors on walls and ceilings have been calculated and in fig.5c after additional angled reflectors have been added to strengthen the sound. The remaining surfaces are not useful areas for strengthening the loudness of the sound, and indeed may be dangerous if they are left as flat reflectors, introducing echoes or sounds that are too prolonged. For this reason the remaining areas of wall and ceiling are usually treated as absorbent or diffusing surfaces (see below).

In larger rooms the shape of the walls, of the ceilings, and even of the floor, may be determined by acoustic needs. Shaping the floor is usually thought necessary when the audience numbers more than 100, and desirable even when it is only 50. The audience seats are raised on tiers so that sound can travel unobstructed to the ears, passing over the heads of the people in front; this compensates to some extent for the greater distance sound has to travel. As an added improvement the musicians may also be raised on tiers so that they are unobstructed by performers in front. It is an old adage among designers that ‘if one can see well one can hear well’. Fig.6c shows a floor shape thus determined, and fig.6d shows the plan of the seating so that the entire audience has a clear view of the whole source of sound. Three sources are shown (S, S1, S2), together with their images (I, I1, I2) produced by reflection from the various reflecting surfaces, the images being constructed here geometrically.

Electronic amplification may be used when reflection is insufficient to produce a suitable volume, but the argument that there is concomitant coloration and distortion has tended to discourage its use except in the special case of electronic music or when quiet instruments (e.g. harpsichord, guitar) are required to sound well in a large hall.

(ii) Reflectors as diffusing surfaces.

Any rough surface will scatter sound waves, and hence ‘diffuse’ the sound field. Unless the roughness is pronounced, however, the sounds affected will be limited to those at the extreme upper end of the frequency scale. In order to affect sounds over the whole of the frequency range the roughness of the wall has to be of the order of at least 0·75 m and generally it is designed even larger. In the design of diffusing surfaces curved surfaces are often favoured, whether in concave sections, in convex sections or undulating (fig.7). Research has suggested that diffusing surfaces made up of rectangular parallelepipeds are equally efficient, but diffusion can also be achieved in quite different ways, by alternating small areas of absorbing and reflecting materials, or by the use of so-called stepped or profile diffusers, which consist of wells of unequal depth.

It is an ideal in acoustics to produce a ‘diffuse sound field’, so that the sounds reaching the audience are coming from every direction at equal strength. This ideal is never attained, but its approximation is important in producing predictable acoustical behaviour in a room.

Acoustics, §I: Room acoustics

3. Resonance, reverberation and absorption.

The property of sympathetic vibration is encountered in its direct form in room acoustics in the rattling of window panes, light shades and movable panels in the presence of very loud sounds, such as may occasionally be produced by a full organ. As these things rattle (or even if they do not audibly rattle) sound energy is being converted into mechanical energy, and so the sound is absorbed. Wood panelling and anything else that is lightweight and relatively unrestrained have the same effect. Absorptivity is at its highest at the resonant frequency, usually near or below 100 Hz.

Volume resonance occurs when standing waves are created by correspondences between the wavelengths of a fundamental sound and the dimensions of the room, and may result in uneven distribution of sound. This effect is at its worst in small rooms and becomes decreasingly serious in large volumes, where the dimensions are so great that they exceed the fundamental wavelengths of the lowest audible sounds.

A sound that is prolonged by multiple reflections around walls, floor and ceiling is said to have reverberated. The time of reverberation can be used as a simple yardstick to compare the capacities of different rooms for prolonging sound but for the yardstick to be practically serviceable, all the variables have to be specified. These include the frequency at which the reverberation is tested, and the range of loudness over which the decay is measured. Thus, for practical purposes, the ‘reverberation time’ is defined as the time taken for the sound in a room to die from 60 decibels to inaudibility (fig.8). It is customary to compare the reverberation times of rooms at ‘mid-frequency’ (an average of values measured at 500 Hz – just below c'' – and 1000 Hz), but for fuller comparisons reverberation time at 125, 250, 2000 and 4000 Hz are also used, to provide a composite picture of the prolonging characteristics of each room throughout the musical spectrum.

Reverberation is determined by the ability of sounds to bounce around a room for some time, that is, by the number and area of reflecting surfaces. A larger room naturally has sounds travelling for a longer period and the reverberation is more prolonged, though it can be reduced by replacing reflecting surfaces with absorbing ones. A reverberant room offers less clarity but is louder than a non-reverberant room, and vice versa. Analyses have been made of the acoustic characteristics of many concert halls throughout the world that are thought to have ‘good’ acoustics so that they may be compared and a synthesis of the optimum acoustic characteristics determined. The reverberation characteristics are summed up in the graph in fig.9. Using this it is possible to compare the reverberation of a projected room (calculated in advance by means of a standard formula) with the accepted aggregate norm. However, there are more recent additional criteria for satisfactory acoustics, which are discussed below.

Audience size affects reverberation markedly. In concert and recital halls where most surfaces are reflective, people are often the main absorbers of sound. In an endeavour to reduce the effect of this inevitably variable function, the seating is usually designed to provide a maximum of absorption when empty; it is covered with softly padded fibrous material, and the underneath surfaces perforated. But this is only a partial solution to the problem of varying audience size, for at middle frequencies the absorption of the seat is little more than half the absorption provided when a person is sitting in it.

The absorbing surfaces in a room vary in efficiency with the pitch of the sounds reaching them. High frequencies are normally absorbed by fibrous materials – woollen curtains or carpets, or specially designed surfaces incorporating fibrous materials. Sometimes cheap wood fibre blankets are placed behind perforated surfaces to achieve the same end. Glass fibre or slag wool blankets may also be used in this way, or wood fibre may be pressed into boards or tiles (‘acoustic tiles’) that are drilled or otherwise roughened to allow sound to penetrate into the material. Low frequencies are absorbed by using the capacity of resonant surface materials to absorb energy in the manner previously described. Resonant surfaces of this type usually depend partly on a trapped air space behind them; in other words, they are rather like sounding boxes, which, though never activated by enough sound energy to produce audible sounds, continue to resonate whenever small amounts of energy impinge on them. The resonating surface is usually wood or some flexible panel material.

An invention applying this resonating principle to absorb low-frequency sounds is the Helmholtz resonator, which uses the principle of sympathetic vibration of an organ pipe or an open bottle. A container, generally made of concrete or fibrous cement, is fixed behind the ceiling or walls, and connected to the room only by a small opening, or ‘neck’ (fig.10a). Helmholtz resonators are more frequency selective than resonant panels, and a series of them are used to correct specific peaks in the low-frequency spectrum. For this purpose holes are often left in some surfaces in a room when it is being built, enabling Helmholtz resonators to be inserted to correct unevenness in the acoustic spectrum, should that be necessary when the room is completed.

The Helmholtz resonator principle has been used in the design of a special panelled surface that combines the advantages of all three types of absorbent discussed above, absorbing sound over a wide frequency range. The Helmholtz resonator panel surface, or perforated resonating panel, has a dense surface material (compressed hardboard or asbestos cement) perforated with holes usually 3 mm in diameter spaced approximately 25 mm apart; to the volume behind it each perforation acts as the neck of a single Helmholtz resonator (fig.10b). The frictional resistance of each hole is often increased by gluing hessian across the back of the board. Whether this is provided or not, a layer of fibrous material (slag wool or glass wool, usually 2·5 cm in thickness) behind the holes provides considerable frictional resistance and absorbs resonant vibrations as they are set up in the air space and the panel. An important factor is the size of the air space (i.e. the distance between the panel and the wall behind it); this is approximately 13 cm ideally, and the absorption reduces in efficiency as it is decreased.

Another absorbing surface, which has the advantage of improved appearance though it is less efficient, is the strip panel resonator (fig.10c). This is made by fastening narrow strips of wood side by side, leaving small air gaps between them that act as the necks of individual Helmholtz resonators, although these operate only in one dimension of the surface (i.e. at right angles to the direction of the strips). The important dimensions here are those of the width of the strips (not much more than 2·5 cm), the width of the gaps between them, which should be between one fifth and one tenth of the width of the strips, and the depth of the air space, optimally 13 cm as above. Hessian and porous materials are fixed as with perforated resonating panels. The final appearance of the wood strips may be varied considerably; they may be shaped, patterned, painted or varnished without affecting the acoustic absorbent properties of the surface. Also, using the same Helmholtz resonating surface principle, many other absorbent devices are possible. A valuable derivative is the suspended absorbent cone, made of perforated hardboard or fibrous cement, which may be hung in rooms in which the walls and ceiling are difficult to render absorbent. Today, fibrous absorbers of varying thicknesses are widely used.

Besides absorption provided by the walls, floor, ceiling and furnishings of a room, and by the audience, reverberation is also affected by the absorption of the air in the room; in particular, high frequencies are absorbed if they travel considerable distances through air. But the effect of such absorption is only really noticeable in large spaces, especially when the air is dry.

Since an acoustical experience depends not only on the reverberation time, which tends to be fairly uniform throughout a room, but also on the strength, the timing and the direction of arrival of individual reflections, the sound in no two seats, let alone two widely separated areas of an auditorium, is exactly the same. The discerning listener knows this and selects a seat accordingly. Thus ‘perfect acoustics in every seat’ is unlikely to be achieved, yet halls that are known for their good acoustics tend to be accepted as such wherever people sit.

Achieving uniformity is often most difficult in small rooms. Great pains have to be taken to create the diffuse sound field necessary for good acoustics, by scattering the absorbing surfaces so that they alternate with reflecting surfaces in relatively small areas, and by the provision of broken, concave or convex diffusing surfaces. Two spaces connected to each other by an opening, such as the stage tower volume connected to an auditorium by the proscenium opening, may produce curious acoustical affects. Coupled volumes that are more reverberant than the auditorium have been designed to enhance the acoustics of halls in which adequate reverberation is lacking. On the other hand, openings to spaces that are less reverberant than the auditorium act as absorbers.

Reverberation can be measured in completed rooms by a number of methods using physical recording equipment, or by the subjective tests of trained observers. In models it can be measured reasonably accurately, provided care is taken to duplicate materials and surfaces at a smaller scale, or to make allowances for their omission. The patterns of wave distribution may be studied by the use of wave patterns on the surface of water when model sections of the room are placed in test tanks, or by spectrum photographs of the behaviour of sound inside a model placed in a smoke chamber. These time-tested modelling methods are now being replaced by computer programs that allow the designer to study complex reflection patterns and determine parameters, the calculation of which by hand would take many hours.

In severe cases of lack of reverberation, artificial reverberation may be introduced by distributing loudspeakers around the walls, floors and ceiling of a room, and relaying suitably delayed recorded sounds through them into the room. In the analogue era a specially devised tape recorder was used which allowed delays of a fraction of a second to be achieved between recording and replay. Modern digital audio systems not only create a longer and non-colorating reverberation in dry rooms, but add controlled early reflections to enhance the clarity or spaciousness of the sound using complex digital signal processing systems and numerous microphones and loudspeakers. While the result achieved is often a great improvement if too small a room is being used for orchestral, choral or organ music, its use tends to be confined to recording and broadcasting studios, because of the resistance of performers and audiences to artificial alterations of the natural sound in live performance.

At the Royal Festival Hall, London, unexpected deficiencies were found in the frequency spectrum of sound as a result of excessive absorption at certain frequencies. To correct this, artificial resonance was introduced by placing loudspeakers in resonant cavities closely resembling Helmholtz resonators, designed to resonate at the deficient frequencies, and activated by specially placed microphones. In this way the reverberation at many frequencies was increased without audible artificiality or coloration of sound. The many parameters which create an acoustically pleasing environment are constantly revalued. Concert halls and music rooms in general cannot be rank-ordered on an absolute scale of acoustical merit. Like the art for which they are built, they evoke responses that vary from person to person. Some prefer clarity, others a ‘large’ sound that can only be had at the expense of clarity. Musicians expect the room to respond, but they also want to hear each other.

Until recently, the main room-acoustics parameter used by acousticians was reverberation time. While reverberation time is still an important consideration, other, newly developed acoustical measures are thought to be at least as important. These include the clarity index C (the ratio of ‘early’ to ‘late’ energy, in which the boundary between early and late is 0·08 seconds after the arrival of the direct sound), the loudness descriptor G and a quality called ‘spatial impression’ which depends on the directional distribution of the energy. There is continuing discussion among acoustic researchers and consultants as to which measurements are most significant. Besides these objective measures, there are some purely subjective parameters to consider. A good example is the reverence for wood among musicians – a notion that finds limited scientific support, but on which musicians assessing room performance still rely heavily, perhaps through analogy with the rich resonances associated with wood in many musical instruments.

Acoustics, §I: Room acoustics

4. Insulation against noise.

Satisfactory listening is only possible in a room which is relatively quiet. Music rooms should have noise levels approaching inaudibility. Nevertheless, a very low noise level is not always ideal. In offices, for instance, some noise (typically that of the air-conditioning system) is often felt to be preferable to no noise, because in masking other unwanted sounds, such as voices from the next room, the sound contributes to a sense of acoustical privacy. For optimum acoustics in a room for listening to music, it is essential to achieve the lowest possible noise level. As well as quietening the ventilation system, this means insulating the room against two quite different types of noise. The fist is noise reaching the room through the air, whether from inside or outside the building, usually called ‘air-borne sound’. Ideally, external sound should be eliminated, but this is often both difficult and expensive, especially as it involves sealing windows and doors, which in turn necessitates air-conditioning the room. The second is sound generated in the solid material of the building, or in the ground or solid material of neighbouring buildings, known as ‘impact sound’. Of these two, the latter is by far the more difficult to cope with, and treatment of impact sound normally deals with many of the problems of air-borne noise.

Impact sound ranges from the noise of water falling in drainage pipes to slamming doors, footsteps, and the vibration of passing trains or buses. Internally all the floors of the building are usually designed to ‘float’ on insulating pads, so that impact noises are not carried into the structure. Alternatively, the floor material itself becomes the insulator, being soft and resilient, and thick enough to absorb vibrations; materials like thick cork or pile carpeting can be used in this way. Slamming doors and plumbing and drainage noises are either eliminated by careful design, or kept from the structure by thorough insulation. External vibration travelling through the ground and then up into the building through the foundations is much more difficult to deal with. Thick fibrous or rubber pads (when compressed remaining 15 cm thick, or even thicker) may be placed under the foundations to provide some improvement. In the case of the Royal Festival Hall, the whole auditorium was raised three storeys above the ground on tall slender columns that are thought to attenuate some of the solid-borne vibrations within their height (fig.11).

Air-borne noise ranges from sounds made by telephones and instruments playing in other parts of the building to chiming clocks, traffic noises and aeroplanes. In order to insulate the auditorium from all these noises it is sometimes found best to surround it with two completely separate skins of construction supported on separate foundations at ground level (see fig.11); it is thus difficult for any vibrations set up in the outer skin to pass into the inner skin, which serves as the envelope of the room. This insulation is effective only if the gap between the two skins is at least 30 cm, and its effectiveness is sometimes slightly improved by introducing into the gap a fibrous insulating material, such as fibreglass blanket. The points of weakness in such a design are clearly the windows and doors; windows may be sealed and double-glazed with a large gap between the panes (preferably no less than 10 cm), but this is not possible for doors, so they should be made airtight using gaskets, and a ‘sound trap’ in the form of a small sound-absorbent lobby must be provided at all points of access. As a final precaution the whole of the auditorium volume is surrounded by a blanket of other rooms and foyers, all treated with absorbent surfaces so that sounds passing through them are absorbed before reaching the auditorium, as in fig.11. Even with all these precautions, complete sound insulation is never achieved, and the principle of masking referred to above has to be relied on to disguise some external noise and air-conditioning hum. A large audience usually provides a natural masking level of between 20 and 30 decibels.

Acoustics, §I: Room acoustics

5. Radio and television studios.

The acoustics of radio and television studios follow the same general principles as normal room acoustics, with more rigid standards necessitated by: the special problems of recording and transmitting sounds via microphones; the double reverberation problem introduced by having one reverberation in the room of the broadcast or recording and another in the receiving room; and the outside noise, which has an even more serious effect on broadcast than on live sound.

Reverberation becomes a major concern in studio acoustics because too much of it reduces the definition and clarity of broadcast sound. On the other hand, elimination of reverberation would weaken the ‘character’ of the sound, and is, of course, inconceivable for the performance of music in ensemble. Of particular importance is the variation of acoustic behaviour in different parts of the room, which must be avoided if several microphones are to be used at the same time. For these reasons the reverberation is reduced but not eliminated, and great pains are taken to achieve even diffusion of sound.

The commonest type of studio in a radio broadcasting centre is the small announcing studio with a listening room alongside equipped for control, editing or playback. Of domestic room size, such studios provide spoken programmes of all kinds, and from them drama and music programmes are announced and monitored. A broadcasting centre has larger studios for drama, as well as a range of studios of different sizes for the performance of music; the largest are comparable in size with concert halls.

Stereophonic transmission does not greatly alter the acoustic requirements of studios, and as a rule the same studios are used, with different microphone placings. However, noise interference must be guarded against even more stringently, and precautions taken to avoid strong reflections altering the apparent position of a sound source; the latter usually involves increasing the areas of absorption at microphone level.

The design of small studios is affected by their tendency to add colorations to the sound; these disappear if the studio is made larger. A great deal has been learnt about the acoustic correction of this and other defects in small rooms (see Gilford, 1972). Briefly, this is done in two ways: by improving diffusion, ensuring that all walls, the floor and the ceiling have approximately the same absorption; and by careful measurement of the dimensions of a rectangular room so that prominent standing waves are separated from each other by intervals of 20 Hz or more.

Since all the available evidence suggests that the radio audience prefers music with the same balance and character as under live conditions, experiments with microphone placing have gradually ceased in favour of a placing that duplicates the ears of a single listener in the body of a reasonably sized concert room, and at a distance that produces a good blend and a natural reverberation. It has been found that the ratio of intensity of reverberant sound to direct sound should be approximately 5:8, and that the absence of an audience can be compensated for by moving the microphone closer to the source than it would be under normal conditions. With these assumptions, broadcast studios for music generally have the same design problems and solutions as concert halls. Care has to be taken to achieve adequate diffusion by using irregularities to scatter the wavefronts during reflection, and to avoid masking the bass by allowing too long a bass reverberation relative to middle and upper frequency reverberation. Predominant bass is sometimes avoided by placing special absorbers near loud brass and percussion instruments. For live audiences, experience suggests that the early energy, which includes the direct sound, should be a decibel or two lower than the late (reverberant) sound.

Acoustics, §I: Room acoustics

6. Introduction to the history of acoustics.

Although a good deal was known about the propagation of sounds and the acoustics of musical instruments from ancient Greek times onwards, the acoustics of rooms was imperfectly understood until well into the 20th century, and many traditional concepts and remedies are now known to have been false. Acoustic designs were empirical, successful in the evolution of efficient shapes but permitting the growth of a good deal of mystique where ‘resonance’ and ‘absorption’ were concerned. A particular misconception was the nature of resonance in room acoustics; it was thought that the principle of the sounding box of a lute or lyre could be transferred to architectural design, not taking into account the large amounts of mechanical energy needed if such ‘resonant chambers’ were to operate as amplifiers and enrichers of sound, energy far in excess of any that could be transmitted by the travel of sound through air. The result was that in many cases the ‘resonant chambers’ acted instead as absorbers of lower-frequency sounds, quite the opposite of the intended function. Equally fallacious were many of the attempts to absorb sound; it was thought by some designers until quite recently that any material soft to the touch would absorb sound energy, whereas many resilient materials are in fact poor absorbers (e.g. cork and rubber). Wires suspended around a room above head height were used to absorb excess sound energy, although they are now known to have had negligible effect because of their small cross-sectional area. The properties of wood panelling and large volumes of air as absorbers of sound energy were not generally known. Viewed with hindsight, therefore, the acoustics of the rooms for which much great music and opera was written must be expected to vary a great deal from what would now be considered satisfactory conditions. Nevertheless, the acoustics in those rooms that were considered excellent in earlier periods often approximate closely to those that would now be chosen for the performance of the same music. This is becoming clearer as famous churches and music rooms undergo thorough testing with recently developed techniques.

Acoustics, §I: Room acoustics

7. Classical times.

Before classical times there are only the vaguest references to buildings designed to allow music or speech to be clearly heard (e.g. royal courtyards for audiences or theatre in India were to be ‘built so that each sound [svara] and letter [aksara] should be audible’, Manasara, xxxiv, 506).

The basis of the modern science of acoustics was formulated in Greece by the Pythagoreans in the 6th century bce; Aristotle and his followers, the Peripatetics, continued its development as an empirical science. Aristoxenus of Tarentum (4th century bce), whom Horace called musicus idemque philosophus, examined the study of musical sounds from a physical–acoustical point of view, going beyond the origin and propagation of sound to consider the problems of perception by the human ear. The work concerning acoustics of Vitruvius (1st century bce), the Roman writer on architecture, is largely based on Aristoxenus’s writings (see below).

Greater understanding of acoustics at the beginning of the 5th century bce is exemplifed in the design of the first of the great Greek theatres, that of Dionysos at Athens (c498 bce), and its successor, the theatre at Syracuse (475 bce). The seats were arranged in curved rows round the circular orchestra, which provided a large horizontal reflecting surface to transmit sounds made by the chorus and by actors on the raised platform that served as a stage behind. Further high-frequency reflections were provided by the scenes, painted on skins, which are thought to have served as backdrop and wings to the stage. Because of the shape of the seating the furthest distance between the stage and the back row was only about 30 m. The propagation of sound to the audience was aided by the megaphone effect of the masks worn by the actors; besides providing facial expressions of nobility, anger or mirth, these masks improved the mechanical coupling between the vibrations within the actors’ heads and the surrounding air, thus enabling more of the available vocal energy to emerge in the form of sound waves and directing it towards the audience. A notable feature of later masks was the large opening of the mouth, which concealed megaphone-shaped cavities within the lips. Cassiodorus (51st epistle) remarked that the actors’ voices were so strengthened by ‘concavities’ that it was difficult to believe they could issue from the chest of a man. Acoustical devices were used in the design of the theatre to enhance vocal effects. For example, the space under the wooden stage platform, open to the audience and partly enclosed by removable wooden panels, appears to have been intended to act as a kind of resonant chamber. A similar but smaller volume in the middle of the orchestra housed an altar that was raised for thymelic spectacles but also played an acoustical role when the chorus chose to call down into it. It is thought that the narrowness of the stage was necessary to prevent the actors stepping so far back that the orchestra floor ceased to function as the main reflector, or so far forward that the backdrop lost its reinforcing effect on the higher frequencies (fig.12).

Later developments in the shape of theatres are all thought to have been due to attempts at improving the acoustics. In the second half of the 5th century bce, notably at Catania and Magnesia, theatres became more exaggerated in shape, with side walls converging not to the orchestra but to the stage. In the 4th century bce this trend was reversed with the erection of the great semicircular theatre at Epidaurus (fig.13a). The increased length of the seating area brought more of the audience close to the stage and thus improved the acoustics, especially as less of the sound could escape at the sides of the orchestra, and consequently more was retained in the volume of the auditorium. On the other hand, the direction in which the actors were facing became of greater importance, and the height of the stage building was increased and made of stone to provide more reflection from behind and improve the distribution of the sound. In the following centuries the older theatres were enlarged, both by extending the seating at the sides and by building more seats beyond the back rows to emulate and surpass Epidaurus. Eventually the distance from the stage to the rear seats became 70 m at Syracuse and as much as 100 m at Athens. The stages had to be increased in height and further framed to contain the sound.

In the late Hellenistic drama the chorus assumed a reduced role, while action and scenery grew in importance; the stage was enlarged and began to encroach on the orchestra, so that it no longer formed a complete circle. This was the model for the Roman theatre (fig.13b), in which the stage building was for the first time completely joined to the seating. Thus the whole building became one compact form, with a very high skēnē carrying a large slanting roof over the stage, which deflected sound towards the audience. The stage could now become much deeper, since the reflecting properties of the back wall were not of such importance for the clarity of speech. Similarly, a gallery was built over the back seats to deflect sound back to them. There were even examples of parabolic ceiling reflectors, apparently for directing the sound more accurately back into the auditorium (e.g. the theatre at Orange, 1st century ce).

In De architectura (1st century bce) Vitruvius discussed both the fundamental principles of acoustics and their application to the design of theatres (bk v, chaps. 3–8). In chapter 5 he referred for the first time to acoustic vases (echea):

Now in accordance with these researches bronze vases should be made in mathematical proportions [to each other], taking into account the size of the theatre: and they should be designed so that when they are excited they sound a series of notes at intervals of a 4th, 5th, and so on up to two octaves.
Then cubicles should be built among the auditorium seats on the basis of music theory, and the vases placed in them in such a way that they are not in contact with any of the upright stonework, and have a free space around and above; they should he placed upside-down, with wedges not less than [15 cm] high under them, on the side facing the stage. And in line with these cubicles openings should be left [in] the slabs of the lower rows, [62 cm] wide and [15 cm] deep.
The method of marking out the positions in which the jars are to be placed is as follows. If the theatre is not very large, a horizontal line should be marked out, halfway up the slope [of the auditorium], and 13 vaulted cubicles built, with 12 equal intervals between them: then the sounding jars as described above are placed in them [
fig.14].
So by this arrangement, the voice, radiating from the stage as from a centre, spreads itself around [the auditorium]: and, by exciting resonance in particular vases, produces an increased clarity and a series of notes which harmonize with itself.

Chapter 5 elucidates how such a system of acoustic vases might be extended in a larger theatre, with three horizontal rows of cubicles, one for the harmonia, a second for the chromatic and a third for the diatonic (fig.15).

The term for ‘clarity’, claritas, is probably an equivalent for the Greek term lamprotēs, defined by Aristotle in De audibilibus, implying, besides distinctness, loudness and purity; and the context almost certainly implies a singing rather than a speaking voice. The function of the vases would have been to make some sounds louder than others, and to make them purer by stressing their fundamentals and suppressing their harmonics or overtones. The ‘series of notes which harmonize’ with the voice seems to refer to the fact that each vase would resonate and then re-radiate sound after the voice had ceased singing its fundamental note, so that if the concordant scale were sung, a number of the vases might be heard sounding together. In this way a kind of artificial reverberation time (estimated as 0·2–0·5 seconds) of particular quality would be produced in an open-air theatre that otherwise had none.

Although some 20th-century scientists (e.g. Knudsen) have dismissed the efficacy of such acoustic vases, others (e.g. Brüel) have attempted to duplicate their behaviour by direct experiment. As Vitruvius’s original diagrams illustrating the size and shape of the bronze vases were lost in late antiquity, these experimental vases were made in a wine-beaker shape of hard-burnt baked clay and with wider mouths than would be necessary to absorb low frequencies. The experiments showed that they enhanced reverberation at the resonant frequency, although they were not tested in an open-air theatre. There is a clear need here for further practical research. That resonance can be used to augment reverberation is confirmed by acoustical theory (see Gilford, 1972, p.155) although the amount of augmentation is limited. No remains of bronze resonators from antiquity have survived, which is scarcely surprising, since throughout the Middle Ages and the Renaissance ancient bronze was melted down for metal.

12 pairs of compartments corresponding to those described by Vitruvius have been found in the supporting wall of the uppermost row of seats of the Greek theatre at Aizani in Phrygia, eight in the podium of the Roman theatre at Nicopolis, and seven in the Greek theatre at Scythopolis in Syria. There are 20 niches in the upper part of the Greek theatre of Gerasa in Jordan; at Ierapetra and Gortyn in Crete the theatres have 13 niches each; and at Lyttos, also in Crete, there are three rows of 13 niches each (fig.16; Belli, 1854, Müller, 1886).

Vitruvius was at some pains to explain why acoustic vases were not used in the theatres of Rome built in his day. He said it was because of their wooden construction. Singers who wished to sound a loud note could direct their voices towards the scene doors (valvae) and ‘receive help’ from them; when the theatre was to be built of solid materials, such as stone or concrete, which do not resonate, then it should be equipped with the sounding jars described. According to Vitruvius there were many examples of theatres that used them in Greek cities and in the provincial towns. The theatre of Corinth was cited as a classic example; when Lucius Mummius sacked the town (146 bce) he carried off the bronze jars as part of his spoil, and dedicated them at the temple of Luna in Rome, where Vitruvius had seen three of them. Finally, Vitruvius mentioned that ‘many clever architects who have built theatres in small cities, from the want of others have made use of earthen vessels, yielding the proper tones, and have introduced them with considerable advantage’.

Besides bringing a reverberant response into the Greek theatre, it is possible that the acoustic jars helped an unaccompanied singer to keep to proper pitch for long periods. The vases resonating in various parts of the auditorium may also have served to disguise inferior musicianship by giving emphasis to musically important pitches. At the beginning of some early editions of Terence there is a short treatise in which the commentator, whose name is unknown, spoke of brass vases. He assigned to them the same use as Vitruvius, and then added:

I hear that there exists to this day something very like them, in some ancient temples, which have been preserved in their integrity down to our time. At the lower and upper parts of the roof are to be seen holes distributed on both sides, and corresponding diametrically with each other. In these holes are set vases of brass, the opening of which is smaller than the body, and is turned outwards, without projecting. The voices of those who sing in the temple, reverberating in these vases, grow more distinct and harmonious.

Acoustics, §I: Room acoustics

8. Medieval times.

With medieval acoustics one can turn from speculation about acoustic vases to the consideration of their tangible effects, for many examples survive in church buildings throughout Europe, from Russia to Britain (see Harrison, 1967–8); terms for them exist in most European languages. Although little modern research has been done on their effectiveness it appears that earthenware jars were used both as absorbers (see Brüel, 1947) and as resonators.

The methods of use of medieval acoustic jars may be roughly summarized as follows: (a) Areas of spaced jars in two or three rows inserted in the stone walls of the interior above ground level, usually about 2·5 m from the floor, with their mouths opening inwards to the nave or choir (e.g. Fairwell, near Lichfield; and St Clement’s, Sandwich, Kent). According to Viollet le Duc, in his Dictionnaire, many examples in France are placed near angles in the walls. (b) A single or double row of jars inserted in the stone walls just below the ceiling, trusses or vault, often extending down the full length of both side walls (e.g. St Nicholas, Leeds, near Maidstone, Kent – 48 or 52 vases). (c) Jars inserted at regular intervals across the stone barrel vaulting of the choir (e.g. St Martin, Angers; Bjeresjoe, Sweden – about 45 in five rows; and Montréale, France, according to Viollet le Duc). (d) Jars inserted in the sleeper walls below the choir stalls or in pits or cavities (e.g. St Peter Mancroft, Norwich – 40 jars; Fountains Abbey, York – seven jars). These are often separated from the volume of the church by the wooden flooring (e.g. St Peter Permountergate, Norwich – 16 jars) but in other cases a gap is left so that the jars are acoustically coupled to the air in the church (Church of the Cordeliers, Amiens).

The jars used were either specially manufactured (e.g. Leeds, Kent – about 50 jars with their bottoms perforated; and Luppitt, Devon – about six jars flattened on one side; fig.17), or else jars of ordinary domestic type, greatly varying in shape. Most of the jars were between 20 cm and 30 cm in length, and probably resonated at fundamental frequencies of between 90 and 350 Hz. They were 13–15 cm wide at the mouth; the mouths of the wider jars were often reduced in aperture by being placed behind perforated stone or wooden screens (Denford, Northamptonshire; and St Mary-le-Tower, Ipswich) or partly plugged with a wooden block (fig.18), but a number of vases appear never to have had any such constriction of the opening. The former seem to have been intended to act as absorbing resonators to reduce echoes in corners or in the vaults, the latter as resonators to enhance or assist sound. Some of the unconstricted vases in Scandinavia had peat or ashes in them, and it has been thought that they were intended to absorb sound by damping instead of re-radiating resonant sounds. It should be noted that many of the medieval vases were cemented into the masonry, and not placed loose in an air cavity like the Greek vases described by Vitruvius. This would not have prevented them acting as resonators, but it may have reduced their efficacy.

Archaeologists have frequently been sceptical about the function of these vases in spite of the existence of records testifying that they have been known as ‘acoustic’ or ‘sound vases’ since medieval times; they have often been considered relic vases, or their purpose was thought to be the drying of walls to protect fresco paintings, or they were assumed to have some structural purpose. However, such sources as the Metz Chronicle (1432) establish their function clearly: ‘il fit et ordonnoit de mettre les pots au cuer de l’église et pensant qu’il y fesoit milleur chanter et que il ly resonneroit plus fort’. Acoustic jars are therefore a valuable indication of the attitude of medieval designers to acoustics. At least some of the vases were clearly intended to add resonance and amplification to speech and music, although the Chronicler commented, after recording the Metz example: ‘je ne seay si on chante miez que on ne fasoit’.

Vitruvius was certainly the source from which the medieval use of these vases originated. Harrison cited 12 copies of his works known to have existed in England during the Middle Ages, and there were many copies available elsewhere in Europe. They appear to have been used throughout the medieval period and up to the 17th century. In a satire by Claude Pithoys, published in 1662 at Saint Léger, Luxembourg, he reproved the clergy for negligence of their duties: ‘Of 50 singing men that the public maintain in such a house, there are sometimes not more than six present at the service; the choirs are so fitted with jars in the vaults and in the walls that six voices there make as much noise as 40 elsewhere’. Little scientific research has been done on the actual effects of the vases in these churches.

The reverberation times of some important medieval churches have been carefully tested, and this has led to a better understanding of their acoustical characteristics. S Paolo fuori le Mura, Rome (386 ce), an example of a large early Christian basilica with double aisles, an open trussed roof and a transverse bema at the east end (fig.19), has a reverberation time at mid-frequency of 9·1 seconds in the nave. The walls and columns have hard smooth surfaces, leading to close acoustical coupling of all parts of the building, so that it functions to some extent as a single air volume. Nevertheless, some absorption is provided by the depth of the aisles, which scatter the sound so that it does not return to the nave; for this reason there is no echoing from the side walls. The acoustic result is a sustained sound in the church, but also a relatively clear one, with no confusion, echo or fluctuation in intensity. The low-frequency reverberation is more pronounced than reverberation at other frequencies, though the decrease to the higher-pitched sounds is a gradual one (R.S. and H.K. Shankland, 1971). It is likely that in its original form, before its rebuilding after a fire, the diffusion caused by ornament and fluting would have improved the acoustics even further.

S Paolo is an exceptionally large church subdivided by screens of columns, and some of its special acoustical quality is due to its size. Other large basilicas with similar screens that have been tested, such as S Maria Maggiore, Rome (352 ce), have been found to have even better acoustics, with shorter reverberation times, providing almost ideal listening conditions for choral and organ music when full. S Maria Maggiore has only single aisles, but there are chapels beyond the side walls, and a chancel arch separating the apse from the nave. The large volume is therefore broken up into a number of separate volumes coupled together, with the surrounding volumes absorbing sounds made in the nave. The measured reverberation times are 4·9 seconds when empty and 2·5 seconds when full (R.S. and H.K. Shankland, 1971). It must be expected that smaller basilicas did not achieve the same degree of separation between the volumes of nave, aisles, bema and chapels, and therefore the reverberation times were longer, and the acoustics less satisfactory.

With the arrival of the Romanesque style the height of the nave was increased, and stone vaulted ceilings were introduced to protect the interiors against fire. These ceilings increased reflections and reduced diffusion, leading to a significant change in acoustic quality; not merely was the reverberation time lengthened, but the focussing effect of the ceilings brought fluctuations in the reverberant sound, with a resultant decline in clarity. Sounds appeared to pile on top of one another to produce an effect of surging confusion. Gothic cathedrals suffered from the same kind of unruly acoustics, but conditions were often better in large buildings, where the great height of the ceilings reduced the interference of sounds reflected from them. Tests in Durham, Canterbury, Salisbury and York cathedrals have shown that they have remarkably similar acoustics, the reverberation times falling from an average of 8 seconds at low frequencies to 5·5 seconds at mid-frequencies, and continuing to decline as high frequencies are reached. All have volumes of more than 30,000 m3; in such conditions the presence or absence of the congregation has little effect on the acoustic quality (Purkis, 1963). Furthermore, the acoustics of very large volumes are often more satisfactory because there is seldom enough energy produced in them to excite the room.

There seems no doubt that long reverberation times were thought to enhance both music and prayer. Writing in 1535, Francesco Giorgi of Venice recommended that a new church should ‘have all the chapels and the choir vaulted, because the word or song of the priest echoes better from the vault than it would from rafters’. This was still thought to be true for singing in the 17th century (the music of Heinrich Schütz, for example, was carefully written to exploit the long reverberation of the Kapelle in Dresden). But an increased concern with clarity of speech during the sermon led architects as early as the 13th century to omit aisles and transepts altogether, and design churches with single volumes (e.g. S Francesco, Assisi; S Caterina, Barcelona). Even so, it was found that the reverberation continued to be pronounced until the vaults had been lowered and the ceiling flattened. The final improvement was replacing the stone ceiling with a wooden one, which absorbed the predominant low-frequency reverberation, and covering the surface of the ceiling with elaborate decoration of small ribs or coffering, which greatly reduced the fluctuation of sound by increasing uniform diffusion. Giorgi mentioned both these effects in making his recommendations of 1535. ‘In the nave of the church, where there will be sermons, I recommend a ceiling (so that the voice of the preacher may not escape nor re-echo from the vaults). I should like to have it coffered with as many squares as possible … they will be very convenient for preaching: this the experts know and experience will prove it’.

Spacious Gothic churches, such as those of the Netherlands and Germany built without transepts and with nave and aisles of equal height, often have excellent acoustics. On the other hand, late polyphonic music was frequently written to exploit the peculiar acoustics of the older churches. S Marco in Venice had two organs and two choirs by the 15th century; the two choirs, with the accompanying organs, were placed facing one another in the tribunes, halfway up the height of the choir, from which position the unusual acoustical effects of the cathedral could be used without too long an initial delay in reflections from the ceilings. The choral and instrumental groups were gradually multiplied, until as many as four choirs and four organs, with instrumental accompaniment, provided the means for achieving a unique kind of polyphonic vocal and instrumental music. Such a disposition of resources still remained in Salzburg cathedral in the late 17th century, when Biber wrote church music utilizing them.

Acoustics, §I: Room acoustics

9. Renaissance and Baroque periods.

Reformation church builders laid great emphasis on acoustic clarity, which suited sermons but necessitated adjustments in church music. Luther arranged his congregation around the sides of his churches, and later Gothic churches were altered by the Protestants to enable similar focussed seating arrangements; an example is the Thomaskirche, Leipzig, to which galleries and tribunes were added, with an especially wide organ loft gallery at the west end used by Bach for the choir and orchestra (fig.20). As the vaults of this spacious Gothic church were low (8 m), the introduction of the galleries created shallow volumes with short paths of reflection. The reverberation time was quite short when the church was crowded (1·6 seconds), with excellent diffusion and absorption of low frequencies due to wooden panelling, hanging draperies and carving; at the same time the high frequencies were bright and clear (Beranek, 1962).

S Pietro in Rome has a remarkably short reverberation time, caused by the combination of its exceptional size and its complex internal structure; in effect the basilica is five large churches interconnected and acoustically coupled, each damped by the air spaces leading into the others (R.S. and H.K. Shankland, 1971). Thus sound travels from the nave into the side spaces, where it undergoes extensive multiple reflection and delay before returning in a markedly attenuated form. The result is a reverberation time in the nave of 5 seconds at mid-frequencies when a large congregation is present.

During the Counter-Reformation one of the main aims was the design of churches in which every word of the service might be clearly heard. Vignola, the architect of the Gesù in Rome, was instructed to design the church with a nave as wide and short as possible, and without aisles, clearly with the intention of improving the acoustics. This was a quality that does not seem particularly to have concerned the Church of England, for Wren’s St Paul’s Cathedral in London has pronounced reverberation due to its relatively long low vaults and high central dome, and the fact that the nave, choir and dome do not function as acoustically separate volumes. When nearly empty the reverberation time is approximately 12 seconds at mid-frequencies, but it improves steadily as the congregation size increases, until it is 6·5 seconds at maximum capacity (Purkis, 1963).

The earliest Renaissance theatres maintained the forms, and therefore presumably some of the acoustic qualities, of the classical Greek and Roman theatres as described by Vitruvius; examples include Serlio’s Vicenza theatre (1539) and Palladio’s Teatro Olimpico, Vicenza (1588). An orchestra was added to the latter, seated on either side of the proscenium between the actors and the audience, but it apparently played only an occasional accompaniment or interlude. The extensive use of wood in the construction of these theatres, the use of elaborate decoration, and in particular the addition of wooden coffered ceilings, must have ensured good diffusion with brilliant high frequencies and rather dulled low frequencies. Allowing for the dense crowding of audiences which was common, the reverberation times must have been very short.

In the earliest operas, music was subordinated to the clarity of the text. It is known that in the first public performance of Peri’s Euridice (1600) the orchestra was placed behind the scenes. Cavalieri’s instructions for the performance of his Rappresentatione di Anima, et di Corpo in the same year were explicit. It was to be given in a theatre or a hall containing not more than 1000 spectators; the orchestra was to be situated behind the scene and be ‘adapted to the needs of each performance’, the latter presumably referring to acoustic conditions as well as other exigencies.

The masques held in the banqueting hall of Whitehall Palace early in the 17th century used musicians seated on either side of the stage at the front; this position appears to have been a common one, necessitated partly by the fact that the flat floor area between the stage and the raised seats of the audience was used for dance. There is a design for a masque house by Inigo Jones that has the same arrangement with the orchestra partly screened (fig.21). The Duke’s Theatre in Dorset Garden, London (1671), designed by Wren, had a music balcony above Grinling Gibbons’s stage front, proscenium balconies over the stage doors, and galleries for the audience.

The first theatres to be built with ranges of boxes one above the other, the Venetian theatres of S Cassiano and SS Giovanni e Paolo (1637–8) – the latter modified especially for opera with five tiers of boxes in 1654 – were characterized by the crowding of a lay audience into the flat floor area in front of the stage. Boxes were sometimes reserved for the use of musicians on either side of the stage, these boxes being called ‘proscenium boxes’ or ‘trumpet loges’. Often an area in front of the stage was enclosed for the use of other members of the orchestra, later increased in size to become the orchestra pit. Such theatres had surprisingly good acoustics. They were small, with closely packed audiences, were largely made of wood and had flat ceilings. The sound had only a short distance to travel to the audience, and there was little risk of echo because of the large areas of absorption provided by the audience and the boxes. Sound reflected from balcony fronts and ceiling was scattered by decorated surfaces to aid in achieving uniform diffusion. The reverberation time was short, and low frequencies were absorbed by the wooden construction, which reflected high frequencies to preserve brilliance.

Even the great opera houses of the 18th century retained these qualities. The original La Scala in Milan (1778) had about 2300 seats, packed closely together, many with a view of only two thirds of the stage area because of the horseshoe shape of the six tiers of boxes (fig.22). The openings of boxes were relatively small, 1·4 m square, which meant that a greater reflecting surface than usual remained in the box fronts. The acoustics were good for all members of the audience, except those at the rear of the boxes; sound was loud and clear, warm in tone and brilliant. The reverberation time at mid-frequencies was about 1·25 seconds (Beranek, 1962). But it must be remembered that the acoustic quality of the voices fell off markedly if the singers retreated from the forestage into the volume of the scenery behind the proscenium arch.

The literature of acoustics begins with the theatrical treatises of the 17th century. Carini Motta’s study of the design of theatres and stages (1676) mentions the importance of the ceiling as a sound reflector, and recommends that it and the supporting structure should be of wood. Motta believed that rooms used for performances in private palaces should have the same kind of construction.

Acoustics, §I: Room acoustics

10. 18th and 19th centuries.

In 18th-century theatres ingenious methods were often resorted to in order to improve the acoustics. In the theatre in Turin (1740) the architect attempted to overcome the lack of balance between a large chorus and a small string orchestra by constructing a hard-surfaced semi-cylindrical resonant chamber running the full length of the orchestra pit below the wooden floor (fig.23). The dish shape was clearly meant to act as a reflector of sound back to the orchestra, while the floor and the volume of air acted together in resonance. The device was often copied, sometimes, apparently, with grilles opening into the orchestra pit from the resonant chamber. In other cases, as at Turin, there were two tubes connecting the ends of the resonant trough with the front of the stage so that the orchestral sound could be heard better by performers and audience. Patté, in his Essai sur l’architecture théâtrale (1782), stated that the Turin theatre had good acoustics, and attributed this to the housing of the orchestra. The volume of sound from the orchestra was considered so strong in its largely enclosed space that it could activate the floor and the volume of air underneath to cause an amplifying resonance.

The success of these features led to further experiments with shaped sections of masonry. In the Teatro Nuovo in Parma the entire parterre of the theatre was built over an enormous masonry saucer, shallow and semi-elliptical in section, with sound passages entering it from the orchestra pit. It is not clear whether grilles were set into the parterre floor allowing sound to pass into the audience without obstruction by the floor. In the Teatro Argentina, Rome (1732), the acoustic problems of an extremely large house, with six tiers of boxes, are said to have been satisfactorily solved using another original device. Here the problems introduced by the size of the theatre were compounded by the elimination of the forestage, which meant that singers’ voices on the stage could not be clearly heard. The design of the theatre was modified after its opening by the introduction of a channel of water under the parterre running from the stage to the back of the theatre; it appears that sound was reflected from the surface of the water inside a vaulted brick enclosure and thus travelled under the parterre whence it emerged through grilles in the floor.

In his project for the theatre at Besançon (1778), Ledoux proposed both a semi-cylindrical resonant chamber under the orchestra pit floor, and a semi-cylindrical stone dish reflector behind the orchestra (fig.24). This must have had an extraordinary focussing effect on the players themselves, but the result was judged successful, as is proved by the repetition of the same device in other theatres, such as Covent Garden, London (1809). Another part of the theatre considered of great importance for its acoustic effect was the ceiling of the auditorium. Writers continued to recommend that it should be made of wood (Algarotti, 1762, for ‘a full, sonorous and agreeable sound’). The ceiling in the Turin theatre had in addition a ‘resonating’ chamber above it, but here its only effect could have been to increase the absorption of low frequencies. The Bordeaux theatre (1773), which was generally considered to have excellent acoustics, was, like all 18th-century theatres, very compact; the maximum distance from the stage to the boxes was only 19·5 m.

Rooms specifically built for the public performance of music without acting or stage presentation began to appear in the late 17th century. The earliest concert room specifically built as such was probably one erected in York Buildings near the Strand in London in 1675; in this ‘great room’ there were ‘proper decorations for musick’. The music of Purcell was performed there, and its last recorded use was for the first performance of Handel’s Esther in 1732, the first appearance of oratorio in England (Forsyth, 1985, citing North). The oldest music room still in use in Europe is that at Holywell, Oxford, opened in 1748 (fig.25) and designed to satisfy a demand for oratorio and choral works. Before the addition of a curtain to one of the side walls (in 1959) the hall, seating 300, must have had a relatively long reverberation time; the present value is 1·5 seconds at mid-frequencies (Beranek, 1962). That composers of this period considered the reverberation time of theatres and opera houses too short is indicated by, for example, Handel’s remark on hearing on one occasion that his theatre would be half empty: ‘Never mind, the music will sound the better’.

The precise music of the Classical period, particularly, required a predominance of direct over reflected sound. Mozart wrote, after a performance of Die Zauberflöte in 1791: ‘You have no idea how charming the music sounds when you hear it from a box close to the orchestra – it sounds much better than from the gallery’. This implies that a narrow rectangular hall, such as was often used at this time for concerts, enhanced the music better than a wide hall. The narrow Redoutensaal in the Hofburg, Vienna, in which a good deal of Haydn’s, Mozart’s and Beethoven’s music was first performed, is estimated to have had a reverberation time of about 1·4 seconds at mid-frequencies with a full audience of 400. For the study of historical performing practice the acoustical properties of the four halls for which Haydn wrote most of his symphonies, and in which he performed them with the local orchestras (whose exact size is known), are particularly interesting. Measurements have been made in the two existing halls: that of the castle of Eisenstadt (c1700) has a volume of 6800 m3 and a mid-frequency reverberation time of 1·7 seconds with 400 persons; the music room of Esterháza castle (1766) has a volume of 1530 m3 and a reverberation time of 1·2 seconds with 200 persons. Acoustical values have been calculated for two halls in London (both before 1790) that have been destroyed: the Hanover Square Rooms had a volume of 1875 m3 and a reverberation time of 0·95 seconds with 800 persons; the King’s Theatre had a volume of 4550 m3 and a reverberation time of 1·55 seconds with about 900 persons. All these rooms had an increasing reverberation time with low frequencies, as did the concert room in the palace of Prince Lobkowitz in Vienna (where Beethoven’s ‘Eroica’ Symphony was first performed), which has a volume of only 950 m3 and reverberation time of 1·45 seconds with 160 persons.

Meyer (1978) draws a connection between the halls for which Haydn’s symphonies were originally written and their instrumentation and scoring. Many of the middle-period symphonies were first performed in the small Esterháza hall, with a short reverberation time: the orchestral works have a chamber music character, with emphasis on clarity and quick changes in effect. By contrast, the London symphonies were written for the King’s Theatre, which had a longer reverberation time; Haydn allows a bar’s rest between a fortissimo chord and a quiet passage, to permit reverberation to take effect – as in the first movement of Symphony no.102. Further, Haydn appears to have tried to utilize the effect of the space in the same movement: there is a long unison passage, in which a piano section swells to a crescendo followed by a diminuendo; this would have been aided by the large room, a sense of spatial broadening produced by the lateral reflections becoming audible during the crescendo passage, after which they would have receded to inaudibility. (It is significant that the Esterháza works often exist in two versions, one using trumpet and kettledrums for performance in other places, including the open air, and another, without these instruments, apparently the original version written for the intimate acoustics of the music room at Esterháza.)

The much admired Altes Gewandhaus (1780) in Leipzig (fig.26), also a narrow rectangular hall, accommodated an audience of 400 and had a reverberation time of only 1·3 seconds (Beranek, 1962). This building, in which Mendelssohn held his concerts from 1835 to 1846, was entirely constructed of wood, securely jointed, which was thought to lend the hall the quality of an immense musical instrument. But the value of a long narrow hall lay in its inherent ability to combine clarity (due to the quickness of the reflections reaching the ear) with a sense of spaciousness (thanks to the reflections, including the late reflections that are perceived as reverberation, being lateral). Wide halls usually lack the clarity and the reverberance that distinguish the best narrow halls.

The development of the orchestra in the early years of the 19th century seems to have given rise to the desire for more sustained reverberation. When the first large halls were constructed specifically for concerts, in the middle of the century, they had longer reverberation times and a lower ratio of direct to reflected sounds. The old Boston Music Hall (1863) had a reverberation time at mid-frequencies of over 1·8 seconds with an audience of 2400; the Grosser Saal (1870) in the Musikverein in Vienna had a reverberation time of 2 seconds with an audience of 1680. Halls of this type with classical ornament characteristically also had highly diffuse sound fields.

Wagner wished the architects of the Bayreuth Festspielhaus (1876) to create a building that would enhance the orchestral sound but still permit the work to be intelligible. At length the reverberation time of 1·6 seconds at mid-frequencies was arrived at, with a full audience of 1800. An important development was the complete sinking of the orchestra pit so that the musicians could no longer distract the audience by their movements (fig.27). A carefully shaped orchestra chamber projects the sound, but at the same time blends the orchestral tones so that instruments cannot be heard individually. The steeply raked, fan-shaped parterre permits clear vision of the stage and ensures minimum shading of direct sound from the singers by the heads of the audience in front. Also, the paired columns along the sides of the seating towards the stage act as acoustic reflectors, diffusing the sound effectively. The important influence of studies of ancient Greek theatres on the acoustical success of this opera house has been demonstrated by Izenour (1992).

Not all concert halls were acoustically satisfactory. The Royal Albert Hall (1871) in London was regarded as disastrous from its opening when ‘The Prince of Wales’ … welcoming address … in many parts … could be heard twice, a curious echo bringing a repetition of one sentence as the next was begun’. Of immense size, 90,000 m3, and an awkward shape, elliptical in plan with a huge dome, the Albert Hall seats 5000 people. The reverberation time must have exceeded 3 seconds when it was opened, and it remains 2·5 seconds after extensive correction (fig.28). Such a hall only begins to function satisfactorily when orchestral and choral forces are large. Nevertheless the visually unifying shape of the Albert Hall is preferred by many performers and listeners to the more acoustically desirable rectangular shape of other famous concert halls.

The undoubted acoustic success of the Paris Opéra’s building (1869–75) was shrugged off by the architect, Charles Garnier:

I gave myself great pains to master this bizarre science [of acoustics] but … nowhere did I find a positive rule to guide me; on the contrary, nothing but contradictory statements … . I must explain that I have adopted no principle, that my plan has been based on no theory, and that I leave success or failure to chance alone … like the acrobat who closes his eyes and clings to the ropes of an ascending balloon.

Acoustics, §I: Room acoustics

11. The science of acoustics.

The science of acoustics received its name from Sauveur, its first noted exponent, who discovered and studied overtones at the beginning of the 18th century. His work was further developed by Euler, who devised a system of binary logarithms to facilitate musical calculations. Ernst Chladni’s Akustik (1802) contained his studies of the vibration of strings, rods and plates by means of sand figures and his discovery of the modal lines. Charles Delezenne (1776–1866) applied calculus to the solution of acoustic problems, and Félix Savart (1791–1840) made investigations into resonance, especially in string instruments. D.B. Reid of Edinburgh published in 1835 his ‘On the Construction of Public Buildings in Reference to the Communication of Sound’ (Transactions of the British Association). It shows an accurate application of recent discoveries in physics to room acoustics, and contains the earliest clear recognition of reverberation.

Hermann von Helmholtz (1821–94) laid the foundations for much modern physical and physiological research in acoustics. Rudolf Koenig (1832–1901) manufactured instruments for the study of acoustics and conducted extensive research. Others who followed closely behind were John Tyndall (Sound, 1867), Lord Rayleigh (Theory of Sound, 1877–8) and Carl Stumpf (Psychology of Tone, 1883–90). Stumpf’s insistence that the scientific system of music theory depended on the psychological interpretation of acoustic data opened a new discipline and many new avenues for research.

W.C. Sabine pioneered the study of applied acoustics in buildings in the period 1895–1915, publishing his results in a series of important papers. Sound decay was analysed in detail and the prediction of reverberation time in rectangular rooms by calculation was made possible. The impedance method of specifying acoustical materials was developed, and a wide variety of acoustical materials began to be manufactured. At the Bell Telephone Laboratories, Harvey Fletcher studied loudness and masking in the 1920s and 30s, and developed new, more accurate techniques of acoustic analysis and measurement. Other research laboratories, especially those in California, England and Germany, contributed to rapid advances in scientific acoustics.

The first large-scale experiment in the application of the new scientific understanding of acoustics to room design came with the building of the Salle Pleyel in Paris (1927), which seated 3000; it was a ‘notorious disappointment’ (Beranek, 1962). The whole shape of the hall, in plan and section, was designed to send sound to the audience (fig.29). Sound diffusion was poor and uneven, and reverberation short. The result was a room in which a large audience could enjoy recitals and chamber music, but in which orchestral music lacked body and colour. In addition, the seats in the front and middle of the parterre received their first reflections from the ceiling, high at this point, with too long a delay after the direct sound, resulting in the loss of any sense of intimacy (Knudsen, 1932).

After the disappointment of the Salle Pleyel, no major attempt to apply the new scientific knowledge of acoustics to design was made until the building of the Royal Festival Hall in 1948–51 (see London (i), fig.40). Great care was taken to avoid external and internal noise interference (see fig.11); the entire auditorium was raised high in the air and a double construction used to isolate the interior. In this respect the hall was an important and successful experiment. The interior was designed along principles not dissimilar from those of the Salle Pleyel, and with some of the latter’s defects. In particular, concentrating the first reflections at the audience by reflectors over almost the whole ceiling means that a large proportion of the sound energy is absorbed in a much shorter time than the reverberation time of the hall. The effect is to make reverberation much less evident than it is in the older concert halls because it is relatively less loud, and so the hall seems ‘dry’, especially in loud ensemble passages.

Later advances in acoustics have therefore concentrated on finding methods of increasing the amplitude and the length of reverberation while maintaining a high level of direct and first-reflection sound to all the seats. One solution is that adopted with success in the Koussevitzky Music Shed in Lenox, Massachusetts (1959; fig.30); a pattern of suspended ceiling panels reflects a proportion of the sound to the audience from quite a short distance above the orchestra, while the spaces between allow the rest of the sound to travel into the volume above, where it is diffused before returning as prolonged reverberation. A defect of this technique is that sounds of short wavelength are almost completely reflected by the panels, whereas sounds of long wavelength pass almost completely around them, giving an imbalance in first reflections and a further imbalance in reverberation. Such problems led to the initial failure of one of the major concert halls to be built in the 1960s, Philharmonic (now Avery Fisher) Hall at Lincoln Center in New York (1961).

Other recent concert halls have concentrated on relating the volume carefully to the type of music for which the hall is built, achieving ceilings that are more diffuse and providing fewer reflectors of direct sound close to the orchestra. The result has been a considerable increase in the reverberation time and in the relative strength of the late or reverberant sound; the Maltings at Snape, Suffolk (1967; fig.32), for example, has a reverberation time of 2·25 seconds at mid-frequencies, whereas the Royal Festival Hall, with a volume nearly three times as large, has a reverberation time of only 1·47 seconds. Today, many new concert halls are expected both to provide for audiences of varying sizes and to accommodate new demands that far exceed the needs of auditoriums even 50 years ago. Simply referring back to acoustically approved music rooms of earlier times will not deal with these problems. New rooms are being successfully designed which demonstrate the benefit of our increased scientific understanding. Certain new concert halls, such as the Doelen hall in Rotterdam, have more uniform reverberation, with diffusion and reflection on walls and ceiling; all sound paths are within calculated limits, and the balance between low and high frequencies is carefully maintained. Others, like the Segerstrom Hall of the Orange County Performing Arts Center, California (1986), which seats 3000, are multi-purpose halls, yet, by the use of flexible acoustical adjustments, manage to achieve good uniformity and ‘a particularly surprising suitability for solo, chamber and opera performances … an enveloping sound character, but more immediate’ than the normal concert hall (Barron).

Flexible acoustical conditions can be created by physically manipulating the surfaces, by varying the number of coupled volumes, and even by reducing or increasing the overall volume, as has been introduced at the Jesse Jones Hall, Houston, the experimental workshop L’Espace de Projection at IRCAM in the Centre Pompidou, Paris (1978) and the Theater de Maaspoort in Venlo, the Netherlands (1986).

While reverberation time remains a prime consideration in designing concert halls, the importance of early reflections that create a sense of acoustical intimacy has come to be increasingly recognized. Recently, attention has focussed on the spatial qualities of sound and the factors that control these qualities: the early lateral reflections, which make a source appear wider than it is; the late lateral energy, which provides a sense of being enveloped by sound; and loudness, which further adds to a hall’s spaciousness.

Acoustics, §I: Room acoustics

12. The contemporary performance of early music.

Sensitivity to the acoustics of rooms seems always to have affected the composition and performance of music. This is clear in the character of much early music, but only rather late was it actually written about. Quantz, in his book Versuch einer Anweisung, die Flöte traversiere zu spielen (1752), recommends:

In the choice of pieces in which he wishes to be heard in public, the flautist … must adjust … to the place where he plays … . In a large place, where there is much resonance, and where the accompanying body is very numerous, great speed produces more confusion than pleasure. Thus on such occasions he must choose concertos written in the majestic style, and in which many passages in unison are interspersed, concertos in which the harmonic parts change only at whole or half bars. The echo that constantly arises in large places does not fade quickly and only confuses the notes if they succeed one another too quickly, making both harmony and melody unintelligible.

The practice today is to attempt to relate the size of the orchestra not merely to that originally used, but to the room acoustics – to compensate for a weak bass response in a hall, for example, by increasing the bass section relative to the upper parts. Where the volume of the room is considerably larger, and the impact of the original small orchestra would be substantially reduced, some increase in orchestral size is necessary to compensate for differences in loudness and spatial impression. Changes in the composition and style of an orchestra are therefore often judged necessary in order to respond to altered acoustical conditions in particular halls. Where possible, conductors often prefer to select rooms that are naturally appropriate to particular types of early music, but this usually restricts audience size, and may therefore be uneconomic.

The acoustician’s contribution to the problem of matching concert halls to the style and type of performance has been to provide, in a number of recent examples, designs allowing considerable variation of the acoustics of concert halls at will (see §11 above).

Acoustics, §I: Room acoustics

BIBLIOGRAPHY

O. Belli: History of Candia’, Museum of Classical Antiquities (London, 1854)

G.M. Hills: Earthenware Pots (Built into Churches) which have been called Acoustic Vases’, Transactions of the Royal Institute of British Architects (1882), 65

A. Müller: Lehrbuch der griechischen Bühnenalterthümer (Freiburg, 1886), 46

H. Bagenal and A. Wood: Planning for Good Acoustics (London, 1931)

V.O. Knudsen: Architectural Acoustics (New York, 1932)

P.V. Brüel: Panel Absorbants of the Helmholtz Type’, First Summer Symposium of the Acoustics Group: Papers on Resonant Absorbers (London, 1947)

F. Giorgi: Memorandum for S. Francesco della Vigna’, in R. Wittkower: Architectural Principles in the Age of Humanism (London, 1949, 4/1988)

P.H. Parkin and H.R. Humphreys: Acoustics, Noise and Buildings (London, 1958, 4/1979)

L.L. Beranek: Music, Acoustics & Architecture (New York, 1962)

H.J. Purkis: The Reverberation Times of some English Cathedrals’, Bulletin of the Institute of Physics and the Physical Society (1963), no.14, p.8

J.G. Landels: Assisted Resonance in Ancient Theatres’, Greece and Rome, 2nd ser., xiv (1967), 80

K. Harrison: Vitruvius and Acoustic Jars in England during the Middle Ages’, Transactions of the Ancient Monuments Society, new ser., xv (1967–8), 49–58

R. Taylor: Noise (London, 1970)

R.S. and H.K. Shankland: Acoustics of St Peter’s and Patriarchal Basilicas in Rome’, JASA, l (1971), 389–96

C. Gilford: Acoustics for Radio and Television Studios (London, 1972)

A.H. Benade: Fundamentals of Musical Acoustics (New York, 1976, 2/1990)

G.C. Izenour: Theater Design (New York, 1977/R)

J. Meyer: Raumakustik und Orchesterklang in den Konzertsälen Joseph Haydn’, Acustica, xli (1978), 145–62

A. Trochidis: Reverberation Time of Byzantine Churches of Thessaloniki’, Acustica, li (1982), 299–301

M. Forsyth: Buildings for Music (Cambridge, MA, 1985)

G.C. Izenour: Theater Technology (New York, 1988/R)

G.C. Izenour: Roofed Theaters of Classical Antiquity (New Haven, CT, 1992)

M. Barron: Auditorium Acoustics and Architectural Design (London, 1993)

J. Meyer: Akustik und musikalische Aufführungspraxis (Frankfurt, 3/1995)

L.L. Beranek: Concert and Opera Halls: how they Sound (Woodbury, NY, 1996)

Acoustics

II. String instruments

1. Foundations.

2. Differences between viols and violins.

3. Wood and varnish.

4. Acoustical findings important for violin tone.

5. The mute and vibrato.

6. Sound radiation.

7. Bowing.

8. The plucked string.

9. Current research.

BIBLIOGRAPHY

Acoustics, §II: String instruments

1. Foundations.

The essence of the bowed string families (violins and viols) and of the plucked strings (guitars and lutes) is a set of strings mounted on a wooden box containing an almost enclosed air space. Some energy from the vibrations of the strings is communicated through the bridge to the box and air space, in which are set up corresponding vibrations. The loudness and nature of the sound, putting aside the acoustics of the room and the skill of the player, depend on the transfer of vibration from the strings to the sounding box to the air. The following discussion relates chiefly to bowed string instruments; plucked strings are dealt with in §8 below.

The sounding box, or body, of a finished viol or violin consists of a resonant top and a hardwood back (usually spruce and curly maple respectively) which are from 2 to 6 mm thick and firmly glued to the ribs (sides). The latter are 1 mm thick strips of wood, usually of the same species as the back. The ribs are bent to the shape of the instrument by means of moisture and heat, and glued to small blocks at each corner and at the upper and lower ends. The ribs, together with the top and back soundboards, form a sturdy, very thin-walled box able to support the tension of the strings – in the violin, about 25 kg weight. This tension exerts a downward force through the bridge of 7–9 kg. The vibrations from the strings are carried by the bridge to the top plate and thence through the entire structure, including the air of the cavity (fig.33).

The sounding box has two functions without which there would be no violins or viols as we know them today. First, it unites two soundboards possessing different patterns and amplitudes of vibration which support each other within an important range of frequencies. When a single soundboard, such as that of the piano, is smaller than the wavelength of the sounds coming from it, the radiations from its two sides tend to cancel each other when heard from a distance. In the violin and viol, however, the top and back soundboards of the box radiate with sufficient independence that their radiations do not cancel each other. The second function of the box is to create a resonant cavity in which vibrations of the inner air interact with those of the wood of the entire instrument including scroll, neck/fingerboard, chinrest and even pegs. These interacting air and wood vibrations are critical to the tone and playing qualities of the finished instrument.

The soundholes – f-shaped in the violin family, and C-shaped or flame-shaped in the viol family – also play an important role, particularly in the lower range of violins and viols. They not only provide a flexible platform for the rocking motion of the bridge, but allow for the air vibrations to move in and out of the cavity. The lowest of these vibrations is the so-called Helmholtz resonance or mode, which in the violin lies around 270–280 Hz, giving fundamental reinforcement to the mid-range of the G string and low range of the D string of the violin. This resonance is enhanced by a breathing motion of the body in which the top and back flex in opposite directions. The frequency of the Helmholtz mode depends on the volume of the violin cavity, the flexibility (compliance) of its walls, and the area and edge thickness of the f-holes.

The soundpost is a rod of straight-grained spruce thin enough to pass through the soundhole. It is carefully fitted and positioned approximately under the high-string foot of the bridge. The soundpost provides a fulcrum for the rocking of the bridge: the low-string foot of the bridge makes wider up-and-down excursions than the high-string foot, creating asymmetrical motions of the top and back plates. Moreover, a strong rocking motion occurs at the ends of the soundpost itself (Fang and Rodgers, 1992), indicating that the compliance of the post plays an important role in the sound of an instrument.

The insertion of the soundpost reduces the flexibility of the top and back plates. This results in a rise not only of the frequency of the plates, but also of the cavity resonance modes, particularly the Helmholtz mode, depending on how tightly the post is fitted (C.M. Hutchins, 1974). The position of the soundpost between top and back, its fit, wood quality, shape and stiffness are so important to the sound and playing character of the instrument that the Italians call the soundpost the ‘anima’ and the French the ‘âme’ or soul.

The bass-bar is a straight-grained tapered bar of spruce glued to the underside of the top plate and extending about three-quarters of its length, approximately under the string of lowest tuning. It serves not only to help support the down-bearing force from string tension, but also to carry the vibrations from the low-string bridge foot to the upper and lower areas of the top, keeping them in step with each other. Modal analysis has shown that the bending and twisting of the bass-bar help determine the frequencies and resonance modes of the top plate and thus of the whole instrument (Marshall, 1985). The proper shaping or tuning of the bass-bar is critical to the sound of the finished violin.

The bridge acts as a filter, transmitting the vibrations of the strings to the wooden structure of the instrument and to its cavity resonances via the close-fitting feet, which rest on the top at a position indicated by the notches of the soundholes, one foot standing approximately over the soundpost and the other over the bass-bar. The sound of violins and viols is markedly affected not only by the bridge's position on the top, but also by its density, stiffness, mass distribution and acoustical transmission characteristics. All these characteristics are partly determined by the actual cutting of the bridge in the hands of a skilled maker. The important bending modes of a violin bridge up to 6000 Hz, as revealed by hologram interferometry, are shown in fig.34 (Reinicke, 1973). A calibrated study of bridge action has been made by Trott (1987), and the effects of bridge trimming have been measured by finite element analysis (Rodgers and Masino, 1990). Müller (1979) has studied the transmission functions of the bridge, relative to the skills of the violin maker.

The tailpiece not only serves to hold the ends of the strings, but also can have considerable effect on the tone and playing qualities of an instrument. Adjustments are made by changing the tuning of the strings between tailpiece and bridge, the length and stiffness of the tailgut, and the mass and frequency of the tailpiece-plus-tuners. Experiments show that these adjustments are most effective when the tailpiece frequency matches or is a simple partial of one of the cavity or body modes of the violin (C.M. Hutchins, 1993).

Acoustics, §II: String instruments

2. Differences between viols and violins.

Although both viols and violins have arched top plates of spruce (or occasionally another species of wood), with the grain following the longer dimension, the tops of viols tend to have somewhat higher archings and slightly thinner wood than violins. The early master violin makers carefully matched the arch of the top to the arch of the back of a violin. In contrast, viols traditionally have thin flat backs, usually of curly maple, which are reinforced in several places with fairly heavy cross-braces, on one of which the soundpost rests. Thus the back of a viol serves primarily as a support for the soundpost and closure for the sounding box, and traditionally has not been ‘tuned’ as in the case of the violin. Instead of the four strings characteristic of the violin family, viols have five, six or more strings, more slackly tuned than violin strings and supported by a flatter, heavier bridge. Their tailpieces are heavier and more rigidly supported than those of violins. Consequently viols have less brilliance, power and dynamic range than violins.

The viol family consists of a balanced consort of five or more instruments with uniformly designed bodies in graduated sizes from the high-pitched descant viol to the violone. All the voices are adequately represented, with some, such as the treble and tenor, in two or three sizes. All viols are held in a vertical position, the necks are fretted, the bows are held the same way and the fingering technique is uniform.

The violin family as described by Praetorius (1619) consisted of seven or eight Geigen in graduated sizes from a three-string treble instrument to a seven-foot bass. With the development of the violin and violoncello by the master luthiers of the 17th and 18th centuries, the tenors and large altos fell into disuse. In the early 19th century composers such as Berlioz demanded more power from the strings to fill increasingly large concert halls. To achieve this violin makers lengthened the neck of the violin and increased its angle relative to the plane of the body, while at the same time enlarging the bass-bar to provide appropriate stiffness and support.

Since the mid-1950s eight instruments of the violin family, one at each half-octave from the tuning of the double bass to an octave above the violin, have been developed through a combination of mathematics, acoustical theory and testing, and skilled violin making (C.M. Hutchins, 1962, 1967, 1992). Designed to project the resonance characteristics of the violin into seven other tone ranges, these instruments, known as the New Violin Family, provide consistent quality of string tone and increased power covering the musical range, thus bringing to fruition the concept of Praetorius (fig.35).

Acoustics, §II: String instruments

3. Wood and varnish.

It is no accident that spruce (Picea) has traditionally been chosen for practically all instrument soundboards. In addition to having a high ratio of velocity to density and a long decay time (the time it takes for a sound to die away), it possesses the property unique among wood species of being at least ten times stiffer along the grain than across it.

Curly and tiger maple have become the preferred woods for violin backs, although the early master violin makers used a variety of other woods such as poplar (Populus), sycamore (Platanus), beech (Fagus), birch (Betula), and apple and pear (Pyrus). Curly-grain wood is not only beautiful but provides a more nearly isotropic material with a desirable stiffness ratio of along to across the grain of about five to three. Selection of wood with straight grain and a more or less even growth pattern is highly important. Violin and viol makers have long known how to assess these characteristics by the sound and the feel of the wood in their hands (Müller, 1996). According to tradition, at least five years of seasoning in covered outdoor sheds and then indoors for a matter of years is desirable for top wood, and eight to ten years for back wood however, 50 years or more is ideal for fine tone quality. Acoustical studies of these characteristics include Haines (1979), M.A. Hutchins (1981, 1983), McIntyre and Woodhouse (1984–6) and Dunlop (1989); studies using ultrasonics include Bucur (1987) and Bucur and Archer (1984).

Varnish research can be grouped into three categories: (1) historical and archaeological research based on surviving instruments and contemporary literature about the great makers; (2) attempts to prescribe recipes to modern makers for treating wood; and (3) experiments to determine the effects of wood treatment on vibrational behaviour and thus on the sound of the instrument.

In a study that falls in the first category, Barlow and Woodhouse (1989) analyzed several samples from authenticated old instruments and discussed what can be deduced about the materials used by their makers. They were mainly concerned with the ‘ground layers’ of varnish, which they studied by means of a scanning electron microscope combined with X-ray analysis, not only to obtain pictures of the topography of the sample, but also to determine its chemical constituents.

The second category consists of experimental attempts to create a modern equivalent of the ‘old master’ varnishes (Fry, 1904; Michelman, 1950; Condax, 1968; and Fulton, 1974–5).

In the third category, a study of the effects of five years of filler and varnish seasonings on the bending modes of four pairs of free or unattached viola plates (C.M. Hutchins, 1987) showed surprisingly small changes in the modal frequencies of the plates. In a study of the acoustical effects of four different sealer materials, followed by two coats of an oil-type violin varnish added each year over a period of four years, M.A. Hutchins (1991) showed that the greatest changes were in the cross-grain strips of spruce where there was an increase in weight, density and damping, and a concomitant decrease in strip frequency and stiffness.

Acoustics, §II: String instruments

4. Acoustical findings important for violin tone.

Modern acoustical science has been applied to violin making with significant results. The normal bending modes of unattached violin plates have been revealed by hologram interferometry and other vibration methods. These show the frequency sequence and configuration of the normal resonance modes basic to the sounds and stiffnesses in free violin plates that violin makers have long known how to assess by flexing, feeling, tapping and listening. The frequencies and damping characteristics of modes 1, 2 and 5 in both top and back free plates are critical to the sound of the violin in spite of the fact that they are not transferred intact into the finished instrument (fig.36). It has been found that the best instruments result when modes 2 and 5 are tuned to matching frequencies in both top and back free plates and the two pairs are an octave apart, with mode 1 in the top an octave below that of mode 2 (C.M. Hutchins, 1981). This finding has been corroborated by many violin makers. However, its successful application depends on other factors such as wood quality, and the arching and thickness characteristics basic to good violin making. Current thinking is that the free-plate modes provide the violin maker with clues as to the desired local stiffnesses in the plates, which then share in the bending modes of the entire instrument (C.M. Hutchins, 1991). Finite element analyses to quantify free-plate tuning show the effects of wood stiffness, local thickness changes, plate arching and local thickness patterns on free-plate frequencies (Rodgers, 1988, 1990, 1993; and Molin, Lindgren and Jansson, 1988).

The body and cavity modes of the whole violin have been mapped in various ways since the early 19th century (Savart, 1819; Backhaus, 1931; Meinel, 1957; Reinicke and Cremer, 1970; and Stetson, 1970). More recent studies by means of modal analysis (Marshall, 1985) and finite element analysis (Knott, 1987) have elucidated the unsuspected and bewildering body vibrations, up to about 1300 Hz, not only in the top and back of the violin, but also in the ribs, neck, fingerboard and scroll. Fig.37 gives a three-dimensional representation of the three important body modes (B−1, c 145–190 Hz; B0, c 250–300 Hz; and B1, c 500–570 Hz), as well as the two important cavity (air) modes (A0, c 260–290 Hz; and A1, c 430–490 Hz) in the first 1000 Hz of the violin range.

For centuries the ‘Helmholtz’ or breathing mode (A0) of the violin cavity was thought to be the only cavity mode. Research since the 1970s has shown that there are many cavity modes whose frequencies depend on the geometry of the box, the stiffness of its walls and the special characteristics of the f-hole openings.

The interrelations of wood and cavity modes are critical in the final tuning of an instrument in playing condition. Very desirable effects in tone and playing qualities result when the Helmholtz mode and the nearby body mode (B0) are at the same frequency (C.M. Hutchins, 1985). The A0 mode frequency is essentially fixed in the finished violin by the dimensions of the cavity, the compliance of its walls and the characteristics of the f-holes. The B0 mode frequency can be moved up or down by altering the mass and stiffness of the neck, fingerboard, pegs and chinrest. With practice the pitch of the A0 mode can be heard by blowing across an f-hole. The pitch of the B0 mode can be heard by holding the violin at the nut (a node of B0) and tapping on the end of the fingerboard or on the chinrest.

When one hums into an f-hole, a slightly lower pitch can be heard; this is shown in fig.37 as the wood prime bowed tone. Good tone and playing qualities also result when Bo and wood prime are at the same frequency. Wood prime is not a fundamental vibration; it is a strong bowed tone, due to reinforcement from the main wood bowed tone (c 440–570 Hz), which is an octave higher and acts as its second harmonic. These two bowed tones, main wood and wood prime, do not exist in a single frequency (sine-wave) excitation of the violin; however, they are two of the strongest tones in the bowed violin.

The main wood bowed tone usually lies just above the open A string of the violin, and has been found to be a combination of the A1 and B1 modes when the instrument is bowed. The frequency spacing between the A1 and the B1 modes indicates whether the tone qualities of a violin are very harsh (over 100 Hz), brilliant for solos (65–80 Hz), or suitable for orchestral work (45–65 Hz), chamber music in small halls (25–45 Hz) or more intimate Hausmusik (0–25 Hz) (C.M. Hutchins, 1989).

Controlled vibration tests of violins in playing conditions show that over an extended period (more than 1500 hours) the frequency of the B1 mode goes down about 25 Hz, resulting in smoother tone and playing qualities. After a rest period of several months the B1 frequency rises 10–15 Hz, but never regains its original frequency. This effect is due to the fracturing of the microfibrils in the cell walls, as well as to the decoupling of the long-chain cellulose polymers under stress and vibration. During a rest period variations of moisture and temperature slowly reverse these changes. Thus after many years of use the tone of a violin becomes smoother and the instrument becomes easier to play. The Hill brothers estimated that it takes 20–80 years of playing to bring a violin to optimum condition (W.H., A.F. and A.E. Hill, 1931). Experiments show that the B1 mode frequency can be raised or lowered some 10–20 Hz by means of various structural changes (Hutchins and Rodgers, 1992). This process indicates (1) that violins played over many years slowly change tone qualities, losing brilliance and power, and eventually wear out in spite of restorations by excellent violin makers; (2) that violins left unplayed for some time need to be ‘played in’ again, since during the rest period the wood cells tend to reform partially under temperature and moisture changes, thus altering the tone qualities; and (3) that new violins which seem harsh and stiff will improve with playing or some form of vibration, a fact that expert violin makers have known for many years (Otto, 1817).

Acoustics, §II: String instruments

5. The mute and vibrato.

The mute, which consists merely of a suitable mass attached to the top of the bridge, changes both the volume and quality of the sound. Its tendency to immobilize the top of the bridge increases with frequency, so that higher tones are reduced and timbre becomes softer and less brilliant; the loudness of sound is correspondingly reduced. The low partial notes of the instrument are not greatly affected, but the loudness of the low notes is indirectly reduced by virtue of the ‘residue effect’. According to this, the subjective sensation of fundamental pitch produced by the higher harmonics is somewhat reduced.

Of vibrato, much has been written from a musical point of view. Here only the physical characteristics will be considered, namely the changes in frequency level (recognized by the ear as pitch changes) and intensity level, and variations in harmonic structure of the sound. The changes in pitch as the finger moves back and forth on the string are quite familiar. This motion causes all the harmonics to have the same rate of pitch variation as the vibrato rate, typically four to six per second. The intensity level of each harmonic also varies at this rate, but is different for each harmonic, some having a high intensity level and some a very low. Also, for some of the harmonics the intensity level is increasing, while at the same time it is decreasing for others. These variations cause the aurally pleasing changes in the quality of the sound of notes played with vibrato. For further details see Fletcher and Sanders (1967).

Acoustics, §II: String instruments

6. Sound radiation.

The mechanical subsystems of the violin discussed thus far all contribute to how the violin functions as a musical instrument in the hands of the player. The sound waves radiated by a violin activated by the broad-band input from the bowed string contain all the partials of the note played, which are essentially in simple multiples of the fundamental: 1, 2, 3, 4 etc. If any part of the instrument, including the air of the cavity, resonates at one of the partial frequencies, the amplitude and sound quality of that partial are enhanced. The quality of the sound heard depends largely on the strength of the various partials as they are affected by the resonance modes of the instrument.

Modal analysis shows that there are some 41 modes in the first 1000 Hz of the vibrating violin (Marshall, 1985). However, a reciprocal method of vibrating the violin with sound and measuring the velocity of vibration at the bridge shows that of the many modes in the violin below 1000 Hz only six to eight are important for sound radiation (Weinreich, 1983).

Much research on the radiation of violins has been done using a single-frequency (sine-wave) input sweeping across the entire spectrum. The response is picked up by one or more microphones and plotted on a chart against a bowed ‘loudness curve’ (fig.38). By comparing many such charts with the tone and playing qualities of the violins tested, investigators have built up a body of knowledge enabling them to assess the tone qualities of an instrument (Moral and Jansson, 1982; Dünnwald, 1990; and Meyer, 1993). However, variations in methods make it difficult to compare tests carried out by different investigators.

The acoustic radiation field surrounding the violin has been measured in various ways. Fig.39 shows the main directions of radiation in the horizontal plane around the player. Fig.40 gives the measured radiation characteristics of different frequency ranges around the violin in a plane parallel to the bridge. Fig.40a shows wavelength large in comparison to source dimensions (290 and 517 Hz respectively); fig.40b shows wavelength comparable to source dimensions (922 and 950 Hz); and fig.40c shows wavelength small in comparison to source dimensions (2323 and 2630 Hz). A method capable of measuring the entire spherical output of a violin, giving phase and amplitude information, has been developed by Weinreich and Arnold (1980). Weinreich (1997) has also found that the radiation pattern of a violin varies rapidly, not only with direction but also with frequency, typically changing drastically from one semitone to the next. In a concert hall this can produce an illusion that each note played by a solo violin comes from a different direction, endowing fast passages with a special flashing brilliance. This finding has important consequences for the perception and reproduction of violin tone.

Acoustics, §II: String instruments

7. Bowing.

When a bow is drawn across a string, the string appears as a lens-shaped blur. Within this blur the string is vibrating in a remarkable manner, first described by Helmholtz in 1877. If the motion is frozen in a series of snapshots (fig.41) the string is found to take the form of a sharply bent straight line. The ‘corner’ shuttles back and forth along the length of the string at the frequency of the note being played, for example 440 times per second for the open A string of a violin. While the corner travels from the bow to the player's finger and back the string is sticking to the bow hairs, and while it travels the shorter distance from bow to bridge and back the string is slipping rapidly across the hairs of the bow. The arrival of the corner at the bow triggers the transitions between these two states. This is what distinguishes the bowed string from other ‘stick-slip’ vibrations such as squealing brakes: the accurate timekeeping provided by the shuttling Helmholtz corner makes the pitch of the note very stable. When reversing from an up-bow to a down-bow stroke, the Helmholtz corner must be made to run in the opposite direction, one reason why it is hard to perform a completely inaudible bow change in a long note.

The player controls four quantities when bowing: the bow speed, the bow ‘pressure’ (more properly ‘force’), the position of the bow on the string and the degree to which the ribbon of bow hair is tilted relative to the string. A consequence of the ‘Helmholtz motion’ is that, to a first approximation, the steady vibratory motion of the string is the same, regardless of these details of bowing. For ideal Helmholtz motion the waveform of oscillating force exerted by the string on the instrument's bridge, which is ultimately responsible for the sound, takes the form of a ‘sawtooth wave’ . Three factors affect the player's ability to influence the sound quality of a note. First, if the bowing parameters fall outside a certain range the Helmholtz motion is not possible at all, and something else (usually undesirable) happens, Second, within the range for which Helmholtz motion occurs, the fine details of the string motion do depend somewhat on the bowing parameters, enough for audible variations in sound quality to be produced for musical effect. Third, our perception of sound quality is influenced by the length and detailed nature of the transients – the brief sounds made by the way an individual note is started or stopped – and different bowings produce different transients.

The range of bowing parameters for which Helmholtz motion is possible can best be appreciated with the aid of the diagram shown in fig.43. For a given bow position and speed the bow force must lie within a certain range. Below the lower limit the Helmholtz motion gives way to one in which the string slips relative to the bow more than once per cycle, producing what is usually described as ‘surface sound’. Above the upper limit the arrival of the Helmholtz corner is insufficient to ‘unstick’ the string from the bow, the note ceases to be exactly repetitive from cycle to cycle, and the result is no longer a musical tone but a raucous ‘crunch’. To bow nearer to the bridge (sul ponticello) the player must press harder and control the force more accurately: both limits rise but they become closer together. Beginners often fail to control the bow position on the string, and so may inadvertently leave the Helmholtz range by moving horizontally in the diagram.

The loudness of a bowed note is influenced by bow position, force and speed, while the brilliance is influenced mainly by bow force alone. The player must keep all three parameters in mind in order to produce the desired combination of volume and tone quality. Loudness is governed mainly by the amplitude of the string motion, brilliance by the exact shape of the Helmholtz corner: a rounded corner gives a more mellow sound, a sharper one a brilliant sound. The precise details of string motion following a particular bowing gesture are influenced by the material and construction of the string, and by its interaction with the body of the instrument and with the player's finger. For more details of the physics of bowing, see Cremer (1984). For an introduction to more recent research using computer simulation, see Schumacher and Woodhouse (1995).

Bowed string instruments frequently display a disconcerting phenomenon known as the ‘wolf’ note, a narrow range of frequency within which the response tends to stutter or warble. This generally occurs around the frequency of the most prominent resonance of the instrument (typically around f' on a viola G or C string, or f on a cello G or C string). The wood of the body vibrates so vigorously that the bridge does not provide a sufficiently solid support to the string, especially the heavier lower strings. The viola and cello are more prone to this problem than the violin because they have bodies which (for ergonomic reasons) are smaller than one would obtain by scaling up the violin in proportion to the tuning. To compensate, they have flimsier bodies and heavier strings, which exacerbate the wolf. The best way to ameliorate a wolf is to fit a ‘tuned absorber’ to the instrument. This takes the form either of a small resonant cantilever which can be installed inside the instrument by a repairer, or of a weight (a commercial ‘wolf-eliminator’ or a piece of plasticene) attached to one of the strings between bridge and tailpiece. It must be accurately tuned: fit a heavy mute to the bridge and adjust the mass and position of the eliminator until the short string, when plucked, rings at the pitch of the wolf.

Acoustics, §II: String instruments

8. The plucked string.

When the string is plucked, the pull of the finger creates a kink, or discontinuity, that divides the string into two straight sections. On release, a dynamic condition is set up in which two discontinuities travel in opposite directions, one towards the bridge and one towards the nut. These are identical to the modes of motion described for the up-bow and down-bow action in the bowed string (see §7 above). Since they are now both present at the same time, however, the wave shape of the force exerted on the bridge is radically different from that of the bowed string. In the bowed string, the Helmholtz motion indicates a sawtooth wave in which reversal is instantaneous regardless of the position of the bow on the string (fig.44a). With the plucked string, on the other hand, the force at the bridge has a rectangular shape that depends on the point of plucking. If plucking occurs at the middle of the string, the shape is that of a square wave with a minimal content of the higher overtones (fig.44b). If the pluck is near the bridge or nut, a sharp rectangular wave is produced which is exceedingly rich in high-frequency components (fig.44c). Thus a wider range of timbre is possible by changing the point of plucking than by changing the point of bowing.

The plucked-string waveforms of fig.44 are highly idealized. Since energy is supplied once only at the start of the note, the string vibrations die away with time. Decay mechanisms include air damping, internal damping in the string material (dominant in gut and nylon strings) and energy loss to the body of the instrument. Only a small fraction of the string’s vibrational energy is converted to sound. High-frequency string components tend to die out more rapidly than the low frequencies, though the decay rates depend on the precise dynamic response of the body. Thus the force waveforms, and the spectral content of the sound, vary considerably not only with time but also from note to note. In practice the player can exert further control over the spectral content of the sound by varying the plucking position or the angle of release of the string (which modifies the coupling between the string and body), or by modifying the shape and frictional characteristics of the plectrum or fingertip.

An important difference between bowing and plucking is that in the former the phenonemon is periodic, so that the overtones are kept in a strictly harmonic relationship to the fundamental. In a plucked string, which involves ‘free’ rather than ‘forced’ vibrations, stiffness in the string and coupling to the body create inharmonic partials. String stiffness causes higher partials to become slightly sharper than integral multiples of the fundamental. Strong coupling between the string and body, the result of an over-compliant soundboard (as commonly found in the guitar), can cause some partials to become sharp or flat. A little inharmonicity can add interest to the sound, but too much causes notes to sound out of tune or generally imparts poor sound quality.

Acoustics, §II: String instruments

9. Current research.

It is apparent that, in respect to quantitative analysis and the predetermining of the effects of structure, wind instruments possess two important advantages over strings: their shapes are amenable to simpler mathematical description, and the resonating material, air, is homogeneous, with the same elasticity in all directions. By contrast, the shapes of string instruments, while a delight to the connoisseur, are forbidding to the mathematician, and the resonating material, wood, is neither homogeneous nor isotropic, and cannot be standardized. This uncertain property of wood is not a serious difficulty in woodwind instruments because their massive walls do not share vitally in the resonance of the instrument. The result has been that designers of wind instruments have had the possibility, which they have brilliantly used, of forecasting the effects of changes in design, while scientific luthiers have been far more dependent on a series of steps in carving the plates of their instruments, each step guided by the best means at their disposal.

Since the 1950s experimenters around the world have applied modern electronic and optical technologies to the acoustical problems of the violin. Efforts by individuals and by groups such as the Catgut Acoustical Society to relate these technical findings to the actual construction of fine violins are making it possible to build consistently fine-sounding violin-family instruments of any size and tuning. Modern violin makers have a ready market for their violins, violas, cellos and double basses, which are beginning to replace the fine early instruments that are gradually losing tone quality. However, we are still far from understanding fully how these apparently simple yet amazingly complicated instruments can produce such beautiful music.

Acoustics, §II: String instruments

BIBLIOGRAPHY

CASN

Catgut Acoustical Society Newsletter

JCAS

Journal of the Catgut Acoustical Society

C.M. Hutchins, ed.: Musical Acoustics (Stroudsburg, PA, 1976) [MusA]

C.M. Hutchins, ed.: Research Papers in Violin Acoustics 1975–1993 (Woodbury, NY, 1996) [RP]

PraetoriusSM, ii

F. Savart: Memoire sur la construction des instruments à cordes et à archet (Paris, 1819/R)

J.A. Otto: Über den Bau und die Erhaltung der Geige (Halle, 1817; Eng. trans., enlarged, 1848)

G. Fry: The Varnishes of the Italian Violin Makers of the Sixteenth, Seventeenth and Eighteenth Centuries: their Influence on Tone (London, 1904)

W.H., A.F. and A.E. Hill: The Violin Makers of the Guarneri Family (1626–1762) (London, 1931/R)

J. Michelman: Composition of Old Italian Varnishes’, Organic Finishing, xi/5 (1950), 22–3

H.F. Meinel: Regarding the Sound Quality of Violins and a Scientific Basis for Construction’, JASA, xxix (1957), 817–22 [MusA]

C.M. Hutchins: The Physics of Violins’, Scientific American, ccvii/5 (1962), 78–84, 87–92 [MusA]

H. Fletcher and L.C. Sanders: Quality of Violin Vibrato Tones’, JASA, xli (1967), 1534–44

C.M. Hutchins: Founding a Family of Fiddles’, Physics Today, xx (1967), 23–37 [MusA, RP]

L.W. Condax: Examination of the Ground Layer of the Italian Violin Varnish’, CASN, no.10 (1968), 12–13

J.C. Schelleng: Acoustical Effects of Violin Varnish’, JASA, xliv (1968), 1175–83 [MusA, RP]

W. Reinicke and L. Cremer: Application of Holographic Interferometry to the Bodies of String Instruments’, JASA, xlviii (1970), 988–92 [MusA]

K.A. Stetson: A Progress Report on Holographic Testing of String Instruments’, CASN, no.14 (1970), 19–22

J. Meyer: Directivity of the Bowed String Instruments and its Effect on Orchestral Sound in Concert Halls’, JASA, li (1972), 1994–2009 [MusA]

W. Reinicke: Übertragungseigenschaften des Streichinstrumentenstegs’, CASN, no.19 (1973), 26–36

J.C. Schelleng: The Bowed String and the Player’, JASA, liii (1973), 26–41

C.M. Hutchins: A Note on the Function of the Soundpost’, CASN, no.21 (1974), 27–8

W.M. Fulton: Old Italian Varnish’, The Strad, lxxxv (1974–5), 491–501

G. Bissinger: Tuning the Bassbar in a Violin Plate’, CASN, no.26 (1976), 10–12; no.30 (1978), 23–7

D.W. Haines: On Musical Instrument Wood’, CASN, no.31 (1979), 23–32 [RP]

H.A. Müller: The Function of the Violin Bridge’, CASN, no.31 (1979), 19–22

G. Weinreich and E.B. Arnold: Method of Measuring Acoustic Radiation Fields’, JASA, lxviii (1980), 404–11 [RP]

C.M. Hutchins: The Acoustics of Violin Plates’, Scientific American (1981), 170–86 [RP]

M.A. Hutchins: Acoustical Parameters for Violin and Viola Back Wood’, CASN, no.35 (1981), 29–31 [RP]

L.W. Condax: Final Summary Report of Violin Varnish Research Project’, CASN, no.36 (1982), 31–6

J.A. Moral and E.V. Jansson: Eigenmodes, Input Admittance, and the Function of the Violin’, Acustica, l (1982), 329–37 [RP]

M.A. Hutchins: Physical Measurements on Sampling of European Spruce and Maple for Violin Top and Back Wood’, CASN, no.40 (1983), 28–30

G. Weinreich: Violin Radiativity: Concepts and Measurements’, SMAC 83: Proceedings of the Stockholm Music Acoustics Conference: Stockholm 1983, ii, 99–109 [RP]

V. Bucur and R. Archer: Elastic Constants of Wood by an Ultrasonic Method’, Wood Science and Technology, no.18 (1984), 255–65

L. Cremer: The Physics of the Violin (Cambridge, MA, 1984)

M.E. McIntyre and J. Woodhouse: On Measuring Wood Properties’, JCAS, no.42 (1984), 11–15; no.43 (1985), 18–24; no.45 (1986), 14–24

C.M. Hutchins: Effects of an Air-Body Coupling on the Tone and Playing Qualities of Violins’, JCAS, no.44 (1985), 12–15 [RP]

K.D. Marshall: Modal Analysis of a Violin’, JASA, lxxvii (1985), 695–709 [RP]

V. Bucur: Varieties of Resonance Wood and their Elastic Constants’, JCAS, no.47 (1987), 42–8 [RP]

C.M. Hutchins: The Effects of Five Years of Filler and Varnish Seasonings on the Eigenmodes in Four Pairs of Viola Plates’, JCAS, no.48 (1987), 25–6 [RP]

G.A. Knott: A Modal Analysis of the Violin using MSC/NASTRAN and PATRAN (thesis, Naval Postgraduate School, Monterey, CA, 1987) [RP]

W.J. Trott: The Violin and its Bridge’, JASA, lxxxi (1987), 148–54 [RP]

N.-E. Molin, L.-E. Lindgren and E.V. Jansson: Parameters of Violin Plates and their Influence on the Plate Modes’, JASA, lxxxiii (1988), 281–90 [RP]

O.E. Rodgers: The Effect of the Elements of Wood Stiffness on Violin Plate Vibrations’, JCAS, 2nd ser., i/1 (1988), 2–8 [RP]

C.Y. Barlow and J. Woodhouse: Of Old Wood and Varnish: Peering into a Can of Worms’, JCAS, 2nd ser., i/4 (1989), 2–9 [RP]

J.I. Dunlop: The Acoustic Properties of Wood in Relation to Stringed Musical Instruments’, Acoustics Australia, xvii/2 (1989), 37–40 [RP]

C.M. Hutchins: A Measurable Controlling Factor in the Tone and Playing Qualities of Violins’, JCAS, 2nd ser., i/4 (1989), 10–15 [RP]

H. Dünnwald: Ein erweitertes Verfahren zur objectiven Bestimmung der Klangqualität von Violinen’, Acustica, lxxi (1990), 269–76 [RP]

O.E. Rodgers: Influence of Local Thickness Changes on Violin Free Plate Frequencies’, JCAS, 2nd ser., i/5 (1990), 13–16 [RP]

O.E. Rodgers: Relative Influence of Plate Arching and Plate Thickness Patterns on Violin Back Free Plate Tuning’, JCAS, 2nd ser., i/6 (1990), 29–33 [RP]

O.E. Rodgers and T.R. Masino: The Effect of Wood Removal on Bridge Frequencies’, JCAS, 2nd ser., i/6 (1990), 6–10

N.H. Fletcher and T.D. Rossing: The Physics of Musical Instruments (New York, 1991)

C.M. Hutchins: A Rationale for Bi-Tri Octave Plate Tuning’, JCAS, 2nd ser., i/8 (1991), 36–9 [RP]

M.A. Hutchins: Effects on Spruce Test Strips of Four-Year Application of Four Different Sealers plus Oil Varnish’, JCAS, 2nd ser., i/7 (1991), 11–16 [RP]

C.M. Hutchins: A 30-Year Experiment on the Acoustics and Musical Development of Violin-Family Instruments’, JASA, xcii (1992), 639–49 [RP]

C.M. Hutchins and O.E. Rodgers: Methods of Changing the Frequency Spacing (Delta) between the A1 and B1 Modes of the Violin’, JCAS, 2nd ser., ii/1 (1992), 13–19 [RP]

N.J.-L. Fang and O.E. Rodgers: Violin Soundpost Elastic Vibration’, JCAS, 2nd ser., ii/1 (1992), 39–40 [RP]

J.E. Miller: Spectral Measurements of Violins’, JCAS, 2nd ser., ii/4 (1992), 1–4 [RP]

C.M. Hutchins: The Effect of Relating the Tailpiece Frequency to that of Other Violin Modes’, JCAS, 2nd ser., ii/3 (1993), 5–8 [RP]

J. Meyer: The Sound of the Orchestra’, Journal of the Audio Engineering Society, xli (1993), 203–13 [RP]

O.E. Rodgers: Influence of Local Thickness Changes on Violin Top Plate Frequencies’, JCAS, 2nd ser., ii/3 (1993), 14–16 [RP]

V. Bucur: Acoustics of Wood (Boca Raton, FL, 1995)

R.T. Schumacher and J. Woodhouse: Computer Modelling of Violin Playing’, Contemporary Physics, xxxvi (1995), 79–92

H.A. Müller: How Violin Makers choose their Wood and what the Procedure means from a Physical Point of View’, Research Papers in Violin Acoustics 1975–1993, ed. C.M. Hutchins (Woodbury, NY, 1996)

B.E. Richardson: Stringed Instruments: Plucked’, Encyclopedia of Acoustics, ed. M.J. Crocker (New York, 1997)

G. Weinreich: Directional Tone Colour’, JASA, ci (1997), 23–38

J. Woodhouse: Stringed Instruments: Bowed’, Encyclopedia of Acoustics, ed. M.J. Crocker (New York, 1997)

C.M. Hutchins: The Air and Wood Modes of the Violin’, Journal of the Audio Engineering Society, xlvi (1998), 751–65 [RP]

Acoustics

III. Keyboard string instruments

1. Foundations.

2. Clavichord.

3. Harpsichord.

4. Piano.

BIBLIOGRAPHY

Acoustics, §III: Keyboard string instruments

1. Foundations.

Keyboard string instruments are distinguished by key-operated mechanical devices which are used initially to shock the strings into free vibration, and finally to damp these free vibrations. Differences in the shape, size, material and function of these percussive and damping devices create most of the characteristic variations at the beginnings and ends of the notes. These differences also influence the timbre while the strings are vibrating freely, because the initial shock determines the waveform of the ensuing vibrations. There are other acoustical factors, such as the stiffness of the strings, the wrapping of the bass strings, the gradual damping of tone at the string supports, the number of strings per key, the acoustical properties of the structure supporting the strings and, in some cases, the effect of passive sympathetic strings.

The clavichord, harpsichord and piano all have systems of individually tuned strings lying approximately in a plane. Each string is under tension between a hitch-pin, secured in the string-supporting structure, and a tuning-pin which during tuning is rotated to and fro in a wooden pinblock. Between these two pins and generally near each of them the string bears against a point of sliding contact. The distance between the two intermediate contact points defines the length of the tuned vibrating string segment. Each string can vibrate freely at a series of frequencies which would be a harmonic series if the string were perfectly flexible. Because wire strings have some stiffness (remotely resembling a bar) the series of natural frequencies for string vibration is slightly, but audibly, inharmonic. Research has shown that this minute degree of inharmonicity is an important characteristic of the timbre. The inharmonicity also causes the optimum tuning of the instrument scale to be slightly stretched (flat on low notes, sharp on high notes) from the strictly mathematical scale of 2:1 octaves.

In keyboard string instruments the playing-key action causes a dynamic string excitation event. Some impact sound occurs immediately at the point of impact, and more comes through the string vibration system soon after. This is the ‘cause’ sound. The nature of both the impact and the resulting string excitation differs for each of the three major types of instrument, but in each case the shape of at least a portion of the tuned string segment is suddenly and momentarily changed. This string shape change is the pulse wave which travels up and down the string between the two contact points after the event occurs. This is the ‘effect’ sound.

The pulse wave is partially reflected back and forth repeatedly at the string contact points, causing the string vibration and the resulting note to sustain. However, some of the stored wave energy is removed during each reflection, eventually causing the note to die away. This vibratory energy, removed at the end of each round trip of the wave, vibrates the supporting structure, particularly the soundboard from which most of the sound energy enters the air. Only a small part of the sound reaches the air directly from the motion of the strings. The string typically vibrates at many frequencies simultaneously. Because these frequencies lie almost in a harmonic series, the listener hears the combination as a single note having a pitch corresponding to the lowest frequency in the series. The relative amount of sound produced at each frequency within the complex tone largely determines the timbre during the ‘effect’ portion of the tone. This depends on the shape of the pulse given to the string at impact, the response of the contact point of the supporting structure at each frequency of string vibration and the efficiency of the structure at each frequency in the transformation of vibration into sound.

In Table 1 the structural and mechanical differences between the three major types of keyboard string instrument are shown; the acoustical effects are explained below. For diagrams of the mechanisms described below, see articles on individual instruments.

Factor

 

Clavichord

Harpsichord

Piano

 

 

 

 

 

 

strings:

number per key

1 or less

1 per footage

2/3

 

material

brass/steel

brass/steel

steel

 

diameter stiffness

smaller

smaller

larger

 

tension

low

medium

high

 

bass wrapping

fine gauge

heavier gauge

 

 

 

 

 

 

 

 

 

 

 

structure:

string terminations

tangent, bridge

bridge, bridge

cast iron, bridge

 

soundboard size

small

small to medium

medium to large

 

structure material

wood

wood

metal and wood

 

structure mass

light

medium

heavy

 

 

 

 

 

 

 

 

 

 

 

excitation:

mode

strike-hold

pluck

strike-rebound

 

exciter material

metal

quill, leather

felt

 

exciter shape

edge

tapered

curved

 

exciter size

small

small

large to medium

 

 

 

 

 

 

 

 

 

 

 

damping:

mode

threaded through strings

jack weight/spring

damper head

 

damper material

cloth

felt/cloth

felt

 

damper shape

strip

flat

V/flat

 

damper size

continuous

small

large

 

 

 

 

 

 

Acoustics, §III: Keyboard string instruments

2. Clavichord.

The clavichord is typically a rectangular box with the keyboard occupying the left two-thirds of the long side. The strings are stretched in a horizontal plane from hitch-pins near the left end to tuning-pins near the right end. In the right end there is a very small horizontal soundboard supporting a bridge (sometimes segmented) which provides one of the downbearing contact points for each string. The longer bass strings are closest to the keyboard; the shorter treble strings are near the rear. Each playing key is pivoted on a balance-rail, and the key is guided by a rear tongue which rides up and down in a vertical slot. The unique mechanical feature of the clavichord action is the tangent, a wedge-shaped piece of iron or brass borne upon the rear key extension. As the key is depressed, the tangent rises and strikes the string directly, remaining in contact with the string until the key is released. The tangent thus serves two functions: its impact creates the string excitation without the aid of any intermediate action mechanism; and it provides the second downbearing contact point for its associated string, thus defining the length of the vibrating string segment and controlling its pitch frequency. In other words, the striking point and the termination point of the tuned segment are the same.

In many clavichords the same string is used for several notes that would seldom be played in combination (e.g. C and C), by positioning the tangents at different points along the same string. Such instruments are known as fretted clavichords. Some others employ individual strings for greater musical versatility.

The sustained contact between the key-borne tangent and the struck string gives the clavichordist a ‘Bebung’ tonal effect not obtainable with other keyboard string instruments. When the player varies the key force periodically (after striking) the string tension fluctuates, producing a pitch vibrato.

Between the tangent strike points and the hitch-pins there are strips of cloth interwoven among the strings. They quickly damp string vibrations to the left of the strike point where the string segment would not be properly tuned. They also damp the entire string vibration when the tangent contact with the string is released, making a separate damper action unnecessary.

Clavichords have several inherent acoustical advantages and disadvantages. The metal edge striking at the string termination excites a complete frequency series. The pulse shape sharpens with harder blows, giving dynamic range in brilliance. However, both the maximum tangent velocity and the area of the soundboard are so small that the clavichord is limited in output. The direct connection through the key from the player’s finger to the string permits ‘Bebung’ vibrato, but it also provides mechanical damping which shortens note duration.

Acoustics, §III: Keyboard string instruments

3. Harpsichord.

Members of the harpsichord family have one of three orientations for the usually horizontally stretched strings. In the small rectangular virginals the strings run crosswise as in the clavichord; in the small wing-shaped spinet the strings extend obliquely from tuning-pins just behind the keyboard to hitch-pins in a row curving away to the right of the keyboard; and in the large harpsichord the strings extend directly away from the keyboard. This shape difference and the additional long bass strings make the harpsichord a larger instrument. Larger soundboards can provide greater tonal efficiency and fuller timbre.

In all three the strings are excited by the upward plucking action of a quill or plectrum which engages with, then releases the string. The plectrum projects from the central tongue portion of a narrow vertical jack which slides within guides, and is supported on the rear of the pivoted playing key. In large harpsichords there are two or more sets of strings and two or more manuals of keys. Thus different plectra borne by the same key can simultaneously pluck strings related by octave in the different string sets. The same string can also be plucked at different points along its length by plectra from different manuals. The closer the plectrum is to the string termination, the stronger are the higher, brighter partials of the note. The farther away, the stronger are the lower partials. Plectra may be quill, leather, wood, metal or plastic. The harder materials and the sharpest edges give the brightest timbre.

The most important acoustical difference between harpsichords and other keyboard string instruments is the plucking initiation of notes. The smallness and the sharpness of the plectrum edge, and the suddenness of string release as the plectrum passes on, produce a string waveform with ample high-pitched partials, giving the characteristic brilliance of harpsichord tone. In contrast with clavichord action, the string exciter leaves the string immediately and cannot absorb string vibration and so shorten note duration. The return of the plectrum on release of the playing key often produces a weak second excitation of the string. This stroke is minimized by the pivoting of the returning jack tongue. Although the resulting sound is a subtle characteristic of harpsichord tone, it happens so immediately before the jack-borne damper reaches the string that it sounds like part of the damping action.

Acoustics, §III: Keyboard string instruments

4. Piano.

The first Cristofori piano was a harpsichord with the string-plucking action replaced by an upstriking hammer action with escapement. This freed the leather-covered hammer before impact, allowing it to strike the string and bounce back, in contrast with the tangent action of the clavichord. The hammer action gave both piano and forte levels with fully controlled gradations in between, introducing to a concert situation the keyboard dynamic expression previously possessed only by the clavichord, which is too quiet to be played with other instruments. Cristofori’s instrument was the predecessor of the grand piano. The piano hammers were larger and more rounded than the previous tangents and plectra, imparting a larger, rounder pulse shape to the strings. This gave more power in the low partials and less in the higher ones, making piano tone fuller than that of the harpsichord. Hammer sizes and shapes were smaller and more pointed for high notes, where brilliance is needed.

Later a smaller piano was developed by combining a simple, upstriking hammer action with a string vibration system derived from that of the clavichord. Because it was small, this piano was acoustically weak. Many different action mechanisms were invented for this popular instrument, the rectangular or ‘square’ piano.

The substitution of hammers for plectra allowed multiple coplanar unison strings for each note, which had not been feasible in harpsichords. This increased tonal loudness and produced other advantages described below. Forward-striking hammer actions were developed for vertical piano string systems, requiring less floor space than horizontal pianos. Cast iron plates to support string forces and steel piano wire of higher tensile limits permitted larger wire diameters and higher-tension strings, increasing tonal power and brilliance. Compressed felt replaced leather hammer covering, softening the tone and increasing hammer control and durability. Crossing bass strings over tenor strings reduced piano size, and more oblique cross stringing and drop actions led to the small vertical piano.

Multiple strings for each individual note contribute significantly to the distinctiveness of piano tone, adding a choral effect to each tone because of slight differences in string tuning. Research has shown that slight detuning of unison strings (less than 0·1%) is aurally preferred to mathematically exact tuning; experienced piano tuners typically leave this tuning margin. Immediately after piano hammer impact the multiple strings vibrate in close synchronism. At this time the rate of power transfer to the bridge and soundboard is maximal, causing rapid tone diminution initially. Later the strings gradually become asynchronous because their frequencies are slightly different, and the note dies away more slowly. This characteristic dual decay rate in well-tuned pianos lets successive notes stand out clearly over recently played, sustained notes, and this has influenced the development of composition for the piano. The vibrations of struck strings travel along the piano bridge to other strings, which can vibrate sympathetically when the sustaining pedal lifts the dampers, producing a stronger choral effect. In grand pianos the soft pedal moves the action transversely, reducing the number of strings the hammer strikes.

Except in the treble range, each piano note contains many partials, with the strongest lying between 100 and 1000 Hz. Hammer-string impact sound, which spreads throughout the pitch range, is an important characteristic of piano tone but is noticed only in the treble, where the partials are too sparsely spaced to conceal it.

Standard procedures used by piano tuners for tuning octaves produce a stretched scale. Upper notes are higher and lower notes lower than strict equal temperament by about a third of a semitone at each extreme, an effect resulting from slight inharmonicity of the partials of string tone. Research has shown that pianists and listeners prefer the piano scale stretched this way, and attempts to synthesize true piano tone from a strictly harmonic series of partials have had limited success.

The science, engineering and art that combine in the evolution of the piano comprise what is known as ‘scale design’, the configuration of tonally related major structural parts. These include the strings, hammers, dampers, bridges, soundboard, plate and, to some extent, the case, along with their material properties.

Acoustics, §III: Keyboard string instruments

BIBLIOGRAPHY

F. Trendelenburg, E. Thienhaus and E. Franz: Zur Klangwirkung von Klavichord, Cembalo und Flügel’, Akustische Zeitschrift, v (1940), 309

D. Martin: Decay Rates of Piano Tones’, JASA, xix (1947), 535–41

R.E. Kirk: Tuning Preferences for Piano Unison Groups’, JASA, xxxi (1959), 1644–8

D. Martin and W.D. Ward: Subjective Evaluation of Musical Scale Temperament in Pianos’, JASA, xxxiii (1961), 582–5

D. Droysen: Akustische Untersuchungen an Tasteninstrumenten des 18.–20. Jahrhunderts’, GfMKB: Leipzig 1966, 416–23

S. Tomek-Schumann: Akustische Untersuchungen an Hammerflügeln’, JbSIM (1975), 127–72

R.-D. Weyer: Time-Varying Amplitude-Frequency-Structures in the Attack Transients of Piano and Harpsichord Sounds’, Acustica, xxxv (1976), 232–52; xxxvi (1976), 241–58

E.L. Kent, ed.: Musical Acoustics: Piano and Wind Instruments (Stroudsburg, PA, 1977)

V.G. Porvienkov, M.V. Gridněv and N. Čelnokov: Hodnocení hlasitosti a barvy zvuku klávesového nástroje’ [Measuring the volume and colour of the sound of keyboard instruments], Hudební nástroje, xv/1 (1978), 14–17

U.R. Müller: Influence of Ribs on the Acoustic Behaviour of Piano Resonant Plates’, Archives of Acoustics, v (1980), 147–55

K. Wogram: Die Bedeutung nichtstationärer Schwingungsvorgänge für die Bewertung von Musikinstrumenten’, Qualitätsaspekte bei Musikinstrumenten, ed. J. Meyer (Celle, 1988), 23–34

J. Meyer and K. Wogram: Perspektiven der Musikinstrumentenakustik’, Acustica, lxix (1989), 1–12

M. Podlesak and A.R. Lee: Effect of Inharmonicity on the Aural Perception of Initial Transients in Low Bass Tones’, Acustica, lxviii (1989), 61–6

A. Askenfelt, ed.: Five Lectures on the Acoustics of the Piano (Stockholm, 1990)

A. Askenfelt and E. Jansson: From Touch to String Vibrations’, JASA, lxxxviii (1990), 52–62; xc (1991), 2383–93

A.I. de La Campa: Aproximación analítica a la interpretación en el piano (Madrid, 1990)

H. Suzuki and I. Nakamura: Acoustics of Pianos’, Applied Acoustics, no.30 (1990), 147–205

X. Boutillon: Aperçu général sur les modèles physiques de piano (Paris, 1991)

A. Chaigne and A. Askenfelt: ‘Numerical Simulations of Piano Strings’ JASA, xcv (1994), 1112–18, 1631–40

I. Bork, H. Marshall and J. Meyer: On the Radiation of Impact Noises from a Grand Piano’, Acustica, lxxxi (1995), 300–08

H. Conklin: Design and Tone in the Mechanoacoustic Piano’, JASA, xcix (1996), 3268–96; c (1996), 695–708, 1286–98

Acoustics

IV. Wind instruments

1. Introduction.

2. Modes of oscillation of an air column.

3. Maintenance of oscillation.

4. Musically useful air column shapes.

5. Brass instruments.

6. Reed instruments.

7. Flute.

8. Wind instrument tone-colour.

9. Early wind instruments.

BIBLIOGRAPHY

Acoustics, §IV: Wind instruments

1. Introduction.

Every wind instrument consists of a long and carefully shaped duct coupled to an airflow control system that converts the steady wind supply from the player’s lungs, or from the wind chest of a pipe organ, into oscillations of the instrument’s air column. The mechanism for controlling the airflow can be the reed of a clarinet or bassoon, the vibrating lips of a trumpet player, or the less easily visualized steering of a jet of air from the flautist’s lips as it travels across the embouchure hole. In each of these the flow control device sends puffs of air in a regularly varying sequence into the instrument’s mouthpiece to keep the air column oscillating in its longitudinal vibratory motion. The nature and timing of these puffs are in turn controlled by acoustical variations taking place within the mouthpiece, these being a manifestation of the air column’s own oscillations. In order to make this two-part oscillating system useful for musical purposes, the performer must be able to select one or another of the possible sounds that it can generate. A bugler who plays the notes of a call is making such a selection, as is a woodwind player who uses a single fingering to sound notes in the bottom and second registers of the instrument. A musician is able to fill in the remaining gaps left in the musical scale by transforming the original air column of the instrument into a longer or shorter one. These will give alternative sets of sounds from which he can choose according to his needs. In modern brass instruments, changes in air column length are produced by the addition of various lengths of tubing, as exemplified by the slide of a trombone or the valve loops of a french horn. In woodwind instruments these length changes are accomplished by opening a greater or lesser number of tone holes arranged to pierce the air column wall at various points along it.

To be successful the design of a wind instrument must achieve several relationships between the air column and its flow control device: the two must be able to work together to permit the prompt and stable production of each one of the various notes in the scale, and these must be under good control by the player; the pitches of these notes must lie close to those belonging to the musical scale; and the tone-colour of the generated sounds must conform to aesthetic standards, which may vary from period to period and from nation to nation. The nature of these relationships and the way they may be attained is briefly outlined below.

Acoustics, §IV: Wind instruments

2. Modes of oscillation of an air column.

The sloshing of water in a length of rain gutter is made up of oscillatory motions which provide some insight into analogous motions that take place in the air column of a wind instrument. Fig.45 shows a water-filled trough and the first three modes of oscillatory motion that are possible within it; these modes are those having the lowest frequencies of oscillation. Observation of the motion of the water in such a trough makes it quickly apparent that the vertical motion of the water level is very different from the water’s back-and-forth flowing motion: points of small horizontal motion lie at points of maximum vertical motion; conversely, the vertical motion displays its nodes at those points where the horizontal motion is large. Vertical and horizontal motions are inextricably coupled to one another by the fact that the only way to change the water level at some point is to have water flowing towards or away from it.

There are important differences between the sloshing motion of water in a gutter and the oscillatory flow of air in a musical wind instrument: the first is a wave on the surface of an incompressible liquid, while the second is a longitudinal wave in a compressible gas. Nevertheless, a useful analogy can be drawn between the two types of wave motion. The vertical motion of the water (a consequence of a localized inflow or outflow of water) is the cognate of the rise and fall of air pressure at some point within the duct; obviously such changes in air pressure arise from the flow of air. Because woodwind reeds and brass players’ lips are ultimately controlled by pressure variations acting on them within the mouthpiece, and also because eardrums respond directly to the acoustic pressure variations exerted on them by the surrounding air, it is generally convenient to describe the behaviour of air columns in terms of the pressure aspect of their vibrations.

Every air column, regardless of its shape and the way in which its ends are terminated, has its own characteristic collection of vibration modes; each of these modes has its own pattern of flow and pressure and its own frequency of oscillation. Air columns of different shapes not only have different frequencies for their various modes of oscillation but also different ratios between them. It is a straightforward (though sometimes tedious) business to calculate the frequencies of an air column regardless of its shape, or, conversely, to calculate the shape needed to give a specified set of frequencies. As discussed in §4 below, there are only a very few basic air column shapes that can be coupled to a flow control device in order to make a useful note generator. It is a fortunate circumstance that these same shapes are also compatible with the requirements for proper tuning.

Acoustics, §IV: Wind instruments

3. Maintenance of oscillation.

A water analogue to a musical instrument can help one to visualize the way a flow control device maintains a steady oscillation. Fig.46 shows a device that could be called a water trumpet, in which a water supply valve is arranged to open or close progressively in response to the rise and fall of the water level at the shallow end of a tapering channel containing water. The end at which the valve is located is analogous to the mouthpiece end of a trumpet, and the valve itself replaces the player’s lips. A water valve arranged to open and close in this manner acts as a flow controller worked by variations in water height as it maintains oscillations in the duct. In a similar way, the single and double reeds of orchestral woodwind instruments and the lips of brass players function as flow controllers of the pressure-operated type, in that they open and close under the predominating influence of acoustic pressure oscillations within the mouthpiece cavity of the instrument. The generic name ‘reed-valve’ will be used for all these pressure-operated controllers, including those associated with brass instruments.

Any acoustic disturbance has a pattern of flows intertwined with a corresponding pattern of pressure variations. If attention is directed to the flow variations in a wind instrument air column, they should suggest the possibility of another kind of controller that operates by the flow aspect rather than the pressure aspect of the disturbance within the mouthpiece; this other type is found in the flute family.

In 1877 Helmholtz presented the simple theory of the maintenance of oscillation by means of a reed-valve, showing that such oscillations tend to occur at one or other of the natural frequencies of the air column to which the reed-valve is attached (see also Backus, 1963). Wilhelm Weber had already in 1830 elucidated the influence of the reed’s own elasticity on these natural frequencies (see Bouasse, 1929–30). It remained for Bouasse in the late 1920s to recognize that under certain conditions several modes of air column oscillation can act simultaneously on the reed-valve to facilitate the maintenance of a note. The implications of Bouasse’s observations were worked out and given practical application by Worman (1971) and Benade (1973).

The nature of the collaborative effect of several resonances can be summarized in the following terms. The various characteristic air column modes influence one another via the shared excitatory airflow. The valve must therefore come to terms with the oscillatory preferences of these modes to produce a self-consistent oscillation that includes several harmonically related frequency components in setting up what is known as a ‘regime of oscillation’. The name is chosen deliberately to draw attention to what can metaphorically be considered as political negotiations taking place between the air column’s own set of vibrational tendencies and those of the reed-valve, with the alliances changing as varying musical conditions give dominance to different members of the regime. Oscillation is particularly favoured when the air column has two or more natural frequencies arranged to coincide with the harmonics of the note being produced.

The operation of the reed-valve depends crucially on the fact that the relationship between the pressure on the downstream side of the valve and the rate at which air flows through it is non-linear (that is, a change in pressure does not simply produce a proportional change in air flow). It is this non-linearity which allows an initially unidirectional flow of air from the player’s mouth to destabilize the reed, resulting in the development of a periodic oscillation of the coupled system of reed and air column. The non-linearity is also an essential ingredient in the locking together of different air column modes into an oscillation regime with stable phase relationships. In recent years the theory of non-linear dynamics has been fruitfully employed to investigate such issues as the minimum blowing pressure necessary to make a wind instrument sound, and the changes in pitch, loudness and timbre which take place as the blowing pressure is increased beyond this threshold (see McIntyre and others, 1983; Fletcher, 1992; Kergomard, 1995; and Grand and others, 1996).

Acoustics, §IV: Wind instruments

4. Musically useful air column shapes.

We have seen that a stable regime of oscillation is more easily achieved in a wind instrument if it has a set of air column modes whose frequencies are in a harmonic relationship. No real wind instrument air column has exactly harmonic mode frequencies, but a conical tube complete almost to the apex comes quite close to this ideal. For this reason many woodwind instruments have approximately conical tubes: the oboe, bassoon and saxophone come into this category.

A cylindrical tube open at both ends also has a set of modes whose frequencies are, to a good approximation, members of a complete harmonic series. This is the acoustical basis of the flute, which has an effectively open embouchure hole at its upper end. A cylindrical tube closed at one end and open at the other has a set of harmonically related modes, but the even members of the series are missing. This is the case with the clarinet, which has a cylindrical tube effectively closed at the upper end by a cane reed.

In the brass instrument family, the player’s lips form a reed-type control valve which effectively closes the upper end of the tube. The bugle and the Swiss alphorn can be classed as conical instruments; so can the flugelhorn, saxhorn and many types of tuba, at least when the valves are not depressed. A trumpet or trombone, on the other hand, has cylindrical tubing over a substantial fraction of its length, with a final section which flares more and more rapidly into a pronounced bell. If a trombone consisted only of the cylindrical slide-section, it would have an odd-member-only harmonic series of modes, like the clarinet. The addition of the flaring section, together with the cup-shaped mouthpiece at the entrance, not only lowers all the mode frequencies but also reduces the pitch intervals between the modes. On a well-designed trombone or trumpet, the modes from the second upwards have frequencies close to a complete harmonic series, although the first mode frequency is much too low to fit the series.

Extending the slide on a trombone, or depressing valves on a trumpet or horn, increases the length of cylindrical tubing in the instrument. This lowers all the mode frequencies, but also increases the pitch intervals between the modes. It is thus impossible to achieve an ideal harmonic mode relationship for all slide positions or valve combinations. Similarly, opening side holes on a woodwind instrument does not merely shorten the effective length of the tube; it also changes the pitch intervals between the modes of the air column. There is thus no simple solution to the problem of designing a successful wind instrument with a large chromatic compass. Fortunately the non-linear coupling between air column and control valve can provide a strong oscillation regime even when only two or three modes have an approximately harmonic relationship, so the problems are less severe than they might at first appear.

Acoustics, §IV: Wind instruments

5. Brass instruments.

The vibration of a brass player’s lips is controlled by the oscillatory pressure present in the mouthpiece: this pressure is an aspect of the air column’s own oscillation. It is convenient to characterize the air column itself with the help of measurements carried out with an electronically operated pump (a special type of miniature loudspeaker) that produces a sinusoidally varying flow of air in and out of the mouthpiece cavity at any desired frequency. A tiny microphone placed inside the mouthpiece measures the amplitude of the resulting pressure variations. This microphone gives the desired air column response information, which can be displayed on a graph as a function of the pump driving frequency (see Benade, 1973). Such a graph will be called a ‘pressure response curve’; formally it is known to acousticians as an input impedance curve. Fig.47 shows an example of such a curve for a modern B trumpet; its general nature is typical of the pressure response curves of all brass instruments. Each of the peaks on this curve indicates a large pressure variation within the mouthpiece cavity, and each peak corresponds to excitation at the frequency of one of the modes of the air column.

As an illustration of the usefulness of a pressure response curve, consider what fig.47 shows for the playing of the written note c'. The figure indicates that a regime of oscillation is set up for this note involving response peaks 2, 4, 6 and 8 of the air column, which collaborate with the player’s lips to generate a steady oscillation containing many harmonically related partials. The lowest four of these partials get their major sustenance from the peaks named. When the trumpeter plays very softly, peak 2 dominates the oscillation and, because it is not very tall (i.e. the given excitation produces only a mild oscillation in this mode), the note is not well stabilized. As the musician plays louder the other peaks become influential and the note is steadier and better defined. The regime of oscillation for the written note g' is dominated by peak 3 with the cooperation of peaks 6 and 9. Since peak 3 is taller than peak 2, at pianissimo playing levels g' will be steadier than c'. During a crescendo the tall sixth peak enters the regime for g' and greatly stabilizes the oscillation, which gains some help also from peak 9. These are the acoustical reasons why g' is one of the easiest notes to play on a trumpet.

Further examination of fig.47 shows why the notes become increasingly hard to play as one moves up the scale. For example, g'' is still fairly easy to play softly because it is fed by the tall sixth peak; but during a crescendo it becomes progressively more ‘stuffy’, because the increasing dissipation of acoustic energy via the generation of higher partials in the note is inadequately offset by the contributions made by the small 12th peak. The note c''' is difficult at all levels, since it is sustained only by the eighth peak, which is not particularly tall and which, moreover, has no assisting peak in the neighbourhood of its second harmonic (near 1864 Hz).

In every case described above, the tuning of each note is determined not only by the resonance peak closest to the nominal frequency of the note, but also by any other peaks that lie near whole-number multiples of this frequency. If errors in the shape of the air column lead to resonance peaks that are not exactly in harmonic relationship, not only is the steadiness and clarity of the note spoiled by the less than perfect cooperation at forte levels, but also the player must compensate for pitch shifts that take place during crescendos and diminuendos as the misplaced resonances gain or lose their votes in the regime. The fact that misplaced resonances lead to changes of playing pitch during a crescendo provides the basis for an extremely sensitive technique for the adjustment of the proportions of brass instruments.

It has already been shown that the positions of the various resonance peaks are crucial to the proper speech of a wind instrument, and there have been hints that their frequencies may systematically be adjusted by suitable modifications of the air column shape. For example, if a mouthpiece is to work properly on any given brass instrument, there is a certain critical relationship that must be maintained between its total volume (cup plus backbore) and its ‘popping frequency’, which is the frequency of the sound made by slapping the rim closed against the heel of the hand. Similar adjustments can be made at the other end of the instrument. Skilled horn players develop great sensitivity in moving their right hand as they go from note to note. The placement of the hand in the bell ekes out the last bit of perfection in the alignment of the resonances of the instrument. On other brass instruments it is left to the maker to carry out similar but fixed adjustments of the bells.

In an air column’s resonance curve, the positions and alignment of the peaks are important, but so also are their heights. For example, the mouthpiece of a brass instrument has an acoustical duty beyond the simple one of helping to achieve suitable frequency relationships between the peaks: it is responsible also for the increased height of the middle four or five peaks relative to their low- and high-frequency neighbours, the maximum peak height lying roughly in the region of the mouthpiece popping frequency itself. Without this area of added height in the response peaks, even a perfectly aligned brass instrument tends to be difficult to play.

The bell also plays an important role in influencing the height of the resonance peaks, in that it causes the disappearance of the peaks above a certain frequency, determined by its rate of flare. The presence of the horn player’s hand in the bell raises the frequency above which there are no resonances, which allows the pressure response curve to have half a dozen additional resonance peaks. If these additional peaks are properly aligned, they will join with the other peaks to stabilize various regimes of oscillation and will also raise the upper limit of the player’s range. Much of the confusion surrounding the phenomenon of handstopping is resolved when proper account is taken not only of the fact that moving the hand rearranges the peaks which collaborate in producing the note, but also of the fact that handstopping makes additional peaks available to the collaboration.

Acoustics, §IV: Wind instruments

6. Reed instruments.

It was mentioned in §4 above that many reed woodwind instruments are based on the conical air column shape. In such instruments the apical segment of the prototypical cone is replaced by a reed cavity (or mouthpiece) with a staple, neck or bocal, while the lower, large end of the active bore extends down to the first of a row of open tone holes. It is important to recognize that the presence of closed tone holes on the bore significantly alters its acoustical behaviour.

Fig.48 shows the pressure response curves measured on an oboe for the air columns (including staple and reed cavity) used in playing the low-register notes b', f' and b (the curves for intermediate notes follow the trend implied in the figure). These curves are closely similar to those found for notes having similar fingering on the other conical woodwind instruments.

In reed instruments each low-register note is produced by a regime of oscillation involving the first (lowest-frequency) resonance peak, along with one or more other peaks whose frequencies match those of the next higher partials of the note being played. For the note b' there are only two peaks that participate in the regime, the higher-frequency peaks being much less tall besides being inharmonically positioned. On a good instrument f' is a much more stable note, being based on a negotiated agreement between three accurately positioned resonance peaks. Once again it is notable that above about 1300 Hz the peaks are not very tall, and they are irregular in their placement. The note b near the bottom of the oboe’s scale is produced by a regime of oscillation involving four accurately harmonic and fairly tall peaks, with one less tall peak whose position is a little below the frequency of the tone’s fifth partial. The size, spacing and chimney length of the tone holes determine the frequency above which the air column resonance peaks become less tall and more irregular in their position. The behaviour is reminiscent of the manner in which the bell of a brass instrument puts an upper limit on the number of resonance peaks. This explains why the bell of a woodwind instrument (even that of an english horn) can be replaced by an extension of the main bore, if this is provided with a suitably designed set of additional tone holes.

The second register of a conical woodwind instrument’s playing range is produced by regimes of oscillation involving response peak 2, along with peak 4 if it exists. The question arises as to how the reed can be persuaded to operate in such a regime. As a general principle, when one plays pianissimo on a reed instrument, oscillation is favoured at the frequency of the tallest air column response peak, and intermode cooperative effects are relatively unimportant. Fig.48 shows that the fingerings for b' and f' give air columns that favour low-register playing under these conditions, whereas pianissimo playing using the b fingering favours sound production an octave higher at b' because of the deficient height of the first peak (a deficiency characteristic of all nearly complete conical air columns). The tendency of the lowest notes to jump an octave in soft playing plagues every saxophonist, and also causes problems of harshness and instability for the player of double-reed instruments. When the b fingering is used to play loudly, the reed prefers the low-register regime, based on all four peaks, to the two-part regime, based on peaks 2 and 4. The different behaviour of the regimes under loud and soft playing conditions explains the functioning of the register hole of a woodwind instrument. This hole must produce two changes: it must cause peak 1 to become less tall than peak 2 in order to assure second-register playing at a pianissimo level; and it must shift the frequency of peak 1, giving it an inharmonic relationship with the other peaks (an inharmonicity chosen to produce the maximum possible disruption of any cooperative effects) in order to assure second-register playing at the forte level. The dotted curves in the middle segment of fig.48 show how opening a register hole alters the heights and positions of peaks 1 and 3 for the f' fingering, leaving peak 2 unscathed and able to cooperate in the production of the note f'' at all dynamic levels.

As in the case of brass instrument mouthpieces, there is a fixed relationship between the proportions of the air column of a conical instrument and the reed cavity and neck or bocal with which it can function. The active volume of the reed cavity (under playing conditions), with that of the associated tube, must closely approximate the volume of the missing apical segment of the basic cone. Furthermore, the playing frequency of the reed with its cavity and neck must (when sounded using a normal embouchure) agree with that of a doubly open cylindrical pipe whose length is that of the missing part of the cone. Systematic methods are available for the mutual adjustment of the air column, tone holes, reed cavity and neck of conical woodwind instruments. These can be as helpful in the fitting of a proper reed to an ancient instrument as they are in the construction of a modern one. The best instruments of all eras show great consistency with the principles outlined above.

The clarinet family (which uses a basically cylindrical air column) has properties remarkably similar to those of the conical woodwind instruments described above. The fingering used to produce the clarinettist’s e' is analogous to that used for b' on the oboe. Once again it gives an air column with only two response peaks, while the lower notes of the scale are played using air columns with an increasing number of active peaks, exactly as before. As discussed in §4, the replacement of the conical air column by a cylindrical one has the effect of shifting the natural frequency ratios from the 1:2:3 etc. harmonic series to a sequence of the type 1:3:5 etc. made up of the odd members of a harmonic series. This has two noteworthy musical consequences. First the clarinet has an enormously large and easily controlled dynamic range in the low register. As one goes from pianissimo playing (peak 1 alone being active) to a mezzo-piano level, the nascent second harmonic partial in the note occurs at a dip in the resonance curve, preventing a tendency for an abrupt growth of tone which would otherwise result from its entry into the regime of oscillation (e.g. as tends to happen on the saxophone). Harder blowing progressively brings in the influence of the third harmonic partial as it cooperates now with response peak 2. The successive entry of cooperative and anticooperative influences as the odd and even partials become important is what makes a crescendo so easily manageable on a clarinet. The second consequence is that the notes of the clarinet’s second register sound a 12th above the corresponding low-register notes, rather than an octave above as among the conical instruments.

Acoustics, §IV: Wind instruments

7. Flute.

As remarked in §2 above the airstream from a flute player’s lips is steered alternately into and out of the instrument’s embouchure hole under the influence of the flow aspect of the air column’s oscillation. It is worthwhile to maintain the analogy with the reed instruments, referring to the controlled airstream informally as an ‘air-reed’ to distinguish it from the aerodynamicist’s ‘edge tone’, with whose action it is often confused. It will suffice to note that an air-reed sets up regimes of oscillation in conjunction with the dips rather than the peaks of the air column response curves. Apart from this the behaviour is strictly analogous to that of reed instruments.

There are three basic shapes of air column that provide adequate cooperation with an air-reed by giving harmonically related response dips: the cylindrical pipe, the contracting cone and the expanding cone. Only the first two shapes are in common use. Certain subtleties of the cooperative action of an air-reed require a contraction of the bore near the blowing end as compared with the trend of the main bore. Thus a cylindrical tube requires a contracted headjoint, as exemplified by the Boehm flute, while the Baroque flute has its conical taper contracted into a cylinder in the headjoint. In both cases the volume of the small cavity existing between the cork and the embouchure hole can be adjusted to match the proportions of the hole itself to the rest of the instrument and to its player.

Acoustics, §IV: Wind instruments

8. Wind instrument tone-colour.

There are four significant influences on the tone-colour of a wind instrument. First, varying the profile of the reed tip and the mouthpiece tip on single-reed instruments changes the relation between the shapes of the puffs of air that come through the reed and the acoustic stimulus (in the mouthpiece) that controls them, thus causing modifications in the strengths of the generated partials. Similar behaviour is observed in brass instruments, flutes and organ pipes. Second, the number of cooperating peaks in the regime of oscillation and their height directly influence the strengths of the partials of a note generated within the instrument. Partials that are directly sustained by the cooperating peaks have strengths roughly proportional to the height of the peaks. The higher partials, lying at frequencies where the peaks are irregular or nonexistent, are weak because they are by-products of the main oscillation. Third, the transmission of sound out of an instrument into the room via the bell of a brass instrument or via the set of open woodwind tone holes also affects the tone-colour. This transmission is small for the lower partials of the internally generated note, rising steadily to a maximum value at the frequency at which the resonance peaks disappear. The resulting ‘treble boost’, characteristic of the emission process, partially offsets the progressively weaker generation of the higher partials. One hears the aggregate result of both effects. Experiment shows that a rise of only 2–3% in the frequency beyond which there are no resonance peaks makes an easily perceived brightening of the tone-colour on any wind instrument. Fourth, misalignments in the resonances will make many changes in the whole sound, but the acceptability of such misalignments is limited by the accompanying deterioration in the responsiveness of the instrument.

In the family of trumpets and trombones, a further factor comes into play which is responsible for a dramatic increase in brightness of timbre in very loud playing. The adjectives frequently used to describe this tone quality – ‘brassy’ or ‘metallic’ (cuivré in French) – reflect a common misconception that the effect arises from vibration of the metal bell of the instrument. In 1996 Hirschberg and his co-workers showed that the cause is in fact the development of shock waves in the cylindrical section of the air column. At the point in the vibration cycle at which the player’s lips open, a large pressure jump is created in the mouthpiece. This pressure rise becomes steeper and steeper as it travels down the tube; by the time it reaches the bell it has become an extremely short and powerful pulse. The form of this shock wave is similar to that created by the passage of a supersonic aircraft, and the sound of a fortissimo g' on a trombone has been graphically described as ‘four hundred sonic bangs per second’ (Gilbert and Petiot, 1997).

Acoustics, §IV: Wind instruments

9. Early wind instruments.

The last quarter of the 20th century saw a remarkable growth of interest in the performance of music on instruments typical of the period in which the music was composed. Acoustical studies of surviving original instruments have helped to clarify some of the important differences between early and modern instruments, and have provided useful guidance to makers engaged in manufacturing reproductions (see Drinker and Bowsher, 1993; Benade, 1994; Campbell, 1994, 1996; Myers, 1995; and Escalas and others, 1998).

Some instruments have undergone a continuous and relatively subtle evolution. The trombone, which appeared after the mid-15th century, remained a very similar instrument acoustically during the following four centuries. In the 20th century the bore diameter of the typical orchestral trombone increased significantly, as did the diameter of the bell. These changes, together with modifications of the mouthpiece, increased the acoustic power of the instrument, and also tended to reduce the brightness of timbre, especially in loud playing.

The most dramatic change in the acoustics of trumpets and horns occurred in the early 19th century, with the invention of the valve. Before this time the air column length of a trumpet or horn was fixed, and the playing technique relied heavily on natural tones corresponding to the frequencies of the air column modes. On the horn the technique of hand-stopping was used to modify the mode frequencies to provide additional notes, although the changes in pitch were inevitably accompanied by some changes in timbre. Natural trumpet technique relied on the ability of the player to alter the pitches of certain natural notes by changes of embouchure, a technique described as ‘lipping’. It has been noted that the pressure response curves of Baroque natural trumpets have less sharply peaked resonances than do those of modern trumpets, making it easier to lip notes on the older instruments (Smithers and others, 1986).

Some wind instruments of the medieval and Renaissance periods fell completely out of use in the 18th and 19th centuries. Although an evolutionary line can be traced from the conical-bored double-reed shawms to the modern oboe and bassoon, the present-day orchestra contains no descendants of the crumhorn or the racket, which were double reed instruments with narrow-bored cylindrical air columns. Nor can modern equivalents be found for the cornett and the serpent, which were conical tubes with side finger holes, normally fitted with cup mouthpieces and sounded by the lips. The 17th-century serpent, although an instrument of great charm, suffered from serious acoustical problems related to the small size and irregular spacing of the fingerholes. These acoustical difficulties were largely overcome in the fully keyed ophicleide, which enjoyed considerable popularity in the 19th century. The cornett had developed an acoustically satisfactory form by the 16th century; its combination of lip excitation, short tube length and side hole fingering gave it a high degree of flexibility and agility. The cornett became the supreme wind instrument of the early Baroque period and has been successfully revived in the late 20th century by a new generation of virtuoso performers.

Acoustics, §IV: Wind instruments

BIBLIOGRAPHY

H. von Helmholtz: Die Lehre von den Tonempfindungen (Brunswick, 1863, 4/1877; Eng. trans., 1875/R, 2/1885/R, 6/1948, as On the Sensations of Tone)

H. Bouasse: Instruments à vent (Paris, 1929–30/R)

A.H. Benade: On the Mathematical Theory of Woodwind Finger Holes’, JASA, xxxii (1960), 1591–1608

J. Backus: Small-Vibration Theory of the Clarinet’, JASA, xxxv (1963), 305–13

A.H. Benade and J. French: Analysis of the Flute Head Joint’, JASA, xxxvii (1965), 679–91

J. Coltman: The Sounding Mechanism of the Flute and Organ Pipe’, JASA, xliv (1968), 983–92

W. Worman: Self-Sustained Nonlinear Oscillations of Medium Amplitude in Clarinet-Like Systems (diss., Case Western Reserve U., Cleveland, 1971)

K. Wogram: Ein Beitrag zur Ermittlung der Stimmung von Blechbläsinstrumenten (diss., Technical U. of Brunswick, 1972)

A.H. Benade: The Physics of Brasses’, Scientific American, ccxxix/1 (1973), 24–35

N.H. Fletcher: Nonlinear Interactions in Organ Flue Pipes’, JASA, lvi (1974), 645–52

E.V. Jansson and A.H. Benade: On Plane and Spherical Waves in Horns of Non-Uniform Flare’, Acustica, xxxi (1974), 80–98, 185–202

A.H. Benade: Fundamentals of Musical Acoustics (New York, 1976, 2/1990)

N.H. Fletcher: Mode Locking in Non-Linearly Excited Inharmonic Musical Oscillators’, JASA, lxiv (1978), 1566–9

J.W. Coltman: Acoustical Analysis of the Boehm Flute’, JASA, lxv (1979), 499–506

S.J. Elliot and J.M. Bowsher: Regeneration in Brass Wind Instruments’, Journal of Sound and Vibration, lxxxiii (1982), 181–217

M.E. McIntyre, R.T. Schumacher and J. Woodhouse: On the Oscillations of Musical Instruments’, JASA, lxxiv (1983), 1325–45

D. Smithers, K. Wogram and J. Bowsher: Playing the Baroque Trumpet’, Scientific American, ccliv/4 (1986), 108–15

M. Campbell and C. Greated: The Musician’s Guide to Acoustics (London, 1987/R)

N.H. Fletcher: Autonomous Vibration of Simple Pressure-Controlled Valves in Gas Flows’, JASA, xciii (1992), 2172–80

P.A. Drinker and J.M. Bowsher: The Application of Noninvasive Acoustic Measurements to the Design, Manufacture and Reproduction of Brass Wind Instruments’, HBSJ, v (1993), 107–31

A. Benade: Woodwinds: the Evolutionary Path since 1700’, GSJ, xlvii (1994), 63–110

D.M. Campbell: The Sackbut, the Cornett and the Serpent’, Acoustics Bulletin (1994), May/June, 10–14

J.P. Dalmont and others: Some Aspects of Tuning and Clean Intonation in Reed Instruments’, Applied Acoustics, xlvi (1995), 19–60

A. Hirschberg, J. Kergomard and G. Weinreich, eds.: Mechanics of Musical Instruments (Vienna, 1995) [incl. J. Kergomard: ‘Elementary Considerations on Reed-Instrument Oscillations’, 229–90; A. Hirschberg: ‘Aeroacoustics of Wind Instruments’, 291–369]

A. Myers: Characterisation and Taxonomy of Historical Brass Musical Instruments from an Acoustical Standpoint (diss., U. of Edinburgh, 1995)

D.M. Campbell: Cornett Acoustics: Some Experimental Studies’, GSJ, xlix (1996), 180–96

N. Grand, J. Gilbert and F. Laloe: Oscillation Threshold of Woodwind Instruments’, Acustica, lxxxii (1996), 137–51

A. Hirschberg and others: Shock Waves in Trombones’, JASA, xcix (1996), 1754–8

J. Gilbert and J.-F. Petiot: Brass Instruments: Some Theoretical and Experimental Results’, Proceedings of the International Symposium on Musical Acoustics: Edinburgh 1997 [Proceedings of the Institute of Acoustics, xix (1997)], 391–400

N.H. Fletcher and T.D. Rossing: Physics of Musical Instruments (New York, 2/1998)

C.J. Nederveen: Acoustical Aspects of Woodwind Instruments (DeKalb, IL, 2/1998)

R. Escalas, A. Barjau and V. Gibiat: Les instruments de ménétriers de la cathédrale de Salamanque’, Acoustique et instruments anciens: Paris 1998

Acoustics

V. Percussion instruments

Percussion instruments generally use one or more of the following basic types of vibrators: bars, membranes, plates, air columns or air chambers. Except for air columns, the frequencies of the modes of vibrations in these components are not related harmonically; therefore percussion instruments are characterized by inharmonic partials in their sound. Another characteristic is the constant change in amplitude of their sounds, rising rapidly at the onset and immediately beginning to die away without reaching a steady state, as do the sounds of most string and wind instruments.

1. Drums.

Drums are probably our oldest musical instruments (with the exception of the human voice). The sounds of some drums, such as kettledrums, tablā and boobams, convey a strong sense of pitch but others do not; in the latter category are the bass drum, snare drum, tenor drum, tom-toms, bongos, congas and many drums of African and East Asian origin. As vibrating systems, drums can be divided into three categories: those consisting of a single membrane coupled to an enclosed air cavity (such as kettledrums), those consisting of a single membrane open to the air on both sides (such as some tom-toms and congas), and those consisting of two membranes coupled by an enclosed air cavity (such as bass drums and snare drums).

The first 12 vibrational modes of an ideal membrane are shown in fig.49. Their frequencies depend upon the radius, the tension and the mass per unit area. The normal mode frequencies of real membranes in a drum may be quite different from those of an ideal membrane, however. The principal effects acting to change the mode frequencies are air mass loading, membrane stiffness and the pressure of air enclosed within the drum, if any. In general air loading lowers the modal frequencies, while the other two effects tend to raise them. In thin membranes air loading is usually the dominant effect.

Although the modes of vibration of an ideal membrane are not harmonically related, a carefully tuned kettledrum is known to sound a strong principal note with two or more harmonic overtones. In the 19th century the physicist Lord Rayleigh recognized the principal note as coming from the (1,1) mode (at frequency f1) and identified overtones about a perfect 5th (f/f1 = 1·50), a major 7th (f/f1 = 1·88) and an octave (f/f1 = 2·00) above the principal tone. He correctly identified these overtones as originating from the (2,1), (3,1) and (1,2) modes respectively, which in an ideal membrane should have frequencies of 1·34, 1·6 and 1·83 times the frequency of the (1,1) mode. Modern experiments have verified Rayleigh's results (see Rossing, 1982).

Air mass loading, which lowers the low-frequency modes more than those of higher frequency, is mainly responsible for establishing the harmonic relationship of kettledrum modes. Other effects, such as membrane stiffness and the size and shape of the kettle, merely fine-tune the frequencies, although they may have considerable effect on the rate of decay of the sound.

Harmonic mode tuning on Indian drums, such as the tablā and mrdangam, takes place in a different way than in the kettledrum. In these drums, with their small membranes, the effect of air mass loading is quite small, and so many layers of black paste are skilfully applied to load the drumheads by the required amount.

The physicist C.V. Raman studied the acoustical properties of tablā and correctly identified five harmonic partials as originating from nine normal modes of vibration, several of which have the same frequencies. The fundamental tone is from the (0,1) mode; the 2nd harmonic is from the (1,1) mode; the (2,1) and (0,2) modes provide the 3rd harmonic; the (3,1) and (1,2) modes supply the 4th harmonic; and the (4,1), (0,3) and (2,2) modes contribute to the 5th harmonic (see Raman, 1934).

In double-headed drums, such as the snare drum and the bass drum, there is considerable coupling between the two heads as they vibrate, especially at low frequency. This coupling, which takes place mechanically through the drum shell and acoustically through the enclosed air, leads to the formation of mode pairs. In fig.50, mode pairs in a freely suspended snare drum based on the (0,1) and (1,1) modes of each membrane are shown. When the drum is placed on a stand, further mechanical coupling on the modes of the support structure occur (see Rossing, 1992).

2. Mallet instruments.

Marimbas, xylophones, vibraphones and glockenspiels employ tuned bars of wood, metal or synthetic material. These bars can vibrate by bending (transverse modes), twisting (torsional modes) or elongating (longitudinal modes). Although longitudinal and torsional modes in bars or beams of uniform cross section have nearly harmonic frequencies, transverse modes are quite inharmonic. Since transverse modes are mainly used in mallet percussion instruments, harmonic tuning must be accomplished by shaping the bars appropriately.

The modes of transverse vibration in a bar or rod depend upon the end conditions. Three different end conditions are commonly considered: free, simply supported (hinged) and clamped. There are six different combinations of these end conditions, each leading to a different set of vibrational modes. Three of the more common combinations are shown in fig.51.

A deep arch is cut in the underside of marimba bars, particularly in the low register. This arch serves two purposes: it reduces the length of the bar required to reach the low pitches, and it allows tuning of the overtones (the 1st overtone is normally tuned two octaves above the fundamental). Marimba resonators are cylindrical pipes tuned to the fundamental mode of the corresponding bars. A pipe with one closed end and one open end resonates when its acoustical length is a fourth of a wavelength of the sound. The purpose of the tubular resonators is to emphasize the fundamental and also to increase the loudness, which is done at the expense of shortening the decay time of the sound.

Xylophone bars are also cut with an arch on the underside, but the arch is not as deep as that of the marimba, since the first overtone is tuned to a 12th above the fundamental (that is, three times the frequency of the fundamental). Since a pipe closed at one end can also resonate at three times its fundamental resonance frequency, a xylophone resonator reinforces the 12th as well as the fundamental. This overtone boost, plus the hard mallets used to play it, give the xylophone a much crisper, brighter sound than the marimba.

Vibraphones or vibraharps have deeply arched bars, so that the first overtone is two octaves above the fundamental, as in the marimba. The aluminium bars tend to have a much longer decay time than the wood or synthetic bars of the marimba or xylophone, so ‘vibes’ are equipped with pedal-operated dampers. The most distinctive feature of vibe sound is the vibrato introduced by motor-driven discs, known as ‘vanes’, at the top of the resonators, which alternately open and close the tubes. The vibrato produced by these rotating vanes consists of rather substantial fluctuations in amplitude (intensity vibrato) and a barely detectable change in frequency (pitch vibrato).

Chimes, or tubular bells, are usually fabricated from lengths of brass tubing 3–4 cm in diameter. Although they are tubular, the modes of vibration excited by striking with a mallet are essentially those of a beam or bar with two free ends. An interesting acoustical property of chimes is that there is no mode of vibration with a frequency at, or even near, the pitch of the strike tone one hears. Modes 4, 5 and 6 appear to determine the strike tone. These modes are nearly in the ratio 2:3:4, so the ear considers them as overtones of a missing fundamental an octave below mode 4.

3. Cymbals and gongs.

The vibrational modes of a circular plate are similar in shape to those of the circular membrane shown in fig.49, although the frequencies are quite different. Using holographic interferometry, more than 100 plate modes have been observed in an orchestral cymbal.

The level below about 700 Hz shows a rather rapid decrease during the first 0·2 second after striking; this is apparently due to conversion of energy into modes of higher frequency. Sound energy in the range of 3–5 kHz, which gives the cymbal its ‘shimmer’ or aftersound, peaks about 0·05–0·1 seconds after striking, and may become the most prominent feature in the sound spectrum of a cymbal (see Rossing and Shepherd, 1983).

Gongs play a very important role in East Asian as well as Western music. Gongs used in symphony orchestras are usually 0·5–1 metre in diameter, cast of bronze with a deep rim and a protruding dome. When they are struck near the centre with a massive soft mallet, the sound builds up relatively slowly and continues for a long time.

Tam-tams are similar to gongs in appearance, but they do not have the dome of the gong, their rim is not as deep and their metal is thinner. Tam-tams a sound much less definite pitch than do gongs; their sound may be characterized as somewhere between the sounds of a gong and a cymbal.

4. Steel drums.

Steel drums or pans are fabricated from 55-gallon oil drums. The first step in making a steel drum is to hammer the end of the oil barrel to the shape of a shallow basin; then a pattern of grooves is cut with a nail punch to define the individual note areas, which may range from 28 in a single tenor to only three in a bass pan. Modern steel bands span five octaves, from around G' to g'''. A skilled pan maker tunes at least one mode of vibration in each note to a harmonic of the fundamental (usually the octave) and, if possible, another mode to the 3rd or 4th harmonic. Additional harmonics in the sound spectrums of steel drum notes result from sympathetic vibration of nearby note areas and from non-sinusoidal motion of the note area itself.

5. Bells.

When struck by its clapper a bell vibrates in a complex way, which can be described in terms of vibrational modes resembling those of a circular plate, with the nodal diameters replaced by nodal meridians. The first five modes of a church bell or carillon bell are shown in fig.52. Dashed lines indicate the locations of the nodes. The numbers (m, n) indicate the numbers of complete nodal meridians extending over the top of the bell (half the number of nodes observed at the mouth) and the numbers of nodal rings respectively. Since there are two modes with m=2 and n=1, one with a nodal ring at the waist and one with a nodal ring near the mouth, we denote the second one as (2,1#); likewise for (3,1#).

Handbells are much thinner and lighter than church bells and carillon bells. They have no thickened soundbow, and they employ relatively soft clappers to give a delicate sound. In recent years handbell choirs have become popular in schools and churches; there are an estimated 40,000 handbell choirs in the USA alone.

Generally the first and second vibrational modes of a handbell are tuned to a 3:1 frequency ratio. Each of these modes radiates a double-frequency partial as well, however, so the sound spectrum of a handbell includes a fundamental, a second harmonic, a third harmonic and a sixth harmonic (see Rossing and Sathoff, 1980).

BIBLIOGRAPHY

C.V. Raman: The Indian Musical Drum’, Proceedings of the Indian Academy of Sciences, section A, i (1934), 179–88

T.D. Rossing and H.J. Sathoff: Modes of Vibration and Sound Radiation from Tuned Handbells’, JASA, lxviii (1980), 2225–6

T.D. Rossing: The Physics of Kettledrums’, Scientific American, ccxlvii (1982), 172–8

T.D. Rossing and R.B. Shepherd: Acoustics of Cymbals’, Proceedings of the 11th International Congress on Acoustics: Paris 1983 (Paris, 1983), 329–33

T.D. Rossing: Percussion Instruments’, The Science of Sound (Reading, MA, 2/1990), 257–86

T.D. Rossing: Acoustics of Drums’, Physics Today, xlv/3 (1992), 40–47

C.-R. Schad and G. Frik: Klangfiguren einer Glocke’, Acustica, lxxviii (1993), 46–54

T.D. Rossing: Modes of Vibration and Sound Production in Percussion Instruments’, Modèles physiques, création musicale et ordinateur, no.1 (1994), 75–112

C.-R. Schad and G. Frik: Über den Schlagklang von Glocken’, Acustica, lxxx (1994), 232–7

T.D. Rossing, D.S. Hampton and U.J. Hansen: Music from Oil Drums: the Acoustics of the Steel Pan’, Physics Today, xlix/3 (1996), 24–9

N.H. Fletcher and T.D. Rossing: The Physics of Musical Instruments (New York, 2/1998)

Acoustics

VI. The voice

1. Introduction.

2. Air pressure supply.

3. Oscillator.

4. Resonator.

5. The singing voice.

BIBLIOGRAPHY

Acoustics, §VI: The voice

1. Introduction.

The voice organ can be regarded as a wind instrument consisting of an air pressure supply driving an oscillator, the output signal of which is fed into a resonator from which the sound is radiated to the air outside the instrument (see fig.53). The air pressure supply is the respiratory system (i.e. the lungs and the respiratory muscles). In the case of voiced sounds, the oscillator is the set of vocal folds (earlier also called cords); they convert the airstream from the lungs into a complex sound built up by harmonic partials. For voiceless sounds the oscillator is a narrow slit through which the airstream is forced; the laminar airstream is then converted into a turbulent airstream which generates noise. The sound generated by the oscillator is called the ‘voice source’. It propagates through the resonator constituted by the cavities separating the oscillator from the free air outside the instrument. In resonators the ability to transmit sound varies considerably with the frequency of the transmitted sound. At certain frequencies (the resonance frequencies), this ability reaches maximum. Thus in the case of the voice, those voice source partials that lie closest to a resonance are radiated with higher amplitudes than other partials. In this way the spectral form of the radiated sound mirrors the properties of the resonator. The resonances and the resonance frequencies of the vocal tract are called ‘formants’ and ‘formant frequencies’ respectively.

Acoustics, §VI: The voice

2. Air pressure supply.

In singing, the air pressure is much more carefully regulated than in normal speech, by a skilled control of the inspiratory and expiratory muscles. The air pressure provided by the respiratory system in singing varies with pitch and vocal effort, generally between 5 and 40 cm of water. The resulting air flow depends also on the glottal conditions. Air flow rates of 0.1–0.3 litres per second have been observed in singers. These air pressure and air flow ranges do not appear to deviate appreciably from values observed in untrained speakers.

Acoustics, §VI: The voice

3. Oscillator.

(i) Voiced sounds.

The vocal folds originate at the angle of the thyroid cartilage, course horizontally backwards and are inserted into each of the arytenoid cartilages. By adduction (i.e. drawing these cartilages towards each other), the slit between the folds, called the ‘glottis’, is narrowed, and an airstream can set the folds into vibration. A vibration cycle can be described as follows. When the glottis is slightly open an airstream from the lungs can pass through it. This airstream throws the vocal folds apart and at the same time generates a negative pressure along the edges of the folds. The sucking effect of this negative pressure along with the elasticity and other mechanical properties of the folds closes the glottis again. Then the air pressure difference across the glottis throws the folds apart, thus starting the next vibratory cycle. The frequency of a vibration is determined by the transglottal air pressure difference and the mechanical properties of the folds. A high pressure difference or tense and thin vocal folds, or both, give a high vibration frequency; converse states give a low frequency. The mechanical properties of the folds are regulated by a series of muscles that vary the length and stiffness of the folds by manipulating the positions of the laryngeal cartilages. Thus these muscles are used to regulate the vibration frequency. As the vibration frequency determines the pitch perceived, these muscles are often referred to as the ‘pitch regulating muscles’. An increase of the subglottal pressure raises the amplitude of the sound produced and also increases the vibration frequency, raising the pitch. Thus, in order to perform a crescendo at a constant pitch a singer has to raise the subglottal pressure and simultaneously compensate for the pitch increase by adjusting the pitch-regulating muscles.

By vibrating, the vocal folds repeatedly interrupt the airstream from the respiratory system. Thus they act as a valve oscillating between open and closed positions: the result is a chopped airstream corresponding to a complex sound, the fundamental frequency of which is equal to the vibration frequency of the folds. The glottis is schematically shown as a function of time in fig.54. The horizontal portion of the curve corresponds to the closed phase of the glottal vibration cycle, and the triangular portion is the open phase. As the air flow generally increases more slowly than it decreases, the triangular part of the curve is asymmetrical in the figure. In trained voices the glottal closure is often observed to be more efficient than in untrained voices. Also, the vibration pattern appears to vary considerably less with pitch and vocal intensity in trained voices than in untrained ones (see Sundberg, Andersson and Hultqvist, 1999).

The sound generated by the chopped transglottal airstream is built up by a great number of harmonic partials whose amplitudes generally decrease monotonically with frequency, roughly by 12 dB per octave at neutral loudness. It is noteworthy that this holds as an approximation for all voiced sounds. Partials of measurable amplitude in the source spectrum are generally found up to 4–6 kHz. This means that a tone with a fundamental frequency of 100 Hz may contain between 40 and 60 partials of appreciable amplitude. However, the amplitudes of the source spectrum partials vary with pitch and vocal intensity (see Sundberg, Andersson and Hultqvist, 1999).

(ii) Voiceless sounds.

The sound source in this case is noise generated by a turbulent airstream. The narrow slit required for the noise generation can be formed at various places along the vocal tract, the lowest position being at the glottis itself, which can be kept wide enough to prevent the folds from vibrating and narrow enough to make the airstream turbulent. This is the oscillator used in the ‘h’ sound. Another place used in some languages is the velar region, which can be constricted by the tongue hump. The resulting sound is used as the voice source in the German ‘ach’ sound. In most remaining unvoiced sounds the tongue tip constricts the vocal tract in the palatal, alveolar or dental regions as in the initial phonemes of ‘sheep’, ‘cheap’ and ‘sip’. In the ‘f’ sound the upper incisors and the lower lip provide the slit.

Acoustics, §VI: The voice

4. Resonator.

The frequencies of the formants are determined by the shape of the resonator. In the case of non-nasalized sounds the resonator consists of the pharynx and mouth cavities. In vowels these cavities constitute a tube resonator which may be regarded as closed at the glottal end and open at the lip end. The average vocal tract length for males is generally considered to be 17·5 cm. A tube of that length and having a uniform cross-sectional area would display a series of resonances falling close to the odd multiples of 500 Hz. However, as the cross-sectional area of the vocal tract is not constant, the formants deviate from these frequencies. The vocal tract shape is determined by the positions of the articulators (i.e. the lips, the jaw, the tongue, the velum and the larynx). The positions of these articulators are continuously varied in singing and in speech, so that the formants are tuned to various target frequencies. Thus each vowel sound corresponds to a certain pattern of articulator positions.

The dependence of the formant frequencies on the articulatory configuration is rather complex. Only a few factors have the same type of effect on all formant frequencies; for instance, all formants drop in frequency more or less when the vocal tract length is increased, by protrusion of the lips or lowering of the larynx or both, and when the lip opening area is decreased. Moreover, certain formants are more dependent on the position of a specific articulator than are others. The first formant frequency is particularly sensitive to the jaw opening: the wider the jaw opening, the higher the first formant frequency. The second and third formant frequencies are especially sensitive to the position of the tongue body and tongue tip respectively. The highest frequencies of the second formant (2–3 kHz) are obtained when the tongue body constricts the vocal tract in the palatal region, as in the vowel of ‘keep’. The lowest values of the third formant (around 1500 Hz) are associated with a tongue tip lifted in a retroflex direction. Fig.55 provides examples of articulatory configurations associated with some vowels.

These guidelines apply to oral sounds; in nasalized sounds the dependence of the formants on the articulator positioning becomes considerably more complex. The nasal tract introduces minima in the sound transfer of the vocal tract resonator. The acoustical effect of nasalization varies between vowels, but a general feature is that the lowest partials are emphasized.

For both oral and nasalized sounds the two lowest formant frequencies are generally decisive in the vowel quality perceived. Frequencies typical of male speakers are given in fig.56. Females have shorter vocal tracts and therefore higher formant frequencies. On average for vowels, the three lowest formant frequencies of female voices are 12, 17 and 18% higher, respectively, than those of male voices. Children, having still shorter vocal tracts, possess formant frequencies that are 35–40% higher than those of males (see Fant, 1973).

The amplitudes of the partials emitted from the lip opening depend on the sound transfer ability of the vocal tract. This ability depends not only on the partials’ frequency distance from the closest formant, but also on the frequency distance between formants. Thus a halving of the frequency distance between two formants increases the sound transfer ability by 6 dB at the formant frequencies and by 12 dB midway between the formant frequencies, other things being equal. Another factor important to the amplitudes of the radiated partials is the sound radiation properties of the lip opening, which boosts the entire spectrum envelope by 6 dB per octave. For this reason, the amplitudes of all spectrum partials tend to increase with the pitch even when there is no change in vocal effort.

Acoustics, §VI: The voice

5. The singing voice.

Basically the voice organ seems to be used in the same way in singing as in speech. In both cases the sound produced is entirely determined by the properties of the sound source and the vocal tract resonances. In other words, there seems to be no reason to assume that in non-nasalized vowels, resonance outside the vocal tract, such as in the maxillary sinuses or the lungs, contributes to the acoustic output to any appreciable extent. In singing, however, the possibilities inherent in the normal voice organ are used in quite special ways.

(i) Breathing.

The demands on the breathing apparatus differ significantly between speech and singing. There are two main reasons for this. Firstly, phrases in neutral speech are generally short, typically using no more than 15–20% of lung capacity. In singing, on the other hand, phrases tend to be considerably longer, using twice as much and occasionally nearly 100% of lung capacity. As the recoil forces of the respiratory apparatus vary with lung volume, a singer needs to supply different degrees of respiratory muscle force depending on lung volume. Secondly, the mean over-pressure of air in the lungs, which controls the loudness of phonation, is basically constant in neutral speech, although it is raised for emphasized syllables. In singing, as higher pitches require higher pressures, this air pressure needs to be varied with pitch. As lung pressure affects pitch, failures to reach target pressures result in singing off the pitch. Singers generally use the diaphragm muscle for inhalation, which is reflected in an expansion of the abdominal wall. However, the strategy used for achieving the necessary control of the respiratory apparatus differs between singers. Some contract the abdominal wall, thus raising the level of the diaphragm in the trunk before phonation, while others keep the abdominal wall expanded and thus the diaphragm low in the trunk at the initiation of a phrase. Some even contract both abdominal wall muscles and diaphragm during singing. It is frequently assumed that these different strategies affect the function of the vocal folds and hence the voice timbre (see Thomasson and Sundberg, 1997).

(ii) Vibrato.

One of the typical peculiarities of opera and concert singing is vibrato. In Western operatic singing its acoustical correlate is an undulation of the frequencies and amplitudes of the partials (fig.57). The undulation is almost sinusoidal and has a rate of about 5–7 Hz in good voices. The rate is generally constant within a singer, although it tends to slow down with advanced age. The magnitude of the frequency excursions is of the order of ±70 cents, but greater variation occurs for expressive purposes and at advanced age. Vibrato tends to increase in regularity as voice training proceeds successfully. The frequency and amplitude undulations are synchronous but not necessarily in phase, depending on the frequency distance between the strongest spectrum partial and the nearest formant. If the strongest partial is slightly below the strongest formant, an increase in frequency will cause the amplitude to increase, so that frequency and overall amplitude will vary in phase. The opposite occurs if the partial is slightly higher than the frequency of the strongest formant.

The physiological origin of vibrato is not well understood. EMG (electromyographic) measurements in laryngeal muscles have revealed rhythmical contractions, synchronous with the vibrato undulations, of the pitch-raising cricothyroid muscle. This suggests that the laryngeal muscles produce the vibrato. The neural origin of these rhythmical contractions is unknown. Possibly as a consequence of this, the transglottal air flow varies with the frequency variations, and the resulting vibrato notes tend to consume more air than vibrato-free notes (see Large and Iwata, 1971). In popular singing subglottal pressure seems to be the vibrato-generating mechanism. In some singers the variations in the muscle activity affect the larynx height and even other parts of the voice organ. Pitch seems to be perceived with comparable accuracy regardless of the presence of vibrato for a single note. The perceived pitch agrees within a few cents with the pitch of a vibrato-free note with a fundamental frequency equal to the average frequency of the vibrato note.

(iii) Register.

The term ‘register’ is used for groups of adjacent notes that sound similarly and are felt to be produced in a similar way. However, there are a great number of conflicting terms and definitions in common use. In untrained voices in particular a change from one register to another may be accompanied not only by a marked shift in tone quality but also by a ‘register break’, a sudden jump in pitch. In both male and female adults register shifts typically occur in the range of approximately 300–450 Hz. The register above this shift is mostly referred to as ‘falsetto’ in male voices and ‘middle register’ in female voices, while the register below the shift is known as ‘chest register’ or ‘modal register’. A further shift occurs below 100 Hz; this register is called ‘vocal fry’. Registers are associated with certain vocal fold configurations. Thus, in chest/modal register the folds are thick while in falsetto they are thinner. Acoustically, the lowest spectrum partial, other things being equal, has been found to be more dominating in falsetto than in chest/modal register. Also, the ‘heavy’ register in male and female voices has been reported typically to contain stronger high partials than the ‘light’ register. The physiological origin of register is confined to the voice source. According to some experts, a difference between the falsetto and the normal voice in males is that the vocal folds never reach full contact with each other during the vibration cycle in falsetto. Transitions between registers have been found to be accompanied by changes in the EMG signals from laryngeal muscles, and by changes in transglottal air flow. There is reasonable agreement on the importance of the laryngeal muscles to registers, though it has been suggested that a purely acoustical interaction between the glottal oscillator and the resonator is a contributory factor.

(iv) Singer’s formant.

The ‘singer’s formant’ is a peak in the spectrum envelope typically appearing near 3 kHz in all voiced sounds as sung by Western operatic singers except sopranos. It corresponds acoustically to a high spectrum envelope peak which is present in all vowels and generally centred at a frequency of 2500–3500 Hz. In vocal pedagogy it is often referred to as ‘singing in the mask’, ‘focussing’ etc. Mainly a resonatory phenomenon, the singer’s formant is achieved by clustering formants 3, 4 and 5 into a rather narrow frequency band. This seems to explain why sopranos lack a singer’s formant: they mostly sing at high fundamental frequencies, i.e. the frequency distance between adjacent partials is typically quite wide, equalling the frequency of the fundamental. This means that a partial would fall into the frequency band of the formant cluster producing the singer’s formant only for certain pitches, causing a salient timbre difference between different pitches. If the pharynx is wide enough, the larynx tube can act as a separate resonator, the resonance frequency of which is rather independent of the rest of the vocal tract; it may be tuned to a frequency lying between those of the third and fourth formants in normal speech. The condition of a widening of the pharynx seems to be met when the larynx is lowered, a gesture occurring typically in male professional Western operatic singing. At high pitches the demands on a wide pharynx are increased, and extreme lowering of the larynx is frequently observed when males sing high-pitched notes. In such cases the term ‘covering’ is sometimes used. The widening of the pharynx and the lowering of the larynx affect the frequencies not only of the higher formants, but also those of the lower formants. As an acoustical consequence of such articulatory modifications, the frequency of the second formant drops in front vowels. This alters the vowel quality to some extent, so that, for instance, the vowel in ‘sheep’ is ‘coloured’ towards the German ‘ü’ sound.

The perceptual function of the ‘singer’s formant’ seems to be to make the voice easier to hear above a loud orchestral accompaniment. It has also been suggested that it helps the singer to be more audible in large auditoriums.

(v) High-pitched singing.

Vowel quality is associated with specific combinations of the two lowest formant frequencies, and these frequencies are maintained regardless of the fundamental frequency. Normally the fundamental frequency is lower than the frequency of the first formant, which varies between about 250 Hz (close to c') and 1000 Hz, depending on the vowel. When the fundamental frequency is higher than the normal frequency value of the first formant, singers tend to increase the latter so that it remains higher in frequency than the fundamental. This partial is the strongest in the source spectrum, and, if it coincides with the first formant frequency, its amplitude will be maximized without raising extreme demands on vocal effort. The degrees of tongue constriction and, in particular, of jaw opening represent important articulatory tools for achieving the necessary increases of the first formant frequency. Though this increase affects vowel quality, this disadvantage is limited since in high-pitched singing the vowel quality cannot be maintained even with correct formant frequencies owing to the great frequency distance between the partials as compared with the number of formants (see Sundberg and Skoog, 1997).

(vi) Voice categories.

Male and female voices tend to differ significantly with regard to their formant frequencies as well as pitch range, and this factor seems also to be significant in differentiating tenors, baritones and basses. Thus when singing the same pitch, voices of these types can be distinguished by their vowel formant frequencies. In most vowels a bass is likely to show the lowest formant frequencies and a tenor the highest; and all formants, not only the two lowest, are relevant. The formant frequency differences between male and female voices resemble closely those observed between bass and tenor voices, which suggests that the dimensions of the resonating system are of major importance. In addition, the centre frequency of the singer’s formant seems to be typically higher in voices with a high pitch range than in voices with a lower pitch range. Thus, centre frequencies at about 2400 and 3000 Hz tend to give a bass-baritone-like and a tenor-like voice quality respectively.

(vii) Overtone singing.

In some Inner Asian cultures the voice is used in a rather special manner, in that the tones produced are perceived as possessing two different pitches. This can be explained as follows. If the frequency of a formant coincides with that of a partial, this partial is likely to be much stronger than the adjacent spectrum partials, other things being equal. If two formants are tuned to the near vicinity of a partial, the effect can be greatly enhanced, so that the partial is perceived as a second pitch of the tone along with the fundamental. This strategy of tuning two formants to a partial is applied in overtone singing. The second and third formants (sometimes the first and second) are tuned to closely spaced frequencies, thus enhancing a specific partial. The fundamental frequency is either low, <100 Hz, produced with a growl or vocal fry quality, or is higher, often with a pressed quality. By these means the amplitude of the fundamental is reduced, and hence the dominance of the amplified overtone is enhanced. In tuning formants the lip opening, the position and elevation of the tongue tip and, in some cases, nasalization seem to play important roles (see Bloothooft and others, 1992; see Overtone singing).

Acoustics, §VI: The voice

BIBLIOGRAPHY

G.E. Peterson and H.L. Barney: Control Methods used in the Study of Vowels’, JASA, xxiv (1952), 175–84

G. Fant: Acoustic Theory of Speech Production (The Hague, 1960)

J. Large and S. Iwata: Aerodynamic Study of Vibrato and Voluntary “Straight Tone” Pairs in Singing’, Folia phoniatrica, xxiii (1971), 50–65

G. Fant: Speech Sounds and Features (Cambridge, MA, 1973)

R.T. Sataloff, ed.: The Professional Voice: the Science and Art of Clinical Voice Care (San Diego, 1977)

W. Seidner and J. Wendler: Die Sängerstimme (Wilhelmshaven, 1978, 3/1997)

J. Large, ed.: Contributions of Voice Research to Singing (Houston, 1980)

J. Sundberg: The Science of the Singing Voice (DeKalb, IL, 1987)

G. Bloothooft and others: Acoustics and Perception of Overtone Singing’, JASA, xcii (1992), 1827–36

I. Titze: Principles of Voice Production (Englewood Cliffs, NJ, 1994)

J. Sundberg and J. Skoog: Dependence of Jaw Opening on Pitch and Vowel in Singers’, Journal of Voice, xi (1997), 301–6

M. Thomasson and J. Sundberg: Lung Volume Levels in Professional Classical Singing’, Logopedics, Phoniatrics, Vocology, xxii (1997), 61–70

K.N. Stevens: Acoustic Phonetics (Cambridge, MA, 1998)

J. Sundberg, M. Andersson and C. Hultqvist: Effects of Subglottal Pressure Variation on Professional Baritone Singers’ Voice Sources’, JASA, cv (1999), 1965–71