+ All Categories
Home > Documents > the legacy of Helmholtz continues

the legacy of Helmholtz continues

Date post: 08-Feb-2022
Category:
Upload: others
View: 4 times
Download: 0 times
Share this document with a friend
56
Tuning and Timbre: A Perceptual Synthesis Bill Sethares IDEA: Exploit psychoacoustic studies on the perception of consonance and dissonance. The talk begins by showing how to build a device that can measure the “sensory” consonance and/or dissonance of a sound in its musical context. Such a “dissonance meter” has implications in music theory, in synthesizer design, in the con- struction of musical scales and tunings, and in the design of musical instruments. ...the legacy of Helmholtz continues... 1
Transcript

Tuning and Timbre: A Perceptual SynthesisBill Sethares

IDEA: Exploit psychoacoustic studies on theperception of consonance and dissonance. Thetalk begins by showing how to build a device thatcan measure the “sensory” consonance and/ordissonance of a sound in its musical context.Such a “dissonance meter” has implications inmusic theory, in synthesizer design, in the con-struction of musical scales and tunings, and inthe design of musical instruments.

...the legacy of Helmholtz continues...1

Some Observations. . .

Why do we tune our instruments the way we do?

Some tunings are easier to play in than others.

Some timbres work well in certain scales, but not in others.

What makes a sound easy in 19-tet but hard in 10-tet?

“The timbre of an instrument strongly affects what tuning and scale soundbest on that instrument.” – W. Carlos

2

What are Tuning and Timbre?

frequency in Hz

0 1000 2000 3000 4000sample:

time: 0 0.23 0.45 0.68

0 10000 20000 30000

196

384

589

787

ma

gn

itu

de

am

plit

ud

e

Tuning = pitch of the fundamental (in this case 196 Hz)

Timbre involves (a) pattern of overtones (Helmholtz )

(b) temporal features

3

Some intervals “harmonious” and others “discordant.” Why?

X

X

X X

X X X X

X

X

X X

1.06:1 2:1

1.89:1 3:2

1.414:1 4:3

4

Theory #1: (Pythagoras ) Humans naturally like the sound of intervals de-fined by small integer ratios.

small ratios imply short period of repetition

short = simple = sweet

Theory #2: (Helmholtz ) Partials of a sound that are close in frequencycause beats that are perceived as “roughness” or dissonance.

absence of beats is called “consonance.”

5

A Short History of “Consonance” and “Dissonance” (after James Tenney )

• CDC#1: melodic consonance (e.g., of successive tones)

• CDC#2: polyphonic consonance (e.g., intervals between notes, “soundsgood”)

• CDC#3: contrapuntal consonance (defined by role in counterpoint)

• CDC#4: functional consonance (relationship with “tonic” or “root”)

• CDC#5: psychoacoustic consonance (intrinsic to a sound)

6

What are Beats? (beats1-2-3.avi)

0 0.2 0.4 0.6 0.8 1.0 1.2

-2

2

time (in seconds)

-1

1

-1

1

+

=

am

plit

ude

am

plit

ude

am

plit

ude

envelope

constructive

interference

destructive destructive

7

What should we expect to hear?

What happens when frequency ofbeats enters audio range?

100 Hz + 105 Hz = 5 Hz beats100 Hz + 150 Hz = 50 Hz beats

Fre

qu

en

cy o

f Be

ats

Frequency of Difference

Experiment: (Plomp & Levelt ) Fixw1 and let w2 scan through all fre-quencies. Ask listeners what theyperceive.

+W1

W2

8

Two Sine Waves: One Fixed, One Sweeping (sinediss.avi)

slow, pleasant beating rough, rapid beatingsound separates intotwo distinct tones

am

plit

ud

e

time

"beats"

"roughness"

"two tones"

0

1

Interval: 1/1 4/3 3/2 2/1

12-tet scale steps: fourth fifth octave

sensory

dis

sonance

Hertz: 400 424 449 476 504 534 566 596 635 673 713 755 800 848

9

Parameterizing the Sensory Dissonance Curve

as the difference between a sum of exponentials

d(x) = e−ax − e−bx

100 Hz

200 Hz

400 Hz

600 Hz

1000 Hz

0

1

frequency of lower tone

frequency interval

12-tet scale steps: fourth fifth octave

se

nso

ry

dis

so

na

nce

10

Some Implications. . .

Spectrum of a sound determines which intervals are most consonant

By using different kinds of spectra, can make almost any set of intervalssound consonant.

How to make a large variety of (organic, natural sounding) inharmonic tim-bres that work together? Using spectral mappings, can often maintainmuch of the original integrity of the sound. Examples:

consonant tritone — consonant pseudo-octaves

n-tone equal-tempered sounds

11

A Tritone Sound

Can use this parameterization tomake predictions. Consider asound composed of two partials:

f 2 f

am

plit

ud

e

frequency

Summing all the dissonances be-tween all pairs of partials gives the“dissonance curve” for this tone:

0 1 2 3 4 5 6 7 8 9 10 11 12

0

1

semitones

sensory

dis

sonance

12

Tritone Sound II

Using the tritone sound, the dissonance curve predicts that the tritoneinterval

√2 will be more consonant than the musical fifth or fourth.

fifths

fourths

tritones

& # ˙̇̇ ˙̇̇ ˙̇̇

(trichime.avi)

13

Harmonic Dissonance Curve

Many musical sounds have har-monic partials, i.e., are (approxi-mately) periodic with partials at

f,2f,3f,4f,5f,6f,7f, . . .

Dissonance curves for harmonictones have many minima that oc-cur at simple integer ratios, whichare close to the tuning of the key-board. 1:1 2:13:24:3

6:5 7:4

5:4

7:5

5:3

Sensory

Dis

sonance

14

Helmholtz’s Dissonance Curve

c' e'b e' f' g' a'b a' b'b c''

1:1

2:2

3:3

4:4

2:3 2:3

1:2

2:4

4:83:6

3:4 3:44:5

4:5

6:8

5:6

5:6 4:6

3:53:5

4:75:95:8

6:9

5:76:7

8:97:8

Two pitches are sounded simultaneously. The regions of roughness dueto pairs of interacting partials are plotted over one another, leaving only afew narrow valleys of relative consonance. The figure is redrawn from On

the Sensation of Tone.

15

Just Intonation

is a family of musical scales thatcontain many of the simple in-teger ratios (such as 3:2, 4:3,5:3, etc.). A body of music ex-ists in JI by composers such asHarry Partch (using his 43-toneper octave scale), Lou Harrison,D. Doty, Larry Polansky, and oth-ers. (* ⇒ just thirds, and <> ⇒just fifths.) (Paradigms Lost)

ratio cents

1/1 0

16/15 112

9/8 204

6/5 316

5/4 386

4/3 498

45/32 590

3/2 702

8/5 814

5/3 884

16/9 996

15/8 1088

2/1 1200

****

*

**

<>

*

<>

<>

<>

<>

<>

<>

<>

<>

<>

C

D

E

F

G

A

B

C

A Just Intonation Scale in C and

extension to a 12-note scale

16

Relating Spectra and Scales

• Dissonance curves are drawn using the spectrum of a sound by sum-ming the dissonances between all pairs of partials over a range offrequencies.

• Given a sound, the related scale is defined by the minima of the dis-sonance curve

• “Just Intonation” is the scale related to harmonic partials.

• What about other, inharmonic tones? Can dissonance curves be usedto make predictions about what will sound “good”?

17

Stretched and Compressed Tones

have partials at fj = fA log2(j)

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18

harmonicspectrum at f

harmonicspectrum at 2f

stretchedspectrum at f

stretchedspectrum at 2.1f

1 2.1 3.2 4.4 5.6 6.8 8.0 9.3 10.5 11.8 13. 14.3 15.6 16.9 18.1

A = 2 gives harmonic tones, A > 2 are called “stretched,” A < 2 arecalled “compressed”

18

Stretched and Compressed Tones II

1

12-tet scale steps

2.1 stretched scale steps

2.1

Sen

sory

Dis

sona

nce

Predictions:(simpletun1-2-3-4)

• harmonic tones in 12 tone = OK• harmonic sounds in stretched scale = not OK• stretched sounds in stretched scale = OK• stretched sounds in 12 tone = not OK

19

Musical Implications

Olson: It is an established fact that the most pleasing combination of twotones is one in which the frequency ratio is expressible by two integers,neither of which is large.

Piston: Two sounds are said to be an octave apart when their frequencyratio is 2:1... The octave is the most consonant of intervals.

Such statements are found throughout the literature. Examples such asthe stretched octaves show that either

(A) octave means a frequency ratio of 2:1, or

(B) octave means the most consonant interval (other than the unison).

(A) and (B) are the same only for sounds with harmonic spectra20

Spectrum of the Tingshaw Bell

0 5000 10000 15000

2368

4784

5553

7921 10103

frequency in Hz

ma

gn

itu

de

is simplified to aline spectrum

2368 4784 5553 7921 101030

frequency in Hz

magnitude

21

0

1

frequency ratio

octave 1 octave 2

sensory

dis

sonance

Dissonance curve for the ting-shaw bell has minima shown bytick marks. The mimimum at2.02 serves as a pseudo-octave,because minima in the secondpseudo-octave align with those inthe first. (tingshaw)

ratio cents

1.0 0 1.16 257 1.29 441

1.43 619 1.56 770 1.66 877 1.81 1027

2.02 1200

Tingshaw Scale

22

Spectral Mappings

can be used to create new “instruments” that are consonant with adesired timbre yet retain much of the character of familiar instruments

Transform Spectral Mapping

InverseTransform

input signal

time

f f ... f 1 2 m

frequency frequency

d d ..d 1 2 m

output signal

time

source spectrum destination spectrumm

agnitude

magnitude

am

plit

ude

am

plit

ude

23

Spectral Mappings changefrequencies, preservingmagnitude and phase re-lationships, which helps topreserve the “character” ofthe original.

f 2.1f 2.9f 3.8f 5.4f 5.8f 7f 8.4f 8.9f ...

f 2f 3f 4f 5f 6f 7f 8f 9f ...source

destination

0 500 1000 1500 2000

frequency

spectrum of original sound

spectrum of transformed soundm

agnitude

magnitude

spectr

al

mappin

g

24

Sounds for 10-tone equal temperament

A timbre designed to be played in 10-tet has partials atf, fα10, fα16, fα20, fα26, fα29, fα30, fα33, fα36, fα39, fα40 where

α = 10√2. The dissonance curve for this timbre is: 12-tet scale steps: octave

10-tet scale steps: octave

E F G H I J A B C D E

sensory

dis

sonance

Minima of the curve coincide with steps of the 10-tet scale and not withsteps of 12-tet. (Ten Fingers, MysteryX)

25

10-tet Circle of Thirds

D E G I J B D F

F H A C E

D E G I J B D F

F H A C E

D E G I J B D F

F H A C E

D E G I J B D F

F H A C E

D E G I J B D F

F H A C E

E->B->I->F->C->J->G

->D->A->H->E

There are many such music theories. You can invent new chords, newscales, new progressions – Bach didn’t use them all up. (Circle of Thirds)

26

10-tet Chords

1

2

12

se

nso

ry d

isso

na

nce

interval between the first and second notes

inte

rval b

etw

een the

first

and third n

ote

sA B C

1interval between first and second notes

1, r3, r6

1, r4, r7

1, r3, r7

1, r3, r9

1, r3, 2

1, r4, 2

B

1, r6, 2

1, r6, r91, r7, 2

C

A

inte

rva

l be

twe

en

the

first a

nd

third

no

tes

2

1

2

27

Sounds for 11-tet

• trumpet→ 11-tet trumpet

• bass→ 11-tet bass

• guitar→ 11-tet guitar

• pan flute→ 11-tet pan flute

• moog synth→ 11-tet moog synth

• phase synth→ 11-tet phase synth

Musical interludes played:

• In 11-tet scale with 11-tet timbres

• In 11-tet scale with regular (unmapped, harmonic) timbres

(tim11tet.avi, tim11vs12.avi, Turquoise Dabo Girl)28

Create sounds consonantwith the spectrum of a tom-tom: Harmonic spectra atg, 2g, 3g, 4g, 5g, . . . aremapped into the tom-tomspectrum and played usingthe related scale.

603

0 500 1000 1500 2000

410

934786

frequency

g 2g 3g 4g 5gharmonic spectrum

spectrum of the tom tomma

gn

itu

de

245

1 1.3 1.55 1.98 2.46 3.83 1.19 1.51 1.63 2.35 3.21

frequency interval

sensory

dis

sonance

29

Sounds consonant with the spectrum of a tom-tom

Several instruments and their transformation into the spectrum of a tomtom. (tomspec.avi)• guitar→ tom tom guitar• bass→ tom tom bass• trumpet→ tom tom trumpet• flute→ tom tom flute

A musical passage illustrates the transformed instruments played in therelated scale. (Glass Lake)

Mild transformations (like the 2.1 stretched and 10-tet) retain much of thecharacter of the instrument from which they were derived.

Severe transformations (like the tom tom example) lose their tonal integrity.This does not mean that such sounds are musically useless!

30

The two sides. . .

Given a timbre, what is the related scale?

Just draw dissonance curve and find minima.

Given a desired scale, what are related timbres?

Solvable via iterative optimization methods

31

More Formally

Suppose the timbre T has n partials at f1, f2, . . . , fn. Let αT be the timbrewith partials at αf1, αf2, . . . , αfn. The dissonance curve generated by Tis defined to be a plot of the sensory dissonance between T and αT overall intervals α of interest.

(1) Number of minima: dissonance curves have up to 2n(n− 1) minima.

(2) Global minimum: the unison (α = 1) is the global minimum.

(3) Asymptotic value: as the intervals grow large (as α → ∞), the disso-nance approaches a value that is no more than the intrinsic dissonance ofthe timbre itself.

(4) Principle of coinciding partials: up to half of the local minima occur atintervals α for which α = fi

fjwhere fi and fj are arbitrary partials of T .

32

Timbre Selection as an Optimization Problem

Often one wishes to specify a desired scale. How can related timbres befound? Suppose there are m scale tones.

Try # 1: Choose a set of n partials and n amplitudes to minimize sum ofdissonances over all m intervals.

Problems: (a) vanishing amplitudes (b) runaway frequencies

Fix by adding constraints: (a) chose set of amplitudes a priori (b) insure allpartials lie in some predetermined range.

Revised problem: With the amplitudes fixed, select a set of n frequencieslying in range to minimize:

w1 (sum of dissonances over all intervals) + w1 (number of local minima)33

Audio Signal Processing

These examples demonstrate that dissonance curves really do capturesomething crucial about our perceptions of desirable and undesirable sounds.

• predictions of good/bad sounding pieces are consistent with calcula-tions

• minima of dissonance curve are good scale/chord indicators

Thus:

We can use the notion of consonance as a basis for audio signal process-ing devices.

Build into our machines knowledge of how our perceptions work.

What would such machines look like?34

Idea: A keyboard “knows” what sounds you have played. Suppose youchoose

The keyboard could automatically adjust the tuning so as to minimize thedissonance by “sliding down” the dissonance curve.

35

Adaptive Tuning

A “Dynamic” or “Adaptive” tuning strategy can be implemented in a gradientstyle algorithm

newfrequencyvalues

=

old

frequencyvalues

− {stepsize}{gradient}that finds the nearest local minimum of dissonance curve to each com-manded note.

Musical implications:

• a way to automatically play in JI when using harmonic timbres

36

• automatically play in related scale using nonharmonic timbres

• no knowledge of key or tonal center required (method operates onsound rather than on musical theory)

• many musicians (singers, violinists, horns) adjust their intonation in re-sponse to musical situation - provides a way for keyboardists to imitatethis

• can act as a kind of intelligent portamento or “elastic tuning”

Listening to Adaptation

Three notes are played, each with timbre f,1.414f,1.7f,2f

Initial ratios of fundamentals are: 1.0, 1.335, 1.587 (i.e., 12-tet notes C, F,G#). Final adapted ratios are 1.0, 1.414, 1.703

Example is played three times, with (a) extremely slow adaptation (b) slowadaptation, and (c) medium adaptation. (listenadapt)

• adaptation removes most prominant beats

• adaptation retunes all three notes

• remaining quicker beats are inherent to sound

• remaining slow beats (≈ 1 per second) due to resolution of the equip-ment

37

(local anomaly,aerophonius intent)

38

JI vs. 12-tet vs. Adaptive Tuning

An example of drift in Just In-tonation: the fragment endsabout 21 cents lower than it be-gins. 12-tet maintains the pitchby distorting the simple integerratios. The adaptive tuning mi-crotonally adjusts the pitches ofthe notes to maintain simple ra-tios and to avoid the wander-ing pitch. Frequency values arerounded to the nearest 0.5 Hz.(sytonJIdrift, synton12tet, syn-tonadapt)

Frequencies when played in 12-tet:

392 440 440 392 392

329.5 329.5 293.5 293.5 329.5

261.5 261.5 293.5 247 261.5

131 110 87.5 98 131

Frequencies when played in adaptive tuning:

392.5 440 438.5 391 392.5

327 330 292 294 327

261.5 264 292 245 261.5

131 110 87.5 98 131

Frequencies whenplayed in JI with held notes:

392.5 436---436 387.5--387.5

327 327 290.5-290.5 323

261.5-261.5 290.5 242 258.5

131 109 87 96.5 129

Ratios when played in adaptive tuning and in JI:

6/5 4/3 3/2 4/3 6/5

5/4 5/4 1/1 6/5 5/4

2/1 6/5 5/3 5/4 2/1

39

Pitch/Intonation Correction

By making sounds more consonant, they become more “in-tune”

TransformConsonance

BasedAdaptation

InverseTransform

input signal

time

f f ... f 1 2 m

frequency frequency

d d ..d 1 2 m

output signal

time

source spectrum destination spectrumm

agnitude

magnitude

am

plit

ude

am

plit

ude

40

Consonance Based Sound Synthesis

Combines character of two (or more) waveforms in a musically intelligent

(consonant) manner

output

input

reference

ConsonanceBasedAdaptation

....

FrequencyDomain Filter

....

InverseTransform

osc2

source 2

Transform

osc1

source 1

Transform

.... ....

EnvelopesFilters,LFO's, etc.

A method of sound synthesis that incorporates a model of the listener.41

Special Effects Device

Algorithm adjusts partials of a sound to maximize consonance with refer-ence.

• use of inharmonic reference

• voice with spectrum of xylophone

• snare drum made consonant with a flute

• use “backwards” to increase dissonance or to precisely control amountof consonance

(maxdiss)42

Traditional Thai Music

is played on a collection of in-struments (the gong circle) fea-turing the (xylophone-like) renat.(Sudsaboun)

Sorrell: Theoretically, the Thai scale has seven equidistant notes, whichmeans that the intervals are “in the cracks” between our semitone andwhole tone, and are equal. . .

43

Traditional Thai Music II

The spectrum of a typical key of the renat

0 1000 2000 3000 4000

436

642

1246

2393

3873

frequency

magnitude

is very close to the spectrum of an ideal bar.

frequency Hz: 436 642 1246 2393 3873ratio: f 1.47f 2.85f 5.48f 8.88fideal bar: f − 2.76f 5.4f 8.9f

44

Traditional Thai Music III

But the dissonance curve for the ideal bar looks nothing like 7-tet. What’swrong? Observe that Thai music uses both inharmonic instruments (like

the renat) and harmonic instruments (reeds, voice). Drawing thedissonance curve for both sounds simultaneously yields:

7-tet scale steps

2 1 2r s

se

nso

ry

dis

so

na

nce

45

Traditional Indonesian Music

is played on a collection of met-allophones including the bonang.(Kebyar Duduk)

There are two kinds of scales: slendro is very close to 5-tet and pelog is aseven tone scale with unequal intervals (sometimes anotated S S L S S SL).

46

Traditional Indonesian Music II

Since the bonang has a uniquebell-like shape, there is no idealshape to which it can be com-pared. The spectrum of three dif-ferent bonang kettles is shown,and a good generic bonangspectrum is

f, 1.52f, 3.46f, 3.92f.

0 1000 2000 3000 4000

frequency

6891090

23962709

595

904 2030

2326

326476

514

1130

1209 1343

1462

47

Traditional Indonesian Music III

As with the Thai instruments, the inharmonic bonang is often playedtogether with harmonic instruments (flutes, voices). Drawing the

dissonance curve for both sounds simultaneously yields5-tet scale steps

12-tet scale steps

2 1 2r s

se

nso

ry

dis

so

na

nce

which has minima at or near all of the 5-tet scale steps. Similar analysiscan be done for the pelog scale.

48

Current Work

• Last time we saw ways to adjust timbre/spectrum to fit a desired spec-trum by minimizing sensory dissonance.

• Idea: take a melody (e.g., a clarinet) and dynamically re-align the tim-bre so that it “harmonizes” with a collection of sounds (e.g., multiphon-ics produced by the clarinet).

• a way to use arbitrary sounds in the role normally occupied by of a“chord pattern”

Example:

• the melody line (clarimel)

• the multiphonics (allmultiphonics)

49

Current Work II

How to align the multiphonics with the melody?

• one multiphonic all the way through (merge01)

• one multiphonic chosen randomly at each beat (clariphonicsrand2)

• divide into groups and choose from a small number (clariphonics-group2.mp3) (clariphonicsgroup2b.mp3)

Comments:

• clarinet-ness lost: it’s not a clarinet or a multiphonic

• a bizzare kind of harmonization

• need to synchronize changes in “harmony” with sections of melody

50

Legend of Spectral Phollow

chooses at each instant a multiphonic to accompany the performed melody.The multiphonic is transposed, and this generates a kind of (inharmonic)harmonization.

What are the “rules” of multiphonic harmonization?

Use other sounds to derive other kinds of harmonies: gongs, bells, drums,anything with a rich/complex timbre will do.

51

Summary

Consonance and dissonance can be described in concrete terms and thepredictions of the theory are readily verifiable.

Given a timbre, it is easy to find the related scale in which the sound canbe played most consonantly.

Given a scale, it is possible to find related timbres that can be played con-sonantly in that scale.

Problem with unrelated scales and timbres is that there is little opportunityfor contrast between consonant and dissonant passages - related scalesand timbres allow composer/performer control.

52

Familiar “music theory” breaks down for inharmonic sounds. The consonance-based approach can help to give structure to inharmonic musical realms.

For harmonic sounds, the Western 12-tet tuning can be viewed as approx-imating related JI scales. The same ideas can be used to describe the7-tet scales of traditional Thai music as well as Indonesian scales. Themusical scales of these cultures can be derived from the timbre of pairs ofinstruments (rather than of a single instrument, as in the West).

Ideas lead to several consonance-based audio signal processing devices:adaptive keyboards, pitch/intonation correction, sound synthesis, timbralmanipulation

Throughout, we have seen the enduring influence of the contributions ofHelmholtz; the basic characterization of timbre in terms of spectrum, andgeneralizations of physical beating as a paradigm for sensory consonance.

Areas for Further Thought

Variable bandwidth transformations? wavelets or short FFTs?

Are there simple time domain operators that do interesting spectral maps?

How to (re)design acoustic instruments for play in other tunings by adjust-ing: mass and density of string — contour and geometry of tubes — shapeand topology of resonator

Exploit other psychoacoustic phenomena in audio devices

• masking effects

• fusion/fissioning of sounds

• auditory illusions

53

Tuning and Timbre: A Perceptual SynthesisBill Sethares

IDEA: Exploit psychoacoustic studies on theperception of consonance and dissonance. Thetalk begins by showing how to build a device thatcan measure the “sensory” consonance and/ordissonance of a sound in its musical context.Such a “dissonance meter” has implications inmusic theory, in synthesizer design, in the con-struction of musical scales and tunings, and inthe design of musical instruments.

...the legacy of Helmholtz continues...54


Recommended