Date post: | 05-Jul-2018 |
Category: |
Documents |
Upload: | juan-perez-arrikitaun |
View: | 214 times |
Download: | 0 times |
of 27
8/15/2019 fpaper
1/27
Algorithmic Composition
Andrew Pascoe
December 7, 2009
1 Introduction
Advances in computing technology have enabled composers to use the computer as
a means of generating musical content. Some of this musical content is presented
as original, and some of it is presented as extending particular composers’ styles.
Regardless of how it is presented, algorithmic processes have caused controversy in
traditional music circles. How is algorithmic music authentic? How can algorithmic
music have meaning? Are human composers destined to become obsolete? How do
these techniques relate to the question of man versus machine?
This paper begins with a discussion of the long history regarding the systematiza-
tion of music. These systematic approaches to music theory provide a foundation for
the possibility of computers to produce “good” music. Only some brief examples are
provided. As such, understanding the role of systematization in music is necessary to
have an intelligent dialogue about this one direction music is heading.
From there, the paper covers some recent methods and approaches to producing
algorithmic compositions. In particular, the works of Benjamin Carson, David Cope,
Peter Elsea, and Microsoft’s MySong application are examined. Each of these attacks
the problem of automatically generating music in different ways.
Last, the paper discusses musical meaning and the philosophical questions at-
tached to this concept. These are broad questions with perhaps no correct answer.
1
8/15/2019 fpaper
2/27
However, they serve to provide at least some notion of why algorithmic composition
is a controversial subject. The role and intent of the composer form the basis for this
discussion.
2 Historical Systematization of Music
2.1 The Sixteenth Century
Consider the following piece of music:
2 4
Figure 1: Bad counterpoint.
This fragment is considered to be poor compositional form, even today.[9] The
technical reason is the hidden octave on the second beat. While there is no question
that composers in the sixteenth century functioned under at least the intuition of
such rules, the real formalization of these practices was completed by Johann Fux in
his Gradus ad Parnassum in 1725.[15] As Taruskin describes it:
[Musicians] used it to gain facility in “the first principles of harmony and
composition,” which were regarded by teachers as an eternal dogma in
its own right, a bedrock of imperishable lore that “remains unaltered, let
taste change as it will.”
The text itself divides counterpoint into five species , along with set of rules con-
cerning prohibition against perfect parallels, hidden parallels (as above), consonances,
dissonances, and also general aesthetics such as restrictions on sequential parallel
thirds and sixths, and a tendency away from parallel motion. These were derived
2
8/15/2019 fpaper
3/27
from studying the works of Palestrina. Modern composition students still receive
instruction in these rules and aesthetics.
Already one can begin to see how a computer can begin to use these notions of good
composition versus bad composition. It is ultimately simply a matter of eliminatingpossibilities and ranking the remaining possibilities in terms of their adherence to the
given aesthetic.
2.2 The Common Practice Period
The common practice period encompasses most styles associated with the term “clas-
sical music:” Baroque, Classical, and Romantic period styles. The theoretical ground-
work for these styles is based not only in the counterpoint and voice-leading consid-
erations of Fux, but also in chord progressions. Kostka and Payne write:[11]
[Students] must learn which chord successions are typical of tonal harmony
and which ones are not. Why is it that some chord successions seem to
“progress,” to move forward toward a goal, while others tend to wander,
to leave our expectations unfulfilled?
They provide the following examples:
43
43
Figure 2: Tonal harmonic progression.
3
8/15/2019 fpaper
4/27
43
43
Figure 3: Random harmonic progression.
The implication is that there exists a set of guidelines for writing good, tonal music.
These guidelines are what is typically referred to as “music theory,” in imprecise,
casual speech. Kostka and Payne summarize some of these guidelines with a diagram
of possible chord progressions, but are quick to point out:
. . . [B]e aware that Bach and Beethoven did not make use of diagrams
such as these. They lived and breathed the tonal harmonic style and had
no need for the information the diagrams contain. Instead, the diagrams
represent norms of harmonic practice observed by theorists over the years
in the works of a large number of tonal composers.
That is to say, these rules and guidelines have been imposed after the fact, much asFux’s Gradus ad Parnassum did for the sixteenth century style. Once again, this push
toward systematization continues to reduce the art of composition to an algorithmic
framework.
2.3 The Twentieth Century
The twentieth century is unique in at least one respect to previous centuries: In ad-
dition to theorists imposing theoretical structures on previous composers, composers
themselves were creating systems for generating music. This section provides a couple
examples from each camp.
4
8/15/2019 fpaper
5/27
2.3.1 Charles Seeger
Charles Seeger, in a twist of Fux’s work, devised a systematic approach to composition
he dubbed “dissonant counterpoint.”[14] Nicholls offers the following example:
Figure 4: Dissonant counterpoint.
Here, the focus is on dissonance as opposed to consonance. This example has no
consonances between the two parts. Moreover, the individual lines themselves have
a tendency toward dissonant intervals, e.g. the B to F in the top line is a tritone.
Repetition of pitches is to be avoided unless they are sufficiently far apart. As the
number of voices increased, rhythmic dissonance also becomes a necessity. All in all,
these rules and guidelines codify a means of creating a new type of music.
2.3.2 Arnold Schoenberg
Arnold Schoenberg created a technique that is known as “serial composition,” “twelve-
tone technique,” or “dodecaphony.”[16] This atonal process begins with a tone row of
all twelve diatonic pitches. The motivation for this practice is to have the pitches not
relate to any particular key, but only to themselves. From here, only certain musical
transformations are allowed: transposition and inversion. P is used to denote the
“prime” tone row. Transposition is denoted through the use of subscripts, so, for
example, a transposition of six semitones would be written as P 6
. Inversion is denotedby I , and inversions can also be transposed. Thus, an inversion transposed by six
semitones would be written as I 6. Taruskin gives the following examples using a tone
row from Schoenberg’s Suite, Op. 25 :
5
8/15/2019 fpaper
6/27
(a) P 0
(b) I 0
(c) P 6
(d) I 6
Figure 5: Twelve-tone processes.
These processes are extremely mathematical in nature. Besides just being a gen-
eral sense of what is “good” composition versus what is “bad” composition, the rules
laid forth by Schoenberg comment on what is possible composition. Moreover, there
is no question that Schoenberg intended this system to function as hard and fast
rules. Taruskin writes, and perhaps provides a snide comment toward algorithms as
a compositional practice:
Indeed, the use of an exhaustive twelve-note series makes the Grundgestalt
function, and the organic unity thus guaranteed, virtually automatic. And
that, of course, was the great breakthrough, the “principle capable of
serving as a rule,” which allowed the composition of large-scale, abstract,
and autonomous atonal music of constant and at-all-time-demonstrable
motivic coherence despite its renunciation of predefined tonal hierarchies,
and despite its frequent “athematicism.” But beware! As soon as any
musical characteristic becomes the automatic result of a method, it stops
being a compositional achievement.
Regardless of Taruskin’s personal opinions, Schoenberg’s serialism does provide
a rigorous framework that a computer can be easily made to emulate. Whether a
computer can make a “compositional achievement” is a question saved for a later
section of this paper.
6
8/15/2019 fpaper
7/27
2.3.3 Leonard Bernstein
Leonard Bernstein began to pose new theoretical question related to the structure
of music, and in that sense his theory imposes interpretations on compositions from
the past.[2] His theory has its roots in the linguistics of Noam Chomsky. The Chom-
skian framework of “transformational linguistics” is applied to music. Essentially,
Bernstein equates melody with basic musical meaning, chords as adjectival modifiers,
rhythm as verbs, and mode as negation. He continues this reasoning and explores
the syntactic structures of music by suggesting an underlying “deep structure” to
musical content that goes through a sort of transformational grammar through not
only these modifiers, but also grammatical aspects of deletion and combination. He
freely admits that his ideas are not fully formed, but here are some basic examples
with potential linguistic equivalents:
(a) Jack loves Jill.
(b) Jack doesn’t love Jill.
(c) Does Jack love Jill?
(d) Doesn’t Jack love Jill?
Figure 6: Transformational musical grammar.
This notion is very different from the forms of music theory we have thus far
considered. It does not comment on notes or chords in particular, but rather focuses
on the construction in form of a piece of music. Bernstein likens this to elevating
prose into poetry.
Bernstein’s analysis lends a different type of insight into the process of composi-
tion. This potentially affords computers the opportunity to replicate human creativity
on a separate level. By using musical fragments and applying transformational gram-
7
8/15/2019 fpaper
8/27
mars to them, new creative structures may emerge. This question is related to natural
language processing.
2.3.4 David Lewin
David Lewin made advances in music theory with his work in generalized musical
intervals and transformations.[12] In many ways it is an extension of Schoenberg’s
atonal practice, but through this extension finds relevance to tonal composers, such
as Chopin. Lewin’s theorization of music relies heavily on the mathematical field of
abstract algebra. Indeed, Lewin’s book reads more like a math text than a treatise on
music, containing theorems, corollaries, and proofs. In the foreword, Edward Gollin
sums it up succinctly:
The work, a methodical examination of the concept of a musical interval,
explores how the familiar notion of interval as “a distance extended be-
tween pitches in a Cartesian space” is merely one specific case of a more
general idea, one that can embrace different kinds of musical objects (du-
rations, meters, Klangs , timbres, and so on), different (i.e. non-Euclidean)
geometries, and different orientational perspectives (interval as action or
gesture rather than as simply measurement of distance between things).
By far the most mathematical example covered in this paper, Lewin’s work finds
a fit in the world of computation. Not just composition, but sound itself can be
analyzed rigorously within Lewin’s framework. This opens the door for computers
to not only compose well, but to also construct interesting collections of frequencies
that evolve over time for synthetic instrument creation—aspects of music that are
hardly intuitive for traditional composition practitioners. Thus, the computer can
more easily express musical ideas than a human can.
8
8/15/2019 fpaper
9/27
3 Recent Methods
This section of the paper now looks at approaches to algorithmic composition from a
few perspectives, namely Benjamin Carson, David Cope, Peter Elsea, and Microsoft’s
MySong application.
3.1 Peter Elsea
Peter Elsea has explored algorithmic composition through the use of fuzzy logic
systems.[8, 7] Pitch classes are represented as sets that denote membership in a par-
ticular chord or scale. For example:
(1 0 1 0 1 1 0 1 0 1 0 1)
Figure 7: Fuzzy logic set of notes.
This example in the major scale does not contain any real fuzzy logic (though the
set still can function as a fuzzy logic set). The example of the minor scale, in which
the seventh can be raised, provides a fuzzier set:
(1 0 1 1 0 1 0 1 1 0 0.7 0.6)
Figure 8: Fuzzier logic set of notes.
Note that these are not probabilities, but merely demonstrate a preference for the
flatted seventh over the naturalized one.
This fuzzy logic system affords a host of mathematical operations that can be per-
formed. Transposition is accomplished by circularly rotating the set. Multiplication
9
8/15/2019 fpaper
10/27
between sets is available. Set unions and intersections are accomplished by taking the
maximum and minimum, respectively, of two numbers with the same index. These
operations allow for simple processing in tonal harmony applications.
Elsea’s software functions by reading in a MIDI note and outputting a harmoniza-tion for that note. The basic set of rules are, with regards to chord inversions:
If root position keeps common tones, then root position.
If first inversion keeps common tones, then first inversion.
If second inversion keeps common tones, then second inversion.
If last position was root, then first inversion or second inversion.
If there have been too many firsts in a row, then root or second.
If there have been too many seconds in a row, then root or first.
If last position was not root, then root.
The first three rules ensure that the harmonization does not fly around too wildly.
The next three rules ensure that the harmonization is not too static. Of particular
note for these rules is the use of the fuzzy concept of “too many.” Thus, programmed
into Elsea’s code is a sense of what “too many” actually means. The last rule has a
relatively low weight of being implemented so that it does not interfere with the main
rules too much.
The question, once we move beyond simple inversions, is how to harmonize melodic
notes given their context not only in the key, but also in the context of the preceding
chord. Elsea constructs three candidate chords by using these aforementioned fuzzy
logic sets: one with the note as the root, one with the note as the third, and one
with the note as the fifth in the chord. Then, the actual roots of these chords are
compared with more fuzzy logic sets that are based on the tonal harmony (the dia-
grams) provided by Kostka and Payne. The best candidate, based merely on taking
the maximum of the resultant computations, is then constructed and played. So for
10
8/15/2019 fpaper
11/27
example, in the key of C, if the user plays a G, there are three basic triadic chords in
the key of C that harmonize G, not including their inversions:
V
I iiiFigure 9: Triadic chords in C that harmonize G.
Which chord to choose is, once again, dependent on the context. If the preceding
chord is a root position I chord, then the following is a possibility:
I(6)
I
Figure 10: One possible progression.
But if the preceding chord is a ii chord, the next two examples show a “good”
progression and a “bad” progression:
V(6,4)
ii
(a) Good progression.
iii
ii
(b) Bad progression.
Figure 11: Progressions of varying favoritism.
In short, Elsea takes a mathematical approach to the tonal theory of the common
practice period, and thus the computer is able to make informed, real-time compo-
sitional decisions based on melodic user input. This would not have been possible
without a systematization of the music of the period in the first place.
11
8/15/2019 fpaper
12/27
3.2 Benjamin Carson
Benjamin Carson, in his compositional practice, takes an algorithmic approach to
rhythm.[4] The software he has developed in collaboration with a software engineer
focuses on syncopation over an underlying music pulse, as opposed to the practice of
new complexity featuring such methods as nested tuplets. The software is designed
to give the user a fine-tuned control of multiple aspects of the composition process.
Fundamentally, the software develops an aesthetic based on the complexity of
durations of musical events. For example, a series of musical events may have metric
times of 416
, 716
, and 1316
. These durations are randomly generated, and depend upon
the composers’ input as to how complex he wishes a certain series of rhythmic events
to be.
Beyond this, the software allows a composer to view his work from a variety of dif-
ferent dimensions. Rhythmic complexity can be tied to pitch classes or dynamics, for
example. Thus, even though the underlying algorithmic production of these durations
is inherently random, the composer is free to choose some sort of logical framework for
how rhythmic complexity interacts with the rest of the composition’s aspects. This
affords a unique mix of both the computer’s input and the compositional goals and
ideas of the composer.
These syncopated rhythmic grammars can be variably juxtaposed to the underly-
ing ordered pulse of the music. This can be accomplished by shifting the syncopation
over the underlying pulse, or by spreading out the underlying pulse in clock time.
These processes generate psychological musical effects for the listener. Once again, it
is at the composer’s whim when to produce such rhythmic effects as part of the cre-
ative process. In particular, Carson describes this feature of his software as a means of
maintaining interesting rhythmic flows that do not grow stale, but still maintain some
sort of logical structure. The goal is not to be completely random, but instead to have
a certain level of ordered randomness. Thus, the piece has a concrete development
over its course without devolving into either banal repetition or pure chaos.
12
8/15/2019 fpaper
13/27
Carson also makes a distinction between metric time and clock time. That is, a
32nd note does not lose its “32nd-ness” under tempo changes. Its musical function
does not change despite its variation in actual clock time. To this extent, Carson also
experiments with an algorithmic approach to changes in tempi.Carson’s software is in at least part a practice in “compositional economy.” By
automating certain musical processes, he finds himself free to focus on other aspects
of his composition he finds either more interesting or more important. However, the
software allows for a large range of input by the composer, allowing him to express
his “musical impulses” without necessarily having the computer completely control
the output. Carson’s idea of musical impulses will be covered in the next section of
this paper.
3.3 David Cope
David Cope has long been involved in the field of algorithmic composition, and much
of his focus has been on making computer models of musical creativity.[6] His pro-
gram, Experiments in Musical Intelligence (EMI), employs a variety of techniques to
accomplish not only algorithmic composition in the styles of particular composers,
but also to produce new, “creative” output.
Cope’s approach is fundamentally database driven. He compiles musical sources
from a particular composer (or particular composers), all of which go through a series
of analyses. From here, the program sets to work using models of recombinance,
allusion, learning, form and structure, and influence. This paper presents a summary
of this ideas and their function.
3.3.1 Recombinance
Recombinance is essentially taking musical material from the database and replacing
sections of it with other material acquired from the database. Cope’s book provides
a simple example of this process:
13
8/15/2019 fpaper
14/27
43
8
43
43
43
(a) Original Bach.
43
43
43
8
43
(b) EMI output.
Figure 12: Recombinance in action.
The EMI output replicates the original Bach Chorale no. 188 on the first beat,
but diverges from there, instead replacing it with a fragment from Bach’s Chorale
no. 157. EMI accomplishes this by forming examining the database and searching for
alternative candidates following from the given first beat. This process is continued
throughout the algorithmically generated composition.
However, this process will ultimately result in free-flowing melodies without any
sense of direction. Cope writes:
In order to provide some sense of this logic and larger structure, and again
wanting to avoid coding my own knowledge of musical form as rules, I
rewrote the program to inherit more of the structural aspects of the music
in its database. This inheritance involved extending the analysis process
so that the program could store other information about the music being
analyzed in each grouping along with destination notes. For example, I
had the program analyze the original music’s distance to cadence, position
of groupings in relation to meter, and other context-sensitive features.
The program is also sensitive to “compositional signatures,” which are larger frag-
ments of music that give a particular composer a sense of musical identity. Thus, there
14
8/15/2019 fpaper
15/27
is an aspect of the program that resists the complete fragmentation of raw recombi-
nance. Due to similar concerns of over-fragmentation, EMI also tries to maintain a
cohesion in musical texture by comparing density of notes and rhythmic characteris-
tics between groupings, and searching the database for passages of similar quality intexture.
3.3.2 Allusion
Allusion refers to the practice of composers to borrow material from other composers.
Cope breaks down allusion into five categories: quotations, paraphrases, likenesses,
frameworks, and commonalities. He freely admits that the boundaries between these
categories are not necessarily clear.
Quotations are as one would expect: direct pitch and rhythm copying. Para-
phrases use similar intervallic structures to the source material and can also incor-
porate rhythmic differences. Likenesses are similar to paraphrases, but allow more
freedom in the differences, as well as changes in harmonic content. Frameworks take
source material and insert notes between the pitches and rhythms of the source mate-
rial. Such processes can be illuminated through textural reductions. Commonalities
are simplistic musical materials, not unlike Bernstein’s concept of “deep structures.”
Cope developed a secondary program called Sorcerer that can sniff out these allu-
sions. A user provides Sorcerer with a database of musical material and a work that
potentially contains allusions that are in the database. The program compares not
only pitches but intervals between pitches, allowing for variation in these parameters,
and also allows for a certain number of notes to fall within the source material (the
framework idea). Users can specify how sensitive they wish Sorcerer to perform, pick-
ing out only quotations, or running the whole gamut up to commonalities. Bounds
on the length of patterns must also be set by the user.
EMI, because of its construction in accessing databases, is naturally inclined to at
least include some elements of allusion. Cope tells a story about how he recognized
15
8/15/2019 fpaper
16/27
in one of EMI’s pieces an allusion to Vivaldi, despite the database being entirely
comprised of works by Bach. Bach himself transcribed a series of Vivaldi pieces. Did
EMI reveal a hidden allusion in Bach to the music of which he himself had intimate
knowledge?
3.3.3 Learning
Cope describes another program he worked on called Gradus (named after Fux’s
Gradus ad Parnassum ) which attempts to learn the rules of first species counterpoint.
Other programs in the past have codified solutions to this problem, but they do not
learn in particular.
A melodic line is given to these programs (called a cantus firmus ). The goal is to
write suitable counterpoint against this line. Previous programs begin with the first
note and proceed to the next. As any student of counterpoint can attest to, this naı̈ve
approach can quickly run into serious problems. Some beginning solutions resolve in
dead ends where there are no possible notes to write that will conform to the rules.
Therefore, such a program needs to backtrack to the last good note and attempt to
replace it with another possible solution, and then continue on from there, potentially
running into the same problem all over again.
Cope’s program stores its mistakes and avoids replicating them again in the future.
So the program starts by having to backtrack, but the more the program “practices,”
the less it must backtrack to find a solution to the given cantus firmus. While Cope’s
example has limited application to sixteenth century first species counterpoint, the
underlying philosophy has large implications for algorithmic composition as a whole.
The ability of programs to learn what to avoid can not only increase efficiency, but
also produce higher quality work.
16
8/15/2019 fpaper
17/27
3.3.4 Form and Structure
Cope considers structure in the form of SPEAC analysis: statement, preparation,
extension, antecedent, and consequent. These all refer to relatively brief musical
gestures that find their characteristics in the amount of tension they provide to the
listener’s ear. The interactions between these different aspects of music create logical
surface structures. Alas, Cope’s book does not go into detail about how such an
analysis is implemented in his programs. He writes, “I simply ask readers to imagine
the possibilities, and to investigate the realities on their own.”
As for form, SPEAC analysis is applicable. Cope outlines another program called
Alice that analyzes a database searching for broad compositional forms. It begins
by attempting to identify themes. If this attempt is unsuccessful, it analyzes texture
changes to deduce the form. Failing that, it relies on simple cadences. If that process
also fails, Cope says Alice “takes its best guess.” From what little details Cope gives
in his book, it appears Alice searches for “expectation, fulfillment, and deception of
themes or progressions,” and tries to replicate these structures in its database.
3.3.5 Influence
Influence is separate from relying on a database or merely composing in another’s
style. Instead, influence is about creating a new style that has its roots in a previous
style. Cope gives three examples of his experiments with EMI and musical influence.
The first example is a feedback loop. EMI begins with a database of a particular
composer’s work. As has already been described, it produces compositions based
on this database. Once a composition is completed, it is inserted back into the
database. By repeating this process thousands of times, Cope found that the output
showed obvious “influence” from the original source material, but the overall style
was exceptionally different.
The second example uses multiple EMI programs all using databases from different
composers. Here, the instances of EMI communicate with one another about their
17
8/15/2019 fpaper
18/27
compositions. Cope programmed in a metric for what a “good” composition is: the
kinetic energy of the music. Each program has a slightly different metric, and hands
over to the other programs music it evaluates as aesthetically acceptable. In this way,
the programs are attempting to influence each other, resulting in compositions thatdemonstrate disparate sources. The author finds a similarity between this idea and
Bohm Dialogues.
Last, Cope describes influence as a form of exploration. He created a web-crawling
application called Serendipity that would download MIDI files (and text files) that
would be added to databases. At first, this process resulted in a mish-mash of styles
in one composition that was not aesthetically pleasing. So Cope produced filters
that would determine if a given MIDI file was good enough, or even appropriate,
for inclusion in EMI’s database. This resulted in much more cohesive and enjoyable
compositions.
3.4 MySong
MySong is a consumer application that has been produced by a Microsoft research
team.[13] While its application is not as serious as what has been previously exam-
ined, the author felt its inclusion to be beneficial as an example of giving a “taste
of composition” to musically untrained individuals. That is to say, algorithmic com-
position is becoming a tool for users to create their own content. It also functions
differently than other work presented in this paper.
However, there are some similarities to the other work. Like Elsea’s work, MySong
attempts to harmonize melodic input by users, and like Cope’s work, it does so
by using a database of music. The distinctions lie in the implementation of these
practices.
To begin with, users specify a tempo and are provided with a metronome track
that delimits bar lines, or measure boundaries, in their melodic input. Users then sing
a melodic line into a microphone. The software then uses a pitch detection algorithm,
18
8/15/2019 fpaper
19/27
and takes all of the notes in a particular measure and compiles them into a set. Here
is an illustration:
(a) Sung line. (b) Pitch classes.Figure 13: Pitch sets derived from melodic lines.
The database MySong employs is a collection of about three hundred lead sheets.
Lead sheets contain melodic content along with chord progressions. These lead sheets
come from a variety of popular music genres, including pop, rock, and country. As
such, these styles of music usually find changes in harmony at the bar line. Here is
an example of what a lead sheet looks like:
G
C
F
Figure 14: Simple lead sheet example.
The method for algorithmic composition is based in Hidden Markov Models (HMM).
Essentially, a Markov model represents a stochastic process through transition prob-
abilities. Based on an analysis of the database, we might conclude that a C major
chord will move to a G major chord 30% of the time, and move to an F major chord
20% of the time. What makes the Markov model hidden in this application is that
the chord progression is unknown—all that is available are the pitch classes derived
from the user input. Thus, there is another level of analysis involving the probability
of a chord for the measure given the pitch class.
The way to find the optimal solution given a set of observances in an HMM is to
use the Viterbi algorithm. A mathematical discussion of the Viterbi algorithm and
why it works is beyond the scope of this paper, but it is important to know that there
19
8/15/2019 fpaper
20/27
is an input parameter k to which the users of MySong are given access.
This k is exhibited in MySong as the “Jazz Factor.” If k = 0, then the program
effectively ignores changes in the melodic line altogether and produces the most typical
chord progression deriving from the first measure. If k = 1, then the program ignoresstandard chord progressions and the harmonization for each measure is determined
solely by the pitch class for that measure. That is, when the Jazz Factor is increased,
the program is more likely to make bold harmonic choices. If k = 0.5, then for each
measure the program equally accounts for the user’s input and the chords in the
preceding and succeeding bars.
The other main user control in MySong is the “Happiness Factor,” which also
ranges from 0 to 1. Chord progressions for major and minor modes differ, and the
application accounts for this. When h = 0, the chord progression is guaranteed to
be in the minor mode, and when h = 1, the chord progression is guaranteed to be in
the major mode. As can probably be deduced, when h = 0.5, the program will freely
choose the best chord based on the Viterbi algorithm and the value of k, whether or
not it is a major or minor chord.
Once the Viterbi algorithm is complete, users are then free to tweak their chord
progressions if they so choose. From there, the user can select a particular style of
accompaniment that adds instrumentation and rhythm to the final chord progression,
and output their song to a sound file. Morris did not elaborate on how this stylization
is accomplished.
MySong is one foray into the public sphere for algorithmic composition. Morris
notes that given the input of a melody from a famous pop song, the program will at
least sometimes produce the same chord progression in the studio version of the song.
He also touts the ability of MySong to produce interesting variations on common
songs by tweaking the Jazz and Happiness Factor controls.
20
8/15/2019 fpaper
21/27
4 Musical Meaning
Let us consider the following musical fragments:
42 ff Figure 15: Opening theme from Beethoven’s Symphony no. 5.
42 p
Figure 16: Subject from Shostakovich’s Fugue no. 5, Op. 87.
43
Figure 17: Opening output of EMI of a Bach-style lute suite.[6]
What, if anything, do these bits of music mean? Taking a cue from MySong ’s
use of a Happiness Factor, should we conclude that Beethoven was having a bad day,
and Shostakovich happened to be feeling jubilant when they wrote their respective
pieces? One of the author’s past professors suggested that such interpretations were
dangerous, but Aldwell and Schachter suggest that our connotations of the major
mode being “happy” and the minor mode being “sad” should not be dismissed too
carelessly.[1] Of course, this begs the question, “What was EMI feeling?”
Hofstadter finds these questions, given the quality of EMI’s output, disturbing.[10]
He writes:
. . . I was going to grapple with this strange program that was threatening
to upset the apple cart that held many of my oldest and most deeply
21
8/15/2019 fpaper
22/27
cherished beliefs about the sacredness of music, about music being the
ultimate inner sanctum of the human spirit, the last thing that would
tumble in AI’s headlong rush towards thought, insight, and creativity.
Having a personal affinity for Chopin and being surprised at how well EMI emu-
lated Chopin, Hofstadter boils his worries down to three possible outcomes, each of
which he finds unsatisfactory as a lover of music and as a human being:
(1) Chopin (for example) is a lot shallower than I had ever thought.
(2) Music is a lot shallower than I had ever thought.
(3) The human soul/mind is a lot shallower than I had ever thought.
These concerns are strange since Hofstadter claims he has long been convinced
that humans are effectively machines anyway. He claims his surprise lies in the fact
that the machines producing EMI’s output are many orders of magnitude simpler than
human biological construction. Regardless, the author is not particularly surprised
by such developments in the production of machine music—the long historical trend
of the systematization of music lends itself to mechanical replication.
But Hofstadter is not alone. Cope has found resistance in music circles to havingEMI’s output performed by trained human musicians, saying responses were mostly
negative.[5] Why do people have such revulsion to the idea that a computer can
compose well? Do people read that much meaning into music?
Bernstein rejects the notion of any sort of extra meaning in music.[3, 2] For him,
the meaning of music is contained entirely within the notes themselves. He declares
that music does not have the same function as language, and any meaning it has is
ineffable. Hofstadter’s claim that music functions as “direct soul-to-soul messages”
does not find a place in Bernstein’s philosophy of music. As Bernstein says:
. . . [L]anguage has a communicative function and an aesthetic function.
Music has an aesthetic function only. For that reason, musical surface
structure is not equitable with linguistic surface structure.
22
8/15/2019 fpaper
23/27
Cope also rejects Hofstadter’s objections.[6] He writes:
Interestingly, one of my favorite analogies is that mathematics is to physics
as music is to language. The first instances—mathematics and music—
are abstract, while the second instances—physics and language—relate
more to the real world. Mathematics and music also deal in proportions,
while physics and language attempt to develop meaning. . . Though sim-
plistic, this analogy nonetheless serves to emphasize music’s reliance on
relationships rather than on meaning and representations of meaning.
Carson, too, suggests that music has no inherent meaning beyond its notes, and
says he agrees with Bernstein.[4] He states that music is nothing more than a surface
structure, and it has no need for semiotics. Beyond this, Carson is glad that the study
of algorithmic composition is finally bringing these issues to light, that the concept
of “depth” in music is completely misguided, in that it necessarily implies a mystical
and mysterious element to the composition process that is just simply not present.
Carson claims that the use of algorithmic processes for compositional practices
does indeed distance the composer from his work. He also states that the highestlevel of intimacy a composer can have with his composition is when the composer
specifies each element in the work himself. However, this is not necessarily beneficial.
For one, as already mentioned previously in this paper, is the issue of economy.
Carson says that with modern technology, a composer would have to be “crazy” not
to automate parts of a composition that have a fundamental structure that could be
mechanically composed. This enables the composer to focus on other aspects of the
work. Second, a là John Cage, removing the ego of the composer grants opportunities
for musical exploration to which the composer’s mind may never have led him.
In response to a follow-up question relating to economy, that perhaps the most
economical action to take is to simply rely on algorithmic composition for all parts
of the process, Carson says that he sees no real issue in this. Composition, he states,
23
8/15/2019 fpaper
24/27
is the act of recording “musical impulses.” These musical impulses function not only
consciously, but subconsciously. Using an algorithm, or even developing an algorithm,
inherently incorporates musical impulses.
Will human composers become obsolete? Carson does not think so. Humans havea natural tendency and demand to relate to other humans, and thus have a desire
to experience compositions produced by human composers. However, as stated, even
an algorithm expresses some element of a composer’s will and aesthetic. Carson even
extends the musical impulse to a user merely “pushing play” on some automatic
composition device.
He also does not take a post-modern stance of all music being equal. In response
to Hofstadter, Chopin is still great whether or not his style is replicable by machines.
Yes, music is a shallow surface structure, but Chopin’s greatness can be attributed
to his ability to produce surface structures of enormous musical complexity. Car-
son concludes that Chopin himself is essentially an algorithm of both conscious and
subconscious processes—but he is a very good algorithm.
The shallowness of music, according to Carson, is not a fundamental problem.
Music still has value. People associate with music because it relates to them on a
multitude of levels, including cultural and emotional levels. Audiences may incorpo-
rate meaning into their musical experiences, and there is nothing inherently wrong
with this. However, there is a benefit in increasing musical understanding by dis-
pelling these myths about composition as some sort of divine, mystical, or spiritual
process. Knowledge of actual compositional processes should aid audiences in their
appreciation of music as an art form.
The question about what EMI was “feeling” when it output the music above is
perhaps a misguided question. The real question could easily be, “What were David
Cope’s musical impulses when he wrote EMI, compiled a database of Bach lute suites,
and pressed the ‘generate’ button?”
24
8/15/2019 fpaper
25/27
5 Conclusion
Algorithmic composition is still a relatively nascent field given the long course of
music history. Its full potential is yet to be explored, and it has yet to completely
break into the public consciousness. Hofstadter asks:[10]
Where will we have gotten in twenty years of hard work? In fifty? What
will be the state of the art in 2084? Who, if anyone, will be able to tell
“the right stuff” from rubbish? Who will know, who will care, who will
loudly protest that the last (though tiniest) circle at the center of the
style-target has still not been reached (and may never be reached)? What
will such nitpicky details matter, when new Bach and Chopin masterpieces
applauded by all come gushing out of silicon circuity at a rate faster than
H2O pours over the edge of Niagara? Will that wondrous new golden age
of music not be “truly a thing of beauty?”
And he concludes:
. . . [T]he day when music is finally and irrevocably reduced to syntacticpattern and pattern alone will be, to my old-fashioned way of looking at
things, a very dark day indeed.
The author shares a curiosity in Hofstadter’s questions, but applies a separate
moral judgment on the future of music. Sharing in Carson’s idea of “musical impulse,”[4]
the future looks bright for composers to continue to develop systems, as theoreticians
and composers alike have done throughout history, that will explore new avenues of musical expression and insight. Algorithmic processes do not detract from our hu-
manity in the slightest. When all is said and done, an algorithm is still a creation
founded in the human mind, and creations are inherently imbued with the wills of
their creators.
25
8/15/2019 fpaper
26/27
References
[1] Edward Aldwell and Carl Schachter. Harmony and Voice Leading . Schirmer,
New York, New York, third edition, 2002.
[2] Leonard Bernstein. The Unanswered Question . Harvard University Press, Cam-
bridge, Massachusetts, 1999.
[3] Leonard Bernstein. The Joy of Music . Amadeus Press, LLC, Pompton Plains,
New Jersey, 2004.
[4] Benjamin Carson. Personal Interview, December 4, 2009.
[5] Jacqui Cheng. Virtual composer makes beautiful music—and stirs
controversy . http://arstechnica.com/science/news/2009/09/
virtual-composer-makes-beautiful-musicand-stirs-controversy.ars,
Accessed December 6, 2009.
[6] David Cope. Computer Models of Musical Creativity . The MIT Press, Cambridge,
Massachusetts, 2005.
[7] Peter Elsea. Fuzzy Logic and Musical Decisions , 1995. ftp://arts.ucsc.edu/
pub/ems/FUZZY/Fuzzy_Logic_And_Music.pdf , Accessed December 6, 2009.
[8] Peter Elsea. Musical Applications of Fuzzy Logic , 1995. ftp://arts.ucsc.edu/
pub/ems/FUZZY/MusAppFuzzy.pdf , Accessed December 6, 2009.
[9] Robert Gauldin. A Practical Approach to Sixteenth-Century Counterpoint . Wave-
land Press, Inc., Long Grove, Illinois, 1995.
[10] Douglas Hofstadter. Sounds Like Bach . http://www.unc.edu/~ mumukshu/
gandhi/gandhi/hofstadter.htm , Accessed December 6, 2009.
26
8/15/2019 fpaper
27/27
[11] Stefan Kostka and Dorothy Payne. Tonal Harmony, With an Introduction to
Twentieth-Century Music . McGraw-Hill, 1221 Avenue of the Americas, New
York, New York, fifth edition, 2004.
[12] David Lewin. Generalized Musical Intervals and Transformations . Oxford Uni-
versity Press, Inc., 198 Madison Avenue, New York, New York, 2007.
[13] Daniel Morris. MySong: Automatic Accompaniment for Vocal
Melodies , 2008. Online Lecture. http://academicearth.org/lectures/
mysong-automatic-accompaniment-for-vocal-melodies, Accessed Decem-
ber 6, 2009.
[14] David Nicholls. American Experimental Music: 1890-1940 . The Press Syndicate
of the University of Cambridge, New York, New York, 1990.
[15] Richard Taruskin. The Oxford History of Western Music , volume I, The Earliest
Notations to the Sixteenth Century . Oxford University Press, Inc., 198 Madison
Avenue, New York, New York, 2005.
[16] Richard Taruskin. The Oxford History of Western Music , volume IV, The Early Twentieth Century . Oxford University Press, Inc., 198 Madison Avenue, New
York, New York, 2005.