+ All Categories
Home > Documents > the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and...

the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and...

Date post: 24-Jun-2020
Category:
Upload: others
View: 3 times
Download: 1 times
Share this document with a friend
333
Transcript
Page 2: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

This page intentionally left blank

Page 3: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

L I E G R O U P S , P H Y S I C S , A N D G E O M E T R Y

An Introduction for Physicists, Engineers and Chemists

Describing many of the most important aspects of Lie group theory, this bookpresents the subject in a ‘hands on’ way. Rather than concentrating on theoremsand proofs, the book shows the relation of Lie groups with many branches ofmathematics and physics, and illustrates these with concrete computations. Manyexamples of Lie groups and Lie algebras are given throughout the text, with appli-cations of the material to physical sciences and applied mathematics. The relationbetween Lie group theory and algorithms for solving ordinary differential equa-tions is presented and shown to be analogous to the relation between Galois groupsand algorithms for solving polynomial equations. Other chapters are devoted todifferential geometry, relativity, electrodynamics, and the hydrogen atom.

Problems are given at the end of each chapter so readers can monitor theirunderstanding of the materials. This is a fascinating introduction to Lie groupsfor graduate and undergraduate students in physics, mathematics and electricalengineering, as well as researchers in these fields.

Robert Gilmore is a Professor in the Department of Physics at Drexel Univer-sity, Philadelphia. He is a Fellow of the American Physical Society, and a Memberof the Standing Committee for the International Colloquium on Group TheoreticalMethods in Physics. His research areas include group theory, catastrophe theory,atomic and nuclear physics, singularity theory, and chaos.

Page 4: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …
Page 5: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

LIE GROUPS,PHYSICS, AND GEOMETRY

An Introduction for Physicists, Engineersand Chemists

ROBERT GILMOREDrexel University, Philadelphia

Page 6: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

CAMBRIDGE UNIVERSITY PRESS

Cambridge, New York, Melbourne, Madrid, Cape Town, Singapore, São Paulo

Cambridge University PressThe Edinburgh Building, Cambridge CB2 8RU, UK

First published in print format

ISBN-13 978-0-521-88400-6

ISBN-13 978-0-511-37752-5

© R Gilmore 2008

2008

Information on this title: www.cambridge.org/9780521884006

This publication is in copyright. Subject to statutory exception and to the provision of relevant collective licensing agreements, no reproduction of any part may take place without the written permission of Cambridge University Press.

Cambridge University Press has no responsibility for the persistence or accuracy of urls for external or third-party internet websites referred to in this publication, and does not guarantee that any content on such websites is, or will remain, accurate or appropriate.

Published in the United States of America by Cambridge University Press, New York

www.cambridge.org

eBook (EBL)

hardback

Page 7: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

Contents

Preface page xi1 Introduction 1

1.1 The program of Lie 11.2 A result of Galois 21.3 Group theory background 31.4 Approach to solving polynomial equations 81.5 Solution of the quadratic equation 101.6 Solution of the cubic equation 111.7 Solution of the quartic equation 151.8 The quintic cannot be solved 171.9 Example 181.10 Conclusion 211.11 Problems 22

2 Lie groups 242.1 Algebraic properties 242.2 Topological properties 252.3 Unification of algebra and topology 272.4 Unexpected simplification 292.5 Conclusion 292.6 Problems 30

3 Matrix groups 343.1 Preliminaries 343.2 No constraints 353.3 Linear constraints 363.4 Bilinear and quadratic constraints 393.5 Multilinear constraints 423.6 Intersections of groups 433.7 Embedded groups 43

v

Page 8: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

vi Contents

3.8 Modular groups 443.9 Conclusion 463.10 Problems 47

4 Lie algebras 554.1 Why bother? 554.2 How to linearize a Lie group 564.3 Inversion of the linearization map: EXP 574.4 Properties of a Lie algebra 594.5 Structure constants 614.6 Regular representation 624.7 Structure of a Lie algebra 634.8 Inner product 644.9 Invariant metric and measure on a Lie group 664.10 Conclusion 694.11 Problems 69

5 Matrix algebras 745.1 Preliminaries 745.2 No constraints 745.3 Linear constraints 755.4 Bilinear and quadratic constraints 785.5 Multilinear constraints 805.6 Intersections of groups 805.7 Algebras of embedded groups 815.8 Modular groups 815.9 Basis vectors 815.10 Conclusion 835.11 Problems 83

6 Operator algebras 886.1 Boson operator algebras 886.2 Fermion operator algebras 896.3 First order differential operator algebras 906.4 Conclusion 936.5 Problems 93

7 EXPonentiation 997.1 Preliminaries 997.2 The covering problem 1007.3 The isomorphism problem and the covering group 1057.4 The parameterization problem and BCH formulas 1087.5 EXPonentials and physics 114

Page 9: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

Contents vii

7.6 Conclusion 1197.7 Problems 120

8 Structure theory for Lie algebras 1298.1 Regular representation 1298.2 Some standard forms for the regular representation 1298.3 What these forms mean 1338.4 How to make this decomposition 1358.5 An example 1368.6 Conclusion 1368.7 Problems 137

9 Structure theory for simple Lie algebras 1399.1 Objectives of this program 1399.2 Eigenoperator decomposition – secular equation 1409.3 Rank 1439.4 Invariant operators 1439.5 Regular elements 1469.6 Semisimple Lie algebras 1479.7 Canonical commutation relations 1519.8 Conclusion 1539.9 Problems 154

10 Root spaces and Dynkin diagrams 15910.1 Properties of roots 15910.2 Root space diagrams 16010.3 Dynkin diagrams 16510.4 Conclusion 16810.5 Problems 168

11 Real forms 17211.1 Preliminaries 17211.2 Compact and least compact real forms 17411.3 Cartan’s procedure for constructing real forms 17611.4 Real forms of simple matrix Lie algebras 17711.5 Results 18111.6 Conclusion 18211.7 Problems 183

12 Riemannian symmetric spaces 18912.1 Brief review 18912.2 Globally symmetric spaces 19012.3 Rank 19112.4 Riemannian symmetric spaces 192

Page 10: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

viii Contents

12.5 Metric and measure 19312.6 Applications and examples 19412.7 Pseudo-Riemannian symmetric spaces 19712.8 Conclusion 19812.9 Problems 198

13 Contraction 20513.1 Preliminaries 20513.2 Inonu–Wigner contractions 20613.3 Simple examples of Inonu–Wigner contractions 20613.4 The contraction U (2) → H4 21113.5 Conclusion 21613.6 Problems 217

14 Hydrogenic atoms 22114.1 Introduction 22114.2 Two important principles of physics 22214.3 The wave equations 22314.4 Quantization conditions 22414.5 Geometric symmetry SO(3) 22714.6 Dynamical symmetry SO(4) 23014.7 Relation with dynamics in four dimensions 23314.8 DeSitter symmetry SO(4, 1) 23514.9 Conformal symmetry SO(4, 2) 23814.10 Spin angular momentum 24314.11 Spectrum generating group 24514.12 Conclusion 24914.13 Problems 250

15 Maxwell’s equations 25915.1 Introduction 25915.2 Review of the inhomogeneous Lorentz group 26115.3 Subgroups and their representations 26215.4 Representations of the Poincare group 26415.5 Transformation properties 27015.6 Maxwell’s equations 27315.7 Conclusion 27515.8 Problems 275

16 Lie groups and differential equations 28416.1 The simplest case 28516.2 First order equations 28616.3 An example 290

Page 11: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

Contents ix

16.4 Additional insights 29516.5 Conclusion 30216.6 Problems 303Bibliography 309Index 313

Page 12: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …
Page 13: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

Preface

Many years ago I wrote the book Lie Groups, Lie Algebras, and Some of TheirApplications (New York: Wiley, 1974). That was a big book: long and difficult. Overthe course of the years I realized that more than 90% of the most useful materialin that book could be presented in less than 10% of the space. This realization wasaccompanied by a promise that some day I would do just that – rewrite and shrinkthe book to emphasize the most useful aspects in a way that was easy for studentsto acquire and to assimilate. The present work is the fruit of this promise.

In carrying out the revision I have created a sandwich. Lie group theory has itsintellectual underpinnings in Galois theory. In fact, the original purpose of whatwe now call Lie group theory was to use continuous groups to solve differential(continuous) equations in the spirit that finite groups had been used to solve alge-braic (finite) equations. It is rare that a book dedicated to Lie groups begins withGalois groups and includes a chapter dedicated to the applications of Lie grouptheory to solving differential equations. This book does just that. The first chapterdescribes Galois theory, and the last chapter shows how to use Lie theory to solvesome ordinary differential equations. The fourteen intermediate chapters describemany of the most important aspects of Lie group theory and provide applicationsof this beautiful subject to several important areas of physics and geometry.

Over the years I have profited from the interaction with many students throughcomments, criticism, and suggestions for new material or different approaches toold. Three students who have contributed enormously during the past few yearsare Dr. Jairzinho Ramos-Medina, who worked with me on Chapter 15 (Maxwell’sequations), and Daniel J. Cross and Timothy Jones, who aided this computer illit-erate with much moral and ebit ether support. Finally, I thank my beautiful wifeClaire for her gracious patience and understanding throughout this long creationprocess.

Robert Gilmore

xi

Page 14: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …
Page 15: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

1

Introduction

Lie groups were initially introduced as a tool to solve or simplify ordinaryand partial differential equations. The model for this application wasGalois’ use of finite groups to solve algebraic equations of degree two,three, and four, and to show that the general polynomial equation ofdegree greater than four could not be solved by radicals. In this chapterwe show how the structure of the finite group that leaves a quadratic,cubic, or quartic equation invariant can be used to develop an algorithmto solve that equation.

1.1 The program of Lie

Marius Sophus Lie (1842–1899) embarked on a program that is still not complete,even after a century of active work. This program attempts to use the power of thetool called group theory to solve, or at least simplify, ordinary differential equations.

Earlier in nineteenth century, Evariste Galois (1811–1832) had used group theoryto solve algebraic (polynomial) equations that were quadratic, cubic, and quartic.In fact, he did more. He was able to prove that no closed form solution could beconstructed for the general quintic (or any higher degree) equation using only thefour standard operations of arithmetic (+, −, ×, ÷) as well as extraction of the nthroots of a complex number.

Lie initiated his program on the basis of analogy. If finite groups were requiredto decide on the solvability of finite-degree polynomial equations, then “infinitegroups” (i.e., groups depending continuously on one or more real or complex vari-ables) would probably be involved in the treatment of ordinary and partial differen-tial equations. Further, Lie knew that the structure of the polynomial’s invariance(Galois) group not only determined whether the equation was solvable in closedform, but also provided the algorithm for constructing the solution in the casethat the equation was solvable. He therefore felt that the structure of an ordinary

1

Page 16: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

2 Introduction

differential equation’s invariance group would determine whether or not the equa-tion could be solved or simplified and, if so, the group’s structure would also providethe algorithm for constructing the solution or simplification.

Lie therefore set about the program of computing the invariance group of ordinarydifferential equations. He also began studying the structure of the children he begat,which we now call Lie groups.

Lie groups come in two basic varieties: the simple and the solvable. Simplegroups have the property that they regenerate themselves under commutation.Solvable groups do not, and contain a chain of subgroups, each of which is aninvariant subgroup of its predecessor.

Simple and solvable groups are the building blocks for all other Lie groups.Semisimple Lie groups are direct products of simple Lie groups. Nonsemisimple Liegroups are semidirect products of (semi)simple Lie groups with invariant subgroupsthat are solvable.

Not surprisingly, solvable Lie groups are related to the integrability, or at leastsimplification, of ordinary differential equations. However, simple Lie groups aremore rigidly constrained, and form such a beautiful subject of study in their ownright that much of the effort of mathematicians during the last century involved theclassification and complete enumeration of all simple Lie groups and the discussionof their properties. Even today, there is no complete classification of solvable Liegroups, and therefore nonsemisimple Lie groups.

Both simple and solvable Lie groups play an important role in the study of differ-ential equations. As in Galois’ case of polynomial equations, differential equationscan be solved or simplified by quadrature if their invariance group is solvable.On the other hand, most of the classical functions of mathematical physics arematrix elements of simple Lie groups, in particular matrix representations. Thereis a very rich connection between Lie groups and special functions that is stillevolving.

1.2 A result of Galois

In 1830 Galois developed machinery that allowed mathematicians to resolve ques-tions that had eluded answers for 2000 years or longer. These questions includedthe three famous challenges to ancient Greek geometers: whether by ruler andcompasses alone it was possible to

• square a circle,• trisect an angle,• double a cube.

Page 17: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

1.3 Group theory background 3

His work helped to resolve longstanding questions of an algebraic nature: whetherit was possible, using only the operations of arithmetic together with the operationof constructing radicals, to solve

• cubic equations,• quartic equations,• quintic equations.

This branch of mathematics, now called Galois theory, continues to provide pow-erful new results, such as supplying answers and solution methods to the followingquestions.

• Can an algebraic expression be integrated in closed form?• Under what conditions can errors in a binary code be corrected?

This beautiful machine, applied to a problem, provides important results. First, itcan determine whether a solution is possible or not under the conditions specified.Second, if a solution is possible, it suggests the structure of the algorithm that canbe used to construct the solution in a finite number of well-defined steps.

Galois’ approach to the study of algebraic (polynomial) equations involved twoareas of mathematics, now called field theory and group theory. One useful state-ment of Galois’ result is the following (Lang, 1984; Stewart, 1989).

Theorem A polynomial equation over the complex field is solvable by radicalsif and only if its Galois group G contains a chain of subgroups G = G0 ⊃ G1 ⊃· · · ⊃ Gω = I with the properties:

(i) Gi+1 is an invariant subgroup of Gi ;(ii) each factor group Gi/Gi+1 is commutative.

In the statement of this theorem the field theory niceties are contained in the term“solvable by radicals.” This means that in addition to the four standard arithmeticoperations +, −, ×, ÷ one is allowed the operation of taking nth roots of complexnumbers.

The principal result of this theorem is stated in terms of the structure of the groupthat permutes the roots of the polynomial equation among themselves. Determiningthe structure of this group is a finite, and in fact very simple, process.

1.3 Group theory background

A group G is defined as follows. It consists of a set of operations G = {g1, g2, . . . },called group operations, together with a combinatorial operation, ·, called groupmultiplication, such that the following four axioms are satisfied.

Page 18: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

4 Introduction

(i) Closure: if gi ∈ G, g j ∈ G, then gi · g j ∈ G.(ii) Associativity: for all gi ∈ G, g j ∈ G, gk ∈ G,

(gi · g j ) · gk = gi · (g j · gk)

(iii) Identity: there is a group operation, I (identity operator), with the property that

gi · I = gi = I · gi

(iv) Inverse: every group operation gi has an inverse (called g−1i ):

gi · g−1i = I = g−1

i · gi

The Galois group G of a general polynomial equation

(z − z1)(z − z2) · · · (z − zn) = 0

zn − I1zn−1 + I2zn−2 + · · · + (−1)n In = 0 (1.1)

is the group that permutes the roots z1, z2, . . . , zn among themselves and leaves theequation invariant:

z1

z2...

zn

−→

zi1

zi2

...zin

(1.2)

This group, called the permutation group Pn or the symmetric group Sn , has n!group operations. Each group operation is some permutation of the roots of thepolynomial; the group multiplication is composition of successive permutations.

The permutation group Sn has a particularly convenient representation in termsof n × n matrices. These matrices have one nonzero element, +1, in each rowand each column. For example, the 6 = 3! 3 × 3 matrices for the permutationrepresentation of S3 are

I →1 0 0

0 1 00 0 1

(123) →

0 1 0

0 0 11 0 0

(321) →

0 0 1

1 0 00 1 0

(12) →0 1 0

1 0 00 0 1

(23) →

1 0 0

0 0 10 1 0

(13) →

0 0 1

0 1 01 0 0

(1.3)

Page 19: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

1.3 Group theory background 5

The symbol (123) means that the first root, z1, is replaced by z2, z2 is replaced byz3, and z3 is replaced by z1

z1

z2

z3

(123)−→

z2

z3

z1

(1.4)

The permutation matrix associated with this group operation carries out the samepermutation

z2

z3

z1

=

0 1 0

0 0 11 0 0

z1

z2

z3

(1.5)

More generally, a matrix representation of a group is a mapping of each groupoperation into an n × n matrix that preserves the group multiplication operation

gi · g j = gi · g j

↓ ↓ ↓ ↓�(gi ) × �(g j ) = �(gi · g j )

(1.6)

Here · represents the multiplication operation in the group (i.e., composition of sub-stitutions in Sn) and × represents the multiplication operation among the matrices(i.e., matrix multiplication). The condition (1.6) that defines a matrix representa-tion of a group, G → �(G), is that the product of matrices representing two groupoperations (�(gi ) × �(g j )) is equal to the matrix representing the product of theseoperations in the group (�(gi · g j )) for all group operations gi , g j ∈ G.

This permutation representation of S3 is 1:1, or a faithful representation of S3,since knowledge of the 3 × 3 matrix uniquely identifies the original group operationin S3.

A subgroup H of the group G is a subset of group operations in G that is closedunder the group multiplication in G.

Example The subset of operations I, (123), (321) forms a subgroup of S3. Thisparticular subgroup is denoted A3 (alternating group). It consists of those oper-ations in S3 whose determinants, in the permutation representation, are +1. Thegroup S3 has three two-element subgroups:

S2(12) = {I, (12)}S2(23) = {I, (23)}S2(13) = {I, (13)}

as well as the subgroup consisting of the identity alone. The alternating subgroupA3 ⊂ S3 and the three two-element subgroups S2(i j) of S3 are illustrated in Fig. 1.1.

Page 20: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

6 Introduction

S3

A3 S2(12) S2(13) S2(23)

I

Figure 1.1. Subgroups of S3.

The set of operations I, (123), (12) does not constitute a subgroup because productsof operations in this subset do not lie in this subset: (123) · (123) = (321), (123) ·(12) = (23), etc. In fact, the two operations (123), (12) generate S3 by takingproducts of various lengths in various order.

A group G is commutative, or abelian, if

gi · g j = g j · gi (1.7)

for all group operations gi , g j ∈ G.

Example S3 is not commutative, while A3 is. For S3 we have

(12)(23) = (321)(123) �= (321)

(23)(12) = (123)(1.8)

Two subgroups of G, H1 ⊂ G and H2 ⊂ G are conjugate if there is a groupelement g ∈ G with the property

gH1g−1 = H2 (1.9)

Example The subgroups S2(12) and S2(13) are conjugate in S3 since

(23)S2(12)(23)−1 = (23) {I, (12)} (23)−1 = {I, (13)} = S2(13) (1.10)

On the other hand, the alternating group A3 ⊂ S3 is self-conjugate, since anyoperation in G = S3 serves merely to permute the group operations in A3 amongthemselves:

(23)A3(23)−1 = (23) {I, (123), (321)} (23)−1 = {I, (321), (123)} = A3 (1.11)

A subgroup H ⊂ G which is self-conjugate under all operations in G is calledan invariant subgroup of G, or normal subgroup of G.

Page 21: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

1.3 Group theory background 7

S3

A3 S2

I

Figure 1.2. Subgroups of S3, combining conjugate subgroups.

In constructing group-subgroup diagrams, it is customary to show only one ofthe mutually conjugate subgroups. This simplifies Fig. 1.1 to Fig. 1.2.

A mapping f from a group G with group operations g1, g2, . . . and group multi-plication · to a group H with group operations h1, h2, . . . and group multiplication× is called a homomorphism if it preserves group multiplication:

gi · g j = gi · g j

↓ ↓ ↓ ↓f (gi ) × f (g j ) = f (gi · g j )

(1.12)

The group H is called a homomorphic image of G. Several different group ele-ments in G may map to a single group element in H . Every element hi ∈ H hasthe same number of inverse images g j ∈ G. If each group element h ∈ H has aunique inverse image g ∈ G (h1 = f (g1) and h2 = f (g2), h1 = h2 ⇒ g1 = g2) themapping f is an isomorphism.

Example The 3:1 mapping f of S3 onto S2 given by

S3f−→ S2

I, (123), (321) −→ I(12), (23), (31) −→ (12)

(1.13)

is a homomorphism.

Example The 1:1 mapping of S3 onto the six 3 × 3 matrices given in (1.3) is anisomorphism.

Remark Homomorphisms of groups to matrix groups, such as that in (1.3), arecalled matrix representations. The representation in (1.3) is 1:1 or faithful, sincethe mapping is an isomorphism.

Remark Isomorphic groups are indistinguishable at the algebraic level. Thus,when an isomorphism exists between a group and a matrix group, it is often

Page 22: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

8 Introduction

preferable to study the matrix representation of the group since the properties ofmatrices are so well known and familiar. This is the approach we pursue in Chapter3 when discussing Lie groups.

If H is a subgroup of G, it is possible to write every group element in G as aproduct of an element h in the subgroup H with a group element in a “quotient,”or coset (denoted G/H ). A coset is a subset of G. If the order of G is |G| (S3

has 3! = 6 group elements, so the order of S3 is 6), then the order of G/H is|G/H | = |G|/|H |. For example, for subgroups H = A3 = {I, (123), (321)} andH = S2(23) = {I, (23)} we have

G/H · H = G{I, (12)} · {I, (123), (321)} = {I, (123), (321), (12), (13), (23)}

{I, (12), (321)} · {I, (23)} = {I, (23), (12), (123), (321), (13)}(1.14)

The choice of the |G|/|H | group elements in the quotient space is not unique. Forthe subgroup A3 we could equally well have chosen G/H = S3/A3 = {I, (13)}or {I, (23)}; for S2(23) we could equally well have chosen G/H = S3/S2(23) ={I, (123), (321)}.

In general, it is not possible to choose the group elements in G/H so that theyform a subgroup of G. However, if H is an invariant subgroup of G, it is alwayspossible to choose the group elements in the quotient space G/H in such a waythat they form a subgroup in G. This group is called the factor group, also denotedG/H . Since A3 is an invariant subgroup of S3, the coset S3/A3 is a group, andthis group is isomorphic to S2. More generally, if H is an invariant subgroup of G,then the group G is the direct product of the invariant subgroup H with the factorgroup G/H : G = G/H × H .

1.4 Approach to solving polynomial equations

The general nth degree polynomial equation over the complex field can be expressedin terms of the kth order symmetric functions Ik of the roots zi as follows:

(z − z1)(z − z2) · · · (z − zn) = zn − I1zn−1 + I2zn−2 − · · · + (−)n In = 0

I1 =n∑

i=1

zi = z1 + z2 + · · · + zn

I2 =n∑

i< j

zi z j = z1z2 + z1z3 + · · · + z1zn + z2z3 + · · · + zn−1zn

......

... (1.15)

In =n∑

i< j<···<k

zi z j · · · zk = z1z2 · · · zn

Page 23: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

1.4 Approach to solving polynomial equations 9

The n functions Ik (k = 1, 2, . . . , n) of the n roots (z1, z2, . . . , zn) are symmetric:this means that they are invariant under the Galois group Sn of this equation. Further,any function f (z1, z2, . . . , zn) that is invariant under Sn can be written as a functionof the invariants I1, I2, . . . , In . The invariants are easily expressed in terms ofthe roots (see Eq. (1.15)). The inverse step, that of expressing the roots in termsof the invariants, or coefficients of the polynomial equation, is the problem ofsolving the polynomial equation.

Galois’ theorem states that a polynomial equation over the complex field can besolved if and only if its Galois group G contains a chain of subgroups (Lang, 1984;Stewart, 1989)

G = G0 ⊃ G1 ⊃ · · · ⊃ Gω = I (1.16)

with the properties

(i) Gi+1 is an invariant subgroup of Gi ,(ii) Gi/Gi+1 is commutative.

The procedure for solving polynomial equations is constructive. First, the lastgroup-subgroup pair in this chain is isolated: Gω−1 ⊃ Gω = I . The charactertable for the commutative group Gω−1/Gω = Gω−1 is constructed. This liststhe |Gω−1|/|Gω| inequivalent one-dimensional representations of Gω−1. Linearcombinations of the roots zi are identified that transform under (i.e., are basisfunctions for) the one-dimensional irreducible representations of Gω−1. Thesefunctions are

(i) symmetric under Gω = I ,(ii) not all symmetric under Gω−1.

Next, the next pair of groups Gω−2 ⊃ Gω−1 is isolated. Starting from the set offunctions in the previous step, one constructs from them functions that are

(i) symmetric under Gω−1,(ii) not all symmetric under Gω−2.

This bootstrap procedure continues until the last group-subgroup pair G = G0 ⊃G1 is treated. At this stage the last set of functions can be solved by radicals.These solutions are then fed down the group-subgroup chain until the last pairGω−1 ⊃ Gω = I is reached. When this occurs, we obtain a linear relation betweenthe roots z1, z2, . . . , zn and functions of the invariants I1, I2, . . . , In .

This brief description will now be illustrated by using Galois theory to solvequadratic, cubic, and quartic equations by radicals.

Page 24: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

10 Introduction

S2 = I, (12)

I

Figure 1.3. Group chain for the Galois group S2 of the general quadratic equation.

1.5 Solution of the quadratic equation

The general quadratic equation has the form

(z − r1)(z − r2) = z2 − I1z + I2 = 0

I1 = r1 + r2 (1.17)

I2 = r1r2

The Galois group is S2 with subgroup chain shown in Fig. 1.3.The character table for the commutative group S2 is

I (12) Basis functions�1 1 1 u1 = r1 + r2

�2 1 −1 u2 = r1 − r2

(1.18)

Linear combinations of the roots that transform under the one-dimensional irre-ducible representations �1, �2 are[

u1

u2

]=

[1 11 −1

] [r1

r2

]=

[r1 + r2

r1 − r2

](1.19)

That is, the function r1 − r2 is mapped into itself by the identity, and into its negativeby (12)

I−→ +(r1 − r2)(r1 − r2)

}(1.20)

(12)−→ (r2 − r1) = −(r1 − r2)

As a result, (r1 − r2) is not symmetric under the action of the group S2. It transformsunder the irreducible representation �2, not the identity representation �1.

Since the square (r1 − r2)2 is symmetric (transforms under the identity repre-sentation of S2), it can be expressed in terms of the two invariants I1, I2 as follows

(r1 − r2)2 = r21 − 2r1r2 + r2

2

= r21 + 2r1r2 + r2

2 − 4r1r2 = I 21 − 4I2 = D (1.21)

Page 25: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

1.6 Solution of the cubic equation 11

where D is the discriminant of the quadratic equation. Since (r1 − r2) = ±√D,

we have the following linear relation between roots and symmetric functions:[1 11 −1

] [r1

r2

]=

[I1

±[I 21 − 4I2]1/2

](1.22)

Inversion of a square matrix involves a sequence of linear operations. We find[r1

r2

]= 1

2

[1 11 −1

] [I1

±√D

](1.23)

The roots are

r1, r2 = 1

2(I1 ±

√D) (1.24)

We solve the quadratic equation by another procedure, which we use in thefollowing two sections to simplify the cubic and quartic equations. This method isto move the origin to the mean value of the roots by defining a new variable, x , interms of z (see Eq. (1.15)) by a Tschirnhaus transformation

z = x + 1

2I1 (1.25)

The quadratic equation for the new coordinate is

x2 − I ′1x + I ′

2 = x2 + I ′2 = 0

I ′1 = 0 (1.26)

I ′2 = I2 −

(1

2I1

)2

The solutions for this auxiliary equation are constructed by radicals

x = ±√

−I ′2 (1.27)

from which we easily construct the roots of the original equation

r1,2 = 1

2

(I1 ±

√I 21 − 4I2

)(1.28)

1.6 Solution of the cubic equation

The general cubic equation has the form

(z − s1) (z − s2)(z − s3) = z3 − I1z2 + I2z − I3 = 0I1 = s1 + s2 + s3

I2 = s1s2 + s1s3 + s2s3 (1.29)I3 = s1s2s3

Page 26: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

12 Introduction

S3

A3 S2

I

Figure 1.4. Group chain for the Galois group S3 of the general cubic equation.

The Galois group is S3 with subgroup chain shown in Fig. 1.4.Since A3 is an invariant subgroup of S3 and I is an invariant subgroup of A3, the

first of the two conditions of the Galois theorem (there exists a chain of invariantsubgroups) is satisfied. Since S3/A3 = S2 is commutative and A3/I = A3 is com-mutative, the second condition is also satisfied. This means that the general cubicequation can be solved.

We begin the solution with the last group-subgroup pair in this chain: A3 ⊃ I .The character table for the commutative group A3 is

I (123) (321) Basis functions�1 1 1 1 v1 = s1 + s2 + s3

�2 1 ω ω2 v2 = s1 + ωs2 + ω2s3

�3 1 ω2 ω v3 = s1 + ω2s2 + ωs3

(1.30)

where

ω3 = +1 ω = e2π i/3 = −1 + i√

3

2(1.31)

Linear combinations of the roots that transform under each of the three one-dimensional irreducible representations are easily constructed

v1

v2

v3

=

1 1 1

1 ω ω2

1 ω2 ω

s1

s2

s3

=

s1 + s2 + s3

s1 + ωs2 + ω2s3

s1 + ω2s2 + ωs3

(1.32)

For example, the action of (123)−1 on v2 is

(123)−1v2 = (321)v2 = (321)(s1 + ωs2 + ω2s3)

= s3 + ωs1 + ω2s2 = ω(s1 + ωs2 + ω2s3) = ωv2 (1.33)

Page 27: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

1.6 Solution of the cubic equation 13

Since v1 is symmetric under both A3 and S3, it can be expressed in terms of theinvariants Ik :

v1 = I1 (1.34)

The remaining functions, v2 and v3, are symmetric under I but not under A3.We now proceed to the next group-subgroup pair: S3 ⊃ A3. To construct func-

tions symmetric under A3 but not under S3 we observe that the cubes of v2 and v3

are symmetric under A3 but not under S3:

(12)(v2)3 = (12)(s1 + ωs2 + ω2s3)3 = (s2 + ωs1 + ω2s3)3

= ω3(s1 + ω2s2 + ωs3)3 = (v3)3

(1.35)(12)(v3)3 = (12)(s1 + ω2s2 + ωs3)3 = (s2 + ω2s1 + ωs3)3

= ω6(s1 + ωs2 + ω2s3)3 = (v2)3

Since S2 = S3/A3 permutes the functions v32 and v3

3, it is the Galois group of theresolvent quadratic equation whose two roots are v3

2 and v33. This equation has the

form

(x − v32)(x − v3

3) = x2 − J1x + J2 = 0

J1 = v32 + v3

3 (1.36)

J2 = v32v

33

Since J1, J2 are symmetric under S3, they can be expressed in terms of the invariantsI1, I2, I3 of the original cubic. Since J1 has order 3 and J2 has order 6, we can writethe invariants of the quadratic equation (1.36) in terms of the invariants I1, I2, I3

(of orders 1, 2, 3) of the original cubic equation (1.29) as follows:

J1 =∑

i+2 j+3k=3

Ai jk I i1 I j

2 I k3

(1.37)J2 =

∑i+2 j+3k=6

Bi jk I i1 I j

2 I k3

These relations can be computed, but they simplify considerably if I1 = s1 + s2 +s3 = 0. This can be accomplished by shifting the origin using a Tschirnhaus trans-formation as before, with

z = y + 1

3I1 (1.38)

Page 28: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

14 Introduction

The auxiliary cubic equation has the structure

y3 − 0y2 + I ′2 y − I ′

3 = 0

I ′1 = s ′

1 + s ′2 + s ′

3 = 0

I ′2 = s ′

1s ′2 + s ′

1s ′3 + s ′

2s ′3 = I2 − (1/3)I 2

1

I ′3 = s ′

1s ′2s ′

3 = I3 − (1/3)I2 I1 + (2/27)I 31

(1.39)

The invariants J1 = v32 + v3

3 and J2 = v32v

33 can be expressed in terms of I ′

2, I ′3

as follows

J1 = v32 + v3

3 = −27I ′3

J2 = v32v

33 = −27I ′3

2

(1.40)

The resolvent quadratic equation whose solution provides v32, v

33 is

x2 − (−27I ′3)x + (−27I ′3

2 ) = 0 (1.41)

The two solutions to this resolvent quadratic equation are

v32, v

33 = −27

2I ′3 ± 1

2

[(27I ′

3)2 + 4 × 27I ′32

]1/2(1.42)

The roots v2 and v3 are obtained by taking cube roots of v32 and v3

3.

v2

v3=

{−27

2I ′3 ± 1

2

[(27I ′

3)2 + 4 × 27I ′32

]1/2}1/3

Finally, the roots s1, s2, s3 are linearly related to v1, v2, v3 by

1 1 1

1 ω ω2

1 ω2 ω

s1

s2

s3

=

v1

v2

v3

(1.43)

Again, determination of the roots is accomplished by solving a set of simultaneouslinear equations

s1

s2

s3

= 1

3

1 1 1

1 ω2 ω

1 ω ω2

I1

v2

v3

= 1

3

v1 + v2 + v3

v1 + ω2v2 + ωv3

v1 + ωv2 + ω2v3

(1.44)

Page 29: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

1.7 Solution of the quartic equation 15

1.7 Solution of the quartic equation

The general quartic equation has the form

(z − t1)(z − t2)(z − t3)(z − t4) = z4 − I1z3 + I2z2 − I3z + I4 = 0

I1 = t1 + t2 + t3 + t4

I2 = t1t2 + t1t3 + t1t4 + t2t3 + t2t4 + t3t4 (1.45)

I3 = t1t2t3 + t1t2t4 + t1t3t4 + t2t3t4

I4 = t1t2t3t4

For later convenience we will construct the auxiliary quartic by shifting the originof coordinates through the Tschirnhaus transformation z = z′ + 1

4 I1

(z′ − t1)(z′ − t2)(z′ − t3)(z′ − t4) = z′4 − I ′1z′3 + I ′

2z′2 − I ′3z′ + I ′

4 = 0

I ′1 = 0

I ′2 = I2 − 3

8 I 21

I ′3 = I3 − 1

2 I2 I1 + 18 I 3

3 (1.46)

I ′4 = I4 − 1

4 I3 I1 + 116 I2 I 2

1 − 344 I 4

1

The Galois group is S4. This has the subgroup chain shown in Fig. 1.5. Thealternating group A4 consists of the twelve group operations that have determinant+1 in the permutation matrix representation. The four-group (vierergruppe, Kleingroup, Klein four-group) V4 is {I, (12)(34), (13)(24), (14)(23)}. The chain

S4 ⊃ A4 ⊃ V4 ⊃ I

S4

V8 A4 S3

V4 A3 S2

I

Figure 1.5. Group chain for the Galois group S4 of the general quartic equation.

Page 30: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

16 Introduction

satisfies both conditions of Galois’ theorem. In particular

(i) A4 is invariant in S4 and S4/A4 = S2,(ii) V4 is invariant in A4 and A4/V4 = C3 = {I, (234), (432)},

(iii) I is invariant in V4 and V4/I = V4 = {I, (12)(34), (13)(24), (14)(23)}.

We again begin at the end of the chain with the commutative group V4 whosecharacter table is

I (12)(34) (13)(24) (14)(23) Basis functions�1 1 1 1 1 w1 = t1 + t2 + t3 + t4�2 1 1 −1 −1 w2 = t1 + t2 − t3 − t4�3 1 −1 1 −1 w3 = t1 − t2 + t3 − t4�4 1 −1 −1 1 w4 = t1 − t2 − t3 + t4

(1.47)

The linear combinations of these roots that transform under each of the irreduciblerepresentations are

w1

w2

w3

w4

=

1 1 1 11 1 −1 −11 −1 1 −11 −1 −1 1

t1t2t3t4

=

t1 + t2 + t3 + t4t1 + t2 − t3 − t4t1 − t2 + t3 − t4t1 − t2 − t3 + t4

(1.48)

These basis vectors are symmetric under I but the basis vectors w2, w3, w4 are notsymmetric under V4.

We now advance to the next group-subgroup pair: A4 ⊃ V4. It is a simple matterto construct from these linear combinations functions that are

(i) symmetric under V4,(ii) permuted among themselves by A4 and the group A4/V4.

These functions are w1 = I1 and w22, w

23, w

24. In the coordinate system in which the

sum of the roots is zero, the three functions w22, w

23, w

24 are

w22 = (t ′

1 + t ′2 − t ′

3 − t ′4)2 = 22(t ′

1 + t ′2)2 = −4(t ′

1 + t ′2)(t ′

3 + t ′4)

w23 = (t ′

1 − t ′2 + t ′

3 − t ′4)2 = 22(t ′

1 + t ′3)2 = −4(t ′

1 + t ′3)(t ′

2 + t ′4)

w24 = (t ′

1 − t ′2 − t ′

3 + t ′4)2 = 22(t ′

1 + t ′4)2 = −4(t ′

1 + t ′4)(t ′

2 + t ′3)

(1.49)

It is clear that the three w2j ( j = 2, 3, 4) are permuted among themselves by the

factor group C3 = A4/V4, which is a subgroup of the Galois group of a resolvent

Page 31: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

1.8 The quintic cannot be solved 17

cubic equation whose three roots are w22, w

23, w

24:

(y − w22)(y − w2

3)(y − w24) = y3 − J1 y2 + J2 y − J3 = 0

J1 = w22 + w2

3 + w24

J2 = w22w

23 + w2

2w24 + w2

3w24 (1.50)

J3 = w22w

23w

24

Since the three Jk are invariant under C3, they can be expressed in terms of thesymmetric functions (coefficients) of the original quartic equation (1.45) or (1.46).We find by direct calculation

J1 = (−4)1(2I ′2)

J2 = (−4)2(I ′22 − 4I ′

4) (1.51)

J3 = (−4)3(−I ′23 )

This cubic equation is solved by proceeding to the first group-subgroup pair in thechain: S4 ⊃ A4, with S4/A4 = S2. The cubic is solved by introducing the resolventquadratic, as described in the previous section.

If the three solutions of the resolvent cubic equation are called y2, y3, y4, thenthe functions w2, w3, w4 are

w2 = ±√y2

w3 = ±√y3 (1.52)

w4 = ±√y4

A simple computation shows that w2w3w4 = 8I ′3. The signs ±√

y j are chosen sothat their product is 8I ′

3. The simple linear relation between the roots ti and theinvariants I1 and functions w j (I ′) is easily inverted:

t1t2t3t4

= 1

4

1 1 1 11 1 −1 −11 −1 1 −11 −1 −1 1

I1

w2

w3

w4

(1.53)

where the w j are square roots of the solutions of the resolvent cubic equation whosecoefficients are functions (1.51) of the auxiliary quartic equation.

1.8 The quintic cannot be solved

To investigate whether the typical quintic equation is solvable (and if so, how), itis sufficient to study the structure of its Galois group S5. The alternating subgroup

Page 32: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

18 Introduction

A5 of order 60 is an invariant subgroup. S5 has no invariant subgroups except A5

and I . Further, A5 has only I as an invariant subgroup. The only chain of invariantsubgroups in S5 is

S5 ⊃ A5 ⊃ I (1.54)

Although S5/A5 = S2 is commutative, A5/I = A5 is not. Therefore the quinticequation does not satisfy the conditions of Galois’ theorem, so cannot be solved byradicals. General polynomial equations of degree greater than five also cannot besolved by radicals.

1.9 Example

To illustrate the solution of a polynomial equation by radicals using the machineryintroduced above, we begin with a quartic equation whose roots are: −2, −1, 2, 5.We will carry out the algorithm on the corresponding quartic equation. As weproceed through the algorithm, we indicate the numerical values of the functionspresent. Those values that would not be available at each stage of the computationare indicated by arrows.

The fourth degree equation is

(z + 2)(z + 1)(z − 2)(z − 5) = z4 − 4z3 − 9z2 + 16z + 20 = 0

I1 = 4

I2 = −9

I3 = −16

I4 = 20

(1.55)

We now center the roots by making a Tschirnhaus transformation

z = z′ + 1

4I1 = z′ + 1

The new roots are −3, −2, 1, 4 and the auxiliary quartic equation is

(z′ + 1)4 − 4(z′ + 1)3 − 9(z′ + 1)2 + 16(z′ + 1) + 20

= (z′ + 3)(z′ + 2)(z′ − 1)(z′ + 4) = z′4 − 15z′2 − 10z′ + 24 = 0

I ′1 = 0

I ′2 = −15

I ′3 = 10

I ′4 = 24

(1.56)

Page 33: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

1.9 Example 19

Next, we introduce linear combinations of the four roots t ′1 = −3, t ′

2 = −2, t ′3 =

1, t ′4 = 4

w1

w2

w3

w4

=

1 1 1 11 1 −1 −11 −1 1 −11 −1 −1 1

t ′1

t ′2

t ′3

t ′4

0−10−4

2

(1.57)

Observe at this stage that w2w3w4 = 8I ′3.

Now we compute the squares of these numbers

w22 = y2 → (−10)2 = 100

w23 = y3 → (−4)2 = 16

w24 = y4 → (+2)2 = 4

(1.58)

From the auxiliary quartic (1.56) the resolvent cubic equation can be constructed

y3 − J1 y2 + J2 y − J3 = 0

J1 = (−4)1[2I ′2] = (−4)(−30) = 120

J2 = (−4)2[I ′22 − 4I ′

4] = 16(225 − 4 × 24) = 2064

J3 = (−4)3[−I ′23 ] = (−64)(−100) = 6400

(1.59)

Note that these are the coefficients of the equation

(y − 22)(y − 42)(y − 102) = y3 − 120y2 + 2064y − 6400 = 0 (1.60)

Now we construct the cubic equation auxiliary to this cubic. This is done by definingy = y′ + 1

3 J1 = y′ + 13 (4 + 16 + 100) = y′ + 40. The roots are now

y′1 = y1 − 40 → 4 − 40 = −36

y′2 = y2 − 40 → 16 − 40 = −24

y′3 = y3 − 40 → 100 − 40 = 60

(1.61)

The auxiliary cubic is

y′3 − J ′1 y′2 + J ′

2 y′ − J ′3 = 0

J ′1 = 0

J ′2 = −2736

J ′3 = 51840

(1.62)

We note that these are the coefficients of the equation

(y′ + 36)(y′ + 24)(y′ − 60) = 0 (1.63)

Page 34: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

20 Introduction

These coefficients are obtained directly from the coefficients of the resolvent cubic,in principle without knowledge of the values of the roots.

Next we construct the functions v1, v2, v3v1

v2

v3

=

1 1 1

1 ω ω2

1 ω2 ω

s1

s2

s3

−→

s1 = −24s2 = −36s3 = 60

0

−36 − i48√

3−36 + i48

√3

(1.64)

We can express v32 + v3

3, v32v

33 in terms of J ′

2, J ′3:

v32 + v3

3 = 27J ′3 = 27 × 518400 = 1399680

v32v

33 = −27J ′3

2 = −27 × (−2736)3 = 552983334912(1.65)

The quadratic resolvent for the auxiliary cubic is

x2 − 1399680x + 552983334912 = 0

K1 = 1399680K2 = 552983334912

(1.66)

A Tschirnhaus transformation x = x ′ + 12 K1 produces the auxiliary quadratic

x ′2 + 63207309312 = 0

K ′1 = 0

K2 = 63207309312(1.67)

The square of the difference between the two roots of this equation is easily deter-mined:

x ′1 − x ′

2 = x1 − x2 = ±2√−K2 = ±2i

√K2

= ±2i × 145152√

3 = ±i × 290304√

3(1.68)

Now we work backwards. The solutions of the resolvent quadratic are given bythe linear equation[

x1

x2

]= 1

2

[1 11 −1

] [K1 = 1399680

2√−K2 = i × 290304

√3

]

= 699840 ± i × 145152√

3 (1.69)

These solutions are the values of v32 and v3

3:

v32 = 699840 + i 145152

√3

v33 = 699840 − i 145152

√3

(1.70)

Page 35: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

1.10 Conclusion 21

Next, we take cube roots of these quantities. These are unique up to a factor of ω

v2 = −36 + i48√

3

v3 = −36 − i48√

3(1.71)

The values y1, y2, y3 of the resolvent cubic are complex linear combinations ofv2, v3

y1

y2

y3

= 1

3

1 1 1

1 ω2 ω

1 ω ω2

J1 = 120

v2 = −36 + i 48√

3v3 = −36 − i 48

√3

=

16

1004

(1.72)

w22 = y1 w2 = ±4

w23 = y2 w3 = ±10

w24 = y3 w4 = ±2

(1.73)

Since w2w3w4 = 8I ′3 = 80, an even number of these signs must be negative. The

simplest choice is to take all signs positive. This is different from the results shownin (1.57); this choice of signs serves only to permute the order of the roots. In thefinal step, the roots of the original quartic are linear combinations of w2, w3, w4

and the linear symmetric function w1 = I1

x1

x2

x3

x4

= 1

4

1 1 1 11 1 −1 −11 −1 1 −11 −1 −1 1

I1 = 4w2 = 4w3 = 10w4 = 2

=

20/4 = +5−4/4 = −18/4 = +2

−8/4 = −2

(1.74)

We have recovered the four roots of the original quartic equation using Galois’algorithm, based on the structure of the invariance group S4 of the quartic equation.

1.10 Conclusion

One of the many consequences of Galois’ study of algebraic equations and thesymmetries that leave them invariant is the proof that an algebraic equation can besolved by radicals if and only if its invariance group has a certain structure. Thisproof motivated Lie to search for analogous results involving differential equationsand their symmetry groups, now called Lie groups. We have described in this chapterhow the structure of the discrete symmetry group (Galois group) of a polynomialequation determines whether or not that equation can be solved by radicals. If theanswer is “yes,” we have shown how the structure of the Galois group determinesthe structure of the algorithm for constructing solutions. This algorithm has beendeveloped for the cubic and quartic equations, and illustrated by example for aquartic equation.

Page 36: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

22 Introduction

1.11 Problems

1. Compute S4/A4, A4/V4, V4 and show that they are commutative.

2. Construct the group V8 with the property S4 ⊃ V8 ⊃ V4 (see Fig. 1.5). (Hint: includea cyclic permutation).

3. For the cubic equation z3 − 7z + 6 = 0 ((z − 1)(z − 2)(z + 3) = 0) show

I1 = 0 J1 = 162I2 = −7 J2 = 9261I3 = −6

Show that the resolvent equation for v32, v

33 is (x − v3

2)(x − v33) = x2 − 162x +

9261 = 0. Solve this quadratic to find v32, v

33 = 81 ± i30

√3, so that v2, v3 = 1

2 (3 ±i5

√3). Invert Eq. (1.43) to determine the three roots of the original equation:

(1, 2, −3).

4. Ruler and compass can be used to construct an orthogonal pair of axes in the plane(Euclid). A compass is used to establish a unit of length 1. Then by ruler and compass itis possible to construct intervals of length x , where x is integer. From there it is possibleto construct intervals of lengths x + y, x − y, x y and x/y using ruler and compass.It is also possible to construct intervals of length

√x by these means. The set of all

numbers that can be constructed from integers by addition, subtraction, multiplication,division, and extraction of square roots is called the set of constructable numbers. Thisforms a subset of the numbers x + iy = (x, y) in the complex plane. If a number is(is not) constructable the point representing that number can (cannot) be constructedby ruler and compass alone. Since repeated square roots can be taken, a constructablenumber satisfies an algebraic equation of degree K with integer coefficients, whereK = 2n must be some power of two.The three geometry problems of antiquity are as follows.a. Square a circle? For the circle of radius 1 the area is π . Squaring a circle means

finding an interval of length x , where x2 − π = 0. This is of degree 2 but π is notrational (not even algebraic). Argue that it is impossible to square the circle byruler and compass alone.

b. Double the cube? A cube with edge length 1 has volume 13 = 1. A cube withtwice the volume has edge length x , where x satisfies x3 − 2 = 0. Although thecoefficients are integers this equation is of degree 3 �= 2n for any integer n. Arguethat it is impossible to double the volume of a cube by ruler and compass alone.

c. Trisect an angle? If 3θ is some angle, the trigonometric functions of 3θ and13 (3θ ) = θ are related by

ei3θ = (eiθ )3

cos(3θ ) + i sin(3θ ) = (cos3(θ ) − 3 cos(θ ) sin2(θ ))

+ i(3 cos2(θ ) sin(θ ) − sin3(θ ))

Page 37: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

1.11 Problems 23

In particular

cos(3θ ) = 4 cos3(θ ) − 3 cos(θ )

Whether cos(3θ ) is rational or irrational, the equation for cos(θ ):

4 cos3(θ ) − 3 cos(θ ) − cos(3θ ) = 0

is cubic. Argue that it is impossible to trisect an angle unless cos(3θ ) is such that thecubic factors into the form (x2 + ax + b)(x + c) = 0, where a, b, c are rational.For example, if cos(3θ ) = 0, c = 0 so that a = 0 and b = −3/4. Then cos(θ ) = 0or ±√

3/2 for 3θ = π/2 (+), 3π/2 (0), or 5π/2 (−).

Page 38: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

2

Lie groups

Lie groups are beautiful, important, and useful because they have one footin each of the two great divisions of mathematics – algebra and geometry.Their algebraic properties derive from the group axioms. Their geometricproperties derive from the identification of group operations with pointsin a topological space. The rigidity of their structure comes from thecontinuity requirements of the group composition and inversion maps. Inthis chapter we present the axioms that define a Lie group.

2.1 Algebraic properties

The algebraic properties of a Lie group originate in the axioms for a group.

Definition A set gi , g j , gk, . . . (called group elements or group operations)together with a combinatorial operation ◦ (called group multiplication) form agroup G if the following axioms are satisfied.

(i) Closure: if gi ∈ G, g j ∈ G, then gi ◦ g j ∈ G.(ii) Associativity: gi ∈ G, g j ∈ G, gk ∈ G, then

(gi ◦ g j ) ◦ gk = gi ◦ (g j ◦ gk)

(iii) Identity: there is an operator e (the identity operation) with the property that forevery group operation gi ∈ G

gi ◦ e = gi = e ◦ gi

(iv) Inverse: every group operation gi has an inverse (called g−1i ) with the property

gi ◦ g−1i = e = g−1

i ◦ gi

24

Page 39: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

2.2 Topological properties 25

Example We consider the set of real 2 × 2 matrices SL(2; R):

A =[

α β

γ δ

]det(A) = αδ − βγ = +1 (2.1)

where α, β, γ, δ are real numbers. This set forms a group under matrix multiplica-tion. This is verified by checking that the group axioms are satisfied.

(i) Closure if A and B are real 2 × 2 matrices, and A ◦ B = C (where ◦ now representsmatrix multiplication), then C is a real 2 × 2 matrix. If det(A) = +1 and det(B) = +1,then det(C) = det(A) det(B) = +1.

(ii) Associativity: (A ◦ B) ◦ C and A ◦ (B ◦ C) are given explicitly by

∑k

(∑j

Ai j B jk

)Ckl

?=∑

j

Ai j

(∑k

B jkCkl

)∑

k

∑j

Ai j B jkCklok=

∑j

∑k

Ai j B jkCkl

(2.2)

(iii) Identity: the unit matrix is the identity

e −→ I2 =[

1 00 1

](iv) Inverse: the unique matrix inverse of A is[

A11 A12

A21 A22

]→

[A11 A12

A21 A22

]−1

= 1

A11 A22 − A12 A21

[A22 −A12

−A21 A11

]

2.2 Topological properties

The geometric structure of a Lie group comes from the identification of each elementin the group with a point in some topological space: gi → g(x). In other words, theindex i depends on one or more continuous real variables.

The topological space that parameterizes the elements in a Lie group is a mani-fold. A manifold is a space that looks Euclidean on a small scale everywhere. Forexample, every point on the surface of a unit sphere S2 ⊂ R3: x2 + y2 + z2 = 1,has a neighborhood that looks, over small distances, like a piece of the plane R2

(see Fig. 2.1). Locally, the two spaces S2 and R2 are topologically equivalent butglobally they are different (Columbus).

Definition An n-dimensional differentiable manifold Mn consists of the follow-ing.

(i) A topological space T . This includes a collection of open sets Uα (a topology) thatcover T : ∪αUα = T .

Page 40: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

26 Lie groups

p

S2R2

•q

Figure 2.1. Every point p on a sphere S2 is surrounded by an open neighborhoodthat is indistinguishable from an open neighborhood of any point in the plane R2.Locally the two spaces are indistinguishable. Globally they are distinguishable.

(ii) A collection of charts φα , with φα(Uα) = Vα ⊂ Rn . Each φα is a homeomorphism ofUα to Vα .

(iii) Smoothness conditions. The homeomorphisms φα ◦ φ−1β : φβ(Uα ∩ Uβ) → φα(Uα ∩

Uβ) of open sets in Rn to open sets in Rn are 1:1, invertible, and differentiable.

Remarks The charts φα allow construction of coordinate systems on the opensets Uα. It is often not possible to find a single coordinate system on the entiremanifold, as the example of the sphere in Fig. 2.1 shows. Since the “transitionfunctions” φα ◦ φ−1

β map Rn → Rn , all the definitions of elementary multivariablecalculus are applicable to them. For example, the adjective “differentiable” can bereplaced by other adjectives (Ck , smooth, analytic, . . .) in the definition above.

Example Real 2 × 2 matrices are identified by four real variables. The unimodularcondition det(A) = +1 places one constraint on these four real variables. Thereforeevery group element in SL(2; R) is determined by a point in some real three-dimensional space. One possible parameterization is

(x1, x2, x3) −→

x1 x2

x31 + x2x3

x1

x1 �= 0 (2.3)

Parameterization of the operations in a group by real numbers is a nontrivial prob-lem, as is clear when one asks: “what happens as x1 → 0?” We will consider thisquestion in Chapter 5.

The manifold that parameterizes the group SL(2; R) is the direct product mani-fold R2 (plane) × S1 (circle) (see Fig. 2.2). This is not at all obvious, but will becomeclear when we discuss the infinitesimal properties of Lie groups in Chapter 4.

The dimension of the manifold that parameterizes a Lie group is the dimensionof the Lie group. It is the number of continuous real parameters required to describeeach operation in the group uniquely.

Page 41: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

2.3 Unification of algebra and topology 27

R3

H2

S1

z

yx+

Figure 2.2. Every matrix in SL(2; R) can be written as the product of a sym-metric matrix and a rotation matrix, both unimodular. The symmetric matrixis parameterized by a two-dimensional manifold, the two-sheeted hyperboloidz2 − x2 − y2 = 1. The rotation matrix is parameterized by a point on a circle. Theparameterization manifold, H 2 × S1, is three dimensional.

It is useful at this point to introduce the ideas of compactness and noncompact-ness. Roughly speaking, a compact space is in some sense finite and a noncompactspace is not finite.

Definition A topological space T is compact if every open cover (set of open setsUα) has a finite subcover: ∪finite

α T ⊂ Uα.In spaces Rn with a Euclidean notion of distance (|x − x ′|2 = |x1 − x

′1|2 + · · · +

|xn − x′n|2), this definition is equivalent to an older definition of compact spaces:

a space is compact if every infinite sequence of points has a subsequence thatconverges to a point in the space.

Example In Fig. 2.1 the sphere S2 is compact and the plane R2 is not compact.In Fig. 2.2, the circle is compact and the hyperboloid is not compact.

Remark In Rn every bounded closed subset is compact. “Closed” means that theset contains all its limit points.

Remark Compactness is an important topological property because it means thatthe space is in some sense like a bounded, closed space. For Lie groups it is importantbecause all irreducible representations of compact Lie groups are finite dimensionaland can be constructed by rather simple means (tensor product constructions).

2.3 Unification of algebra and topology

The rigidity of Lie group structures comes from combining the algebraic and topo-logical properties through smoothness (differentiability) requirements.

Page 42: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

28 Lie groups

Definition A Lie group consists of a manifold Mn that parameterizes the groupoperations (g(x), x ∈ Mn) and a combinatorial operation defined by g(x) ◦ g(y) =g(z), where the coordinate z ∈ Mn depends on the coordinates x ∈ Mn and y ∈ Mn

through a function z = φ(x, y).

There are two topological axioms for a Lie group.

(i) Smoothness of the group composition map The group composition map z = φ(x, y),defined by g(x) ◦ g(y) = g(z), is differentiable.

(ii) Smoothness of the group inversion map The group inversion map y = ψ(x), definedby g(x)−1 = g(y), is differentiable.

It is possible to combine these two axioms into a single axiom, but there is noadvantage to this.

Example For SL(2; R) with parameterization given by (2.3) the compositionfunction z = φ(x, y) is constructed easily by matrix multiplication g(x) ◦ g(y) =g(φ(x, y))

g(x1, x2, x3) ◦ g(y1, y2, y3) = g(z1, z2, z3)

x1 x2

x31 + x2x3

x1

×

y1 y2

y31 + y2 y3

y1

=

z1 z2

z31 + z2z3

z1

where

g (φ(x1, x2, x3; y1, y2, y3)) = g(z1, z2, z3)

x1 y1 + x2 y3 x1 y2 + x21 + y2 y3

y1

x3 y1 + 1 + x2x3

x1y3 ∗

=

z1 z2

z31 + z2z3

z1

(2.4)

The result is easily read off, matrix element by matrix element:

z1 = φ1(x1, x2, x3; y1, y2, y3) = x1 y1 + x2 y3

z2 = φ2(x1, x2, x3; y1, y2, y3) = x1 y2 + x21 + y2 y3

y1(2.5)

z3 = φ3(x1, x2, x3; y1, y2, y3) = x3 y1 + 1 + x2x3

x1y3

The function φ is analytic in its two pairs of arguments provided x1 and y1 arebounded away from the x2–x3 plane x1 = 0 and the y2–y3 plane y1 = 0. In theneighborhood of these values an alternative parameterization of the group is needed.

Page 43: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

2.5 Conclusion 29

It is also useful to determine the mapping that takes a group operation into itsinverse. We can determine the coordinates (y1, y2, y3) of [g(x1, x2, x3)]−1 by setting(z1, z2, z3) = (1, 0, 0) and solving for (y1, y2, y3) in terms of (x1, x2, x3). Or moresimply we can compute the inverse of the matrix (2.3)[

x1 x2

x3 (1 + x2x3)/x1

]−1

=[

(1 + x2x3)/x1 −x2

−x3 x1

](2.6)

The inverse mapping [g(x)]−1 = g(y) = g(ψ(x)) is

ψ1(x1, x2, x3) = y1 = (1 + x2x3)/x1

ψ2(x1, x2, x3) = y2 = −x2

ψ3(x1, x2, x3) = y3 = −x3

(2.7)

This mapping is analytic except at x1 = 0, where an alternative parameterization isrequired. The parameterization shown in Fig. 2.2 handles this problem quite well.Every matrix in SL(2; R) can be written as the product of a symmetric matrix and arotation matrix, both 2 × 2 and unimodular. The symmetric matrix is parameterizedby a two-dimensional manifold, the two-sheeted hyperboloid z2 − x2 − y2 = 1.The rotation matrix is parameterized by a point on a circle. Two points (x, y, |z|, θ )and (−x, −y, −|z|, θ + π ) map to the same matrix in SL(2; R). The manifold thatparameterizes SL(2; R) is three dimensional. It is H 2+ × S1, where H 2+ is theupper sheet of the two-sheeted hyperboloid.

2.4 Unexpected simplification

Almost every Lie group that we will encounter is either a matrix group or elseequivalent to a matrix group. This simplifies the description of the algebraic, topo-logical, and continuity properties of these groups. Algebraically, the only groupoperations that we need to consider are matrix multiplication and matrix inversion.Geometrically, the only manifolds we encounter are those manifolds that can beconstructed from matrices by imposing algebraic constraints (algebraic manifolds)on the matrix elements. The continuity properties on the matrix elements are simpleconsequences of matrix multiplication and inversion.

2.5 Conclusion

Lie groups lie at the intersection of the two great divisions of mathematics: alge-bra and topology. The group elements are points in a manifold, and as such areparameterized by continuous real variables. These points can be combined by anoperation that obeys the group axioms. The combinatorial operation φ(x, y) de-fined by g(x) ◦ g(y) = g(z) = g(φ(x, y)) is differentiable in both sets of variables.

Page 44: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

30 Lie groups

In addition, the mapping y = ψ(x) of a group operation to its inverse [g(x)]−1 =g(y) = g (ψ(x)) is also differentiable.

Unexpectedly, almost all of the Lie groups encountered in applications are matrixgroups. This effects an enormous simplification in our study of Lie groups. Almostall of what we would like to learn about Lie groups can be determined by studyingmatrix groups.

2.6 Problems

1. Construct the analytic mapping φ(x, y) for the parameterization of SL(2; R) illus-trated in Fig. 2.2.

2. Construct the inversion mapping for the parameterization of SL(2; R) given inFig. 2.2. Show that

x ′

y′

θ ′

= −

cos(2θ ) − sin(2θ ) 0

sin(2θ ) cos(2θ ) 00 0 1

x

3. Convince yourself that every matrix M in the group SL(n; R) can be written as theproduct of an n × n real symmetric unimodular matrix S and an orthogonal matrixO in SO(n): M = SO . Devise an algorithm for constructing these matrices. ShowS = (M Mt )1/2 and O = S−1 M . How do you compute the square root of a matrix?Show that O is compact while S and M are not compact.

4. Construct the most general linear transformation (x, y, z) → (x ′, y′, z′) that leavesinvariant (unchanged) the quadratic form z2 − x2 − y2 = 1. Show that this lineartransformation can be expressed in the form

x ′

y′

z′

=

M1

ab

a b M2

SO(2)

00

0 0 1

x

yz

where the real symmetric matrices M1 and M2 satisfy

M21 = I2 +

[ab

][ a b ] =

[1 + a2 ab

ba 1 + b2

]and

M22 = I1 + [ a b ]

[ab

]= [ 1 + a2 + b2 ]

5. Construct the group of linear transformations [SO(1, 1)] that leaves invariant thequantity (ct)2 − x2. Compare this with the group of linear transformations [SO(2)]that leaves invariant the radius of the circle x2 + y2. (This comparison involves map-ping trigonometric functions to hyperbolic functions by analytic continuation.)

Page 45: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

2.6 Problems 31

6. Construct the group of linear transformations that leaves invariant the quantity(ct)2 − x2 − y2 − z2. This is the Lorentz group O(3, 1). Four disconnected mani-folds parameterize this group. These contain the four different group operations

±1 0 0 00 ±1 0 00 0 1 00 0 0 1

where the ± signs are incoherent.

7. The group of 2 × 2 complex matrices with determinant +1 is named SL(2; C). Ma-trices in this group have the structure [ α β

γ δ], where α, β, γ, δ are complex numbers

and αδ − βγ = 1. Define the matrix X by

X = H (x, y, z, ct) =[

ct + z x − iyx + iy ct − z

]= ct I2 + σ · x

where x is the three vector x = (x, y, z) and σ = (σ1, σ2, σ3) = (σx , σy, σz) are thePauli spin matrices.a. Show that X is hermitian: X † ≡ (Xt )∗ = X .b. Show that the most general 2 × 2 hermitian matrix can be written in the form used

to construct X .c. If g ∈ SL(2; C), show that g†Xg = X ′ = H (x ′, y′, z′, ct ′).d. How are the new space-time coordinates (x ′, y′, z′, ct ′) related to the original

coordinates (x, y, z, ct)? (They are linearly related by coefficients that are bilinearin the matrix elements α, β, γ, δ of g and α∗, β∗, γ ∗, δ∗ of its adjoint matrix g†.)

e. Find the subgroup of SL(2; C) that leaves t ′ = t . (It is SU (2) ⊂ SL(2; C)).f. For any g ∈ SL(2; C) write g = kh, where h ∈ SU (2), h† = h−1, h has the

form h = EXP( i2σ · θ ) and k ∈ SL(2; C)/SU (2), k† = k+1, k has the form

k = EXP( 12σ · b). The three vector b is called a boost vector. The three vectors θ

and b are real. Construct k†H (x, y, z, ct)k = H (x ′, y′, z′, ct ′). If this is too diffi-cult, choose b along the z-axis, b = (0, 0, b).

g. Show that the usual Lorentz transformation law results.h. Applying k(b′) after applying k(b) results in (a) k(b′ + b), (b) two successive

Lorentz transformations. Show that the velocity addition law for colinear boostsresults.

i. If b and b′ are not colinear, k(b′)k(b) = k(b′′)h(θ ). Compute b′′, θ . The angle θ isrelated to the Thomas precession (Gilmore 1974b).

8. The circumference of the unit circle is mapped into itself under the transformationθ → θ ′ = θ + k + f (θ ), where k is a real number, 0 ≤ k < 2π , and f (θ ) is periodic,f (θ + 2π ) = f (θ ). The mapping must be 1:1, so an additional condition is imposedon f (θ ): d f (θ )/dθ > −1 everywhere. Does this set of transformations form a group?What are the properties of this group?

Page 46: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

32 Lie groups

9. Rational fractional transformations (a, b, c, d) map points on the real line (real pro-jective line R P1) to the real line as follows:

x → x ′ = (a, b, c, d)x = ax + b

cx + d

The transformations (a, b, c, d) and (λa, λb, λc, λd) = λ(a, b, c, d) (λ �= 0) generateidentical mappings.a. Compose two successive rational fractional transformations

(A, B, C, D) = (a′, b′, c′, d ′) ◦ (a, b, c, d)

and show that the composition is a rational fractional transformation. Compute thevalues of A, B, C, D.

b. Show that the transformations (λ, 0, 0, λ) map x to itself.c. Construct the inverse transformation x ′ → x , and show that it is λ(d, −b, −c, a)

provided λ �= 0. Such transformations exist if D = ad − bc �= 0.d. Show that the transformation degeneracy x ′ = (a, b, c, d)x = λ(a, b, c, d)x can

be lifted by requiring that the four parameters a, b, c, d describing these transfor-mations satisfy the constraint D = ad − bc = 1.

e. It is useful to introduce homogeneous coordinates (y, z) and define the real projec-tive coordinate x as the ratio of these homogeneous coordinates: x = y/z. If thehomogeneous coordinates transform linearly under SL(2; R) then the real projec-tive coordinates x transform under rational fractional transformations:[

y′

z′

]=

[a bc d

] [yz

]⇒ x ′ = y′

z′ = a(y/z) + b

c(y/z) + d= ax + b

cx + d

f. Show that a rational fractional transformation can be constructed that maps threedistinct points x1, x2, x3 on the real line to the three standard positions (0, 1, ∞),and that this mapping is

x → x ′ = (x − x1)(x2 − x3)

(x − x3)(x2 − x1)

What matrix in SL(2; R) describes this mapping? (Careful of the condition D = 1.)g. Use this construction to show that there is a unique mapping of any triple of distinct

points (x1, x2, x3) to any other triple of distinct points (x ′1, x ′

2, x ′3).

10. The real projective space R Pn is the space of all straight lines through the originin Rn+1. The group SL(n + 1; R) maps x = (x1, x2, . . . , xn+1) ∈ Rn+1 to x ′ ∈ Rn+1,with x ′ �= 0 ↔ x �= 0 and x ′ = 0 ↔ x = 0. A straight line through the origin containsx �= 0 and y �= 0 if (and only if) y = λx for some real scale factor λ �= 0. The scalefactor can always be chosen so that y is in the unit sphere in Rn+1: y ∈ Sn ⊂ Rn+1.In fact, two values of λ can be chosen: λ = ±1/(

∑n+1i=1 x2

i )1/2. In R3 the straightline containing (x, y, z) can be represented by homogeneous coordinates (X, Y ) =(x/z, y/z) if z �= 0. Straight lines through the origin of R3 are mapped to straight linesin R3 by x → x ′ = Mx , M ∈ SL(3; R). Show that the homogeneous coordinates

Page 47: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

2.6 Problems 33

representing the two lines containing x and x ′ are related by the linear fractionaltransformation[

XY

]→

[X ′

Y ′

]

=([

m11 m12

m21 m22

] [XY

]+

[m13

m23

]) / ([ m31 m32 ]

[XY

]+ m33

)Generalize for linear fractional transformations R Pn → R Pn .

11. The hyperbolic two-space SL(2; R)/SO(2) � [ z+x yy z−x ] consists of the algebraic sub-

manifold in the Minkowski 2 + 1 dimensional space-time with metric (+1, −1, −1)

z2 − (x2 + y2) = 1

This submanifold inherits the metric

ds2 = dz2 − (dx2 + dy2)

a. Show that

−ds2 = dx2 + dy2 − (d√

1 + x2 + y2)2

= 1

1 + x2 + y2( dx dy )

[1 + y2 −xy−yx 1 + x2

] (dxdy

)b. Introduce polar coordinates x = r cos φ, y = r sin φ, and show

−ds2 = dr2

1 + r2+ (r dφ)2

c. Show that the volume element on this surface is

dV = r dr dφ√1 + r2

d. Repeat this calculation for SO(3)/SO(2). This space is a sphere S2 ⊂ R3: thealgebraic manifold in R3 that satisfies z2 + (x2 + y2) = 1 and inherits the metricds2 = dz2 + (dx2 + dy2) from this Euclidean space. Show that the metric andmeasure on S2 are obtained from the results above for H 2 by the substitutions1 + r2 → 1 − r2. Show that the disk 0 ≤ r ≤ 1, 0 ≤ φ ≤ 2π maps onto the up-per hemisphere of the sphere, with r = 0 mapping to the north pole and r = 1mapping to the equator. Show that the geodesic length from the north pole to theequator along the longitude φ = 0 is s = ∫ 1

0 dr/√

1 − r2 = π/2 and the volume of

the hemisphere surface is V = ∫ r=1r=0

∫ φ=2π

φ=0 dV (r, φ) = ∫ 10 rdr/

√1 − r2

∫ 2π

0 dφ =2π .

Page 48: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

3

Matrix groups

Almost all Lie groups encountered in the physical sciences are matrixgroups. In this chapter we describe most of the matrix groups that aretypically encountered. These include the general linear groups GL(n; F)of nonsingular n × n matrices over the fields F of real numbers, complexnumbers, and quaternions, and various of their subgroups obtained byimposing linear, bilinear and quadratic, and n-linear constraints on thesematrix groups.

3.1 Preliminaries

It is first useful to state a simple theorem.

Definition A subgroup H of G (also H ⊂ G) is a subset of G that is also a groupunder the group multiplication of G.

Example The set of matrices [a b0 1

a

](3.1)

is a subgroup of SL(2; R).

Theorem If H1 ⊂ G and H2 ⊂ G are subgroups of G then their intersectionH12 = H1 ∩ H2 is a subgroup of G.

Proof Verify that the four group axioms are satisfied for all operations in H1 ∩ H2.

Example If H1 is the two-dimensional subgroup of SL(2; R) described in (3.1)above and H2 is the one-dimensional subgroup of 2 × 2 orthogonal matrices

H2 = SO(2) =[

cos θ sin θ

− sin θ cos θ

]θ ∈ [0, 2π ) (3.2)

34

Page 49: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

3.2 No constraints 35

then the intersection H1 ∩ H2 is the zero-dimensional subgroup containing the twodiscrete group operations ±I2.

The matrix groups that we consider are defined over the fields of real numbers(F = R), complex numbers (F = C), and quaternions (F = Q). The complex num-bers can be constructed from pairs of real numbers by adjoining a square root of−1. Their multiplication properties can be analyzed by mapping the pair of realnumbers into 2 × 2 matrices

c = (a, b) = a + ib a ∈ R, b ∈ R, i2 = −1

(a, b) −→[

a b−b a

]i = (0, 1) −→

[0 1

−1 0

](3.3)

In an analogous way, the quaternions can be constructed from pairs of complexnumbers by adjoining another square root of −1, and their multiplication propertiesanalyzed by mapping the pair of complex numbers into 2 × 2 matrices

q = (c1, c2) = c1 + jc2

c1 = a1 + ib1 ∈ C

c2 = a2 + ib2 ∈ C

i2 = −1, j2 = −1, i j + j i = 0

(c1, c2) −→[

c1 c2

−c∗2 c∗

1

](3.4)

The mapping of two complex numbers into a 2 × 2 matrix representing a quaternioncan also be expressed as a mapping of four real numbers into a 2 × 2 matrixrepresenting a quaternion:

q0 + q1I + q2J + q3K →[

q0 + iq3 iq1 + q2

iq1 − q2 q0 − iq3

]The four basis vectors 1, I, J, K for this map are related to the four Pauli spinmatrices, and i is the usual square root of −1 introduced above in Eq. (3.3). Thedetails are presented in Problem 1 at the end of this chapter.

We list, in order, matrix groups on which no constraints are imposed (1), onwhich only linear constraints are imposed ((2)–(7)), on which bilinear and quadraticconstraints are imposed ((8)–(11)), and on which n-linear or multilinear constraints[det(M) = +1] are imposed (12).

3.2 No constraints

1. GL(n; F). General linear groups consist of nonsingular n × n matrices overthe real, complex, or quaternion fields. The group GL(1; Q) consists of 1 × 1

Page 50: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

36 Matrix groups

quaternion, or 2 × 2 complex matrices that satisfy

det

[a1 + ib1 a2 + ib2

−a2 + ib2 a1 − ib1

]= a2

1 + b21 + a2

2 + b22 �= 0 (3.5)

The determinant of an n × n matrix A with matrix elements A ji is defined by

det(A) =∑

I

∑J

1

n!εi1i2···in A j1

i1A j2

i2· · · A jn

inε j1 j2··· jn

Here εi1i2···in and its covariant version are the Levi–Civita symbols: +1 for an evenpermutation of the integers 1, 2, . . . , n; −1 for an odd permutation; and 0 if twoor more values of the indices i∗ are equal. With this definition there is no difficultycomputing the determinant of a matrix containing matrix elements that do notcommute (quaternions).

All remaining matrix groups in this list are subgroups of GL(n; F).

3.3 Linear constraints

These matrix groups all have a block structure or an echelon block structure. Thelinear constraints simply require specific blocks of matrix elements to vanish, orrequire some diagonal matrix elements to be +1. The structures of all these matrixgroups are exhibited in Fig. 3.1.

2. U T (p, q). Upper triangular groups. The n × n (n = p + q) matrix is parti-tioned into block form and an off-diagonal block is constrained to be zero

miα = 0p + 1 ≤ i ≤ p + q

1 ≤ α ≤ p(3.6)

Example The action of transformations in U T (1, 1) on the plane R2 is asfollows: [

x ′

y′

]=

[a b0 d

] [xy

]=

[ax + by

dy

](3.7)

The x-axis y = 0 remains invariant. It is an invariant subspace (y = 0 → y′ = 0),mapped into itself by all group operations in U T (1, 1). The y-axis x = 0 is notinvariant. More generally, if U T (p, q) acts on the direct sum vector space Vp ⊕ Vq ,the subspace Vq is invariant while Vp is not. For lower triangular matrices reversep and q .

Page 51: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

3.3 Linear constraints 37

l1

l2

ln

l3

11

1

1

Ip

Iq

p

5.

0 00

6. 7.

q

Sol (n) Nil (n) A (p,q)

• • •• • •

11

1•

••1

p

UT ( p,q) HT ( p,q)

2.

0

00

0

3.

q

4.

0

0 0

pv

r

= ∩q

0

0

UT ( p,q,r) UT (p,q+r) UT (p+q,r)= ∩

Figure 3.1. Structure of the matrix groups defined by linear constraints.

3. H T (p, q). This is a subgroup of U T (p, q) obtained by imposing the additionallinear constraints on the matrix elements of a diagonal block

mi j − δi j = 0p + 1 ≤ i ≤ p + qp + 1 ≤ j ≤ p + q

(3.8)

Example Affine transformations in H T (1, 1) (m22 = 1) act on the x-axis by x →x ′ = ax + b: [

x ′

1

]=

[a b0 1

] [x1

]=

[ax + b

1

](3.9)

4. U T (p, q, r ). This matrix group consists of upper triangular matrices that are theintersection of the matrix groups U T (p, q + r ) ∩ U T (p + q, r ).

Page 52: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

38 Matrix groups

Example We consider 4 × 4 complex matrices with the structure

1 ∗ ∗ ∗00

SU (1, 1)∗∗

0 0 0 1

(3.10)

where the 2 × 2 matrix SU (1, 1) is defined below in (3.30). Matrix groups with thestructure (3.10) are encountered in treatments of squeezed states of the electromag-netic field and scattering of projectiles from simple diatomic molecules (Gilmoreand Yuan, 1987, 1989).

5. Sol(n) = U T (1, 1, 1, . . . , 1). Solvable groups are strictly upper triangular.

Example We consider the subgroup of 3 × 3 matrices in U T (1, 1, 1) of theform

1 l d0 η r0 0 1

(3.11)

These matrices have the same structure as the group generated by exponentialsof the photon number operator (n = a†a), the creation (a†) and annihilation (a)operators, and their commutator (I = aa† − a†a = [a, a†]). We will use this iden-tification between operator and matrix groups to develop some powerful operatordisentangling theorems.

6. Nil(n). Nilpotent groups are subgroups of Sol(n) whose diagonal matrixelements are all +1.

Example Matrices in Nil(3) of the form1 l d

0 1 r0 0 1

(3.12)

are closely related to the photon creation and annihilation operators (a†, a, I ) andthe group generated by the exponentials of the position and momentum operators(p and q) and their commutator [p, q] = �/ i . This 3 × 3 matrix group is called theHeisenberg group. (It is technically the covering group of the Heisenberg group.)The set of change of basis transformations 〈p|q〉 = 1√

2e2π i pq/h encountered in

quantum mechanics is a unitary representation of this group of 3 × 3 matrices.

7. A(p, q). This group consists of matrices that are the sum of an identity matrixand the upper right-hand off-diagonal block of a (p, q) blocked matrix. Its matrix

Page 53: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

3.4 Bilinear and quadratic constraints 39

elements satisfy

Ai, j = δi, j 1 ≤ i, j ≤ pAα,β = δα,β p + 1 ≤ α, β ≤ p + qAα, j = 0Ai,β = arbitrary

This group is abelian or commutative: AB = B A for all elements (matrices) in thisgroup.

Example We consider the translation subgroup A(1, 1) of the affine group oftransformations of the x-axis (3.9): x → x ′ = x + a. Successive transformationsof this type commute[

1 a0 1

] [1 b0 1

]=

[1 a + b0 1

]=

[1 b0 1

] [1 a0 1

](3.13)

3.4 Bilinear and quadratic constraints

In (8)–(11) we treat groups that preserve a metric, represented by a matrix G. Theyall satisfy the bilinear or quadratic constraint condition M†G M = G. If G is sym-metric positive-definite we can set G = In (8). If G is nonsingular and symmetricbut indefinite we can set G = Ip,q (9). If G is nonsingular and antisymmetric, wecan take (10)

G =[

0 In

−In 0

]

These are the groups that leave Hamilton’s equations of motion invariant in form.A large spectrum of interesting groups occurs if G is singular (11). The matrixelements in these cases are defined by both bilinear and linear conditions.

8. Compact metric-preserving groups Matrices M in these groups satisfy thequadratic condition M†G M = G, where G is symmetric positive-definite, andwhich we can take as In

R O(n) orthogonal groupG = In C U (n) unitary group

Q Sp(n) symplectic group(3.14)

These are groups of rotations that leave invariant a positive-definite metric in a real,complex, or quaternion valued n-dimensional linear vector space. The manifoldsthat parameterize these groups are compact because the condition M†G M = G

Page 54: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

40 Matrix groups

defines matrices that form closed bounded subsets of the manifolds that parame-terize the matrix groups GL(n; F), F = R, C, Q.

Example As examples we introduce real 3 × 3 matrices of rigid rotations (andinversions) in R3, complex 2 × 2 matrices that preserve inner products in a complextwo-dimensional linear vector space C2 (of spin states, for example), and quaternionvalued 1 × 1 matrices that preserve length in a one-dimensional linear vector spaceover Q

M† I3 M = I3 M ∈ O(3) F = R

M† I2 M = I2 M ∈ U (2) F = C

M† I1 M = I1 M ∈ Sp(1) F = Q

(3.15)

The group SU (1; Q) is the subgroup of GL(1; Q) (3.5) subject to the condition

a21 + b2

1 + a22 + b2

2 = 1 (3.16)

This group is geometrically equivalent to the three-dimensional sphere embeddedin R4

SU (1; Q) ∼ S3 ⊂ R4 (3.17)

We will see many other relations between groups and geometry.

9. Noncompact metric-preserving groups Matrices in these groups leaveinvariant a nonsingular symmetric but indefinite metric G, which we take asG = Ip,q , p + q = n. This is a diagonal matrix with p elements +1 and q ele-ments −1 along the diagonal. Matrices M in these groups satisfy the quadraticcondition M†G M = G, where

R O(p, q) orthogonal groupG = Ip,q C U (p, q) unitary group

Q Sp(p, q) symplectic group(3.18)

The manifolds that parameterize these groups are noncompact when p �= 0, q �= 0.These noncompact groups are related by analytic continuation to correspondingcompact metric-preserving groups.

Example The Lorentz group preserves the invariant x2 + y2 + z2 − (ct)2 and isthus defined by the condition

Mt I3,1 M = I3,1[At Ct

Bt Dt

] [I3 00 −1

] [A BC D

]

=[

At A − CtC At B − Ct DBt A − DtC Bt B − Dt D

]=

[I3 00 −1

](3.19)

Page 55: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

3.4 Bilinear and quadratic constraints 41

There are much better ways to parameterize this group. These involve exponenti-ating its Lie algebra.

10. Antisymmetric metric-preserving groups The metric G is an N × N non-singular antisymmetric matrix

Mt G M = G F ={

R Sp(N , R)C Sp(N , C)

(3.20)

Since det(G) = det(Gt ) = det(−G) = (−)N det(G), N must be even: N = 2n. Themetric matrix can be chosen to have the canonical forms

G =[

0 In

−In 0

]or

G =n∑

α=1

⊕ [iσy

This consists of n copies of the matrix iσy = [ 0 1−1 0 ] along the diagonal. Symplectic

transformations in Sp(2n; R) leave invariant the form of the classical hamiltonianequations of motion.

Example The symplectic group Sp(2; R) ⊂ GL(2; R) satisfies the constraint[a cb d

] [0 1

−1 0

] [a bc d

]=

[0 ad − bc

bc − ad 0

]=

[0 1

−1 0

](3.21)

The constraint is ad − bc = +1. Thus, Sp(2; R) = SL(2; R).

11. General metric-preserving groups Matrices in these groups leave invarianta singular metric G.

R O(n; G)C U (n; G)Q Sp(n; G)

(3.22)

Example We consider 4 × 4 real matrices and choose

G =[

I3 00 0

](3.23)

Partitioning M into blocks and imposing the condition MG Mt = G, we find[A BC D

] [I3 00 0

] [At Ct

Bt D

]=

[AAt ACt

C At CCt

]=

[I3 00 0

](3.24)

Page 56: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

42 Matrix groups

This results in the conditions

AAt = I3 quadratic constraints, A ∈ O(3)C = 0 linear constraintsB, D arbitrary no constraints

The subgroup obtained by setting the 1 × 1 submatrix D equal to +1 is theEuclidean group E(3) whose action on the coordinates (x, y, z) of a point inR3 is

x ′

y′

z′

1

=

A

t1t2t3

0 0 0 1

xyz1

=

A

x

yz

+ t1

+ t2+ t3

1

(3.25)

That is, the coordinates are rotated by the matrix A and translated by the vector t. Byclosely similar arguments the Poincare group, consisting of Lorentz transformations(A ∈ SO(3, 1), AI3,1 At = I3,1 (3.17)) and space-time displacements is isomorphicto the real 5 × 5 matrix group

Poincare group

O(3, 1) t

0 1

(3.26)

The Galilei group consists of rotations in R3, transformations to a coordinate systemmoving with velocity v, and displacements of space (t) and time (t4) coordinates.It is isomorphic to the group of 5 × 5 matrices with the structure

Galilei group

O(3) v t

0 1 t40 0 1

(3.27)

3.5 Multilinear constraints

It is possible to impose trilinear, four-linear, . . . , constraints on n × n matrices.This requires a great deal of effort, and leads to few results, principal among whichare the five exceptional Lie groups that we will meet in Chapter 10. The onlymultilinear constraint that leads systematically to a large class of Lie groups is then-linear constraint, defined by the determinant.

Page 57: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

3.7 Embedded groups 43

12. Special linear groups or unimodular groups These are defined by the condi-tion

detM = +1 F =

R SL(n, R)C SL(n, C)Q SL(n, Q)

(3.28)

Example The group SL(2; R) has previously been encountered. The subset of ma-trices [ a b

c d ] ∈ SL(2; R) ⊂ GL(2; R) satisfies the constraint ad − bc = +1, whichis bilinear.

3.6 Intersections of groups

Some important groups are intersections of those listed above

SO(n) = O(n) ∩ SL(n; R)SO(p, q) = O(p, q) ∩ SL(p + q; R)SU (n) = U (n) ∩ SL(n; C)SU (p, q) = U (p, q) ∩ SL(p + q; C)

(3.29)

Example We construct the three-dimensional noncompact group SU (1, 1) bytaking the intersection of U (1, 1) with SL(2; C):

SU (1, 1) = U (1, 1) ∩ SL(2; C) →[

a bb∗ a∗

](3.30)

where a∗a − b∗b = +1.

3.7 Embedded groups

The unitary group U (n) consists of n × n complex matrices that obey the constraintU †U = In . For some purposes it is useful to represent this group as a group of realmatrices. This is done by replacing each of the complex entries in U (n) by a real2 × 2 matrix according to the prescription given in Eq. (3.3). The resulting matrixis a real 2n × 2n matrix M . This matrix inherits the constraint that comes with theunitary group, U †U = In . This constraint now appears in the form Mt M = I2n .We have been able to replace † by t since the matrices are real, and must replaceIn by I2n since the matrices are 2n × 2n. In other words, the matrices M obey thecondition that determines orthogonal groups. This group of 2n × 2n matrices formsan orthogonal representation of the unitary group. It is a subgroup of SO(2n). Thismatrix group is called OU (2n). Symbolically,

U (n)C→2×2 R−→ OU (2n) ⊂ SO(2n) (3.31)

Page 58: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

44 Matrix groups

There is an even more compelling reason to carry out the same type of replace-ment of quaternions by 2 × 2 complex matrices. Quaternions do not commute, as doreal and complex numbers. Rather than worry about the order in which quaternionsare written down in carrying out computations (such as constructing the determinantof a matrix), it is usually safer and more convenient to replace each quaternion in ann × n matrix by a 2 × 2 complex matrix using the embedding shown in Eq. (3.4).For the metric-preserving quaternion group U (n; Q) = Sp(n) whose matrices obeyU †U = In , this process generates 2n × 2n complex matrices M that inherit theconstraint in the form M†M = I2n . In other words, the matrices M obey the condi-tion that determines unitary groups (over C). This group of 2n × 2n matrices formsa unitary representation of the symplectic group. It is a subgroup of SU (2n). Thismatrix group is called U Sp(2n). Symbolically,

Sp(n)Q→2×2 C−→ U Sp(2n) ⊂ SU (2n) (3.32)

The groups OU (2n) and U Sp(2n) will appear in Chapter 11 (see, Table 11.1)in the classification of the real forms of the simple Lie groups.

3.8 Modular groups

We close with a useful aside. We have not considered matrices over the integersbecause they lack the geometric structure contributed by the continuous fields R,C, and Q. However, matrices over the integers play an important role in some areasof Lie group theory (representation theory of noncompact unimodular groups).

There are in fact three distinct groups over the integers that are sometimes con-fused

(i) GL(n; Z): if m ∈ GL(n; Z), det(m) = ±1.(ii) SL(n; Z): if m ∈ SL(n; Z), det(m) = +1.

(iii) P SL(n; Z), n even: P SL(n; Z) = SL(n; Z)/ {In, −In}.

For n = 2 these groups of matrices have the form [ a bc d ], with a, b, c, d all integers.

If det(m) = n, with n an integer, then det(m−1) = 1/n. Since the determinant ofany matrix composed of integers must be an integer, the condition is that det(m) =±1. The subset of GL(2; Z) with determinant +1 forms the subgroup SL(2; Z) ⊂GL(2; Z). The modular group P SL(2; Z) is obtained by identifying each pair ofmatrices in SL(2; Z) of the form [ −a −b

−c −d ] � [ a bc d ].

As a hint of the useful properties of these groups, we consider the matrix[1 11 0

]∈ GL(n; Z) (3.33)

Page 59: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

3.8 Modular groups 45

Then [1 11 0

]n

=[

F(n + 1) F(n)F(n) F(n − 1)

](3.34)

where F(n) is the nth Fibonacci number, defined recursively by

F(n) = F(n − 1) + F(n − 2)

n 0 1 2 3 4 5 6 7 · · ·F(n) 0 1 1 2 3 5 8 13 · · ·

The proof by induction is simple. It proceeds by computation

[1 11 0

]n+1

=[

1 11 0

] [F(n + 1) F(n)

F(n) F(n − 1)

]

=[

F(n + 1) + F(n) F(n) + F(n − 1)F(n + 1) F(n)

]

=[

F(n + 2) F(n + 1)F(n + 1) F(n)

](3.35)

and by comparison of initial conditions for n = 1 (F(0) = 0, F(1) = 1). Manyother recursive relations among the integers are possible using different matricesin the groups GL(2; Z), GL(3; Z), etc.

The group GL(n; Z) has important subgroups defined by imposing linear,quadratic, and multilinear constraints on the matrix elements, in exact analogywith GL(n; R).

Imposing linear constraints generates subgroups with the structures given inExamples (2) through (7) above. The only remark necessary is that for the analogsof Example (5) (solvable groups) the diagonal matrix elements can only be ±1.

Imposing quadratic constraints, for example Mt In M = In , generates a sub-group for which the sum of the squares of the matrix elements in each row orcolumn is +1. Since the matrix elements themselves can only be ±1, 0, thisgroup, O(n; Z), consists of n × n matrices in which all but one matrix elementin every row or column is zero, and the nonzero matrix element is ±1. An im-portant subgroup of O(n; Z) is Sn , in which the nonzero matrix elements are all+1. This is the n × n faithful permutation representation Pn of the symmetricgroup Sn .

Finally, the multilinear condition det(m) = +1 defines the unimodular subgroupSL(n; Z) of GL(n; Z).

Page 60: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

46 Matrix groups

Additional important groups are intersections of those just described. For exam-ple, the alternating group An consists of unimodular matrices in Pn:

An = Pn ∩ SL(n; Z) (3.36)

Example The group O(2; Z) consists of the 8 = 22 × 2! matrices

±[

1 00 1

[1 00 −1

[0 11 0

[0 1

−1 0

](3.37)

The group O(3; Z) has order 23 × 3! = 48. Its subgroup S3 of order 6 = 3! consistsof the six matrices

1 0 00 1 00 0 1

0 1 0

0 0 11 0 0

0 0 1

1 0 00 1 0

0 1 0

1 0 00 0 1

0 0 1

0 1 01 0 0

1 0 0

0 0 10 1 0

(3.38)

Its alternating subgroup A3 ⊂ S3 ⊂ O(3; Z) consists of the three matrices withpositive determinant, contained in the first row.

3.9 Conclusion

In this chapter we have taken advantage of a surprising observation: most of theLie groups encountered in applied (as well as pure) mathematics, the physicalsciences, and the engineering disciplines are matrix groups. Most of the matrixgroups typically encountered have been listed here. They consist of the general lin-ear groups of n × n nonsingular matrices over the fields of real numbers, complexnumbers, and quaternions, as well as subgroups obtained by imposing linear con-ditions, bilinear and quadratic conditions, and multilinear conditions on the matrixelements of the n × n matrices. Lie groups not encountered in the simple construc-tion presented here consist primarily of some real forms (analytic continuations,encountered in Chapter 11) of those encountered here, the exceptional Lie groupsG2, F4, E6, E7, E8 and their real forms (encountered in Chapters 10 and 11), andcovering groups of noncompact simple Lie groups such as SL(2; R) (encounteredin Chapter 7). We have in addition opened a door to analogs of Lie groups over theintegers, GL(n; Z), SL(n; Z), and P SL(n; Z). Matrix groups over finite fields arealso of great interest, but fall outside the scope of our discussions.

Page 61: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

3.10 Problems 47

3.10 Problems

1. Use the mapping (3.4) to construct a 2 × 2 matrix representation of the quaternionsover the field of complex numbers. In particular, make the following associations,where I J = −K:

1 =[

1 00 1

]I = i

[0 11 0

]J = i

[0 −ii 0

]K = i

[1 00 −1

]I2 iσx iσy iσz

(3.39)

Here σx , σy, σz are the Pauli spin matrices, and i is the usual square root of −1. Showthat any pair of the unit quaternions anticommute: i.e., {I,J } = IJ + J I = 0.

2. Show that the unit quaternions I, J, K generate a group of order 8 under multi-plication. Show that this group is isomorphic to O(2; Z). Exhibit this isomorphismexplicitly.

3. Show that SU (1; Q) ∼ SU (2; C).

4. Show that the dimensionalities (over the real field) of the general linear groups andtheir special linear subgroups are

GL(n; R) = n2 SL(n; R) = n2 − 1GL(n; C) = 2n2 SL(n; C) = 2n2 − 2GL(n; Q) = 4n2

5. Show that if the n × n metric matrix G is symmetric, nonsingular, and positive definite,then we can set G = In in the definitions in Example (8). If the n × n metric matrix Gis symmetric, nonsingular, and indefinite, then we can set G = Ip,q in the definitionsin Example (9), for suitable positive integers p and q, with p + q = n.

6. Show that it is possible to define subgroups SLi (n; C) of GL(n; C) by the con-ditions

SL1(n; C) det(M) = eiφ 2n2 − 1SL2(n; C) det(M) = eλ 2n2 − 1SL3(n; C) det(M) = r 2n2 − 1SL(n; C) det(M) = +1 2n2 − 2

where φ, λ, r are real and r �= 0. Show that the dimensions of these three subgroupsare 2n2 − 1 and that SL3(n; C) is disconnected. It consists of two topologically iden-tical copies of a 2n2 − 1 dimensional manifold, one of which contains the identity.Show that SL(n; C) = SL1(n; C) ∩ SL2(n; C). Do these results extend under the fieldrestriction C → R? and the field extension C → Q?

7. A subgroup of U T (1, 1) includes matrices of the form [ −1 a0 1 ], a ∈ R. Show that the

underlying group manifold consists of two copies of the real line R1. If matrices ofthe form [ 1 a

0 −1 ] are also included, then the parameterizing manifold consists of howmany copies of R1?

Page 62: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

48 Matrix groups

8. Compute the dimensions of the real matrix groups in Examples (2)–(7) over the realfield and show:

Group DimensionU T (p, q) p2 + q2 + pqH T (p, q) p(p + q)U T (p, q, r ) p2 + q2 + r2 + pq + pr + qrSol(n) n(n + 1)/2Nil(n) n(n − 1)/2A(p, q) pq

What happens to these dimensions if the matrix groups are over the field of complexnumbers? Quaternions?

9. Newton’s equations of motion are F = dp/dt . In the Lorentz gauge Maxwell’s equa-tions can be written in the form(

∇2 − 1

c2

∂2

∂t2

)Aµ(x, t) = −4π

cjµ

These equations can be expressed in a different coordinate system usisng eitherGalilean or Poincare transformations. Verify that the equations do or do not remaininvariant in form under these transformations, as follows:

Transformation F = dp/dt (∇2 − 1c2

∂2

∂t2 )Aµ = − 4πc jµ

Galilean invariant not invariantPoincare not invariant invariant

How do you reconcile these results?

10. Show that the group of 2 × 2 matrices SU (2) is parameterized by two complex num-bers c1 = a1 + ib1 and c2 = a2 + ib2, so that

SU (2) =[

c1 c2

−c∗2 c∗

1

]

subject to the condition a21 + b2

1 + a22 + b2

2 = 1. Convince yourself (a) that topolog-ically this group (i.e., its parameterizing manifold) is equivalent to a three-sphereS3 ⊂ R4; and (b) algebraically it is equivalent to SU (1; Q) (cf. (3.16)).

11. The group of 2 × 2 matrices SU (1, 1) is parameterized by two complex numbersc1 = a1 + ib1 and c2 = a2 + ib2, so that

SU (1, 1) =[

c1 c2

c∗2 c∗

1

]

subject to the condition a21 + b2

1 − a22 − b2

2 = 1. Identify the parameterizing manifold.

12. The group SO(2) is one dimensional. Show that every matrix in SO(2) can be writtenin the form [ m11 x

m21 m22], where m2

11 + x2 = 1, so that m11 = ±√1 − x2. The second row

Page 63: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

3.10 Problems 49

is orthogonal to the first, so that m21m11 + m22x = 0. As a result, we find

SO(2) −→[±√

1 − x2 x−x ±√

1 − x2

]The ± signs are coherent. Each choice of sign (±) covers half the group.

13. The group SO(3) is three dimensional. Show that every matrix in SO(3) can bewritten in the form

SO(3) −→ m11 x y

m21 m22 zm31 m32 m33

Use arguments similar to those used in Problem 12 to express the matrix elementsmi j i ≥ j in terms of the three parameters (x, y, z).

14. An alternative parameterization of SO(3) is given by

SO(3) −→ Z2

xy

−x −y Z1

×

±√

1 − z2 z 0−z ±√

1 − z2 00 0 1

Express the 2 × 2 and 1 × 1 submatrices Z2 and Z1 in terms of the coordinates (x, y).Determine the range of the parameters (x, y, z). How many square roots (“sheets”)are necessary to cover SO(3) completely?

15. If M ∈ GL(n; Z), show that det(M) must be ±1.

16. Show that the orders of O(n; Z) ⊃ Sn ⊃ An are 2n × n!, n!, 12 n!.

17. Estimate the Fibonacci number F(n) from the eigenvalues λ± = 12 (1 ± √

5) of thegenerating matrix (3.33). What happens to this sequence if different initial conditions(other than F(0) = 0, F(1) = 1) are introduced?

18. Derive other Fibonacci-type series using other symmetric generating matrices inGL(2; Z) (for example, [ 2 −1

−1 1 ]) and other initial conditions.

19. The energy levels |nlm〉 of the nonrelativistic hydrogen atom exhibit an n2-folddegeneracy under the Lie group SO(4). All bound states with the same principalquantum number n have the same energy E(nlm) = −E0/n2 (E0 = 13.6 eV). If theCoulomb symmetry is broken by placing one or more electrons in the Coulombpotential, the overall symmetry reduces to that of the rotation group: there is asymmetry reduction SO(4) ↓ SO(3). The representations of SO(4) that enter intothe description of the hydrogen atom bound states are indexed by the principalquantum number n (n = 1, 2, 3, . . . ). The SO(4) representation with quantum num-ber n splits into angular momentum representations that are indexed with quantumnumber l, l = 0, 1, 2 . . . , n − 1, with

∑l=n−1l=0 (2l + 1) = n2. The SO(3) multiplet

with quantum number l is 2l + 1-fold degenerate. An empirical hamiltonian withSO(4) ↓ SO(3) broken symmetry that describes the filling order when electrons areintroduced into a Coulomb potential established by a central charge +Ze can be

Page 64: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

50 Matrix groups

chosen to have the form:

E = −E0 Z2 {1 + δ ∗ (n − l − 1)} /n2

This hamiltonian depends only on the quantum numbers of the representations ofSO(4) and its subgroup SO(3). Show that this phenomenological energy spec-trum with δ = 0.28 provides the filling ordering that accounts for Mendeleev’speriodic table of the chemical elements: (n, l) → 1s; 2s, 2p; 3s, 3p; 4s, 3d, 4p;5s, 4d, 5p; 6s, 4 f, 5d, 6p; 7s, 5 f, 6d, 7p; 8s, 6 f, 7d, 8p; . . . .

20. Symmetries Show the following equivalences:

U T (p, q) = U T (q, p) SO(p, q) = SO(q, p)A(p, q) = A(q, p) U (p, q) = U (q, p)

Sp(p, q) = Sp(q, p)

21. G1 and G2 are two metrics on a real 2n-dimensional linear vector space that aredefined by

G1 =[

In 00 In

]G2 =

[0 In

−In 0

]

Show that the 2n × 2n matrices M that satisfy the bilinear constraints Mt Gi M = Gi

are:

G1 G2 G1 and G2

O(2n; R) Sp(2n; R) OU (2n; R)

22. In an n-dimensional linear vector space two coordinate systems x and y are relatedby a linear transformation: y j = xi M j

i . Show that the derivatives are related by thesame transformation (covariance–contravariance)

∂xi= ∂y j

∂xi

∂y j= M j

i

∂y j

As a result, a transformation that preserves a metric when acting on the coordinatespreserves the same metric when acting on the derivatives.

23. The Poisson brackets between two functions f (q, p) and g(q, p) on a classical phasespace of dimension 2n are defined by

{ f, g} =∑

k

∂ f

∂qk

∂g

∂pk− ∂g

∂qk

∂ f

∂pk

a. Show that these relations can be written in simple matrix form as

{ f, g} = (D f )t G(Dg) where G =[

0 In

−In 0

]and (Dg) =

[∂g/∂q∂g/∂p

]

Page 65: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

3.10 Problems 51

b. Introduce a new coordinate system (Q, P), related to the original by a lineartransformation of the form[

∂g/∂ Q∂g/∂ P

]=

[A BC D

] [∂g/∂q∂g/∂p

]

Find the conditions on this 2n × 2n matrix that preserves the structure of thePoisson brackets. Show At C and Bt D must be symmetric and At D − Bt C = In .

c. Show that the same conditions hold for linear transformations and the quantumnmechanical commutator bracket: [q j , qk] = [p j , pk] = 0 and [q j , pk] = i�δ jk .

Note: The transformation from classical mechanics to quantum mechanics is madeby identifying the classical Poisson bracket {, } with the quantum commutator bracket[, ] according to

{u(q, p), v(q, p)} ↔ [u(q, p), v(q, p)]

i�

The hatˆindicates an operator.

24. Transfer matrices Figure 3.2 shows a potential in one dimension. The wavefunctionto the left of the interaction region has the form

ψL (x) = ALe+ikx + BLe−ikx = [e+ikx e−ikx

] [AL

BL

]

with a similar expression for the wavefunction on the right. The exponential e+ikx

describes a particle of mass m moving to the right (+) with momentum �k andenergy E = (�k)2/2m. The complex number AL is the probability amplitude forfinding a particle moving to the right with this momentum. The expected value of themomentum in the left-hand region is 〈 p〉 = (|AL |2 − |BL |2)�k, where the operatorp = �

id

dx .a. Show that conservation of momentum leads to the equation

|AL |2 − |BL |2 = |AR|2 − |BR|2

VL VR

AB

V(x)

R

AB L

( ) R R( )e e e e( ) ( )= =

Figure 3.2. The potentials to the left and right of the interaction region areconstant, with VL = VR . The wavefunctions to the left and right of thisregion are represented in the form ψσ (x) = Aσ e+ikx + Bσ e−ikx , whereσ = L , R.

Page 66: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

52 Matrix groups

when the asymptotic value of the potential to the left of the interaction region isthe same as the value on the right.

b. Since the Schrodinger equation is second order the four amplitudes AL , AR,

BL , BR are not independent. Only two are independent. Two linear relations existamong them. Show that they can be expressed in terms of a matrix relation of theform [

AL

BL

]=

[t11 t12

t21 t22

] [AR

BR

]

The 2 × 2 matrix T is called a transfer matrix. The transfer matrix is a functionof energy E . Show that T (E) ∈ U (1, 1).

c. Show that T ∈ SU (1, 1) by appropriate choice of phase.

25. Crossing symmetry: A transfer matrix T for a one-dimensional potential relates am-plitudes for the wavefunction on the left of the interaction region with the amplitudeson the right. A scattering matrix (S-matrix) S relates the incoming amplitudes withthe outgoing amplitudes:[

AL

BL

]= T

[AR

BR

] [AR

BL

]= S

[AL

BR

]

a. Invoke conservation of momentum arguments to conclude S ∈ U (2).b. Show that the matrix elements of S and T are related by

[s11 s12

s21 s22

]=

1

t11− t12

t11

t21

t11

t11t22 − t12t21

t11

c. Show that the poles of S(E) are the zeroes of T (E), specifically of t11(E). Polesalong the real energy axis describe bound states. Poles off the real axis of the formr j/[(E − E j ) + i(� j/2)] describe resonances at energy E j with characteristic de-cay time � j/�.

26. Two interaction regions V1 and V2 on the line are characterized by transfer matricesT1 and T2, and also by S-matrices S1 and S2 (see Fig. 3.3). The outputs of one regionare inputs to the other, as follows:[

i2

i3

]=

[0 11 0

] [o1

o4

]

a. The transfer matrices for the two regions are defined by[i1

o2

]= T1

[o1

i2

] [i3

o4

]= T2

[o3

i4

]

Page 67: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

3.10 Problems 53

( ) ( ) ( ) ( )

( )( )( )( )

( ) ( ) ( )

AL

BL

AM

BL

AL

BM

AR

BM

AM

BR

AR

BR

AM

BM

AM

BM

ABM

M

T1

S1

T2

S2

ABL

L ABR

R

=

= =

=

VL VM VRV1 (X) V2 (X)

Figure 3.3. Two potentials on the line are characterized by their T and S matrices.

Show that the transfer matrix for the entire interaction region is[i1o2

]= TTot

[o3i4

]TTot = T1T2

b. The S-matrices for the two regions relate inputs to outputs as follows

o1

o2

o3

o4

s11 s12 0 0s21 s22 0 00 0 s33 s34

0 0 s43 s44

i1

i2

i3

i4

Show that the scattering matrix for the entire region is

STot =[

0 s34

s21 0

]+ 1

1 − s12s43

[s33s22 s33s112s44

s22s43s11 s22s44

]

c. Show that STot is unitary.d. Interpret the matrix STot in terms of a Feynman-like sum over all paths. Do this

by expanding the fraction 1/(1 − s12s43) as a geometric sum and interpreting eachterm in this expansion as a path through the two scattering potentials.

27. If the potentials V1 and V2 are modified to V1′ and V2

′ their transfer matrices andtheir scattering matrices will also be modified Ti (E) → T ′

i (E) and Si (E) → S′i (E),

i = 1, 2. It is possible that for some energy E , S′Tot(E) = STot(E). Find the set of

all modified scattering matrices S′1(E) and S′

2(E) with the property that the modi-fied pair maps into the original total S-matrix STot(E). In fancy terms, find the fiberin U (4) ⊃ U (2) ⊗ U (2) ↓ U (2). (Hint: if this seems daunting, note that to satisfyT1(E)T2(E) = TTot(E) = T ′

1(E)T ′2(E) we can take T ′

1(E) = T1(E)R and T ′2(E) =

R−1T2(E) for any R ∈ U (1, 1). The fiber in U (2, 2) ⊃ U (1, 1) ⊗ U (1, 1) ↓ U (1, 1)over TTot(E) consists of the matrices (T1(E)R, R−1T2(E).) Now map this into thefiber (S′

1(E), S′2(E)) over STot(E).

Page 68: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

54 Matrix groups

28. A passive linear device, classical or quantum, can be described by an S matrix. Ifthe device has n external leads the scattering matrix is an n × n matrix. Deviceswith n1, n2, . . . , nk leads can be connected together by soldering some of the leadstogether. The leads that are soldered together are the internal leads. The remainderof the leads are external leads. We distinguish between internal and external leadsby subscripts i and e. The S matrix that describes the original set of k devices is adirect sum of k S matrices of sizes n j × n j ( j = 1, 2, . . . , k). Through appropriatepermutation of the rows and columns of this direct sum of S matrices the input-outputrelations can be expressed in the form[

oi

oe

]=

[A BC D

] [ii

ie

][ii ] = � [oi ]

The matrix � that relates internal outputs to internal inputs describes the topology, orconnectivity, of the network.a. Show that the S matrix that describes the network, defined by [oe] = SNetwork [ie],

is given by (cf., Problem 3.26c)

SNetwork = D + C�(I − A�)−1 B

b. Show that SNewtork is unitary: S†Newtork = SNewtork, SNewtork ⊂ U (d).

c. Expand SNewtork to show that

SNetwork = D + C�B + C�A�B + C�A�A�B + C�A�A�A�B + · · ·Interpret this expansion in terms of a Feynman sum over all possible scatteringpaths through the network.

29. A mathematical description of the preceeding problem involves a subgroup restrictionU (

∑kj=1 n j ) ⊃ �k

j=1 ⊗ U (n j ) and a projection to the total network scattering matrixin U (d), where d is the number of the network’s external leads. The connectivity isdetermined by the permutation matrix �. Determine the fiber in �k

j=1 ⊗ U (n j ) overeach group operation in U (d).

30. All the matrices in this problem are square n × n, with: H hermitian; U unitary; Aantihermitian. Show the right-hand column follows from the definition in the left-handcolumn.

H2 = H1 + In

H1 − In[H1, H2] = 0 H1 = H2 + In

H2 − In

U = In + i H

In − i H[H, U ] = 0 H = i

In − U

In + U

A = In + iU

In − iU[U, A] = 0 U = i

In − A

In + A

H = In − i A

In + i A[A, H ] = 0 A = i

H − In

H + In

Page 69: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

4

Lie algebras

The study of Lie groups can be greatly facilitated by linearizing the groupin the neighborhood of its identity. This results in a structure called a Liealgebra. The Lie algebra retains most, but not quite all, of the propertiesof the original Lie group. Moreover, most of the Lie group propertiescan be recovered by the inverse of the linearization operation, carriedout by the EXPonential mapping. Since the Lie algebra is a linear vectorspace, it can be studied using all the standard tools available for linearvector spaces. In particular, we can define convenient inner products andmake standard choices of basis vectors. The properties of a Lie algebrain the neighborhood of the origin are identified with the properties of theoriginal Lie group in the neighborhood of the identity. These structures,such as inner product and volume element, are extended over the entiregroup manifold using the group multiplication operation.

4.1 Why bother?

Two Lie groups are isomorphic if:

(i) their underlying manifolds are topologically equivalent;(ii) the functions defining the group composition laws are equivalent.

Two manifolds are topologically equivalent if they can be smoothly deformedinto each other. This requires that all their topological indices, such as dimension,Betti numbers, connectivity properties, etc., are equal.

Two group composition laws are equivalent if there is a smooth change of vari-ables that deforms one function into the other.

Showing the topological equivalence of two manifolds is not necessarily an easyjob. Showing the equivalence of two composition laws is typically a much moredifficult task. It is difficult because the group composition law is generally nonlinear,and working with nonlinear functions is notoriously difficult.

55

Page 70: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

56 Lie algebras

The study of Lie groups would simplify greatly if the group composition lawcould somehow be linearized, and if this linearization retained a substantial part ofthe information inherent in the original group composition law. This in fact can bedone.

Lie algebras are constructed by linearizing Lie groups.A Lie group can be linearized in the neighborhood of any of its points, or group

operations. Linearization amounts to Taylor series expansion about the coordinatesthat define the group operation. What is being Taylor expanded is the group com-position function. This function can be expanded in the neighborhood of any groupoperation.

A Lie group is homogeneous – every point looks locally like every other point.This can be seen as follows. The neighborhood of group element a can be mappedinto the neighborhood of group element b by multiplying a, and every element inits neighborhood, on the left by group element ba−1 (or on the right by a−1b). Thismaps a into b and points near a into points near b.

It is therefore necessary to study the neighborhood of only one group operationin detail. Although geometrically all points are equivalent, algebraically one pointis special – the identity. It is very useful and convenient to study the neighborhoodof this special group element.

Linearization of a Lie group about the identity generates a new set of operators.These operators form a Lie algebra. A Lie algebra is a linear vector space, by virtueof the linearization process.

The composition of two group operations in the neighborhood of the identityreduces to vector addition. The construction of more complicated group products,such as the commutator, and the linearization of these products introduces additionalstructure in this linear vector space. This additional structure, the commutationrelations, carries information about the original group composition law.

In short, the linearization of a Lie group in the neighborhood of the identity toform a Lie algebra brings about an enormous simplification in the study of Liegroups.

4.2 How to linearize a Lie group

We illustrate how to construct a Lie algebra for a Lie group in this section. Theconstruction is relatively straightforward once an explicit parameterization of theunderlying manifold and an expression for the group composition law is available. Inparticular, for the matrix groups the group composition law is matrix multiplication,and one can construct the Lie algebra immediately for the matrix Lie groups.

We carry this construction out for SL(2; R). It is both customary and convenientto parameterize a Lie group so that the origin of the coordinate system maps to the

Page 71: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

4.3 Inversion of the linearization map: EXP 57

identity of the group. Accordingly, we parameterize SL(2; R) as follows

(a, b, c) −→ M(a, b, c) =[

1 + a bc (1 + bc)/(1 + a)

](4.1)

The group is linearized by investigating the neighborhood of the identity. This isdone by allowing the parameters (a, b, c) to become infinitesimals and expandingthe group operation in terms of these infinitesimals to first order

(a, b, c) → (δa, δb, δc) → M(δa, δb, δc)

=[

1 + δa δbδc (1 + δbδc)/(1 + δa)

](4.2)

The basis vectors in the Lie algebra are the coefficients of the first order infinitesi-mals. In the present case the basis vectors are 2 × 2 matrices

(δa, δb, δc) → I2 + δa Xa + δbXb + δcXc =[

1 + δa δbδc 1 − δa

](4.3)

Xa =[

1 00 −1

]= ∂ M(a, b, c)

∂a

∣∣∣∣(a,b,c)=(0,0,0)

Xb =[

0 10 0

]= ∂ M(a, b, c)

∂b

∣∣∣∣(a,b,c)=(0,0,0)

Xc =[

0 01 0

]= ∂ M(a, b, c)

∂c

∣∣∣∣(a,b,c)=(0,0,0)

(4.4)

Lie groups that are isomorphic have Lie algebras that are isomorphic.

Remark The group composition function φ(x, y) is usually linearized in one of itsarguments, say φ(x, y) → φ(x, 0 + δy). This generates a left-invariant vector field.The commutators of two left-invariant vector fields at a point x are independent ofx , so that x can be taken in the neighborhood of the identity. It is for this reason thatthe linearization of the group in the neighborhood of the identity is so powerful.

4.3 Inversion of the linearization map: EXP

Linearization of a Lie group in the neighborhood of the identity to form a Liealgebra preserves the local group properties but destroys the global properties –that is, what happens far from the identity. It is important to know whether thelinearization process can be reversed. Can one recover the Lie group from its Liealgebra?

Page 72: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

58 Lie algebras

To answer this question, assume X is some operator in a Lie algebra – such asa linear combination of the three matrices spanning the Lie algebra of SL(2; R)given in (4.4). Then if ε is a small real number, I + εX represents an element inthe Lie group close to the identity. We can attempt to move far from the identity byiterating this group operation many times

limk→∞

(I + 1

kX

)k

=∞∑

n=0

Xn

n!= EXP(X ) (4.5)

The limiting and rearrangement procedures leading to this result are valid not onlyfor real and complex numbers, but for n × n matrices and bounded operators aswell.

Example We take an arbitrary vector X in the three-dimensional linear vectorspace of traceless 2 × 2 matrices spanned by the generators Xa, Xb, Xc of SL(2; R)given in (4.4)

X = a Xa + bXb + cXc =[

a bc −a

](4.6)

The exponential of this matrix is

EXP(X ) = EXP(a Xa + bXb + cXc) =∞∑

n=0

1

n!

[a bc −a

]n

= I2 cosh θ + Xsinh θ

θ

=[

cosh θ + a sinh(θ )/θ b sinh(θ )/θc sinh(θ )/θ cosh θ − a sinh(θ )/θ

]θ2 = a2 + bc (4.7)

The actual computation can be carried out using either brute force or finesse.With brute force, each of the matrices Xn is computed explicitly, a pattern is

recognized, and the sum is carried out. The first few powers are X0 = I2, X1 = X(given in (4.6)), and X2 = θ2 I2. Since X2 is a multiple of the identity, X3 = X2 X1

must be proportional to X (= θ2 X ), X4 is proportional to the identity, and so on.Finesse involves use of the Cayley–Hamilton theorem, that every matrix satisfies

its secular equation. This means that a 2 × 2 matrix must satisfy a polynomialequation of degree 2. Thus we can replace X2 by a function of X0 = I2 and X1 = X .Similarly, X3 can be replaced by a linear combination of X2 and X , and then X2

replaced by I2 and X . By induction, any function of the 2 × 2 matrix X can bewritten in the form

F(X ) = f0(a, b, c)X0 + f1(a, b, c)X1 (4.8)

Page 73: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

4.4 Properties of a Lie algebra 59

Furthermore, the functions f0, f1 are not arbitrary functions of the three parameters(a, b, c), but rather functions of the invariants of the matrix X . These invariants arethe coefficients of the secular equation. The only such invariant for the 2 × 2 matrixX is θ2 = a2 + bc. As a result, we know from general and simple considerationsthat

EXP(X ) = f0(θ2)I2 + f1(θ2)X (4.9)

The two functions are f0(θ2) = 1 + θ2/2! + θ4/4! + θ6/6! + · · · = cosh θ andf1(θ2) = 1 + θ2/3! + θ4/5! + θ6/7! + · · · = sinh(θ )/θ . These arguments are ap-plicable to the exponential of any matrix Lie algebra.

The EXPonential operation provides a natural parameterization of the Lie groupin terms of linear quantities. This function maps the linear vector space – the Liealgebra – to the geometric manifold that parameterizes the Lie group. We can expectto find a lot of geometry in the EXPonential map.

Three important questions arise about the reversibility of the process representedby

Lie groupln�

EXPLie algebra (4.10)

(i) Does the EXPonential function map the Lie algebra back onto the entire Lie group?(ii) Are Lie groups with isomorphic Lie algebras themselves isomorphic?

(iii) Is the mapping from the Lie algebra to the Lie group unique, or are there other waysto parameterize a Lie group?

These are very important questions. In brief, the answer to each of these questionsis “No.” However, as is very often the case, exploring the reasons for the negativeresult produces more insight than a simple “Yes” response would have. They willbe treated in more detail in Chapter 7.

4.4 Properties of a Lie algebra

We now turn to the properties of a Lie algebra. These are derived from the propertiesof a Lie group. A Lie algebra has three properties:

(i) the operators in a Lie algebra form a linear vector space;(ii) the operators close under commutation: the commutator of two operators is in the Lie

algebra;(iii) the operators satisfy the Jacobi identity.

If X and Y are elements in the Lie algebra, then g1 = I + εX is an element inthe Lie group near the identity for ε sufficiently small. In fact, so also is I + εαX

Page 74: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

60 Lie algebras

for any real number α. We can form the product

(I + εαX )(I + εβ X ) = I + ε(αX + βY ) + higher order terms (4.11)

If X and Y are in the Lie algebra, then so is any linear combination of X and Y .The Lie algebra is therefore a linear vector space.

The commutator of two group elements is a group element:

commutator of g1 and g2 is g1g2g−11 g−1

2 (4.12)

If X and Y are in the Lie algebra, then for any ε, δ sufficiently small, g1(ε) =EXP(εX ) and g1(ε)−1 = EXP(−εX ) are group elements near the identity, as areg2(δ)±1 = EXP(±δY ). Expanding the commutator to lowest order nonvanishingterms, we find

EXP(εX )EXP(δY )EXP(−εX )EXP(−δY )

= I + εδ (XY − Y X ) = I + εδ [X, Y ] (4.13)

Therefore, the commutator of two group elements, g1(ε) = EXP(εX ) and g2(δ) =EXP(δY ), which is in the group G, requires the commutator of the operators X andY , [X, Y ] = (XY − Y X ), to be in its Lie algebra g

g1g2g−11 g−1

2 ∈ G ⇔ [X, Y ] ∈ g (4.14)

The commutator (4.12) provides information about the structure of a group. Ifthe group is commutative then the commutator in the group (4.12) is equal to theidentity. The commutator in the algebra vanishes

g1g2g−11 g−1

2 = I ⇒ [X, Y ] = 0 (4.15)

If H is an invariant subgroup of G, then g1 Hg−11 ⊂ H . This means that if X is in

the Lie algebra of G and Y is in the Lie algebra of H

g1 Hg−11 ∈ H ⇒ [X, Y ] ∈ Lie algebra of H (4.16)

If X, Y, Z are in the Lie algebra, then the Jacobi identity is satisfied

[X, [Y, Z ]] + [Y, [Z , X ]] + [Z , [X, Y ]] = 0 (4.17)

This identity involves the cyclic permutation of the operators in a double com-mutator. For matrices this identity can be proved by opening up the commutators([X, Y ] = XY − Y X ) and showing that the 12 terms so obtained cancel pairwise.This proof remains true when the operators X, Y, Z are not matrices but operatorsfor which composition (e.g., XY is well defined, as are all other pairwise products)

Page 75: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

4.5 Structure constants 61

is defined. When operator products (as opposed to commutators) are not defined,this method of proof fails but the theorem (it is not an identity) remains true. Thistheorem represents an integrability condition on the functions that define the groupmultiplication operation on the underlying manifold.

To summarize, a Lie algebra g has the following structure.

(i) It is a linear vector space under vector addition and scalar multiplication. If X ∈ g andY ∈ g then every linear combination of X and Y is in g:

X ∈ g, Y ∈ g, αX + βY ∈ g

(ii) It is an algebra under commutation. If X ∈ g and Y ∈ g then their commutator is in g:

X ∈ g, Y ∈ g, [X, Y ] ∈ g

This property is called “closure under commutation.”(iii) The Jacobi identity is satisfied. If X ∈ g, Y ∈ g, and Z ∈ g, then

[X, [Y, Z ]] + [Y, [Z , X ]] + [Z , [X, Y ]] = 0

Example The three generators (4.4) of the Lie group SL(2; R) obey the commu-tation relations

[Xa, Xb] = 2Xb

[Xa, Xc] = −2Xc

[Xb, Xc] = Xa

(4.18)

It is an easy matter to verify that the Jacobi identity is satisfied for this Lie algebra.

4.5 Structure constants

Since a Lie algebra is a linear vector space we can introduce all the usual conceptsof a linear vector space, such as dimension, basis, inner product. The dimension ofthe Lie algebra g is equal to the dimension of the manifold that parameterizes theLie group G. If the dimension is n, it is possible to choose n linearly independentvectors in the Lie algebra (a basis for the linear vector space) in terms of whichany operator in g can be expanded. If we call these basis vectors, or basis operatorsX1, X2, . . . , Xn , then we can ask several additional questions such as: Is there anatural choice of basis vectors? Is there a reasonable definition of inner product(Xi , X j )? We return to these questions shortly.

Since the linear vector space is closed under commutation, the commutator ofany two basis vectors can be expressed as a linear superposition of basis vectors

[Xi , X j ] = C ki j Xk (4.19)

Page 76: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

62 Lie algebras

The coefficients C ki j in this expansion are called structure constants. The structure

of the Lie algebra is completely determined by its structure constants. The antisym-metry of the commutator induces a corresponding antisymmetry in the structureconstants

[Xi , X j ] + [X j , Xi ] = 0 C ki j + C k

ji = 0 (4.20)

Under a change of basis transformation

Xi = A ri Yr (4.21)

the structure constants change in a systematic way

C ′ trs = (A−1) i

r (A−1) js C k

i j A tk (4.22)

(second order covariant, first order contravariant tensor). This piece of informationis surprisingly useless.

Example The only nonzero structure constants for the three basis vectorsXa, Xb, Xc (4.4) in the Lie algebra sl(2; R) for the Lie group SL(2; R) are, from(4.18)

C bab = −C b

ba = +2 C cac = −C c

ca = −2 C abc = −C a

cb = +1 (4.23)

4.6 Regular representation

A better way to look at a change of basis transformation is to determine how thechange of basis affects the commutator of an arbitrary element Z in the algebra

[Z , Xi ] = R(Z ) ji X j (4.24)

Under the change of basis (4.21) we find

[Z , Yr ] = S(Z ) sr Ys (4.25)

where

S sr (Z ) = (A−1) i

r R(Z ) ji A s

j (4.26)

In this manner the effect of a change of basis on the structure constants is reducedto a study of similarity transformations.

The association of a matrix R(Z ) with each element of a Lie algebra is calledthe regular representation

regularZ −→ R(Z )

representation(4.27)

Page 77: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

4.7 Structure of a Lie algebra 63

The regular representation of an n-dimensional Lie algebra is a set of n × n matrices.This representation contains exactly as much information as the structure constants,for the regular representation of a basis vector is

[Xi , X j ] = R(Xi )kj Xk = C k

i j Xk (4.28)

so that

R(Xi )kj = C k

i j (4.29)

The regular representation is an extremely useful tool for resolving a number ofproblems.

Example The regular representation of the Lie algebra sl(2; R) is easily con-structed, since the structure constants have been given in (4.23)

R(X ) = R(a Xa + bXb + cXc) = a R(Xa) + bR(Xb) + cR(Xc)

= 0 −2b 2c

−c 2a 0b 0 −2a

(4.30)

The rows and columns of this 3 × 3 matrix are labeled by the indices a, b and c,respectively.

4.7 Structure of a Lie algebra

The first step in the classification problem is to investigate the regular representationof the Lie algebra under a change of basis. We look for a choice of basis that bringsthe matrix representative of every element in the Lie algebra simultaneously toone of the three forms shown in Fig. 4.1. The first term (nonsemisimple, . . .) isapplied typically to Lie groups and algebras while the second term (reducible, . . .)is typically applied to representations.

nonsemisimplereducible

0 0

0

semisimple fully reducible

simple irreducible

Figure 4.1. Standard forms into which a representation can be reduced.

Page 78: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

64 Lie algebras

Example It is not possible to reduce simultaneously the regular representatives ofthe three generators Xa, Xb, and Xc of sl(2; R) to either the nonsemisimple or thesemisimple form. This algebra is therefore simple. However, the Euclidean groupE(2) with structure

E(2) = cos θ sin θ t1

− sin θ cos θ t20 0 1

(4.31)

has a Lie algebra with three infinitesimal generators

Lz = 0 1 0

−1 0 00 0 0

Px =

0 0 1

0 0 00 0 0

Py =

0 0 0

0 0 10 0 0

(4.32)

and regular representation

R(θ Lz + t1 Px + t2 Py) = 0 −θ 0

θ 0 0−t2 t1 0

(4.33)

where the rows and columns are labeled successively by the basis vectors Px , Py , andLz . This regular representation has the block diagonal structure of a nonsemisimpleLie algebra. The algebra, and the original group, are therefore nonsemisimple.

There is a beautiful structure theory for simple and semisimple Lie algebras.This will be discussed in Chapter 9. A structure theory exists for nonsemisimpleLie algebras. It is neither as beautiful nor as complete as the structure theory forsimple Lie algebras.

4.8 Inner product

Since a Lie algebra is a linear vector space, we are at liberty to impose on it all thestructures that make linear vector spaces so simple and convenient to use. Theseinclude inner products and appropriate choices of basis vectors.

Inner products in spaces of matrices are simple to construct. A well-known andvery useful inner product when A, B are p × q matrices is the Hilbert–Schmidtinner product

(A, B) = tr A†B (4.34)

This inner product is positive-definite, that is

(A, A) =∑

i

∑j

|A ji |2 ≥ 0 = 0 ⇒ A = 0 (4.35)

Page 79: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

4.8 Inner product 65

If we were to adopt the Hilbert–Schmidt inner product on the regular representationof g, then

(Xi , X j ) = tr R(Xi )†R(X j ) =

∑r

∑s

R(Xi )s∗

r R(X j )s

r =∑

r

∑s

C s∗ir C s

jr

(4.36)

This inner product is positive-semidefinite on g: it vanishes identically on thosegenerators that commute with all operators in the Lie algebra (Xi , where C ∗

i� = 0)and also on all generators that are not representable as the commutator of twogenerators (Xi , where Ci

∗� = 0).The Hilbert–Schmidt inner product is a reasonable choice of inner product from

an algebraic point of view. However, there is an even more useful choice of innerproduct that provides both algebraic and geometric information. This is definedby

(Xi , X j ) = Tr R(Xi )R(X j ) =∑

r

∑s

R(Xi )s

r R(X j )r

s =∑

r

∑s

C sir C r

js

(4.37)

This inner product is called the Cartan–Killing inner product, or Cartan–Killingform. It is in general an indefinite inner product. It is used extensively in theclassification theory of Lie algebras.

The Cartan–Killing metric can be used to advantage to make further refinementson the structure theory of a Lie algebra. The vector space of the Lie algebra canbe divided into three subspaces under the Cartan–Killing inner product. The innerproduct is positive-definite, negative-definite, and identically zero on these threesubspaces:

g = V+ + V− + V0 (4.38)

The subspace V0 is a subalgebra of g. It is the largest nilpotent invariant subal-gebra of g. Under exponentiation, this subspace maps onto the maximal nilpotentinvariant subgroup in the original Lie group.

The subspace V− is also a subalgebra of g. It consists of compact (a topologicalproperty) operators. That is to say, the exponential of this subspace is a subset ofthe original Lie group that is parameterized by a compact manifold. It also forms asubalgebra in g (not invariant).

Finally, the subspace V+ is not a subalgebra of g. It consists of noncompactoperators. The exponential of this subspace is parameterized by a noncompactsubmanifold in the original Lie group.

Page 80: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

66 Lie algebras

In short, a Lie algebra has the following decomposition under the Cartan–Killinginner product

Cartan–Killingg −→

V0 nilpotent invariant subalgebraV− compact subalgebraV+ noncompact operators

inner product(4.39)

We return to the structure of Lie algebras in Chapter 8 and the classification ofsimple Lie algebras in Chapter 10.

Example The Cartan–Killing inner product on the regular representation (4.30)of sl(2; R) is

(X, X ) = tr R(X )R(X ) = tr

0 −2b 2c

−c 2a 0b 0 −2a

2

= 8(a2 + bc) (4.40)

From this we easily drive the form of the metric for the Cartan–Killing inner product:

8(a2 + bc) = (a b c

) 8 0 0

0 0 40 4 0

a

bc

(4.41)

A convenient choice of basis vectors is one that diagonalizes this metric matrix: Xa

and X± = Xb ± Xc. In this basis the metric matrix is8 0 0

0 8 00 0 −8

Xa

X+X−

(4.42)

In this representation it is clear that the operator X− spans a one-dimensionalcompact subalgebra in sl(2; R) and the generators Xa, X+ are noncompact.

We should point out here that the inner product can also be computed even moresimply in the defining 2 × 2 matrix representation of sl(2; R)

(X, X ) = tr

[a bc −a

]2

= 2(a2 + bc) (4.43)

This gives an inner product that is proportional to the inner product derived from theregular representation. This is not an accident, and this observation can be used tocompute the Cartan–Killing inner products very rapidly for all matrix Lie algebras.

4.9 Invariant metric and measure on a Lie group

The properties of a Lie algebra can be identified with the properties of the corre-sponding Lie group at the identity.

Page 81: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

4.9 Invariant metric and measure on a Lie group 67

Once the properties of a Lie group have been determined in the neighborhoodof the identity, these properties can be translated to the neighborhood of any othergroup operation. This is done by multiplying the identity and its neighborhood onthe left (or right) by that group operation.

Two properties that are useful to define over the entire manifold are the met-ric and measure. We assume the coordinates of the identity are (α1, α2, . . . , αn)and the coordinates of a point near the identity are (α1 + dα1, α2 + dα2, . . . , αn +dαn). If (x1, x2, . . . , xn) represents some other group operation, then the point(α1 + dα1, α2 + dα2, . . . , αn + dαn) is mapped to the point (x1 + dx1, x2 +dx2, . . . , xn + dxn) under left (right) multiplication by the group operation associ-ated with (x1, x2, . . . , xn). The displacements dx and dα are related by a position-dependent linear transformation

dxr = M(x)ri dαi (4.44)

Suppose now that the distance ds between the identity and a point with coordi-nates αi + dαi infinitesimally close to the identity is given by

ds2 = gi j (Id)dαi dα j (4.45)

Any metric can be chosen at the identity, but the most usual choice is the Cartan–Killing inner product. Can we define a metric at x , grs(x), with the property thatthe arc length is an invariant?

grs(x)dxr dxs = gi j (Id)dαi dα j (4.46)

In order to enforce the invariance condition, the metric at x , g(x), must be relatedto the metric at the identity by

g(x) = M−1(x)t g(Id)M−1(x) (4.47)

The volume elements at the identity and x are

dV (Id) = dα1 ∧ dα2 ∧ · · · ∧ dαn

dV (x) = dx1 ∧ dx2 ∧ · · · ∧ dxn =‖ M ‖ dα1 ∧ dα2 ∧ · · · ∧ dαn (4.48)

The two volume elements can be made equal by introducing a measure over themanifold and defining an invariant volume

dµ(x) = ρ(x)dV (x) = ρ(x) ‖ M(x) ‖ dV (Id) ⇒ ρ(x) =‖ M(x) ‖−1 (4.49)

Example Under the simple parameterization (4.1) of the group SL(2; R) theneighborhood of the identity is parameterized by (4.3). We move a neighborhood ofthe identity to the neighborhood of the group operation parameterized by (x, y, z)

Page 82: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

68 Lie algebras

using left multiplication as follows1 + x y

z1 + yz

1 + x

×

[1 + dα1 dα2

dα3 1 − dα1

]

=

1 + (x + dx) y + dy

z + dz1 + (y + dy)(z + dz)

1 + (x + dx)

=

(1 + x)(1 + dα1) + y dα3 (1 + x)dα2 + y(1 − dα1)

z(1 + dα1) + (1 + yz)dα3

(1 + x)zdα2 + (1 + yz)(1 − dα1)

(1 + x)

(4.50)

The linear relation between the infinitesimals (dα1, dα2, dα3) in the neighborhoodof the identity and the infinitesimals (dx, dy, dz) in the neighborhood of the groupoperation (x, y, z) can now be read off, matrix element by matrix element

dx

dy

dz

=

1 + x 0 y

−y 1 + x 0

z 01 + yz

1 + x

dα1

dα2

dα3

(4.51)

From this linear transformation we immediately compute the invariant measure bytaking the inverse of the determinant

dµ(x) = ρ(x, y, z)dx ∧ dy ∧ dz = dx ∧ dy ∧ dz

1 + x(4.52)

The invariant metric is somewhat more difficult, as it involves computing the inverseof the linear transformation (4.51). The result is

2 0 0

0 0 10 1 0

left translation−→

by (x,y,z)

2(1 + yz)

(1 + x)2− z

(1 + x)− y

(1 + x)

− z

(1 + x)0 1

− y

(1 + x)1 0

(4.53)

The invariant measure (4.52) can be derived from the invariant metric (4.53) in theusual way (see Problem 4.11).

Page 83: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

4.11 Problems 69

4.10 Conclusion

The structure that results from the linearization of a Lie group is called a Liealgebra. Lie algebras are linear vector spaces. They are endowed with an addi-tional combinatorial operation, the commutator [X, Y ] = (XY − Y X ), and obey theJacobi identity. Since they are linear vector spaces, many powerful tools are avail-able for their study. It is possible to define an inner product that reflects not onlythe algebraic properties of the original Lie group, but its topological propertiesas well. The properties of a Lie algebra can be identified with the properties ofthe parent Lie group in the neighborhood of the identity. These structures can bemoved to neighborhoods of other points in the group manifold by a suitable groupmultiplication.

The linearization procedure is more or less invertible (a little less than more).The inversion is carried out by the EXPonential mapping.

4.11 Problems

1. Carry out the commutator calculation for g1 = (I + εX ), g−11 = (I + εX )−1 = I −

εX + ε2 X2 − · · · , with similar expressions for g2, to obtain the same result as in(4.13). In other words, this local result is independent of the parameterization in theneighborhood of the identity.

2. The inner product of two vectors X and X ′ in a linear vector space can be computedif the inner product of a vector with itself is known. This is done by the method ofpolarization. For a real linear vector space the argument is as follows:

(X ′, X ) = 1

2

[(X ′ + X, X ′ + X ) − (X ′, X ′) − (X, X )

]a. Verify this.b. Extend to complex linear vector spaces.c. Use the result from Eq. (4.43) that (X, X ) = 2(a2 + bc) to show

(X ′, X ) = 2a′a + b′c + c′b

3. Suppose that the n × n matrix Y is defined as the exponential of an n × n matrix X ina Lie algebra: Y = eX . Show that “for Y sufficiently close to the identity” the matrixX can be expressed as

X = −∞∑

n=1

(I − Y )n

n(4.54)

Show that this expansion converges when X and Y are symmetric if the real eigenval-ues λi of Y all satisfy 0 < λi < +2. Show that if Y ∈ SL(2; R) and tr Y < −2 this

Page 84: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

70 Lie algebras

expansion does not converge. That is, there is no 2 × 2 matrix X ∈ sl(2; R) with theproperty tr eX < −2.

4. The Lie algebra of SO(3) is spanned by three 3 × 3 antisymmetric matrices L =(L1, L2, L3) = (X23, X31, X12), with

θ · L = 0 θ3 −θ2

−θ3 0 θ1

θ2 −θ1 0

=

0 θ12 θ13

θ21 0 θ23

θ31 θ32 0

= X (4.55)

Use the Cayley–Hamilton theorem to show

eθ ·L = I3 f0(θ ) + X f1(θ ) + X2 f2(θ ) (4.56)

where θ is the single invariant that can be constructed from the matrix X = θ · L:θ2 = θ2

1 + θ22 + θ2

3 . Show

f0(θ ) = cos θ or f0(θ ) = cos θ

f1(θ ) = sin(θ )/θ θ f1(θ ) = sin(θ )f2(θ ) = (1 − cos(θ ))/θ2 θ2 f2(θ ) = 1 − cos(θ )

5. The Lie algebra for the matrix group SO(n) consists of antisymmetric n × n matrices.Show that a useful set of basis vectors (matrices) consists of the n(n − 1)/2 matricesXi j = −X ji (1 ≤ i �= j ≤ n) with matrix elements (Xi j )αβ = δiαδ jβ − δiβδ jα .a. Show that these matrices satisfy the commutation relations

[Xi j , Xrs] = Xisδ jr + X jrδis − Xirδ js − X jsδir (4.57)

b. Show that the operators Xi j = xi∂ j − x j∂i satisfy isomorphic commutation rela-tions.

c. Show that bilinear products of boson creation and annihilation operators Bi j =b†

i b j − b†j bi (1 ≤ i �= j ≤ n) satisfy isomorphic commutation relations.

d. Show that bilinear products of fermion creation and annihilation operators Fi j =f †i f j − f †j fi (1 ≤ i �= j ≤ n) satisfy isomorphic commutation relations.

6. The Jacobi identity for operators D, Y, Z (replace X by D in Eq. (4.17)) can berewritten in the form

[D, [Y, Z ]] = [[D, Y ], Z ] + [Y, [D, Z ]] (4.58)

Show this. Compare with the expression for the differential operator

d( f ∧ g) = (d f ) ∧ g + f ∧ (dg)

It is for this reason that the Jacobi identity is sometimes called a differential identity.

7. For the matrix Lie algebra so(4) the defining matrix representation consists of 4 × 4antisymmetric matrices while the regular representation consists of 6 × 6 antisym-metric matrices. Construct the defining and regular matrix representations for the

Page 85: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

4.11 Problems 71

element ai j Xi j in the Lie algebra:

X =∑

i j

ai j Xi j →{

def(X ) = ∑ai jdef(Xi j )

reg(X ) = ∑ai j reg(Xi j )

(4.59)

Construct the Cartan–Killing inner product using these two different matrix repre-sentations:

tr def(X )def(X ) ← (X, X ) → tr reg(X )reg(X ) (4.60)

Show that the two inner products are proportional. What is the proportionality con-stant? How does this result extend to SO(n)? to SO(p, q)? Is there a simple relationbetween the proportionality constant and the dimensions of the defining and regularrepresentations?

8. Assume a Lie algebra of n × n matrices is noncompact and its Cartan–Killing formsplits this Lie algebra into three subspaces:

g → V0 + V− + V+

Show that the subspace V− exponentiates onto a compact manifold. Do this by showingthat the basis matrices in V− have eigenvalues that are imaginary or zero, so thatEXP(V−) is multiply periodic. Apply this construction to the noncompact groupsSO(3, 1) and SO(2, 2). Show EXP(V−) is a two-sphere S2 for SO(3, 1) and a two-torus T 2 for SO(2, 2).

9. Construct the infinitesimal generators for the group SO(3) using the parameterizationsproposed in Problems 13 and 14 in Chapter 3.

10. Use the exponential parameterization of SO(3) to construct the linear transformationM (Eq. 4.44) describing displacements from the identity to displacements at the groupoperator eθ ·L ∈ SO(3). From this construct the invariant density ρ(θ ) and the metrictensor gµν(θ ). Give a reason for the strange behavior (singularities) that these invariantquantities exhibit.

11. Compute the determinant of the metric tensor (4.53) on the group SL(2; R). Showthat the square root of the determinant is equal to the measure, in accordance with thestandard result of Riemannian geometry that dV (x) =‖ g(x) ‖1/2 dn x . Discuss theadditional factors of 2 and −1 that appear in this calculation.

12. An inner product is (x, x) is imposed on a real n-dimensional linear vector space.It is represented by a real symmetric nonsingular n × n matrix grs = (er , es), wherex = ei x i . The inverse matrix, grs , is well defined.a. Lie group G preserves inner products. If y = Gx, (y, y) = (x, x). Show Gt gG = g.b. Show the Lie algebra H of G satisfies H t g + gH = 0.c. Show that the infinitesimal generators of G are Xrs = grt x t∂s − gst x t∂r .

Page 86: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

72 Lie algebras

d. Show that the operators Xrs satisfy commutation relations[Xi j , Xrs

] = +Xis g jr + X jr gis − Xir g js + X js gir

13. Every real unimodular 2 × 2 matrix M can be written in the form M = SO , where Sis a real symmetric unimodular matrix and O is a real orthogonal matrix.

In group Relation In algebraSt = S+1 det(S) = +1 S = e� Tr � = 0 �t = +�

Ot = O−1 det(O) = +1 O = eA Tr A = 0 At = −A

a. Show that M Mt = S2 = e2� .b. Show O = S−1 M = e−� Mc. Write S as a power series expansion in �.d. Write � as a power series expansion in S − I2.Under what conditions are these expansions valid?

14. Extend the result of the previous problem to complex n × n matrices M = HU , withM arbitrary but nonsingular, H † = H+1 hermitian and U † = U−1 unitary.

15. Transfer matrices have been described in Chapter 3, Problem 24. In one dimensionthe transfer matrix for a scattering potential, with free particles incident from the leftor right with momentum �kL or −�kR , has the form (Gilmore, 2004)[

αR + iαI βR + iβI

βR − iβI αR − iαI

](4.61a)

The matrix elements are given explicitly by

2αR = +m11 + kR

kLm22 2αI = +m12kR − k−1

L m21

2βR = +m11 − kR

kLm22 2βI = −m12kR − k−1

L m21

(4.61b)

The real quantities mi j are the four matrix elements of a group operation in SL(2; R).They are energy dependent. By appropriate choice of �kL = �kR and the matrixelements mi j , construct three infinitesimal generators for the group of the transfermatrix for scattering states. Show that they are[

i 00 −i

] [0 i−i 0

] [0 −1

−1 0

](4.61c)

Show that these three matrices span the Lie algebra of the group SU (1, 1).

16. The transfer matrix for a potential that possesses bound states has the form[α1 + α2 β1 + β2

β1 − β2 α1 − α2

](4.62a)

Page 87: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

4.11 Problems 73

The matrix elements are given explicitly by

2α1 = +m11 + κR

κLm22 2α2 = −m12κR − κ−1

L m21

2β1 = +m11 − κR

κLm22 2β2 = +m12κR − κ−1

L m21

(4.62b)

The parameters κR and κL describe the decay length of the exponentially decayingwavefunction in the asymptotic left- and right-hand regions of the potential. The realquantities mi j are the four matrix elements of a group operation in SL(2; R). They areenergy dependent. By appropriate choice of κL = κR and the matrix elements mi j ,construct the infinitesimal generators for the group of the transfer matrix for boundstates. Show that they are[

1 00 −1

] [0 1

−1 0

] [0 11 0

](4.62c)

Show that these three matrices span the Lie algebra of the group SL(2; R). Arguethat there ought to be interesting relations (e.g., analytic continuations) between thescattering states (e.g., resonances) and bound states through the relation betweenthe groups SL(2; R) and SU (1, 1), which are isomorphic. How are the matrices(4.61a) and (4.62a), the matrix elements (4.61b) and (4,62b), and the infinitesi-mal generators (4.61c) and (4.62c) related to each other by analytic continuation?(Hint: k∗ =

√2m(E − V∗)/�2 for E > V∗ and κ∗ =

√2m(V∗ − E)/�2 for E < V∗,

∗ = L , R.)

Page 88: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

5

Matrix algebras

The Lie algebras of the matrix Lie groups described in Chapter 3 areconstructed. This is done by linearizing the constraints defining thesematrix groups in the neighborhood of the identity operation.

5.1 Preliminaries

Lie algebras for the matrix groups treated in Chapter 3 are computed by linearizingthe defining conditions in the neighborhood of the identity. The general lineargroups GL(n; F) have no defining condition (the only condition is det(M) �= 0),while Examples (2)–(7) are already defined by linear constraints. Examples (8)–(11) are defined by bilinear and quadratic constraints that are linearized by applyingthe constraint to matrices infinitesimally close to the identity: I + ε A. The matricesin the Lie algebra are subject to easily derived linear constraints:

(I + ε A)†G(I + ε A) = G

G + ε(A†G + G A) + O(ε2) = G

A†G + G A = 0

(5.1)

The special linear groups are defined by the n-linear constraint

det(I + ε A) = 1 + ε tr(A) + O(ε2) = 1

tr(A) = 0(5.2)

The matrix Lie algebras of the matrix Lie groups given in Chapter 3 are summa-rized below.

5.2 No constraints

1. gl(n; F). This algebra consists of arbitrary n × n matrices over the field F.All remaining matrix algebras in this list are subalgebras of gl(n; F).

74

Page 89: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

5.3 Linear constraints 75

p

3.2.

0

ut ( p,q) ht ( p,q)

0 0q

4.

0

0 0

pv

r

=

=

∩q

0

0

ut ( p,q,r) ut ( p,q + r) ut(p + q,r)∩

5. 6.

0 0

00

0

00

0

0

7.p

q

λ1λ2

λn

λ3

sol (n) nil (n) a (p,q)

Figure 5.1. Structure of the matrix algebras for groups defined by linear constraints.

5.3 Linear constraints

The Lie algebras of the matrix groups have the same structures as the matrix groups.The only difference is that matrix elements that are constrained to be +1 in thegroups are replaced by 0 in the algebra. All matrix algebras of matrix groupsdefined by linear constraints are summarized in Fig. 5.1.

2. ut(p, q). Upper triangular algebras. The matrix algebra has the same structureas the group U T (p, q):

miα = 0p + 1 ≤i ≤p + q

1 ≤α ≤p(5.3)

3. ht(p, q). The algebra for this class of groups is defined by the condition

mi j = 0p + 1 ≤i ≤p + q

1 ≤ j ≤p + q(5.4)

Page 90: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

76 Matrix algebras

Example The group of affine transformations of the straight line consists of ma-trices

M(a, b) =[

a b0 1

](5.5)

The identity is at (a, b) = (1, 0). Its algebra is spanned by the two operators

∂ M

∂a

∣∣∣∣(1,0)

= Xa =[

1 00 0

]∂ M

∂b

∣∣∣∣(1,0)

= Xb =[

0 10 0

](5.6)

The commutation relations are given by [Xa, Xb] = Xb.

4. ut(p, q, r ). This matrix algebra is identical in structure to the parent group.

Example A very useful six-parameter subalgebra of ut(1, 2, 1) is given by

0 l r −2δ

0 η 2R −r0 −2L −η l0 0 0 0

= ηXη + R X R + L X L + r Xr + l Xl + δXδ (5.7)

The commutation relations of the six infinitesimal generators of this matrix Liealgebra are summarized in the table below. The operator in the i th row and j thcolumn is

[Xi , X j

].

Xη X R X L Xr Xl Xδ

Xη 0 2X R −2X L Xr −Xl 0X R 0 −4Xη 0 −2Xr 0X L 0 2Xl 0 0Xr 0 −Xδ 0Xl 0 0Xδ 0

n + 12 I a†a† aa a† a I

n + 12 I 0 2a†a† −2aa a† −a 0

a†a† 0 −4(n + 12 I ) 0 −2a† 0

aa 0 2a 0 0a† 0 −I 0a 0 0I 0

(5.8)

The table inherits the antisymmetry of the commutator, so only one half has beenpresented. It is clear that there is an isomorphism between this matrix algebraand the algebra of the photon energy operator n = a†a + 1

2 , two-photon creation

Page 91: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

5.3 Linear constraints 77

and annihilation operators a†a† and aa, single-photon creation and annihilationoperators a† and a, and the identity operator I = [a, a†]. We observe that the 4 × 4matrix Xδ representing the operator I = [a, a†] is not diagonal. It need not be, aslong as it obeys the appropriate commutation relations.

5. sol(n) = ut(1, 1, 1, . . . , 1). This matrix algebra is also identical in structure toits parent group. A very useful four-parameter subalgebra of ut(1, 1, 1) is given bymatrices of the following form

0 l δ

0 η r0 0 0

= ηXη + l Xl + r Xr + δXδ (5.9)

The following commutation properties are easily verified

[Xη, Xr ] = +Xr [a†a, a†] = +a†

[Xη, Xl] = −Xl [a†a, a] = −a

[Xl, Xr ] = Xδ [a, a†] = I

[Xη, Xδ] = 0 [a†a, I ] = 0

(5.10)

6. nil(n). Nilpotent matrices have an upper triangular structure, with +1 along thediagonal in the group and zeroes along the diagonal in the algebra. The three gen-erators of the algebra of nilpotent 3 × 3 matrices have structure and commutationrelations

0 l δ

0 0 r0 0 0

= l Xl + r Xr + δXδ (5.11)

[Xl, Xr ] = Xδ [a, a†] = I

[Xl, Xδ] = 0 [a, I ] = 0

[Xr , Xδ] = 0 [a†, I ] = 0

(5.12)

These commutation relations are isomorphic to Heisenberg commutation relations.This is easily seen by setting η = 0 in (5.9). As a result, a number of difficultcomputations involving this algebra can be replaced by much simpler computationsinvolving only 3 × 3 matrices.

7. a(p, q). The matrix algebra for the commutative group of Example (7) in Chap-ter 3 (see (3.13)) consists of matrices having the form shown in Fig. 5.1.

Page 92: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

78 Matrix algebras

5.4 Bilinear and quadratic constraints

The nonlinear constraints that define the metric-preserving matrix Lie groups areeasily converted to linear constraints that define their Lie algebras following theprocedure described in (5.1) above.

8. Compact metric-preserving groups Matrices M for the algebras of these groupssatisfy

R o(n) orthogonalM† + M = 0 C u(n) unitary

Q sp(n) symplectic(5.13)

The algebras of the orthogonal, unitary, and symplectic groups consist of n × nantihermitian matrices. The Lie algebras for the groups O(3) and U (2) are

o(3) = 0 θ3 −θ2

−θ3 0 θ1

θ2 −θ1 0

=

∑i

θi Li

(5.14)

u(2) = 1

2

[i x0 + i x3 i x1 + x2

i x1 − x2 i x0 − i x3

]= i

2

∑µ

xµσµ

The four 2 × 2 matrices σµ are called Pauli spin matrices.

9. Noncompact metric-preserving groups Matrices M for the algebras of thesegroups satisfy

R o(p, q)M† Ip,q + Ip,q M = 0 C u(p, q)

Q sp(p, q)(5.15)

The algebras of groups that leave invariant a nonsingular symmetric indefinitemetric are most simply treated by determining their block diagonal structure. Forexample, the algebra so(2, 1) for the Lorentz group in the plane is[

At Ct

Bt D

] [I2 00 −1

]+

[I2 00 −1

] [A BC D

]= 0 (5.16)

From this, we conclude

A = −At

D = −D = 0Bt = C

(5.17)

Page 93: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

5.4 Bilinear and quadratic constraints 79

The matrix algebra of so(2, 1) is explicitly 0 θ v1

−θ 0 v2

v1 v2 0

(5.18)

By an identical argument the matrix Lie algebra for the Lorentz group so(3, 1) is

0 θ3 −θ2 v1

−θ3 0 θ1 v2

θ2 −θ1 0 v3

v1 v2 v3 0

(5.19)

10. Antisymmetric nonsingular metric-preserving groups Matrices M for thealgebras of these groups satisfy

M†G + G M = 0 Gt = −G F ={

R sp(G; R)C sp(G; C)

(5.20)

Since G is nonsingular, M = −G−1 M†G and

tr M = −tr G−1 M†G = −tr M† = −tr M∗ = 0 (5.21)

Therefore the trace of these matrices is imaginary.

11. Singular metric-preserving groups Matrices M for the algebras of thesegroups satisfy

R o(n; G)MG + G M† = 0 C u(n; G)

Q sp(n; G)(5.22)

In the case that the (p + q) × (p + q) matrix G has singular block diagonal struc-ture [ g 0

0 0 ] with det(g) �= 0, this constraint reduces to[A BC D

] [g 00 0

]+

[g 00 0

] [A† C†

B† D†

]=

[0 00 0

]⇒ Ag + g A† = 0 C = C† = 0 B, D arbitrary (5.23)

In particular, in the case of real 4 × 4 matrices with singular symmetric metricdiag(1, 1, 1, 0) the Lie algebra is

0 θ3 −θ2 t1

−θ3 0 θ1 t2θ2 −θ1 0 t30 0 0 s4

(5.24)

Page 94: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

80 Matrix algebras

Here the parameters θi describe rotations about the i th coordinate axis and theti describe displacements of the origin along the i th coordinate direction. Theparameter s4 describes “scaling” of the time axis: t ′ = es4 t . If s4 is set to zero(traceless condition, see the following Section 5.5) the Lie algebra is that of theEuclidean group E(3) = I SO(3) (inhomogeneous rotation group in R3).

The matrix Lie algebra for the Poincare group I SO(3, 1) (3.26) is obtained bysimilar arguments using a singular 5 × 5 metric G = diag(1, 1, 1, −1, 0). The Liealgebra is (setting the trace equal to zero):

0 θ3 −θ2 v1 t1

−θ3 0 θ1 v2 t2θ2 −θ1 0 v3 t3v1 v2 v3 0 t40 0 0 0 0

(5.25)

The Galilei group (3.27) has the following 5 × 5 matrix Lie algebra, obtained by“contraction” (cf., Chapter 13) from the Lie algebra of I SO(3, 1):

0 θ3 −θ2 v1 t1

−θ3 0 θ1 v2 t2θ2 −θ1 0 v3 t30 0 0 0 t40 0 0 0 0

(5.26)

5.5 Multilinear constraints

12. Special linear groups have algebras that satisfy the zero trace condition

tr M = 0 F =

R sl(n, R)C sl(n, C)Q sl(n, Q)

(5.27)

The exponential of a matrix with zero trace is a matrix with determinant +1:

det(eM

) = etrM (5.28)

5.6 Intersections of groups

The Lie algebra for the intersection of two groups is the intersection of the groups’Lie algebras. The important algebra su(n) is obtained from the intersection of u(n)and sl(n; C) (cf. (5.14)). For example

su(2) = u(2) ∩ sl(2; C) = i

2

[x3 x1 − i x2

x1 + i x2 −x3

](5.29)

Page 95: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

5.9 Basis vectors 81

5.7 Algebras of embedded groups

The Lie algebras of the embedded groups are constructed in a straightforward way.The Lie algebra of U (n) consists of n × n antihermitian matrices M :

M ∈ u(n) ⇒ (M†)i j = −M∗j i (5.30)

The Lie algebra of OU (2n) is obtained from the Lie algebra of U (n) by replacingeach of the n(n − 1)/2 complex matrix elements Mi j (i < j) (M ∈ u(n)) above thediagonal of M by a 2 × 2 real matrix, and each of the diagonal matrix elementsMii by a real 2 × 2 matrix representing an imaginary complex number (a = 0,b arbitrary in Eq. (3.3)). The matrix elements Mi j below the diagonal (i > j)are obtained from the antihermiticity condition. The result is a real antisymmetric2n × 2n matrix with the propertyu(n) → ou(2n) ⊂ o(2n). The dimension ofou(2n)is the dimension of u(n): 2 × n(n − 1)/2 + 1 × n = n2.

The Lie algebra of Sp(n) consists of n × n antihermitian matrices M over Q:

M ∈ sp(n) ⇒ (M†)i j = −M∗j i (5.31)

The adjoint is taken over the quaternion field. The Lie algebra of U Sp(2n) is ob-tained from the Lie algebra of Sp(n) by replacing each of the n(n − 1)/2 quaternionmatrix elements Mi j (i < j) (M ∈ sp(n)) above the diagonal of M by a 2 × 2 com-plex matrix, and each of the diagonal matrix elements Mii by a complex 2 × 2matrix representing an imaginary quaternion (q0 = 0, qi arbitrary in Eq. (3.4)). Thematrix elements Mi j below the diagonal (i > j) are obtained from the antihermitic-ity condition. The result is a real antihermitian 2n × 2n matrix with the propertysp(n) → usp(2n) ⊂ su(2n). The dimension of usp(2n) is the dimension of sp(n):4 × n(n − 1)/2 + n × 3 = 2n(2n + 1)/2.

5.8 Modular groups

The modular group GL(n; Z) has no Lie algebra because it is not a continuousgroup.

5.9 Basis vectors

In each of these matrix algebras there is usually a clear choice of basis vectors.A useful choice is made by choosing a basis set that is orthogonal with respect tosome inner product on the space of square matrices. In (5.6), (5.7), (5.9), (5.14),(5.18)–(5.19), and (5.24)–(5.26) the infinitesimal generators have been chosen tobe orthogonal with respect to a convenient inner product.

As discussed in Section 4.8, the Hilbert–Schmidt inner product on rectangularmatrices

(X, Y ) = tr X †Y (5.32)

Page 96: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

82 Matrix algebras

is usually very useful. This inner product is positive–definite: (X, X ) = 0 ⇒ X =0. For example, for the algebra so(2, 1) (Eq. (5.18)), if X, X ′ are two 3 × 3 matricesin the algebra

(X ′, X ) = tr X ′†X = 2(+θ ′θ + v′1v1 + v′

2v2) (5.33)

There is a yet more useful inner product that can be defined on matrix Liealgebras. This is an analog of the Cartan–Killing inner product

(X, Y ) = tr XY (5.34)

For so(2, 1) this inner product is

(X ′, X ) = tr X ′ X = 2(−θ ′θ + v′1v1 + v′

2v2) (5.35)

This inner product is not positive-definite. For giving up positive-definiteness wegain information of both an algebraic and a topological nature. At the algebraiclevel, the subspace on which this inner product is identically zero is the largestnilpotent invariant subalgebra (subalgebra of matrices equivalent to upper trian-gular matrices) in the original algebra. The subspace on which the inner productis negative-definite consists of compact group generators, and the subspace onwhich it is positive-definite consists of noncompact generators. When appropriatemeasures are taken (in Chapter 11), the negative-definite subspace closes undercommutation, and so describes a compact Lie group.

The Cartan–Killing inner product is defined in terms of the structure constants ofa Lie algebra. These are incorporated into the regular matrix representation of theLie algebra. The Cartan–Killing inner product (X, Y ) is specifically defined as thetrace of the product of the regular matrix representatives of X and Y . Other innerproducts are conveniently defined when other matrix representations are used. Inmany instances it is very convenient to use the defining matrix representation ofthe Lie algebra: this representation certainly contains no less information than theregular matrix representation. For a large class of Lie algebras (simple Lie algebras)these two different inner products are strictly proportional.

It is remarkable that this metric contains information of both a topological and analgebraic nature. To illustrate the difference between the compact and noncompactcases, we consider 2 × 2 matrices

X =[

0 +1−1 0

](X, X ) = −2 eθ X =

[cos θ sin θ

− sin θ cos θ

]

Y =[

0 +1+1 0

](Y, Y ) = +2 eθ X =

[cosh θ sinh θ

sinh θ cosh θ

] (5.36)

Page 97: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

5.11 Problems 83

In the compact case, the group element exp(θ X ) periodically returns to the identityas θ increases. Therefore the group can be parameterized by a finite range ofparameter values: −π ≤ θ ≤ +π , with −π and +π identified. On the other hand, inthe noncompact case the group is parameterized by the entire line −∞ < θ < +∞.The underlying manifolds for the two groups are the circle S1 and the line R1.

In the compact case the simplification of parameterizing the group with a boundedsubset of the Lie algebra (−π ≤ θ ≤ +π ) is somewhat offset by the complicationof matching boundary conditions – identifying the group operations parameterizedby −π and +π . In the noncompact case the simplification of not having to worryabout matching boundary conditions is somewhat offset by the fact that it takesthe entire subspace in the Lie algebra, Rk , where k is the number of noncompactgenerators, to parameterize this piece of the group. This piece of the group istopologically identical to Rk , that is, it is Euclidean. These remarks will be clarifiedand elaborated on in Chapter 7.

5.10 Conclusion

In this chapter we have constructed the Lie algebras for all the matrix Lie groupsdefined in Chapter 3. This is done by linearizing the constraints that define theoriginal matrix Lie groups in the neighborhood of the identity. For the generallinear groups which are defined by no constraints, the Lie algebras gl(n; F) arealso defined by no constraints. For the Lie groups defined by linear constraints,linearization is trivial and produces a matrix Lie algebra having structure identicalto that of the parent Lie group. Transition from the Lie group to the Lie algebrareplaces nonlinear constraints by linear conditions defining the Lie algebras ofthe metric-preserving groups (G = In, Ip,q , nonsingular antisymmetric, generalnonsingular) and the unimodular groups. One natural way to choose basis vectorsin these Lie algebras has been described.

5.11 Problems

1. The Lie group U T (1, 1) has Lie algebra of the form

A =[ a b

0 c

]= aXa + bXb + cXc

Show that in this matrix Lie algebra an inner product can be defined by (A, A) =tr(A)2 = a2 + c2.

2. Show that the regular representation of the matrix Lie algebra ut(1, 1) given in Prob-lem 1 is

R(A) =

0 0 −b

0 0 +b

0 0 a − c

Xa

Xc

Xb

Page 98: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

84 Matrix algebras

with the ordering of the basis vectors given on the right. Show that the Cartan–Killinginner product in the regular representation is (A, A) = trR(A)2 = (a − c)2. The innerproduct in the regular representation suggests that the linear combination Xa + Xc

commutes with all operators in the Lie algebra. Is this true?

3. Write down the algebra inclusions gl(1; R) ⊂ gl(1; C) ⊂ gl(1; Q) explicitly in termsof the 2 × 2 complex matrices as defined in (3.3) and (3.4).

4. Construct the table (analogous to (5.8)) giving the commutation relations for thephoton energy operator n = a†a + 1

2 , creation and annihilation operators a† and a,and the identity operator I . Compare with a table for the commutation relations ofthe matrices Xη, Xr , Xl , Xδ defined in (5.9). Show that the two Lie algebras areisomorphic. The photon number operator is n = a†a and the photon energy operatoris E = (a†a + 1

2 )�ω → a†a + 12 for �ω = 1.

5. Cartan decomposition Assume a matrix Lie algebra has a block diagonal structuregiven by

Z =[

A BC D

]=

[A 00 D

]+

[0 BC 0

]

g = h + p

Show that this decomposition satisfies the commutation relations

[h, h] ⊆ h

[h, p] ⊆ p

[p, p] ⊆ h

This means that if X, X ′ ∈ h and Y, Y ′ ∈ p, then [X, X ′] ∈ h, [X, Y ] ∈ p, [Y, Y ′] ∈ h.Conclude that the subspace h is a subalgebra of g. Is p a subalgebra (under whatconditions is p a subalgebra)?

6. Show that an inner product for the Cartan decomposition given in the previous problemis

(Z , Z ) = tr Z2 = tr A2 + tr BC + tr C B + tr D2

If X = [ A 00 D ] ∈ h and Y = [ 0 B

C 0 ] ∈ p, then

(X, X ) = tr (A2 + D2) (Y, Y ) = tr (BC + C B)

Show that X and Y are orthogonal under this inner product: (X, Y ) = 0.

7. The Lie algebra so(p, q) has the structure [ A BBt C ] where the p × p and q × q matrices

A and C satisfy At = −A and Ct = −C . If X = [ A 00 C ] ∈ h and Y = [ 0 B

Bt 0 ] ∈ p, show• (X, Y ) = 0• (X, X ) ≤ 0, (X, X ) = 0 ⇒ X = 0• (Y, Y ) ≥ 0, (Y, Y ) = 0 ⇒ Y = 0

Page 99: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

5.11 Problems 85

These results are summarized by

(h, h) ≤ 0(h, p) = 0(p, p) ≥ 0

8. The Lie algebra su(p, q) has the structure [ A BB† C ] where the p × p and q × q matrices

A and C satisfy A† = −A, C† = −C , and tr(A + C) = 0. If X = [ A 00 C ] ∈ h and

Y = [ 0 BB† 0 ] ∈ p, then show once again that

(h, h) ≤ 0(h, p) = 0(p, p) ≥ 0

Show that (X, X ) = 0 ⇒ X = 0 and (Y, Y ) = 0 ⇒ Y = 0.

9. The Lie algebra for sl(n; R) has a decomposition in terms of real antisymmetric andtraceless symmetric matrices At = −A and Bt = B with tr B = 0:

sl(n; R) = A + Bg = h + p

Show

[h, h] ⊆ h (h, h) ≤ 0[h, p] ⊆ p and (h, p) = 0[p, p] ⊆ h (p, p) ≥ 0

Show that (A, A) = 0 ⇒ A = 0 and (B, B) = 0 ⇒ B = 0.

10. The Lie algebra for sl(n; C) has a decomposition in terms of traceless antihermitianmatrices A† = −A and traceless hermitian matrices H † = H :

sl(n; C) = antihermitian + hermitiang = h + p

Show

[h, h] ⊆ h (h, h) ≤ 0[h, p] ⊆ p and (h, p) = 0[p, p] ⊆ h (p, p) ≥ 0

Show that (A, A) = 0 ⇒ A = 0 and (H, H ) = 0 ⇒ H = 0.

11. Assume that g is a Lie algebra with a Cartan decomposition g = h + p, with commu-tation relations and inner product properties given by

[h, h] ⊆ h (h, h) ≤ 0[h, p] ⊆ p and (h, p) = 0[p, p] ⊆ h (p, p) ≥ 0

Page 100: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

86 Matrix algebras

Show that if every n × n matrix B in p is multiplied by i and the resulting algebra isdefined by g′ = h + ip = h + p′ then

[h, h] ⊆ h (h, h) ≤ 0[h, p′] ⊆ p′ and (h, p′) = 0[p′, p′] ⊆ h (p′, p′) ≤ 0

In short, noncompact algebras that satisfy a Cartan decomposition can be analyticallycontinued to compact algebras.

12. Extend the Cartan decomposition and analytic continuation arguments to the quater-nion algebra g = sl(n; Q) with respect to the subalgebra h = sl(n; C).

13. A matrix Lie algebra has the form

A =

0 θ3 −θ2 b1 t1−θ3 0 θ1 b2 t2θ2 −θ1 0 b3 t3

µb1 µb2 µb3 0 t4σ t1 σ t2 σ t3 −µσ t4 0

1

2(A, A) = −(θ · θ ) + µ(b · b) + σ (t · t) − µσ t2

4

Show

µ σ Algebra Singular subspace+1 +1 so(3, 2)−1 +1 so(4, 1)−1 −1 so(5)+1 0 Poincare translations tµ0 0 Galilei translations tµ, boosts b

14. Assume that g = A, where A is a Lie algebra of n × n matrices on which the in-ner product is negative-definite: tr A2 ≤ 0, = 0 ⇒ A = 0. Then show that EXP(t A)returns to any neighborhood of the identity In if t becomes large enough. If the eigen-values of A are rationally related (λi = γ ni , ni are integers, γ �= 0 is rational orirrational), EXP(t A) returns periodically to In . What is t0, the minimum period in t?

15. Use the parameterization of so(3) given in Problem 3.14. Show that the differentials(dx, dy, dz) of a point in the neighborhood of (x, y, z) are related to the displacements(δx, δy, δz) in the neighborhood of the identity by

dxdydz

=

m11 0 −y

0 m11 x0 m21 m22

δx

δyδz

Use the values you constructed for the matrix elements mi j to construct explicitly themetric tensor g(x, y, z) and the invariant measure dµ(x, y, z) on the group SO(3)with this parameterization.

Page 101: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

5.11 Problems 87

16. g is a matrix Lie algebra. Show that if the matrix subspaces h and p defined belowexist in the algebra (g ∩ h = h, g ∩ p = p),

h p

g + g∗ g − g∗

g − gt g + gt

g − g† g + g†

then the following commutation relations are satisfied:

[h, h] ⊆ h [h, p] ⊆ p [p, p] ⊆ h

Page 102: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

6

Operator algebras

Lie algebras of matrices can be mapped onto Lie algebras of operatorsin a number of different ways. Three useful matrix algebra to operatoralgebra mappings are described in this chapter.

6.1 Boson operator algebras

It is possible to construct useful operator algebras from Lie algebras. An operatorLie algebra can be constructed from a Lie algebra of n × n matrices by introducinga set of n independent boson creation (b†

i ) and annihilation (b j ) operators that obeythe commutation relations

[bi , b†j ] = I δi j (6.1)

with all other commutators (e.g., [bi , b j ], [b†i , b†

j ], [bi , I ], [b†j , I ]) equal to zero.

The operator algebra is constructed from the matrix algebra by associating to eachmatrix A the operator A that is a linear combination of creation and annihilationoperators:

A → A = b†Ab =∑

i

∑j

b†i Ai j b j (6.2)

The matrices and their associated operators have isomorphic commutation relations[A,B

] = [b†

i Ai j b j , b†r Brsbs

]= Ai j Brs

[b†

i b j , b†r bs

]= Ai j Brs

(b†

i δ jr bs − b†rδsi b j

)= b†

i Ai j B jsbs − b†r Brs As j b j

= b†i [A, B]i j b j

= C (6.3)

88

Page 103: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

6.2 Fermion operator algebras 89

where [A, B] = C . This argument is invertible. An algebra of operators bilinear inboson creation and annihilation operators for n independent modes has an isomor-phic n × n matrix algebra (or matrix representation)

[A, B] = C ⇔ [A,B] = C A =∑

i j

b†i Ai j b j (6.4)

Remark The 2n + 1 operators bi , b†j , I (1 ≤ i, j ≤ n) span the Heisenberg

algebra.

6.2 Fermion operator algebras

The success of the calculation above does not depend on the boson commutationrelations (6.1). It depends, rather, on the commutation relations of bilinear productsof these operators

[b†i b j , b†

r bs] = b†i bsδ jr − b†

r b jδsi (6.5)

Any set of operators Xi j that satisfies isomorphic commutation relations

[Xi j , Xrs] = Xisδ jr − Xr jδsi (6.6)

can be used in place of the bilinear combinations b†i b j :

A → A =∑

i j

Ai j Xi j (6.7)

Another useful set of operators with this property is obtained from the fermioncreation ( f †i ) and annihilation ( f j ) operators for n independent modes. These oper-ators do not even satisfy commutation relations. Rather, they satisfy anticommu-tation relations {

fi , f †j} = fi f †j + f †j fi = I δi j (6.8)

with all other bilinear anticommutators (e.g., { fi , f j }, { f †i , f †j }) equal to zero. Bi-linear combinations of fermion operators satisfy commutation relations of theform (6.6), for[

f †i f j , f †r fs] = f †i f j f †r fs − f †r fs f †i f j

= f †i (δ jr − f †r f j ) fs − f †r (δis − f †i fs) f j

= f †i fsδ jr − f †r f jδsi (6.9)

Page 104: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

90 Operator algebras

As a result, matrix Lie algebras can be associated with bilinear products of eitherboson or fermion operators:

[A, B] = C ⇔ [A,B] = C A =∑

i j

f †i Ai j f j (6.10)

These two matrix algebra → operator algebra mappings are useful for con-structing particular classes of representations for the unitary group U (n) and itssubgroup SU (n). The mapping to a boson operator algebra greatly simplifies theconstruction of the symmetric representations of U (n). The mapping to a fermionoperator algebra greatly simplifies the construction of the antisymmetric represen-tations of U (n). A closely related mapping allows an elegant construction of thespin representations of the orthogonal groups.

6.3 First order differential operator algebras

Yet another useful set of operators that satisfies the commutation relations (6.6) arethe first order differential operators

Xi j → xi∂ j = xi∂

∂x j(6.11)

Then

[A, B] = C ⇔ [A,B] = C A =∑

i j

xi Ai j∂ j =∑

i j

Ai j Xi j (6.12)

To illustrate the use of this operator combination, we treat the matrix algebra so(3)of the orthogonal group SO(3)

so(3) = 0 θ3 −θ2

−θ3 0 θ1

θ2 −θ1 0

= θ · L (6.13)

The operator algebra is

( x1 x2 x3 )

0 θ3 −θ2

−θ3 0 θ1

θ2 −θ1 0

∂1

∂2

∂3

= θ ·L (6.14)

where L1 = x2∂3 − x3∂2, L2 = x3∂1 − x1∂3, L3 = x1∂2 − x2∂1. The two algebrashave isomorphic commutation relations

[Li , L j ] = −εi jk Lk [Li ,L j ] = −εi jkLk (6.15)

where Li are 3 × 3 matrices and Li are first order differential operators.

Page 105: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

6.3 First order differential operator algebras 91

As another example, we treat the Lie algebra for the group E(2) = I SO(2) ofrigid motions (translations and rotations) in the x–y plane, whose matrix algebramay be taken in the form

0 θ 0−θ 0 0t1 t2 0

= θ Lz + ti Ti (6.16)

This describes rotations about an axis perpendicular to the x–y plane through anangle θ and displacements in the x and y directions by t1 and t2. The associatedoperator algebra is

( x1 x2 1 )

0 θ 0

−θ 0 0t1 t2 0

∂1

∂2

1

= θLz + tiTi (6.17)

whereLz = x1∂2 − x2∂1 and Ti = ∂i . The matrix algebra and operator algebra haveisomorphic commutation relations.

Differential operator realizations of Lie algebras come about in a natural way.This is illustrated by two simple examples. The general procedure can easily beinferred from these examples. Both involve the group of affine transformations ofthe real line parameterized by points (a, b) in R2 as follows

(a, b) →[

ea b0 1

](6.18)

Imagine a function defined for every point p in R1. Once a coordinate system Sis chosen a coordinate, x(p), can be introduced and the function can be writtenexplicitly as a function of x

f (p)

↓ ↓ (6.19)

fS[x(p)] = fS′[x ′(p)]

If a new coordinate system S′ is chosen, the value of the function at p remainsunchanged but the new coordinate of p, x ′(p), is different. Therefore the functionsfS and fS′ must be different. We ask: how is fS′ related to fS?

To answer this question, assume x ′(p) and x(p) are related by an infinitesimalgroup transformation [

x ′

1

]=

[1 + da db

0 1

] [x1

](6.20)

Page 106: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

92 Operator algebras

Then

fS′[x ′(p)] = fS[x(x ′(p))] (6.21)

We solve for x in terms of x ′ by inverting the linear relation (6.20)

fS′[x ′(p)] = fS[x ′(1 − da) − db]

= fS[x ′] − da x ′ ∂ fS

∂x ′ − db∂ fS

∂x ′ (6.22)

The infinitesimal generators that transform the function at p are

Xa = −x ′ ∂

∂x ′ Xb = − ∂

∂x ′ (6.23)

These operators have commutation relations that are isomorphic with those of theoriginal matrix group

[Xa, Xb] = Xb ⇔ [Xa,Xb] = Xb (6.24)

As a second example we consider functions G(x, y) defined on the plane R2 thatparameterizes the affine group. By repeating the arguments above

GS′(x ′, y′) = GS(x, y) (6.25)

where (x ′, y′) and (x, y) are related by[x ′ y′

0 1

]=

[1 + da db

0 1

] [x y0 1

](6.26)

Inverting the infinitesimal transformation, we have

GS′(x ′, y′) = GS[x = (1 − da)x ′, y = (1 − d)y′ − db]

= GS(x ′, y′) +{

da

(−x ′ ∂

∂x ′ − y′ ∂

∂y′

)+ db

(− ∂

∂y′

)}GS(x ′, y′)

(6.27)

The two infinitesimal generators are

Xa = −x ′∂/∂x ′ − y′∂/∂y′

Xb = −∂/∂y′ (6.28)

The commutation relations are preserved

[Xa, Xb] = Xb ⇔ [Xx ,Xb] = Xb (6.29)

These two examples serve to demonstrate that a single matrix algebra can havemany different operator realizations.

Page 107: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

6.5 Problems 93

Remark In the example above we have adopted the “passive” interpretation ofgroup action. That is, the coordinates of a point changed by virtue of a choice of adifferent coordinate system, but the value of the function did not. Therefore the par-ticular form of the function was required to change. There is another interpretationof the group action – the “active” interpretation. In this interpretation the group op-eration defines a new function at the initial point in accordance with (see Eq. (6.19))

fS′[x(p)] = fS[x ′(p)] (6.30)

Infinitesimal generators for changes in the function under the active interpretationcan be computed. They are exactly the same as those computed for the passiveinterpretation, except for a sign change. This sign difference is encountered inthe theory of rotating bodies as the difference in commutation relations for thegenerators of rotation in a laboratory-fixed frame and a body-fixed frame.

The “active” and “passive” interpretations of group operations are related by theequivalence principle (see Section 14.2).

6.4 Conclusion

Matrix algebra to operator algebra isomorphisms are easily constructed by asso-ciating to each matrix A in a matrix Lie algebra an operator A = ∑

i

∑j Ai j Xi j .

If the operators Xi j obey the simple commutation relations (6.6), the commuta-tion relations of the matrix Lie algebra and the operator algebra are isomorphic:[A, B] = C ⇔ [A,B] = C. Under these conditions, complicated commutators inan operator algebra can be replaced by simpler commutators in the matrix algebra.These results extend to the respective Lie groups. Products of exponentials of op-erators can be replaced by products of exponentials of the corresponding matriceswith a little care: eAeB = eD ⇔ eAeB = eD.

6.5 Problems

1. Bilinear products involving one creation and one annihilation operator for two modesgenerate a four-dimensional Lie algebra with basis vectors a†

i a j , 1 ≤ i, j ≤ 2.a. Show that n = a†

1a1 + a†2a2 commutes with all the operators in this set.

b. If n is chosen as one basis vector in this four-dimensional space, the remainingthree operators can be chosen as a†

1a1 − a†2a2, a†

1a2, and a†2a1. Construct their

commutation relations.c. These calculations simplify considerably under the operator to matrix mapping

a†1a1 + a†

2a2 a†1a1 − a†

2a2 a†1a2 a†

2a1

↓ ↓ ↓ ↓[1 00 1

] [1 00 −1

] [0 10 0

] [0 01 0

]

Page 108: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

94 Operator algebras

d. Show that the three operators 12 (a†

1a1 − a†2a2), a†

1a2, and a†2a1 satisfy commutation

relations isomorphic to the comutation relations of the angular momentum algebraJz, J±. In particular, show

Jz = 12 (a†

1a1 − a†2a2)

Jx = 12 (J+ + J−) = 1

2 (a†1a2 + a†

2a1)

Jy = 12i (J+ − J−) = 1

2i (a†1a2 − a†

2a1)

e. Evaluate J 2 = J 2x + J 2

y + J 2z in terms of the creation and annihilation operators,

and show

J 2 =(

1

2n

) (1

2n + 1

)

2. Schwinger representation of angular momentum Introduce two independentmodes. Assume that the quantum state of mode i (i = 1, 2) is |ni 〉, where ni isthe number of excitations in mode i . Assume also that the creation and annihilationoperators a†

i and ai act on state |ni 〉 in the usual way:

a†i |ni 〉 =

√ni + 1 |ni + 1〉 ai |ni 〉 = √

ni |ni − 1〉Choose as a set of basis vectors the direct product states |n1〉 ⊗ |n2〉 = |n1, n2〉. Define

| jm

〉 = |n1, n2〉 j = 1

2(n1 + n2), m = 1

2(n1 − n2)

a. Identify the lattice sites in Fig. 6.1 with the states |n1, n2〉 = | jm〉, the diagonaloperator 1

2 (a†1a1 − a†

2a2) with the operator Jz , and the shift operators a†1a2, a†

2a1

with J+ and J−.b. Show that the four operators a†

i a j leave invariant the sum n1 + n2.c. J 2| jm〉 = j( j + 1)| jm〉.d. Jz| jm〉 = m| jm〉.e. J+| jm〉 = a†

1a2|n1, n2〉 = √n1 + 1

√n2 |n1 + 1, n2 − 1〉 =

| j, m + 1〉√ j + m + 1√

j − m.f. J−| jm〉 = a†

2a1|n1, n2〉 = √n1

√n2 + 1 |n1 − 1, n2 + 1〉 =

| j, m − 1〉√ j + m√

j − m + 1.g. J±| jm〉 = | j, m ± 1〉√( j ± m + 1)( j ∓ m). Note that J+| j, j〉 = 0, J−| j, − j〉 =

0.h. 〈 j ′m ′|J±| jm〉 = √

( j ′ ± m ′)( j ∓ m) δ j ′ j δm ′,m±1.

3. Basis vectors in the Lie algebra u(3) for the group U (3) have commutation relationsthat are isomorphic to the commutation relations of the nine boson operators a†

i a j ,1 ≤ i, j ≤ 3. Choose a set of basis vectors for a matrix representation of this algebra ofthe form |n1, n2, n3〉 = |n1〉 ⊗ |n2〉 ⊗ |n3〉, where for example bi |ni 〉 = |ni − 1〉√ni ,etc.

Page 109: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

6.5 Problems 95

n2

n1

ln1,n2

5

4

3

2

1

0

0 1 2 3 4 5 6

J– = a2 a1

J+ = a1 a2

J–

J+

Figure 6.1. Identification of the angular momentum operators with operatorsfor two boson modes simplifies computation of the angular momentum matrixelements.

a. Show N = ∑3i=1 ni is not changed by the action of any of the nine operators in

this set.b. Show that the dimension, D, of this representation is D = (N + 3 − 1)!/N !(3 −

1)!. This is the number of ways three nonnegative integers can be chosen whosesum is N (Bose–Einstein counting problem). In higher dimensions (n) replace 3 byn. D is also the number of monomials of degree N in the Taylor series expansionof a function f (x1, x2, . . . , xn) of n variables.

c. Compute the matrix elements of all operators b†i b j in this representation:

〈n′1, n′

2, n′3|b†

i b j |n1, n2, n3〉 (6.31)

d. Is there some operator in the Lie algebra that maps to the identity matrix, ID , inthis representation?

〈n′1, n′

2, n′3|O|n1, n2, n3〉 = IDδn′

1,n1δn′2,n2δn′

3,n3 (6.32)

What is O?

4. Repeat the steps of Problem 3, replacing the boson operators b†i b j by Fermion oper-

ators f †i f j . What is now the dimension of this representation?

5. Construct operators d, d† defined formally from the standard creation and annihilationoperators a, a† as follows: [

dd†

]=

[A BC D

] [aa†

]

Page 110: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

96 Operator algebras

a. Show that if the new operators d, d† are to satisfy standard commutation relations[d, d†] = 1 and [d, d] = [d†, d†] = 0, the four matrix elements must satisfy AD −BC = 1.

b. Argue that the commutation relations are invariant under the group Sp(2; R) =SL(2; R).

c. Show that under Sp(2; R), linear combinations of the coordinate and differentialoperators x, ∂ preserve the commutation relations. In particular, show that[

aa†

]= 1√

2

[1 1

−1 1

] [∂

x

]

preserve commutation relations.d. Replace a by (a1, a2, . . . , an) and similarly for a† and their images d, d† under

some linear transformation as given above, with A, B, C, D now n × n matrices.Determine the conditions on these n × n matrices under which the structure of thecommutation relations is preserved. In particular, show

ADt − BCt = In ABt = B At C Dt = DCt

Show that these transformations belong to the Lie group Sp(2n; R).

6. The N -dimensional isotropic harmonic oscillator has hamiltonian

H = �ω

N∑i=1

(a†

i ai + 1

2

)

and eigenstates |n1, n2, . . . , nN 〉.a. Show that the degeneracy of the multiplet containing n quanta, with energy �ω(n +

N2 ) is deg(N , n) = (n + N − 1)!/n!(N − 1)!. This solution to the Bose–Einsteincounting problem is exactly equal to the number of coefficients of degree n in theTaylor series expansion of a function of N variables: f (x1, x2, . . . , xN ).

b. Show that the symmetry group of this hamiltonian has Lie algebra spanned by theN 2 operators a†

i a j . This is isomorphic to the Lie algebrau(N ). Since [H, a†i a j ] = 0,

this algebra is a direct sum of a simple Lie algebra, su(N ), plus the one-dimensionalalgebra spanned by H.

c. If the generators a†i a j that span the invariance algebra are supplemented with the

single creation and annihilation operators a†i and a j , as well as their commutator I ,

the resulting set of operators closes to form an (N + 1)2 dimensional Lie algebrathat is nonsemisimple. This is called the spectrum generating algebra of theisotropic harmonic oscillator. Show that there is a sequence of operations drawnfrom this algebra that transform any state in a multiplet with n excitations to anystate in a multiplet with n′ excitations.

7. The set of matrices R, S, T, U, . . . belong to a Lie algebra of n × n matrices, a† =(a†

1, a†2, . . . , a†

n) is a row vector of creation operators for n boson modes, and a is its

Page 111: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

6.5 Problems 97

adjoint, a column vector of annihilation operators. Define R = a†Ra = a†i Ri j a j , and

similarly for S, T, U, . . ..a. [R, S] = T ⇔ [R, S] = Tb. eReS = eU ⇔ eReS = eU

8. The Rodriguez formula is often used to generate the Hermite polynomials:

Hn(x) = ex2

(− d

dx

)n

e−x2

a. Show [ ddx , e−x2/2] = −xe−x2/2.

b. Use this result to show(− d

dx

)e−x2 = e−x2/2

(x − d

dx

)e−x2/2

(− d

dx

)n

e−x2 = e−x2/2

(x − d

dx

)n

e−x2/2

c. As a result

Hn(x)e−x2/2 = e+x2/2

(− d

dx

)n

e−x2 =(

x − d

dx

)n

e−x2/2

d. Introduce the annihilation operator a = 1√2(x + d

dx ), define the normalized groundstate 〈x |0〉 by a〈x |0〉 = 0. Solve this equation, normalize the solution, and show〈x |0〉 = e−x2/2/

√1√

π .e. Introduce the creation operator a† = 1√

2(x − d

dx ) and show

〈x |n〉 = (√

2a†)n

√2nn!

〈x |0〉 = Hn(x)e−x2/2√2nn!

√π

= ψn(x) (6.33)

where ψn(x) is the nth normalized harmonic oscillator eigenstate 〈x |n〉 =(a†)n√

n!〈x |0〉.

9. Assume a set of n harmonic oscillators interact through an angular momentum term(Li j = a†

i a j − a†j ai ) and a quadrupole interaction (Qi j = a†

i a j + a†j ai ).

a. Show that the hamiltonian for this system is

H =n∑

i=1

�ωi

(a†

i ai + 1

2

)+ i

∑i< j

θi j(a†

i a j − a†j ai

) +∑i≤ j

qi j(a†

i a j + a†j ai

)b. Show that this hamiltonian can be represented by a hermitian matrix. Show that

for i ≤ j the matrix elements are

�i j = �ωiδi j + (q + iθ )i j

with �∗j i = �i j .

Page 112: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

98 Operator algebras

c. Show that an orthogonal transformation can be constructed so that the hamiltoniancan be expressed in terms of n independent oscillators represented by creation andannihilation operators bi = mi j a j : H = ∑n

i=1 �ω′i (b

†i bi + 1

2 ) + constant. Expressthe amplitudes mi j in terms of the eigenvectors of �(H ).

d. Compute the shift in the zero point energy (“constant”).

Page 113: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

7

EXPonentiation

Linearization of a Lie group to form a Lie algebra introduces an enormoussimplification in the study of Lie groups. The inverse process, reconstruct-ing the Lie group from the Lie algebra, is carried out by the EXPonentialmap. We return to a more thorough study of the exponential map in thischapter. In particular, we address the three problems raised in Chapter4. Does the EXPonential operation map the Lie algebra back onto theLie group? Are Lie groups with isomorphic Lie algebras themselves iso-morphic? Are there natural ways to parameterize Lie groups? We closethis chapter with a spectrum of applications of the EXPonential mappingin physics. Applications include computing the dynamical evolution ofquantum systems and their thermal expectation values.

7.1 Preliminaries

In Chapter 4 we saw how the linearization and EXPonentiation operations relateLie groups and Lie algebras

Lie groupsln�

EXPLie algebras (7.1)

At that time three questions, and their answers, were briefly raised about theEXPonential mapping. These questions are more thoroughly explored in thischapter.

The three questions, and their answers, are now presented.

Question 1 Does EXP map the Lie algebra onto the entire group?Answer 1 No, but with some effort and insight, Yes.Question 2 Are Lie groups with isomorphic Lie algebras isomorphic?Answer 2 No, but there is a unique Lie group (covering group) and all others with the

same Lie algebra are simply related to this unique simply connected Lie group.

99

Page 114: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

100 EXPonentiation

Question 3 Are all mappings of the Lie algebra onto the Lie group identical?Answer 3 No, but with care they are all analytically related to each other (by Baker–

Campbell–Hausdorff formulas).

Each question is now discussed in more detail.

7.2 The covering problem

Cartan gave a simple example which showed that it is not always possible to map aLie algebra onto the entire Lie group through a single mapping of the form EXP(X ).We consider the Lie group SL(2; R) with Lie algebra sl(2; R):

X =[

a b + cb − c −a

]∈ sl(2; R) (7.2)

For this matrix algebra

tr EXP(X ) ≥ −2 (7.3)

Since SL(2; R) contains group operations of the form[−λ 00 −1/λ

]λ > 1 (7.4)

with trace less than −2, a single exponential cannot map the Lie algebra onto theentire group.

The lower bound (−2) on the trace of the exponential can be seen as follows.Trace is an invariant under similarity transformation, so

tr eX = tr S eX S−1 = tr eSX S−1(7.5)

Now choose S to diagonalize (7.2). Since Tr X = 0, the eigenvalues λ can onlyhave the form ±θ or ±iθ (θ real)

tr eSX S−1 −→{

2 cosh θ ≥ 2 real eigenvalues2 cos θ ≥ −2 imaginary eigenvalues

(7.6)

The problem in attempting to parameterize the Lie group with a single exponen-tial map lies with the compact generators. The compact generators “go around” incircles, while the noncompact generators “go on forever.” Furthermore, the com-pact generators always form a subgroup in the Lie group while the noncompactgenerators do not.

Page 115: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

7.2 The covering problem 101

To make these cryptic statements less mysterious, we compute EXP(X ), with Xgiven in (7.2), and find

EXP

[a b + c

b − c −a

]

=[

cosh r + a sinh r/r (b + c) sinh r/r(b − c) sinh r/r cosh r − a sinh r/r

]r2 = a2 + b2 − c2 > 0

=[

1 + a b + cb − c 1 − a

]a2 + b2 − c2 = 0

=[

cos r + a sin r/r (b + c) sin r/r(b − c) sin r/r cos r − a sin r/r

]−r2 = a2 + b2 − c2 < 0

(7.7)

The “light cone” structure of the (a, b, c) coordinate space of the Lie algebra isshown in Fig. 7.1. Points inside this cone map onto 2 × 2 rotation matrices in thegroup SO(2). Points outside this cone map onto noncompact group elements. Pointson the cone itself map onto some interesting group operations.

b

a

c

Figure 7.1. “Light cone” for SL(2; R).

Page 116: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

102 EXPonentiation

Many points inside the cone map onto the same operation in the subgroup SO(2).To see this most easily set a = b = 0. Points on the c-axis map onto

(0, 0, c) −→[

cos c sin c− sin c cos c

](7.8)

and therefore points separated by 2πn along the c-axis map onto the same groupoperation in SO(2) ⊂ SL(2; R). The complementary subspace (a, b, 0) maps ontononcompact group operations in SL(2; R)

(a, b, 0) −→[

cosh r + (a/r ) sinh r (b/r ) sinh r(b/r ) sinh r cosh r − (a/r ) sinh r

]r2 = a2 + b2

(7.9)that are not recurrent. In fact, this two-parameter set of group operations has thesame topology as the subspace (a, b, 0) in the Lie algebra. We show this below.

In addition to providing an example that shows that EXP(X ) may not map ontothe group when the group is noncompact, Cartan provided a theorem that a suc-cession of mappings would always do the job. For simple groups (Chapter 9) theproduct of two exponential mappings – one of the compact generators, the other ofthe noncompact generators – will map the algebra onto the group. To separate com-pact and noncompact generators we use the Cartan–Killing inner product (4.43)computed in the defining matrix representation (7.2)

(X, X ) = tr X2 = 2(a2 + b2 − c2) (7.10)

The metric is positive-definite on noncompact generators and negative-definite onnoncompact generators. This decomposition in the Lie algebra leads to[

a bb −a

]+

[0 c

−c 0

]EXP ↓ ↓ ↓ EXP[

z + y xx z − y

[cos c sin c

− sin c cos c

] (7.11)

For simplicity we have set z = cosh r ≥ 1 and (x, y) = (b, a) sinh(r )/r , r2 = a2 +b2. We observe that

z2 − x2 − y2 = 1 (7.12)

which is just the upper sheet of the two-sheeted hyperboloid H 22+, shown in

Fig. 7.2(a). This sheet is topologically equivalent to the space R2, the planethat it covers. For the compact generator only a small range of parameter values−π ≤ c ≤ +π is required to map the subalgebra onto the subgroup SO(2).

Page 117: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

7.2 The covering problem 103

R3

H 2

z

yx

(a) (b)

Single-sheetedhyperboloidGeodesics

EXP2

2

10

10 2 a 2

22

1

a 3

Straightlines

z

y

x

Figure 7.2. (a) Two-sheeted and (b) single-sheeted hyperboloids. Both are quo-tients (coset spaces) of SL(2; R) by one of its two inequivalent types of subgroups,SO(2) and SO(1, 1).

The connection of SL(2; R) with geometry may be unexpected, but it is notunique to SL(2; R). Moreover, other geometric structures are obtained by expo-nentiating different subspaces of the algebra sl(2; R). For example[

a c−c −a

]+

[0 bb 0

]EXP ↓ ↓ ↓ EXP[

z + y x−x z − y

[cosh b sinh bsinh b cosh b

] (7.13)

In this expression for the coset representatives (recall the definition of cosets, orquotients of a group by a subgroup, given in Chapter 1) the three real parameters(x, y, z) obey

z2 + x2 − y2 = 1 (7.14)

This equation describes the surface of the single-sheeted hyperboloid H 21 , shown

in Fig. 7.2(b). Many other algebraic surfaces can be obtained from Lie algebras inthis way.

We point out that the EXPonential function maps the sum of two subspaces in thealgebra into the product of the associated group operations (cf. (7.11) and (7.13)).We can regard one of the subspaces as the difference between the full space (Liealgebra) and the other subspace (subalgebra). The EXPonential maps the difference

Page 118: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

104 EXPonentiation

of spaces into the quotient of group operations. For example[a bb −a

]=

[a b + c

b − c −a

]−

[0 c

−c 0

]EXP ↓ EXP ↓ ↓ EXP ↓ EXP[

z + y xx z − y

]= SL(2; R) / SO(2)

(7.15)

The “quotient” means that all elements in SL(2; R) that differ only by multiplicationby a 2 × 2 rotation matrix on the right are identified with each other. It is convenientto choose one such group operation to represent this entire set. This group operation(on the left in (7.15)) is called a coset representative. The entire one-dimensionalset parameterized by c, 0 ≤ c < 2π , is the coset. In the theory of Lie groups, cosetsand coset representatives are usually interesting spaces.

From this discussion we conclude that the group SL(2; R) can be viewed in var-ious different ways involving coset decompositions. In the parameterization (7.11)obtained from the coset decomposition SL(2; R)/SO(2), the manifold parameteriz-ing the group is the direct product of the upper sheet of the two-sheeted hyperboloidwith a circle. Since the upper sheet of a two-sheeted hyperboloid is topologically(but not geometrically!) equivalent to R2, the manifold that parameterizes SL(2; R)is the direct product R2 × S1. A different parameterization (7.13) based on the cosetdecomposition [SL(2; R)/SO(1, 1)] × SO(1, 1) (SO(1, 1) � R1) shows that themanifold underlying SL(2; R) is the direct product of the single-sheeted hyper-boloid (equivalent to R1 × S1) with R1. This product is once again R2 × S1.

Since matrix Lie groups are defined by algebraic constraints, so are their sub-groups and quotient spaces. This means that the underlying manifold for eachmatrix Lie group is an algebraic manifold. For example, for subgroups of GL(n; R)the underlying manifold is a subset of RN , N = n2, that is defined by algebraicconstraints. This manifold can be expressed as products of algebraic submanifolds,each parameterizing a subgroup or coset.

We conclude this discussion of the covering problem by stating a theorem due toCartan. It is always possible to map a Lie algebra onto its Lie group with a productof exponential mappings. In fact, if the algebra can be written in the form

algebra = noncompact generators + compact generators

EXP ↓ EXP ↓ ↓ ↓ EXP

group = coset representatives × compact subgroup

(7.16)

then the product of two exponential maps, one of the noncompact generators, theother of the compact generators (which form a subalgebra), maps onto the entire Lie

Page 119: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

7.3 The isomorphism problem and the covering group 105

group. The algebraic manifold parameterizing the EXPonential of the noncompactgenerators is Rm , for suitable m (m is the number of noncompact generators). Themanifold that parameterizes the EXPonential of the compact generators is compact.

7.3 The isomorphism problem and the covering group

Isomorphic Lie groups have isomorphic Lie algebras, but two Lie groups withisomorphic Lie algebras need not be isomorphic. To illustrate this point, we treatthe groups SO(2, 1) and SU (1, 1) with Lie algebras

so(2, 1) = 0 a3 a2

−a3 0 a1

a2 a1 0

su(1, 1) = i

2

[b3 ib1 + b2

ib1 − b2 −b3

](7.17)

The Lie algebras are isomorphic but the Lie groups are not. The group SO(2, 1) iscovered by the map

0 0 a2

0 0 a1

a2 a1 0

+

0 a3 0

−a3 0 00 0 0

EXP ↓ ↓ ↓ EXP

[SO(2, 1)/SO(2)] × cos a3 sin a3 0

− sin a3 cos a3 00 0 1

(7.18)

The group SU (1, 1) is similarly covered by

i

2

[0 ib1 + b2

ib1 − b2 0

]+ i

2

[b3 00 −b3

]

EXP ↓ ↓ ↓ EXP

[SU (1, 1)/U (1)] ×[

e+ib3/2 00 e−ib3/2

] (7.19)

The cosets SO(2, 1)/SO(2) and SU (1, 1)/U (1) are both isomorphic to R2 and havea 1:1 correspondence. The subgroups SO(2) and U (1) have a 2:1 correspondence.This can be seen by increasing b3 by 2π and noticing that the 2 × 2 unitary matrixin (7.19) goes to its negative: U (b3 + 2π ) = −U (b3). However, increasing a3 by2π does not change the 3 × 3 rotation matrix in (7.18). The 2:1 correspondencecan be seen in a better and simpler way. One can ask: how far along a straight linethrough the origin does one have to go to return to the identity? For the subgroup

Page 120: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

106 EXPonentiation

U (1) ⊂ SU (1, 1) the result is 4π ; for the subgroup SO(2) ⊂ SO(2, 1) the result is2π . Therefore, SU (1, 1) is “twice as large” as SO(2, 1). More formally, there is a2 → 1 homomorphism of SU (1, 1) onto SO(2, 1).

Once again there is a result due to Cartan that is useful for comparing Lie groupsthat have isomorphic Lie algebras. Since the noncompact parts of the Lie algebrasmap to elements of the group with the topology of a Euclidean space, a comparisonof the largest compact subgroups of the two groups is sufficient to determine whetherthe groups are isomorphic.

The most familiar example of nonisomorphic groups with isomorphic Lie alge-bras is the pair SO(3) and SU (2) with algebras

so(3) = 0 a3 −a2

−a3 0 a1

a2 −a1 0

su(2) = i

2

[b3 b1 − ib2

b1 + ib2 −b3

](7.20)

It can be checked that all points in the interior of a sphere of radius√

a21 + a2

2 + a23 ≤

π ) map onto SO(3) provided antipodal points at |a| = π are identified

π (sin θ cos φ, sin θ sin φ, cos θ) ∼ −π (sin θ cos φ, sin θ sin φ, cos θ )

with θ the latitude, and φ the longitude on a sphere. For SU (2) all points within a

sphere of radius 2π (√

b21 + b2

2 + b23 < 2π ) are mapped onto distinct elements of

SU (2) and all points at a radius of 2π are mapped onto −I2. There is an easier wayto verify the 2 → 1 nature of the map SU (2) to SO(3). All straight lines through theorigin of the Lie algebra are equivalent (since the algebra has rank 1, see Chapter8). Therefore, we can compare how a convenient line (z-axis) maps onto the twogroups. This has already been done for the comparison of SU (1, 1) with SO(2, 1).

Another convenient parameterization of SO(3) and SU (2) can be used to showthe 2:1 map. This is analogous to (7.18)

so(3) = 0 0 −a2

0 0 a1

a2 −a1 0

+

0 a3 0

−a3 0 00 0 0

EXP ↓ ↓ EXP ↓ EXP∗ ∗ −x

∗ ∗ yx −y z

×

cos a3 sin a3 0

− sin a3 cos a3 00 0 1

(7.21)

Page 121: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

7.3 The isomorphism problem and the covering group 107

A similar parameterization for SU (2) gives

su(2) = i

2

[0 b1 − ib2

b1 + ib2 0

]+ i

2

[b3 00 −b3

]

EXP ↓ ↓ EXP ↓ EXP[z′ i(x ′ − iy′)

i(x ′ + iy′) z′

[eib3/2 0

0 e−ib3/2

] (7.22)

The coset representatives SO(3)/SO(2), parameterized by the real numbers(x, y, z) subject to x2 + y2 + z2 = 1, and SU (2)/U (1), parameterized by the realnumbers (x ′, y′, z′) subject to x ′2 + y′2 + z′2 = 1, are in 1:1 correspondence withpoints in the same geometric space – a sphere in this case. As a result, the 2:1 natureof the mapping SU (2) → SO(3) can be seen from the 2:1 nature of the rotationsaround the “3” axis.

Yet another result of Cartan establishes a unique connection between Lie groupsand Lie algebras. There is a unique Lie algebra for every Lie group. For eachLie algebra there may be many inequivalent Lie groups. But there is a unique Liegroup, G, called the universal covering group. This group is simply connected:every loop starting and ending at the identity can be continuously deformed to theidentity. Moveover, every other Lie group with this Lie algebra is either identical tothis simply connected Lie group, or else has the form of a quotient G/D, where Dis a discrete invariant subgroup of G whose elements commute with G: gdi = di gfor di ∈ D and g ∈ G. If G is compact it is useful to determine the largest suchsubgroup, DMAX, of G. Then all compact Lie groups with the same Lie algebra asG are obtained by “dividing” G by all possible subgroups of DMAX, as shown inFig. 7.3.

For simple matrix Lie groups G, computation of the discrete invariant subgroupD is a simple matter. The only discrete group operations di that commute with allg ∈ G are multiples of the identity, by Schur’s lemma

g ∈ G, di ∈ D, G simple, gdi = di g ⇒ di = λIn (7.23)

Two Lie groups with isomorphic Lie algebras are locally isomorphic. If G1 andG2 have the same Lie algebra, G1 = G/D1 and G1 is locally isomorphic with G.By the same argument G2 is locally isomorphic with G, and therefore also withG1. If G is compact, G1 and G2 are also locally isomorphic with G/DMAX, whichis a universal image Lie group.

G1 = G/D1 → G/DMAX ← G/D2 = G2

Page 122: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

108 EXPonentiation

Simply connected Lie Group

SG

Lie Algebra

SG/D1 SG/D2 SG/Dr

Lin

eari

zati

on L

OG

EX

P

g

Figure 7.3. Cartan’s covering theorem. There is a unique correspondence betweenLie algebras g and simply connected Lie groups SG = G. Every other Lie groupwith this Lie algebra is a quotient of the universal covering group by one of thediscrete invariant subgroups Di of G.

Example The maximal discrete invariant subgroup of SU (2) consists of matricesλI2 that obey λ∗λ = 1 and det(λI2) = +1, so that λ = ±1. D is the two-elementsubgroup D = {I2, −I2}. For the locally isomorphic Lie group SO(3), D = λI3

with λ = +1. As a result SU (2)/ {I2, −I2} = SO(3)/I3 = SO(3). For each groupoperation in SO(3) there are two matrices in SU (2) that differ in sign.

Remark The maximal compact subgroups SO(2) of SO(2, 1) and U (1) ofSU (1, 1) are not simply connected. Their simply connected covering groupis R1, the group of translations of the line. The covering group SO(2, 1) =SU (1, 1) has no compact subgroup at all. Its underlying group manifoldis SO(2, 1)/SO(2) × SO(2) = SU (1, 1)/U (1) × U (1) = [SO(2, 1)/SO(2)] ×SO(2) = SU (1, 1)/U (1) × U (1) = R2 × R1. It is the only group we will encounterin this book that is not a matrix group. The covering group SO(2, 1) = SU (1, 1) hasmany discrete invariant subgroups but does not have a maximal discrete invariantsubgroup.

7.4 The parameterization problem and BCH formulas

A Lie algebra can be mapped onto a Lie group in many different ways. Moregenerally, points in the underlying topological space can be identified with groupoperations in an unlimited number of ways. These different parameterizations ofa Lie group can be related to each other by analytic transformations in a way that

Page 123: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

7.4 The parameterization problem and BCH formulas 109

can often be used to simplify computations. Reparameterization formulas involv-ing products of exponentials of operators are called Baker–Campbell–Hausdorff(BCH) formulas for historical reasons. Once again we illustrate by example ratherthan present a general theory.

As a first example we consider the affine group of transformations of the line,and two different parameterizations of this group. One maps a point (x, y) in theright half-plane R2

+ into the group operator

(x, y) →[

x y0 1

]x > 0 (7.24)

The second maps a point (w, z) in R2 into the group under the EXPonential map

(w, z) = EXP

[w z0 1

]=

[ew (ew − 1)z/w0 1

](7.25)

We ask: is there some mapping of the half-plane R2+ (x > 0, y) into R2 (w, z)

that makes these two group operations, and the group multiplication laws derivedfrom them, equivalent? The transformation between these two parameterizations isobtained by identifying matrix elements:

(x, y) →[

x y0 1

]=

[ew (ew − 1)z/w0 1

]← (w, z) (7.26)

The mapping (“diffeomorphism”) between the half-plane R2+ and the plane R2, or

the coordinates (x, y) and (w, z), is

x = ew

(7.27)

y = (ew − 1)z/w = z

(1 + w

2!+ w2

3!+ · · ·

)

and the inverse transformation is

w = ln x(7.28)

z = y ln(x)/(x − 1) z = 0 for x = 1

These transformations are analytic for x > 0.As a second example we treat the algebra of upper triangular 3 × 3 matrices

0 l δ

0 0 r0 0 0

= l Xl + r Xr + δXδ (7.29)

Page 124: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

110 EXPonentiation

The commutation relations of these three generators are

[Xl, Xr ] = Xδ [Xl, Xδ] = [Xr , Xδ] = 0 (7.30)

The single-mode photon operators a, a†, I obey isomorphic commutation relations

[a, a†] = I [a, I ] = [

a†, I] = 0 (7.31)

The two Lie algebras are isomorphic under

Xl → a

Xr → a† (7.32)

Xδ → I

For many quantum computations it is convenient to relate several different param-eterizations of the Lie group. For example, the following “disentangling” resultsare useful

era†+la+δ I

‖ ‖er ′a†

eδ′ I el ′a = el ′′aeδ′′ I er ′′a†(7.33)

This reparameterization computation can be carried out using 3 × 3 matrices

EXP

0 l δ

0 0 r0 0 0

=

1 l δ + 1

2 lr0 1 r0 0 1

‖ ‖

er ′a†eδ′ I el ′a →

1 l ′ δ′

0 1 r ′

0 0 1

=

1 l ′′ δ′′ + l ′′r ′′

0 1 r ′′

0 0 1

← el ′′aeδ′′ I er ′′a†

(7.34)

We see immediately that l = l ′ = l ′′, r = r ′ = r ′′, δ′ = δ + 12 lr = δ′′ + l ′′r ′′, and

obtain the Heisenberg identity (for δ = 0)

era†e+ 1

2 lr I ela = era†+la = elae− 12 lr I era†

(7.35)

As a third example we treat the four-parameter Lie group of solvable 3 × 3matrices with Lie algebra

0 l δ

0 η r0 0 0

= ηXη + l Xl + r Xr + δXδ (7.36)

Page 125: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

7.4 The parameterization problem and BCH formulas 111

This Lie algebra is isomorphic with the Lie algebra spanned by the four single-modephoton operators n = a†a, a, a†, I under the identification

Xη → n

Xl → a(7.37)

Xr → a†

Xδ → I

If for some reason EXP(ηa†a + ra† + la) needed to be rewritten in the more con-veniently ordered form EXP(r ′a†)EXP(η′a†a + δ′ I )EXP(l ′a), then the reparame-terization computation could be carried out in the 3 × 3 matrix representation

EXP(ηa†a + ra† + la) = EXP(r ′a†)EXP(η′a†a + δ′ I )EXP(l ′a)

‖ ‖1 (eη − 1)l/η (eη − 1 − η)lr/η2

0 eη (eη − 1)r/η0 0 1

=

1 l ′ δ′

0 eη′r ′

0 0 1

(7.38)

By inspection, we obtain

η′ = η l ′ = (eη − 1)l/ηδ′ = (eη − 1 − η)lr/η2 r ′ = (eη − 1)r/η

(7.39)

If it is necessary to compute the expectation value of EXP(ηa†a + ra† + la) in theground state of the harmonic oscillator, then

〈0|eηa†a+ra†+la|0〉 = 〈0|er ′a†eη′a†a+δ′ I el ′a|0〉 (7.40)

Since el ′a|0〉 = |0〉, 〈0|er ′a† = 〈0| and eη′a†a|0〉 = |0〉, the expectation value is

〈0|eηa†a+ra†+la|0〉 = eδ′ = EXP

((eη − 1 − η)lr

η2

)(7.41)

This result is not easy to derive by other techniques.As a final example we treat the Lie algebra su(2). First, we show how to compute

the matrix element of an arbitrary rotation between “ground state” wavefunctions(| j, − j〉) ⟨

j− j

∣∣∣∣eiθ ·J∣∣∣∣ j− j

⟩(7.42)

Page 126: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

112 EXPonentiation

This expectation would be easy to compute if the exponential were written in a“normally ordered form”⟨

j− j

∣∣∣∣eiθ ·J∣∣∣∣ j− j

⟩=

⟨j

− j

∣∣∣∣eiθ ′+ J+eiθ ′

z Jz eiθ ′− J−

∣∣∣∣ j− j

⟩(7.43)

Since

eiθ ′− J−

∣∣∣∣ j− j

⟩= (I + iθ ′

− J− + · · · )

∣∣∣∣ j− j

⟩=

∣∣∣∣ j− j

⟩(7.44)

with a similar result for J+ acting on the left, we find⟨j

− j

∣∣∣∣eiθ ·J∣∣∣∣ j− j

⟩=

⟨j

− j

∣∣∣∣eiθ ′z Jz

∣∣∣∣ j− j

⟩= e−i jθ ′

z (7.45)

The only problem that remains is to compute θ ′z as a function of θ . To do this

we carry out the operator disentangling calculations in the faithful 2 × 2 matrixrepresentation J → 1

2σ , where σ are the Pauli spin matrices (5.14):

eiθ ·J → EXPi

2

[θz θx − iθy

θx + iθy −θz

]

=[

cos(θ/2) + i(θz/θ ) sin(θ/2) i[(θx − iθy)/θ ] sin(θ/2)i[(θx + iθy)/θ ] sin(θ/2) cos(θ/2) − i(θz/θ ) sin(θ/2)

](7.46)

In a similar way we find

EXP(iθ ′+ J+) EXP(iθ ′

z Jz) EXP(iθ ′− J−)

↓ ↓ ↓[1 iθ ′

+0 1

] [eiθ ′

z/2 00 e−iθ ′

z/2

] [1 0

iθ ′− 1

]

=[

eiθ ′z/2 − θ ′

+θ ′−e−iθ ′

z/2 iθ ′+e−iθ ′

z/2

iθ ′−e−iθ ′

z/2 e−iθ ′z/2

](7.47)

where θ± = θ1 ± iθ2. Comparison of the two matrices gives immediately

e−iθ ′z/2 = cos(θ/2) − i(θz/θ ) sin(θ/2) (7.48)

As a result, we find⟨j

− j

∣∣∣∣eiθ ·J∣∣∣∣ j− j

⟩= e−i jθ ′

z = (e−iθ ′z/2)2 j = [cos(θ/2) − i(θz/θ ) sin(θ/2)]2 j (7.49)

This result is useful in the field of quantum optics but is not easy to compute byother means.

Page 127: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

7.4 The parameterization problem and BCH formulas 113

To illustrate the use of Baker–Campbell–Hausdorff formulas in another situationwe compute the matrix elements ⟨

jj

∣∣∣∣J k+ J k

∣∣∣∣ jj

⟩(7.50)

To do this we construct a generating function⟨jj

∣∣∣∣eα J+eβ J−

∣∣∣∣ jj

⟩=

∑rs

αrβs

r !s!

⟨jj

∣∣∣∣Jr+ J s

∣∣∣∣ jj

⟩(7.51)

The operator product eα J+eβ J− is written in normally ordered form EXP(β ′ J−)EXP(n′ Jz)EXP(α′ J+) and the parameters α′, β ′, n′ computed. We find⟨

jj

∣∣∣∣eβ ′ J−en′ Jz eα′ J+

∣∣∣∣ jj

⟩= e jn′ = (1 + αβ)2 j (7.52)

By expanding (1 + αβ)2 j and invoking analyticity, we find⟨jj

∣∣∣∣Jr+ J s

∣∣∣∣ jj

⟩= (2 j)!r !

(2 j − r )!δrs (7.53)

Other matrix elements of products of angular momentum operators can be con-structed similarly from appropriate generating functions.

The general computational procedure should now be clear. Given a Lie algebra ofoperators and the associated group operations that are exponentials of the elementsin the Lie algebra, it is possible to carry out all calculations in either the algebra orthe group using a faithful matrix representation of the operator algebra. In general,the smaller the size of the matrices, the easier the computation.

For example, if operators A,B belong to two complementary subspaces in someoperator Lie algebra g then the operator product eAeB can be reparameterized aseB

′eA

′(A′,B′ different operators in the same subspaces as A,B) by

(i) finding a faithful matrix representation of the operator algebra,(ii) identifying the operators A,B with matrices A, B,

(iii) Carrying out the matrix calculations eAeB and eB ′eA′

,(iv) determining the matrices A′, B ′ by comparing matrix elements; and(v) using the isomorphism A′ ↔ A′ B ′ ↔ B′.

This procedure will produce a local analytic reparameterization (A,B) ↔(A′,B′). If the matrix group used to construct this reparameterization is simplyconnected (the covering group) the analytic reparameterization will be global. Oth-erwise, some care must be taken to compare the maximal discrete invariant sub-groups of the operator group and the matrix group. When the operatorsA,B, . . . are

Page 128: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

114 EXPonentiation

related to matrices A, B, . . . by a matrix–operator mapping (see Chapter 6)A ↔ A,the disentangling formulas can be constructed using the matrices A, B, . . . .

7.5 EXPonentials and physics

By the greatest good fortune – or perhaps by the deepest possible connectionsbetween mathematics and physics – the exponential function also plays a mostfundamental role in physics. In fact, it plays two roles: one in dynamics and anotherin equilibrium statics (thermo“dynamics”). More fundamental yet, these two rolesare related by analytic continuation (“Wick rotation”). We describe both roles inthis section, in terms of two examples, one related to fermions, the other related tobosons.

7.5.1 Dynamics

The dynamics of quantum systems is governed by the time-dependent Schrodingerequation:

H |ψ〉 = i�∂

∂t|ψ〉 (7.54)

The state of the system at time t + δt is related to the state at time t by

|ψ(t + δt)〉 =(

I − i

�Hδt

)|ψ(t)〉 = e− i

�Hδt |ψ(t)〉 (7.55)

The exponential is unitary since the hamiltonian operator H is hermitian. The state|ψ(t f )〉 at some final time t f is related to the state at initial time ti by |ψ(t f )〉 =U (t f , ti )|ψ(ti )〉. The finite time unitary operator is built up from small dis-placements

U (t f , ti ) = U (t f , t f − δt) · · · U (ti + 2δt, ti + δt)U (ti + δt, ti )

=∏

U (ti + (n + 1)δt, ti + nδt) = “∫ t f

ti

” U (τ ) dτ

= T∫ t f

ti

e− i�

H (t) dt (7.56)

Care must be taken with the formal integration in this equation, as in general H (t ′)does not commute with H (t), t ′ �= t . It is for this reason that the symbol “T ”precedes the integral: this signifies a time-ordered product. If the hamiltonian isnot explicitly time dependent then the integral in Eq. (7.56) reduces to an everydayRiemann integral.

Page 129: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

7.5 EXPonentials and physics 115

Expression of the time dependence in terms of a unitary evolution operator isuseful for two very different reasons.

(i) The evolution is decoupled from the initial state.(ii) In special cases it is very simple to construct this unitary evolution operator when it

would be much more difficult to construct the evolution of a specific state.

The second case becomes important when the hamiltonian is a linear superpositionof operators that exist in a Lie algebra. In that case the unitary operator is a groupoperation, and it may be possible to find some shortcuts for its computation. Wegive two examples.

Example 1. A Hamiltonian acts in a 2 j + 1 dimensional space through a set ofthree operators Jz, J± that obey angular momentum commutation relations. Wewish to determine the evolution of some particular state | j, m j 〉. The Hamiltonianis

H = ε(t)Jz + α(t)J+ + α∗(t)J−j→ 1

2−→[ 1

2ε(t) α(t)α∗(t) − 1

2ε(t)

](7.57)

The unitary operator acting in the 2 j + 1 dimensional space is a unitary repre-sentation of some operation in the group SU (2). It is simpler to determine howg(t) ∈ SU (2) evolves, and then construct its unitary representation, than it is to de-termine the time evolution of the (2 j + 1) × (2 j + 1) unitary matrix. Specifically,the equation of motion in the group is

d

dt

[a(t) b(t)

−b∗(t) a∗(t)

]= − i

[ 12ε(t) α(t)α∗(t) − 1

2ε(t)

] [a(t) b(t)

−b∗(t) a∗(t)

](7.58)

After some algebraic manipulation this matrix equation reduces to two equations forthe complex coefficients a(t) and b(t) or three equations for the real coefficients ofthe Pauli spin matrices σ1, σ2, σ3. These are first order equations and can be solvedby standard integration methods (e.g., RK4). The initial conditions are a(ti ) =1, b(ti ) = 0. The final 2 × 2 unitary matrix is determined by a(t f ), b(t f ). This isa group operation in SU (2) that can subsequently be mapped into the (2 j + 1) ×(2 j + 1) unitary irreducible representation of this group. At this point the problemis solved, independent of the initial state |ψ(ti )〉.Example 2. As a second example we treat a hamiltonain that is a linear combinationof the boson number, creation, and annihilation operators (and their commutator):

H = ω(t)a†a + α(t)a† + α∗(t)a + δ(t)I →0 α∗(t) δ(t)

0 ω(t) α(t)0 0 0

(7.59)

Page 130: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

116 EXPonentiation

The boson operators act as a hermitian superposition in an infinite-dimensionalspace with basis vectors |n〉, n = 0, 1, 2, . . . . The matrix on the right is a faith-ful finite-dimensional nonhermitian representation of these operators. The mostgeneral unitary operator that can be constructed from these operators is U =EXP(i[n(t)a†a + r (t)a† + r∗(t)a + d(t)I ]). This exponential is easy to computein the faithful 3 × 3 nonunitary representation. The matrix equation of motionanalogous to Eq. (7.58) is explicitly

d

dt

1 r∗ (ein−1)(in) r∗r

((ein−1−in)

(in)2

)+ id

0 ein r (ein−1)(in)

0 0 1

= − i

0 α∗(t) δ(t)

0 ω(t) α(t)0 0 0

1 r∗ (ein−1)(in) r∗r

((ein−1−in)

(in)2

)+ id

0 ein r (ein−1)(in)

0 0 1

(7.60)

This matrix equation leads to an ugly but manageable set of coupled nonlinear equa-tions in four real variables (n, r, r∗, d) that can be integrated by standard methods.In the case that dω(t)/dt = 0 the equations simplify considerably, and can almostbe solved by inspection.

7.5.2 Equilibrium thermodynamics

In classical and quantum physics expectation values are expressed in terms of adensity operator ρ

〈O〉 = tr ρO (7.61)

In thermodynamic equilibrium the density operator is expressed in terms of thehamiltonian describing the system as ρ = e−β H/Z , where the normalization con-stant, or partition function, is Z = tr e−β H and β = 1/kB T , kB is the Boltzmannconstant and T is the absolute temperature. When H is an element in a finite-dimensional Lie algebra, many simplifications in the computation of thermal ex-pectation values occur. Again, we give two examples.

Example 1. We choose a hamiltonian constructed from angular momentum oper-ators

H = ε Jz + α J+ + α∗ J−j→ 1

2−→[ 1

2ε(t) α(t)α∗(t) − 1

2ε(t)

](7.62)

Page 131: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

7.5 EXPonentials and physics 117

We would like to be able to compute thermal expectation values of various momentsof the angular momentum operators. The simplest way to go about this is to computegenerating functions for these expectation values. To do this we compute 〈e�〉,where � = λ · J. All symmetric moments can be constructed by taking derivativesof this generating function. We first compute this generating function in the smallestfaithful matrix representation:

e−β H e� →(

I2 cosh(β|H |) − β

[ε/2 α

α∗ −ε/2

]sinh(β|H |)

β|H |)

×(

I2 cosh(|�|) +[

λ3/2 λ

λ∗ −λ3/2

])sinh(|�|)

|�| (7.63)

The trace of this expression is

tr e−β H e� →

2 cosh(β|H |) cosh(|�|) − 2H · �√

H · H√

� · �sinh(β|H |) sinh(|�|)

(7.64)

In these expressions H · � = (H, �) = 12 tr H�, and similarly for |H | = √

(H, H )and |�| = √

(�, �).The trace of this 2 × 2 matrix can be written in another useful way after a

similarity transform that diagonalizes it:

tr e−β H eλ·J = tr

[e+µ(H,�)/2 0

0 e−µ(H,�)/2

]= 2 cosh(µ(H, �)/2) (7.65)

If N two-level atoms are acting incoherently, the trace over the 2N states of allN atoms is the N th power of the trace expressed in (7.65). On the other hand, if allN atoms are acting coherently, there are 2J + 1 states, where N = 2J . The traceover these states is (Arecchi et al., 1972)

χ (H, �, J ) = sinh(J + 12 )µ(H, �)

sinh( 12 )µ(H, �)

(7.66)

where µ(H, �, T ) is determined from Eq. (7.65). The thermodynamic generatingfunction is

〈e�〉 = χ (H, �, J )

χ (H, 0, J )(7.67)

To construct explicit expectation values (e.g., 〈J−〉) it is sufficient to differenti-ate the generating function (e.g., ∂

∂λ∗ 〈e�〉/〈e0〉) and evaluate the result at � = 0.It is even more convenient to differentiate the logarithm and evaluate at � = 0:∂

∂λ∗ log(〈e�〉)|�=0.

Page 132: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

118 EXPonentiation

Example 2. As a second example we treat a harmonic oscillator described by atime-independent hamiltonian of the form (7.68) in thermodynamic equilibrium attemperature T

H = �ωa†a + αa† + α∗a + δ I →0 α∗ δ

0 �ω α

0 0 0

(7.68)

The density operator is ρ = e−β(�ωa†a+αa†+α∗a+δ I )/Z . The generating function foroperator expectation values is χ (H, �, T ) = tr e−β H eλna†a+λa†+λ∗a+d I /Z = 〈e�〉.The trace is taken in the infinite-dimensional Hilbert space with Fock basis|0〉, |1〉, |2〉, . . . . It would be insane to attempt to compute this expectation valuewithout exploiting opportunities allowed by choice of a smaller, more convenientfaithful matrix representation M of the group. The calculation proceeds accordingto the following steps.

(i) Write each of the operators H , � in the 3 × 3 matrix representation M (cf., Eq.(7.59));

(ii) Compute the exponential of each. For example

e−βM(H ) = EXP − β

0 α∗ δ

0 �ω α

0 0 0

=

1 α∗ e−β�ω−1�ω

e−β�ω−1+β�ω

(�ω)2 α∗α − βδ

0 e−β�ω α e−β�ω−1�ω

0 0 1

(7.69)

(iii) Multiply the group operations together:

e−βM(H )eM(�) = 1 Zl ∗

0 ∗ Zr

0 0 1

(iv) Find a similarity transformation, S, that zeroes out Zl and Zr :

M(S)

1 Zl ∗

0 ∗ Zr

0 0 1

M(S−1) =

1 0 B

0 A 00 0 1

(v) Map this group operation to the infinite-dimensional matrix representation acting onthe Fock space

1 0 B0 A 00 0 1

→ eAa†a+B I

Page 133: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

7.6 Conclusion 119

(vi) Take the trace. Assuming A < 0 the sum converges to

tr eAa†a+B I = eB

1 − eA

(vii) Take the logarithm to find

log(χ (H, �, T )) = B − A − log(e−A − 1)

(viii) These steps can be implemented easily using symbol manipulation codes. The resultis

− A = β�ω − λn

B = e−β�ω − 1 + β�ω

(�ω)2α∗α − βδ + d + eλn − 1 − λn

λ2n

λ∗λ

+ e−β�ω − 1

�ω

eλn − 1

λn

(α∗λ + αλ∗) /

(1 − e−(β�ω−λn )

)+

[e−β�ω

(eλn − 1

λn

)2

λ∗λ + eλn

(e−β�ω − 1

�ω

)2

α∗α

] / [1 − e−(β�ω−λn )

](7.70)

The generating function for only the creation and annihilation operators (λn =d = 0) is considerably simpler.

7.6 Conclusion

The EXPonential mapping from a Lie algebra to a Lie group is generally not onto.It is not in general possible to recover the entire Lie group by taking a singleexponential of the Lie algebra. However, a sequence of exponential mappings fromvarious linear vector subspaces in the Lie algebra can be found that covers theLie group. This sequence of exponential mappings can be used to determine thestructure of the underlying manifold of the Lie group. It also provides a usefulparameterization for the Lie group.

Associated with every Lie algebra g is a unique Lie group G that is simplyconnected. Every matrix group with this Lie algebra is locally isomorphic to thiscovering group. Every Lie group G with Lie algebra g has the structure G/D, whereD is a discrete invariant subgroup of G. If D = Id, G is isomorphic to G, otherwiseit is a homomorphic image of G. For simple matrix groups, D consists of multiplesof the identity matrix, λIn , and is simple to compute. If G1 and G2 have isomorphicLie algebras they are locally isomorphic with the universal covering group and witheach other.

Many different parameterizations of a Lie group are possible. The most use-ful ones typically involve a sequence of exponential mappings of linear vector

Page 134: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

120 EXPonentiation

subspaces of the Lie algebra into the Lie group. These are “linear” in the sense thatthe coordinates parameterizing elements in the Lie group are components of a vectorin a linear vector space (the Lie algebra). Different parameterizations are related byanalytic reparameterization formulas, called Baker–Campbell–Hausdorff formulasfor historical reasons. These BCH formulas can be constructed by finding a faithfulmatrix representation of the Lie algebra, then carrying out the reparameterizationcomputation using products of exponentials of these matrices.

Exponentials play a fundamental role in physics as well as mathematics. Wehave explored two of the most useful applications of the exponential functionin physics. These describe dynamics and statics. The dynamical evolution of aquantum system is governed by a unitary transformation that can be written asa time-ordered exponential. If the hamiltonian is a linear superposition of basisvectors in a finite dimensional Lie algebra many useful computational methodsare available for its simple computation. We have provided two illustrations ofthe methods that are available. If the physical system is in thermodynamic equi-librium, the density operator is also the exponential of the hamiltonian. The two(dynamics and statics) are related by a “Wick rotation”: i t/� ↔ 1/kB T . We haveused the same two physical systems as vehicles to illustrate how the exponentialmapping, and suitable stepping back and forth through large and small unitary ornonunitary but faithful representations, has been used to simplify computation ofpartition functions and generating functions for symmetrized operator expectationvalues.

7.7 Problems

1. Construct the analytic group mapping φ((x1, y1), (x2, y2)) for the parameterization(7.24) of the affine group. Construct the mapping φ((w1, z1), (w2, z2)) for the param-eterization (7.25) of this group.

2. Show that a straight line through the origin of the parameter space (a, b, c) that isinside the light cone a2 + b2 − c2 < 0 (Eq. (7.7)) maps onto the subgroup SO(2) ⊂SL(2; R). Show that if a = b = 0, the basic “repetition period” in the c-direction,cT , in the subgroup is 2π but if a2 + b2 > 0 (

√a2 + b2 = β × c, |β| < 1), the basic

repetition period in the c-direction is increased to 2πγ , where γ = 1/√

1 − β2 andβ2 = (a2 + b2)/c2. Compare this renormalization of periodicity with “time dilation.”

3. Compute the maximal discrete invariant subgroup DMAX of SU (3) and show that itis

{I3, λI3, λ

2 I3}, where λ = e2π i/3. Next, show that SU (3)/DMAX is isomorphic to

the group of real 8 × 8 matrices EXP[Reg(su(3))] (“eight-fold way”).

4. Compute the maximal discrete invariant subgroup for the special unitary groups SU (n)and show that it is the cyclic group of order n generated by ε In , ε = e2π i/n . What realmatrix group is SU (n)/DMAX equivalent to?

Page 135: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

7.7 Problems 121

5. Show that the covering group SU (1, 1) does not have a maximum discrete invariantsubgroup.

6. It is convenient to introduce the creation and annihilation operators a†, a to study theone-dimensional quantum oscillator. These two operators are defined by

a† = 1√2

(x − d

dx

)a = 1√

2

(x + d

dx

)Computation of the matrix elements of the moments of x in the harmonic oscilla-tor basis, 〈n′|xk |n〉, can be simplified using disentangling theorems. This problemindicates how.a. The function eλx is a generating function for matrix elements of xk . Show that

〈n′|xk |n〉 = dk

dλk〈n′|eλx |n〉λ=0

b. Use the 3 × 3 matrix representation for the photon creation and annihilation oper-ators and their commutator [a, a†] = I to show

eλx = eλ(a†+a)/√

2 = EXP

λ√

2

0 1 0

0 0 10 0 0

=

1 λ/

√2 λ2/4

0 1 λ/√

20 0 1

c. Construct a disentangling theorem that expresses this group operator in the formera†

eδ I ela by constructing the matrix product of these three operators:

era†eδ I ela =

1 0 0

0 1 r0 0 1

1 0 δ

0 1 00 0 1

1 l 0

0 1 00 0 1

=

1 l δ

0 1 r0 0 1

d. By comparing the matrices in b and c, conclude

eλ(a†+a)/√

2 = eλa†/√

2eλ2/4eλa/√

2

e. Use the disentangling theorem in d to compute 〈n′|x4|n〉. In particular, show

〈n′|x4|n〉 = d4

dλ4

∑p,q,r

λp+2q+r

p!q!r !2−(p/2+2q+r/2)〈n′|(a†)par |n〉λ=0

→∑

p+2q+r=4

4!

p!q!r !

〈n′|(a†)p(a)r |n〉2(p/2+q+r/2)

The point of this exercise is that the computation of the matrix elements is simpli-fied because the operators are in normally ordered form (all annihilation operatorsfirst, on the right and all creation operators last, on the left). As a result, the cal-culation reduces to summing a descending series with no more than three nonzeroterms.

7. In order to describe the scattering of X-rays from an atom moving in a harmonicpotential it is necessary to compute a structure factor 〈eikx 〉. The expectation value is

Page 136: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

122 EXPonentiation

thermal: Pn � e−nβ�ω. This expectation value can be written in algebraic form as

〈eikx 〉 = tr eikx e−βH

tr e−βH (7.71)

We concentrate on the numerator, as the denominator is obtained in the limit k → 0.a. Show

tr eikx e−βH =∞∑

n=0

〈n|eikx |n〉 e−nβ�ω =∞∑

n=0

〈n|eikx e−nβ�ω|n〉 (7.72)

b. The trace is invariant under similarity transform (the operator is bounded). Showthat

tr eikx e−β�ωa†a = tr e−β�ωa†aeδ = eδ tr e−β�ωa†a (7.73)

As a result 〈eikx 〉 = eδ .c. Compute δ using 3 × 3 nonunitary matrix multiplications to carry out multi-

plications in the group rather than in an ∞ × ∞ unitary representation of thegroup.

M(S) M(eikx ) M(e−β�ωn) M(S−1) = M(e−β�ω′n) M(eδ) (7.74)

S eikx e−β�ωa†a S−1

↓ ↓ ↓ ↓ 1 α αβ/2

0 1 β

0 0 1

1 ik/

√2 −k2/4

0 1 ik/√

20 0 1

1 0 0

0 e−β�ω 00 0 1

1 −α αβ/2

0 1 −β

0 0 1

(7.75)

Carry out the multiplication of 3 × 3 matrices in this nonunitary representation M .Show that ω′ = ω. Determine α, β, and compute γ . Show

〈eikx 〉 = eδ δ = − 12 k2 coth

(12β�ω

)(7.76)

8. A finite set of operators Xi closes under commutation: [Xi , X j ] = ∑Nk=1 C k

i j Xk .These operators span a finite-dimensional Lie algebra g of Lie group G. Assume thatthis set of operators has two representations R and S with the following properties:• R is hermitian:

(R(ai Xi )

)† = (ai R(Xi )

)† = (ai )∗ R†(Xi ).• S is faithful: S(ai Xi ) = 0 ⇒ ai = 0.We require S to be finite dimensional so that simple matrix computations are possi-ble. We require R to be hermitian to make an immediate connection with quantummechanics.a. It happens frequently that H = R(ai Xi ) describes the physics of some quantum

mechanical system. Show that if H1, H2, . . . , Hr ∈ g span a maximal commutativesubspace, so that [Hi , Hj ] = 0, 1 ≤ i, j ≤ r , then the hermitian operators R(Hi )are mutually commutative and can all be made diagonal simultaneously in thisrepresentation: [R(Hi )]αβ = rα(i)δαβ .

Page 137: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

7.7 Problems 123

b. Show that [S(Hi ), S(Hj )] = 0, but show by example that the r matrices S(Hi )cannot always be simultaneously diagonal.

c. Show the time evolution of the quantum system is given by the unitary operatorU (t) = R(e− i

�Ht ) = e− i

�R(H)t .

d. Show that the density operator for thermal expectation values is ρ(T ) = e−βH/Z =R(e−βH)/Z = e−β R(H)/Z . What is Z?

e. Show that the unitary time evolution operator U (t) and the hermitian densityoperator ρ(T ) are related by a Wick rotation i t/� ↔ β = 1/kB T .

f. A generating function for thermal expectation values has the form

〈exi Xi 〉 = tr eR(xi Xi )e−βH

tr e−βH → tr R(exi Xi ) R(e−βai Xi )

tr R(e−βai Xi )(7.77)

g. The operator product in the numerator is in the group G = eg or its complex exten-sion. If this operator product can be transformed to “diagonal” form (i.e., expressedin terms of the operators Hi ) the trace can easily be constructed. Show that forxi sufficiently small it is always possible to construct a similarity transformationS = eyk Xk with the property

Sex j X j e−βai Xi S−1 = e−βdi (x,a)Hi (7.78)

h. The thermal expectation value then reduces to

〈exi Xi 〉 = tr R(e−βdi (x,a)Hi )

tr R(e−βdi (0,a)Hi )(7.79)

Since the Hi are diagonal in the representation R, the sums are straightforward.i. Relate the steps in the algorithm described in this problem to the steps followed in

the previous problem for computing the result derived in Eq. (7.76). In particular,identify the operators Xi , the “diagonal” operators Hi , the hermitian representationR (it is invisible), the faithful representation S (it is given explicitly), the generatingfunction exi Xi , and the Wick rotation.

9. Coherent states were first discussed by Schrodinger in 1926. For many purposes itis useful to apply a unitary transformation to the harmonic oscillator ground state.The unitary transformation has the form U (α) = e(αa†−α∗a), where a† and a are theusual photon creation and annihilation operators. This unitary operator, acting on theground state, is relatively simple to compute if it can be disentangled as follows

U (α)|0〉 = e(αa†−α∗a)|0〉 = eβa†eδ I eβ ′a|0〉 (7.80)

This disentangling theorem can be worked out easily in the 3 × 3 nonunitary rep-resentation. (It is the group multiplication property that we are after; unitarity is anadditional structure that is applied to the representation of the group.)

Page 138: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

124 EXPonentiation

a. Show that the left-hand side of Eq. (7.80) simplifies to

EXP

1 −α∗ 0

0 1 α

0 0 1

=

1 −α∗ −α∗α/2

0 1 α

0 0 1

(7.81)

b. Show that the right-hand side of Eq. (7.80) becomes 1 β

′δ

0 1 β

0 0 1

(7.82)

c. Use this result to compute

e(αa†−α∗a)|0〉 = eαa†e−α∗α I/2e−α∗a|0〉 =

∑ (αa†)n

n!|0〉e−α∗α/2 (7.83)

d. Use a further property of the creation operators (this is a representation-dependentproperty, so the calculation has now moved back into the infinite-dimensionalHilbert space and out of the nonunitary 3 × 3 matrix representation), a†|n〉 =|n + 1〉√n + 1 to conclude

|α〉 = U (α)|0〉 = e−α∗α/2∞∑

n=0

αn

√n!

|n〉 (7.84)

e. Compute the inner product 〈β|α〉 and show 〈α|α〉 = 1.f. Show a|α〉 = α|α〉.g. Show 〈α|x |α〉 = (α∗ + α)/

√2.

9. An SU (2) coherent state (also called atomic coherent state) is constructed by theaction of an arbitrary SU (2) group operation on the ground state, or lowest lyingstate, in a 2 j + 1 dimensional invariant space (Arecchi et al., 1972; Gilmore, 1974b):∣∣∣∣ j

θ

⟩= SU (2)

∣∣∣∣ j− j

⟩(7.85)

a. Show that rotations by φ around the z-axis serve only to multiply the fiducial state

by a phase angle: eiφ Jz | j− j 〉 = | j

− j 〉e−i jφ . This simply “renormalizes” the fiducial

state, and is generally not important.b. Rotations about an axis in the x–y plane produce a two-parameter family of

coherent states parameterized by coset representatives in SU (2)/U (1):∣∣∣∣ jθ

⟩= ei(θx Jx +θy Jy )

∣∣∣∣ j− j

⟩i(θx Jx + θy Jy) = i

2

[0 θx − iθy

θx + iθy 0

](7.86)

c. Rewrite ei(θx Jx +θy Jy ) in the form eiα+ J+eiαz Jz eiα− J− and compute the analytic relationbetween the angles θ and the parameters α.

d. Show eiα− J−| j− j 〉 = | j

− j 〉.e. Show eiαz Jz | j

− j 〉 = | j− j 〉e−i jαz .

Page 139: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

7.7 Problems 125

f. Compute finally

U (α)|0〉 = eiα+ J+

∣∣∣∣ j− j

⟩e−i jαz =

m=+ j∑m=− j

(iα+ J+) j+m

( j + m)!

∣∣∣∣ j− j

⟩e−i jαz (7.87)

g. Show that J−| jθx θy

〉 cannot be proportional to | jθx θy

〉 because the state | j+ j 〉 is not

occupied. This is different from the harmonic oscillator (photon operator) case.The difference arises because SU (2) is compact with finite-dimensional unitaryirreducible representations and the harmonic oscillator group H4 is not compactwith only an infinite-dimensional unitary irreducible representation of interest.

h. Compute the inner product and show

⟨j j

θ ′xθ

′y θxθy

⟩=

[cos

(θ ′

2

)cos

2

)+ ei(φ′−φ) sin

(θ ′

2

)sin

2

)]2 j

(7.88)

where e−iφ = (θx − iθy)/θ , and similarly for θ ′ (cf., Eq. (7.46)).

10. A number of important quantum eigenvalue equations can be expressed in algebraicformat. A toy example is

(E J3 + pJ1 − Z ) |u〉 = 0

Here E is an energy eigenvalue, p is some sort of coupling strength, Z could (andsometimes does) represent a charge, and |u〉 is an eigenfunction. In this toy example,the operators J3 and J1 are assumed to belong to the Lie algebra su(2) and the equationapplies to half-integer spin spaces ((2 j + 1) is even).a. Show that a unitary transformation U transforms this equation to the diagonal form

(E ′ J3 − Z )|v〉 = 0, where E ′ =√

E2 + p2 and |v〉 = U |u〉.b. Show that E = ±

√(Z/m)2 − p2.

c. Compare this spectrum with the unperturbed spectrum (p → 0).d. Under what conditions on j, p, Z are these solutions valid?e. Construct the unitary transformation that diagonalizes the eigenvalue equation,

and show that |u〉 = eiθ J2 | jm 〉. Compute θ for each E .

11. Compute the matrix elements of the rotation matrices in the 2 j + 1 unitary irreduciblerepresentations of SU (2) and show

EXP(iβ Jy)mn = D jmn(β) = P j

mn(z) = (−) j−m

2 j ( j − n)!

[( j − n)!( j + m)!

( j + n)!( j − m)!

]1/2

× (1 + z)−(m+n)/2(1 − z)−(m−n)/2

(d

dz

) j−m [(1 − z) j−n(1 + z) j+n

]where z = cos(β). The Wigner matrix elements D j

mn are related to the Jacobi poly-nomials when j = l, where l is an integer.

Page 140: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

126 EXPonentiation

12. Use the decompositions (7.21) for SO(3) and (7.22) for SU (2) to show the following.a. Geodesics through I2 ∈ SU (2) focus at −I2 and geodesics through I3 ∈ SO(3)

focus at I3. Conclude that SU (2) is a two-fold covering group of SO(3).b. Geodesics through the “north pole” of SU (2)/U (1) (z = 1, x = y = 0) focus at

its “south pole” (z = −1, x = y = 0) and geodesics through the north pole ofSO(3)/SO(2) (z = 1, x = y = 0) focus at its south pole (z = −1, x = y = 0).

c. Conclude that SU (2)/U (1) = S2 = SO(3)/SO(2) and the 2 → 1 nature of thecovering SU (2) ↓ SO(3) is contained in the subgroup of rotations about the z-axis U (1) ↓ SO(2):[

eiθ/2 00 e−iθ/2

]2→1−→

[cos θ sin θ

− sin θ cos θ

]

13. Show that the discrete invariant subgroups of SU (n) are all commutative groups oforder r , with group elements e2π ik/r In , with n/r integer. Compute the foci in SU (n).How are the foci related to the group operations of the form e2π ik/n In?

14. Show that the matrix [ −λ 00 −1/λ

] in SL(2; R) cannot be reached by exponentiatingany element in the Lie algebra if λ > 1. Show that it can be reached by following a“broken geodesic” eAeB . Find matrices A and B that do this. (Hint: do not work toohard.)

15. A simple model has been introduced to describe the interaction of light with matter. Inthis model (Dicke model) N atoms interact with a single mode of the electromagneticfield. Each atom is modeled as a two-level system, with energy separation ε. A singlephoton has energy �ω. The hamiltonian is chosen as

H =N∑

i=1

ε

2σ (i)

z + �ωa†a + λ√N

N∑i=1

σ(i)+ a + σ

(i)− a†

The operator σ (i)z describes the two states of atom i and the operator a†a describes

the number of photons in the field mode. The operator σ( j)+ (σ ( j)

± = 12 (σ ( j)

x ± iσ ( j)y ))

describes transitions of the j th atom from the ground to its excited state. This atomictransition is accompanied by the absorption (annihilation) of a single photon. Theoperator σ

( j)− a†, describes deexcitation of an atom with emission (creation, a†) of a

photon. The strength of interaction of the atom with the electromagnetic field (thedipole moment) is parameterized by λ.a. Assume the atoms are independent and show[

σ (i)z , σ

( j)±

] = ±σ(i)± δi j

(i)+ , σ

( j)−

] = σ (i)z δi j

b. If all the atoms behave cooperatively it is possible to replace∑N

i=112σ (i)

z → Jz ,∑Ni=1 σ

(i)± → J±. Show that the operators Jz, J± satisfy the usual su(2) commuta-

tion relations.c. Assume the atoms “behave classically.” This means that the quantum mechan-

ical operators Jz, J± can be replaced by their c-number expectation values:

Page 141: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

7.7 Problems 127

Jz → 〈Jz(t)〉, J+ → 〈J+(t)〉, J− → 〈J−(t)〉 = 〈J+(t)〉∗. Show that this semiclas-sical hamiltonian

Hfield = ε〈Jz(t)〉 + �ωa†a + λ√N

(〈J+(t)〉a + 〈J−(t)〉a†)

maps the ground state of the field (the state with no photons) into a coherent stateof the electromagnetic field: |α(t)〉 = U (α(t))|0〉 = eαa†−α∗a|0〉. Use the disentan-gling theorems to compute the relation between the coherent state parameter α(t)and the classical driving fields 〈Jz(t)〉 and 〈J+(t)〉 = 〈J−(t)〉∗.

d. Show that if the initial state of the field is not the ground state, but rather a coherentstate |β〉, the state obtained by the action of the classical current is still a coherentstate. How are the parameters β, describing the initial condition, and α, describingthe unitary evolution of the field, related?

e. Suppose now that the atoms are considered quantum mechanically but the field isconsidered classically. Show that this amounts to the substitutions a† → 〈a(t)〉∗,a → 〈a(t)〉, and a†a → 〈a(t)〉∗〈a(t)〉.

f. Show that the resulting semiclassical hamiltonian is

Hatoms = ε Jz + λ√N

(J+〈a(t)〉 + J−〈a(t)〉∗)

Show that under this semiclassical hamiltonian, if the atoms are in their collectiveground state (m = − 1

2 for each atom, or M = −J, J = N/2 for the ensemble ofN atoms) the ground state will evolve into a coherent state of the group SU (2)parameterized by a point in the coset SU (2)/U (1).

g. Show that, under the action of this semiclassical Hamiltonian a coherent state willevolve into a coherent state: |θ (t)〉 = eiθ (t)·J|J, −J 〉, where J = N/2. How are theangles θ (t) related to the classical field variables 〈a(t)〉 and 〈a(t)〉∗?

h. Conclude that there is a duality between the atoms and the field in this model:a classical current will produce a coherent state of the electromagnetic field; aclassical electromagnetic field will produce a coherent atomic state.

i. The semiclassical hamiltonian for the field can be used to construct time-dependentfield expectation values 〈a〉 and 〈a〉∗. Conversely, the semiclassical hamiltonianfor the atoms can be used to construct time-dependent atomic expectation values〈J+〉 = 〈J−〉∗. Construct a self-consistent model by requiring that both sets oftime-dependent quantities are equal.

16. The thermodynamic properties of the Dicke model can be studied in a similar fashion.Assume N identical atoms interacting with a single field mode are in thermodynamicequilibrium at temperature T (β = 1/kB T ).a. Assume 〈σ (i)

+ 〉T has some fixed unknown value, and similarly for the other atomicthermal expectation values. Use these values in the semiclassical approximationfor the field hamiltonian to compute the density operator. Compute the thermalexpectation values for the operators a†, a, a†a.

Page 142: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

128 EXPonentiation

b. Dualize. Assume the field operators have fixed but unknown expectation values.Use these values in the semiclassical approximation for the atomic hamiltonianto compute the density operator. Compute the thermal expectation values for theoperators σz, σ+, σ−.

c. Impose self-consistency. Require that if a set of field thermal expectation valuesproduces specific atomic expectation values, these atomic expectation values pro-duce the same set of field expectation values. This leads to a nonlinear set ofself-consistency equations. These self-consistent equations may have more thanone solution.

d. To lift the self-consistent solution degeneracy, construct the thermal expectationvalue for H. Choose the minimum energy solution. Under what conditions onε, �ω, λ, N is there a nontrivial solution (e.g., 〈J+〉T �= 0)?

e. Show that a thermodynamic phase transition occurs as λ2/ε�ω increases through+1. Is this a first or second order phase transition?

17. The two complex parameters a(t), b(t) in the evolution equation (7.58) can be ex-pressed in terms of their real and imaginary parts. These obey a2

r + a2i + b2

r + b2i = 1

(unitarity condition). This condition simply reflects that the state of the system is givenby a unit quaternion. As numerical integration proceeds, imprecisions may cause theseparameters to depart slightly from the unitarity condition. Devise a self-correctingintegration procedure to correct for this type of error. After N small integration steps,compute the length of the vector (ar , ai , br , bi ) and scale this length back to +1.

18. The thermodynamic generating functions for SU (2) and H4 given by expressions(7.67) and (7.70) simplify considerably if the “diagonal operator” is not included.Simplify (7.67) by taking the limit λ3 → 0. Simplify (7.70) by taking the limit λn → 0and setting d = δ = 0.

19. For many reasons it is less desirable to compute thermal expectation values for sym-metric operator products such as 〈J+ J− + J− J+〉 or 〈aa† + a†a〉 than it is to constructgenerating functions for ordered products of operators such as 〈J+ J−〉 or 〈a†a〉. Showhow to use disentangling theorems to transform the generating functions for symmet-ric operator products in (7.67) and (7.70), or their simplified forms constructed in theprevious problem, into generating functions for ordered products of operators.

Page 143: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

8

Structure theory for Lie algebras

In this chapter we discuss the structure of Lie algebras. A typical Liealgebra is a semidirect sum of a semisimple Lie algebra and a solvablesubalgebra that is invariant. By inspection of the regular representation“in suitable form,” we are able to determine the maximal nilpotent andsolvable invariant subalgebras of the Lie algebra and its semisimple part.We show how to use the Cartan–Killing inner product to determine whichsubalgebras in the Lie algebra are nilpotent, solvable, semisimple, andcompact.

8.1 Regular representation

A Lie algebra is defined by its commutation relations. The commutation relationsare completely encapsulated by the structure constants. These are convenientlysummarized in the regular representation

[Z , Xi ] = R(Z ) ji X j (8.1)

Under a change of basis X j = A sj Ys this n × n matrix undergoes a similarity trans-

formation

S(Z ) sr = (A−1) i

r R(Z ) ji A s

j (8.2)

It is very useful to find a basis, or construct a similarity transformation, that bringsthe regular representation of every operator in the Lie algebra simultaneously tosome standard form. The structure of the Lie algebra can be decided by inspectingthis standard form.

8.2 Some standard forms for the regular representation

We summarize in Fig. 8.1 the standard forms that the regular representation canassume. We also provide an example for each.

1. Zero In this case all structure constants vanish and the algebra is commutative.

129

Page 144: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

130 Structure theory for Lie algebras

1 2 3

000

zero nil (n) sol (n)

00

0

0

••

λ1

•••

λ2

λn

λ3

654

0

reducible fully reducible irreducible

0

Figure 8.1. Structure of the regular representation for different types of Lie algebras.

Example The Lie algebra a(p, q) consists of matrices of the form

0 A

0 0

↑p↓↑q↓

(8.3)

This is an n × n (n = p + q) matrix algebra which is N = pq dimensional. Theindependent degrees of freedom are the N independent matrix elements of the p × qmatrix A. The n × n matrices all commute under matrix multiplication. The groupoperation is equivalent to addition of the p × q matrices. The regular representationconsists of N × N matrices, all N of them are zero.

2. nil(n) Strictly upper triangular In this case the Lie algebra is nilpotent.

Example We consider the Lie algebra spanned by the photon operators a, a†, andI = [a, a†] or the isomorphic 3 × 3 matrix algebra (5.11). The regular representa-tion is a 3 × 3 matrix

Reg(la + ra† + δ I ) =0 0 l

0 0 −r0 0 0

a†

aI

(8.4)

3. sol(n) Upper triangular In this case nonzero elements occur on and above thediagonal. The algebra is solvable.

Page 145: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

8.2 Some standard forms for the regular representation 131

Example The algebra spanned by the photon number operator n = a†a, creationand annihilation operators a† and a, and their commutator I = [a, a†] is isomorphicto the algebra described by the 3 × 3 matrices (5.9). The regular representation isa 4 × 4 matrix

Reg(ηn + la + ra† + δ I ) =

0 −r l 00 η 0 l0 0 −η −r0 0 0 0

na†

aI

(8.5)

4. ut(p, q) In this case the regular representation is reducible and the Lie algebrais nonsemisimple.

Example We consider the algebra consisting of the six photon operators n =12

{a, a†} = a†a + 1

2 , a†2, a2, a†, a, I = [a, a†]. This is isomorphic to the algebraof six 4 × 4 matrices presented in (5.7). The algebra of 4 × 4 matrices (the “defin-ing” representation) and the regular representation of this algebra are given below:

η(n + 1

2

) + Ra† 2 + La2 + ra† + la + δ I

def =

0 l r −2δ

0 η 2R −r0 −2L −η l0 0 0 0

Reg =

0 −2R 2L −r l 04L 2η 0 2l 0 0

−4R 0 −2η 0 −2r 0η 2L l

−2R −η −r0 0 0

n + 12

a†2

a2

a†

aI

(8.6)

The subspace spanned by the three operators a†, a, I is invariant, as is shown bythe structure of the regular representation.

5. Block diagonal In this case the regular representation is fully reducible and theLie algebra is semisimple.

Example The Lie algebra so(4) has six generators Xi j = −X ji , 1 ≤ i, j ≤ 4 andcommutation relations

[Xi j , Xrs] = Xisδ jr + X jrδis − Xirδ js − X jsδir (8.7)

Page 146: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

132 Structure theory for Lie algebras

The following two linear combinations of generators

Yi = 12 (Xi4 + 1

2εirs Xrs) Xi4 = Yi + Zi

Zi = 12 (Xi4 − 1

2εirs Xrs) Xi j = εi jk(Yk − Zk)(8.8)

obey the commutation relations

[Yi , Y j

] = −εi jkYk[Zi , Z j

] = +εi jk Zk (8.9)[Yi , Z j

] = 0

The 4 × 4 defining matrix representation and the 6 × 6 regular representation havethe structure

X = ∑yi Yi + ∑

z j Z j

def(X ) →

0 +(y3 − z3) −(y2 − z2) +(y1 + z1)−(y3 − z3) 0 +(y1 − z1) +(y2 + z2)+(y2 − z2) −(y1 − z1) 0 +(y3 + z3)−(y1 + z1) −(y2 + z2) −(y3 + z3) 0

Reg(X ) →

0 −y3 +y2

+y3 0 −y1

−y2 +y1 00 +z3 −z2

−z3 0 +z1

+z2 −z1 0

Y1

Y2

Y3

Z1

Z2

Z3

(8.10)

Since the regular representation has a block diagonal structure, the algebra issemisimple. It is not at all obvious that the Lie algebra so(4) is semisimple andcan be written as the direct sum of two simple algebras. This is not true for theother orthogonal Lie algebras, so(n), n > 4. We will have to wait until Chapter 10to be able to see easily that so(4) is semisimple, not simple.

6. Irreducible In this case the regular representation is irreducible and the Liealgebra is simple.

Example The Lie algebras su(n) (n ≥ 2), so(n) (n > 4), and sp(n) (n ≥ 1) areall simple. To be concrete, the Lie algebra for SU (2) has defining and regular

Page 147: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

8.3 What these forms mean 133

representations

def = i

2

[a3 a1 − ia2

a1 + ia2 −a3

]Reg =

0 −a3 +a2

+a3 0 −a1

−a2 +a1 0

X1

X2

X3

(8.11)

8.3 What these forms mean

Reducing the regular representation to one of the standard forms described in theprevious section means that the structure constants, and therefore the commutationrelations, have also been reduced to some standard form. We discuss in this sec-tion what each of the standard forms implies about the commutation relations andstructure of the Lie algebra.

1. Commutative case If all the structure constants are zero, then

[Xi , X j ] = 0 (8.12)

for each element in the Lie algebra.

2, 3. Nilpotent and solvable In these cases

[Z , Xi ] = R(Z ) ji X j

R(Z ) ji = 0 unless j > i nilpotent

j ≥ i solvable(8.13)

This means that [Z , Xi ] can be expressed as a linear combination of basis vectorsX j with j ≥ i . This in turn means that the basis vectors Xi , Xi+1, . . . , Xn span asubalgebra for each value of i = 1, 2, . . . , n. Since this subalgebra is mapped ontoitself by every element Z in the original algebra, each subalgebra is an invariantsubalgebra. The result is shown schematically in Fig. 8.2 and is summarized by

V1 spanned by Xn, Xn−1, Xn−2, . . . , X2, X1

∪V2 spanned by Xn, Xn−1, Xn−2, . . . , X2

∪...

......

∪Vn−2 spanned by Xn, Xn−1, Xn−2

∪Vn−1 spanned by Xn, Xn−1

∪Vn spanned by Xn

(8.14)

Each Vi is an invariant subalgebra in Vj , i > j . The original algebra is V1.

Page 148: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

134 Structure theory for Lie algebras

X1

X2

X0 n–1

Xn Vn Vn –1

V2

V1

λ1

λ2

λn

λn –1

Figure 8.2. Structure of nilpotent and solvable algebras.

4. Reducible or nonsemisimple The block diagonal form for the regular repre-sentation requires the commutation relations

∗ ∗

0 ∗

↑V1

↓↑V2

⇒ [Any, V2] ⊂ V2 (8.15)

Since in particular [V2, V2] ⊆ V2, V2 is a subalgebra in the original algebra. Further,since the commutator of anything in the original algebra with V2 is in V2, V2 is aninvariant subalgebra. The complementary subspace V1 is not generally even asubalgebra of the original algebra.

5. Fully reducible or semisimple In this case the block diagonal form for theregular representation requires the commutation relations

∗ 0

0 ∗

↑V1

↓↑V2

⇒[V1, V1] ⊆ V1

[V2, V2] ⊆ V2

[V1, V2] = 0(8.16)

Both V1 and V2 are invariant subalgebras. Moreover, every element in V1 commuteswith every operator in V2. Therefore the two subalgebras V1 and V2 can be studiedseparately and independently.

6. Irreducible or simple In this case every generator X can be written as thecommutator of some pair of operators Y and Z in the Lie algebra:

X = [Y, Z ] (8.17)

Page 149: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

8.4 How to make this decomposition 135

It is this ability of an algebra to reproduce itself under commutation that distin-guishes simple and semisimple Lie algebras from solvable and nilpotent algebras.Nonsemisimple algebras are composed of a semisimple subalgebra and a solvableinvariant subalgebra.

8.4 How to make this decomposition

There is a systematic procedure for decomposing a Lie algebra into its semisimplecomponent and its maximal solvable invariant subalgebra. This is a simple two-stepprocedure. In the first step we identify the subspace of the Lie algebra on which theCartan–Killing inner product is identically zero. If there is no such subspace thealgorithm stops here and the algebra is semisimple. If there is a nontrivial subspace,it forms the maximal nilpotent invariant subalgebra of the algebra. This subspaceis “removed” from the algebra, and the commutation relations and Cartan–Killinginner product for the remaining operators are computed. The algorithm stops here,regardless of the outcome. If there is a nontrivial subspace on which the newCartan–Killing inner product is identically zero, the elements in this subspace, to-gether with the previously identified nilpotent invariant subalgebra, span a solvablealgebra. This is the maximal solvable invariant subalgebra in the original Lie alge-bra. The complementary subspace on which the new Cartan–Killing inner productis nonsingular is the semisimple part of the original Lie algebra.

In small, easy-to-digest steps, this two-step algorithm takes the following form.

(i) From the structure constants of the original Lie algebra g form the Cartan–Killing innerproduct.

(ii) Determine the subspace on which this inner product is positive-definite, negative-definite, and zero:

g = (V− + V+) + V0 (8.18)

(iii) If V0 is zero, stop. If not, V0 is the maximal nilpotent invariant subalgebra in g.(iv) Form the difference g′ = g − V0. This is a Lie algebra (under the “mod” operation: set

to zero any part of the commutator that is in V0). Compute the structure constants andCartan–Killing inner product for g′.

(v) Effect another decomposition:

g′ = (V ′− + V ′

+) + V ′0 (8.19)

(vi) The original Lie algebra has the following structure

g = V ′−︸︷︷︸

compact subalgebra

+ V ′+︸︷︷︸

noncompact generators︸ ︷︷ ︸semisimple

+ V ′0︸︷︷︸

abelian

+ V0︸︷︷︸nilpotent︸ ︷︷ ︸

maximum solvable invariant subalgebra︸ ︷︷ ︸nonsemisimple Lie algebra

(8.20)

Page 150: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

136 Structure theory for Lie algebras

This algorithm cannot distinguish semisimple Lie algebras from simple Lie alge-bras. We will develop tools in Chapter 10 that will make this distinction possiblesimply by inspection of the algebra’s (canonical) commutation relations.

8.5 An example

To illustrate this procedure, we compute the structure of the six-dimensional Liealgebra of two photon operators. The regular representation is given in (8.6). Theinner product of a vector with itself is

(X, X ) = −40RL + 10η2 (8.21)

The inner product is identically zero on the subspace V0 spanned by a†, a and I .The three remaining operators have regular representation

η(a†a + 12 ) + Ra†2 + La2 −→

0 −2R 2L

4L 2η 0

−4R 0 −2η

n + 12

a†2

a2

(8.22)

with inner product

(X, X )′ = −32RL + 8η2 (8.23)

In this case V ′0 = 0 and the two photon algebra has the decomposition

g = (n + 12 , a†2, a2)︸ ︷︷ ︸su(1,1)

+ (a†, a, I )︸ ︷︷ ︸nilpotent invariant subalgebra

(8.24)

The Cartan–Killing inner product can be diagonalized by choosing two linear com-binations of the operators a†2 and a2. Then a†2 + a2 is a compact generator, sincethe Cartan–Killing form is negative-definite on it. The other two operators, a†a + 1

2and a†2 − a2, are noncompact.

8.6 Conclusion

An arbitrary Lie algebra is a semidirect sum of a semisimple Lie algebra and asolvable invariant subalgebra. The structure of a Lie algebra can be determined byinspecting its regular representation, once this has been brought to suitable form bya similarity transformation. To facilitate constructing this transformation, we haveshown how to use the Cartan–Killing inner product to determine the linear vectorsubspaces in the Lie algebra that are maximal nilpotent invariant subalgebras, themaximal solvable invariant subalgebra, the semisimple subalgebra, and its maximalcompact subalgebra.

Page 151: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

8.7 Problems 137

8.7 Problems

1. Compute the decomposition (8.20) fora. The photon algebra n, a†, a, I (Eq. (8.5)).b. The algebra so(3, 1).c. The algebra for the Poincare group (Eq. (3.26)).d. The algebra for the Galilei group (Eq. (3.27)).

2. Compute the decomposition (8.20) for Lie algebras spanned by various combina-tions of the boson creation and annihilation operators (a–g below). These satisfy[bi , b†

j ] = I δi j , 1 ≤ i, j ≤ n. Commutators involving bilinear (trilinear, . . .) productsare computed in the usual way.a. bi , b†

j , I .

b. b†i b j .

c. b†i b j , bi , b†

j , I .

d. b†i b†

j , b†i b j + 1

2δi j , bi b j .

e. b†i b†

j , b†i b j + 1

2δi j , bi b j , bi , b†j , I .

f. b, b†b, b†b†b.g. b†, b†b, b†bb.

3. Fermion creation and annihilation operators obey anticommutation relations{ fi , f †j } = δi j , but their bilinear combinations close under the same commutation re-lations as do boson operators. Compute the structure of these fermion algebras:a. f †i f j .b. f †i f †j , f †i f j + 1

2δi j , fi f j .

4. Compute the decomposition (8.20) for Lie algebras spanned by various combinationsof the position (xi ) and momentum (∂ j ) operators for n independent directions:a. xi , ∂ j , I .b. xi∂ j .c. xi∂ j , xi , ∂ j , I .d. xi x j , xi∂ j + 1

2 I δi j , ∂i∂ j .e. xi x j , xi∂ j , ∂i∂ j , xi , ∂ j , I .f. d

dx , x ddx , x2 d

dx .

g. x, x ddx , x d2

dx2 .

5. What is the relation between the Cartan–Killing inner product computed using thedefining matrix representation of a matrix Lie algebra and using the regular matrixrepresentation of the Lie algebra?

6. The Lorentz, Poincare, and Galilei groups in 2 + 1 dimensions (x, y and t) have Liealgebras with matrix structures:

0 θ v1

−θ 0 v2

v1 v2 0

0 θ v1 t1−θ 0 v2 t2v1 v2 0 t30 0 0 0

0 θ v1 t1−θ 0 v2 t20 0 0 t30 0 0 0

Lorentz Poincare Galilei

(8.25)

Page 152: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

138 Structure theory for Lie algebras

a. Compute the matrix infinitesimal generators for each.b. Construct their commutation relations.c. Decompose each Lie algebra into the standard form (8.20).d. For each Lie algebra, express the generators in terms of the operators xi , ∂ j .e. For each Lie algebra, express the generators in terms of the boson operators b†

i , b j ,1 ≤ i, j ≤ 3.

7. In a semisimple Lie algebra the Cartan–Killing metric gi j = C sir C r

js is nonsingular andtherefore the contravariant metric gi j is well defined. Show that the bilinear operatorC2 = gi j Xi X j satisfies [C2, Xk] = 0. If there is one quadratic Casimir operator, it musttherefore be proportional to C2.

8. Show that Ci jk = C ri j grk is a third order antisymmetric tensor: Ci jk = C jki = Cki j =

−Ckji = −C jik = −Cik j . (Hint: use the Jacobi identity.)

9. Determine the structure of the Lie algebra defined by the following operators (cf.,Eq. (16.57)):

Xi j = xi∂ j − x j∂i

Yi = 2t∂

∂xi− xi u

∂u(8.26)

Z = 2t∂

∂t+ xi ∂

∂xi− nu

∂u

Page 153: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

9

Structure theory for simple Lie algebras

In this chapter we continue the development begun in the previous chapter.These two chapters focus on determining the structure of a Lie algebra andputting it into some canonical form. In the previous chapter we determinedthe types of subalgebras that every Lie algebra is constructed from. In thischapter we put the commutation relations into a standard form. This canbe done for any Lie algebra. For semisimple Lie algebras this standardform has a very rigid structure whose usefulness is surpassed only by itsbeauty.

9.1 Objectives of this program

In the previous chapter we studied the commutation relations of a Lie algebrathrough its regular representation. This study was carried out using as a tool theCartan–Killing inner product. As far as possible, this was the only method used.In the present chapter we introduce a second powerful tool from the theory oflinear vector spaces. This is the eigenvalue decomposition. This tool is introducedin an attempt to find standard forms for the commutation relations. If a standardform is available then the properties of a Lie algebra, as well as its identification(classification), can be determined at sight.

The eigenoperator decomposition is effected by computing and studying a secularequation determined from the matrix of the regular (or any other matrix) represen-tation of the Lie algebra. To get the most information from this study we seek themaximum number of independent roots of this equation. The decomposition of theLie algebra into eigenoperators according to the roots of the secular equation, andthe properties of these roots, can also be discussed for any Lie algebra. However,for Lie algebras with a nonsingular Cartan–Killing inner product – semisimple andsimple Lie algebras – the properties of the roots are very rigidly prescribed. Thisleads to a very elegant set of canonical commutation relations.

139

Page 154: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

140 Structure theory for simple Lie algebras

In introducing an eigenvalue equation it is necessary to extend the field overwhich the Lie algebra is defined from the real to the complex numbers. Withoutthis extension it is not always possible to find roots of the secular equation. Thisfield extension has the drawback that several different Lie algebras (e.g., su(2) andsu(1, 1)) have the same complex extension and have their different commutationrelations cast into the same canonical form. We return to this question in Chapter 11,where the problem is resolved.

9.2 Eigenoperator decomposition – secular equation

It would be very useful to find vectors Z , X in the Lie algebra that obeyed the“eigenoperator” commutation relations

[Z , X ] = λX (9.1)

It would be even more useful if we could find a set of basis vectors for the Lie algebrawhich all simultaneously obeyed commutation relations of the eigenoperator type.

To determine operators X for which such commutation relations are possible,we write X = ∑N

i=1 ai Xi , where Xi form a basis set. Then[Z ,

∑ai Xi

]= λ

∑a j X j∑ ∑

ai(R(Z ) j

i − λδj

i

)X j = 0 (9.2)

This equation has a nonzero solution for the coefficients ai provided the secularequation

‖ R(Z ) − λIN ‖ = 0 (9.3)

can be solved. This equation can be expanded as a polynomial in λ

N∑j=0

(−λ)N− jφ j (Z ) = 0 (9.4)

where N is the dimension of the Lie algebra and its regular representation. Thecoefficients φ j (Z ) are homogeneous polynomials of degree j in the coefficientszi (Z = ∑

zi Xi ) that describe Z :

φ j (Z ) → φ j (zi ) (9.5)

Example The regular representation for the three-dimensional Lie algebraspanned by the photon creation and annihilation operators and their commutator

Page 155: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

9.2 Eigenoperator decomposition – secular equation 141

a†, a, I = [a, a†] is

ra† + la + δ Iregular−→

representation

0 l 0

0 0 00 −r 0

a†

Ia

(9.6)

With this ordering of basis vectors the regular representation does not havethe structure indicated in (8.4) and Fig. 8.1 for a nilpotent algebra. The secularequation is

‖ Reg(ra† + la + δ I ) − λI3 ‖ = (−λ)3 = 0 (9.7)

Strictly upper (or lower) triangular matrices have a secular equation of this form.The converse is true. If the secular equation of an N × N matrix is (−λ)N = 0, thena basis can be found in which the matrix has strictly upper (or lower) triangular form.Therefore, there is a permutation transformation of the basis vectors that brings theregular representation of this Lie algebra to strictly upper triangular form, and thealgebra is nilpotent by inspection.

Example For X = ∑ai Xi ∈ su(2) the defining 2 × 2 matrix representation

def(X ) and the regular 3 × 3 matrix representation Reg(X ) are

def(X ) = 1

2

[ia3 i(a1 − ia2)

i(a1 + ia2) −ia3

]

Reg(X ) = 0 −a3 a2

a3 0 −a1

−a2 a1 0

(9.8)

The secular equation for the regular representation is

‖ Reg(X ) − λI3 ‖ = (−λ)3 + (−λ)(+a21 + a2

2 + a23) = 0

= (−λ)(λ2 + φ2(a)) (9.9)

φ2(a) = +a21 + a2

2 + a23

Since φ2(a) ≥ 0, this secular equation cannot be solved over the field of real num-bers. Extension of the field from the real to the complex numbers allows factor-ization to find the three (three is the dimension of su(2)) roots: λ = 0, λ = ±ia,a2 = +a2

1 + a22 + a2

3 .

Page 156: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

142 Structure theory for simple Lie algebras

Example For Y = ∑bi Yi ∈ su(1, 1) the defining 2 × 2 matrix representation

def(Y ) and the regular 3 × 3 matrix representation Reg(Y ) are

def(Y ) = 1

2

[ib3 b1 − ib2

b1 + ib2 −ib3

]

Reg(y) = 0 −b3 −b2

b3 0 b1

−b2 b1 0

(9.10)

The secular equation for the regular representation is

‖ Reg(Y ) − λI3 ‖ = (−λ)3 + (−λ)(−b21 − b2

2 + b23) = 0

= (−λ)(λ2 + φ2(b)) (9.11)

φ2(b) = −b21 − b2

2 + b23

By comparing the secular equations for su(1, 1) and su(2), it is clear that thecoefficients of the respective secular equations are “analytic continuations” of eachother. That is, under rotation of some coordinates from the real to the imaginary axis,(a1, a2, a3) → (ib1, ib2, b3), the coefficient φ2(a) = a2

1 + a22 + a2

3 of the secularequation for su(2) maps to φ2(b) = −b2

1 − b22 + b2

3 for su(1, 1). This same rotationof coordinates maps the Lie algebra su(2) to the Lie algebra su(1, 1).

The secular equation was written down for the regular representation, since itcan always be constructed from the Lie algebra. A secular equation could just aseasily be written down for any matrix representation of the Lie algebra. We are byand large interested in studying matrix Lie algebras, so secular equations can bewritten directly for the defining matrix algebras. There is a great deal of utility inthis approach. First, the matrices in a matrix algebra are almost always smaller –much smaller – than the matrices of its regular representation. Second, a matrixLie algebra contains at least as much information (certainly not less) as its regularrepresentation.

Example The secular equation for the defining 2 × 2 matrix representation ofsu(2) in (9.8) is

‖ def(X ) − λI2 ‖ = λ2 + (12

)2(+a2

1 + a22 + a2

3) = 0 (9.12)

Similarly, the secular equation for the defining 2 × 2 matrix representation ofsu(1, 1) in (9.10) is

‖ def(Y ) − λI2 ‖ = λ2 + (12

)2(−b2

1 − b22 + b2

3) = 0 (9.13)

For each algebra the functional forms of the nonzero coefficient φ2 in the secularequation are the same in the defining and the regular matrix representations.

Page 157: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

9.4 Invariant operators 143

9.3 Rank

The rank, l, of a Lie algebra is the number of independent coefficients in the secularequation of its regular representation, Reg. Since the number of independent rootsof the secular equation is equal to the number of independent coefficients φ j (zi ),the rank is also the number of independent roots of the secular equation. The rankis always smaller than the dimension of the Lie algebra, since there is always atleast one zero root (put X = Z in (9.1)). For simple Lie algebras of dimension N ,l2 ∼ N , so describing commutation relations in terms of rank rather than dimensioneffects a big simplification.

9.4 Invariant operators

If φ j (zi ) is a coefficient in the secular equation, the operator obtained by the sym-metrized substitution

zi → Xi φ j (zi ) −→ φ j (Xi ) (9.14)

is an invariant operator: it commutes with all elements of the Lie algebra

[φ j (Xi ), Xk] = 0 (9.15)

The number of independent invariant operators (“Casimir invariants”) is at leastequal to the rank of the algebra, and may be as large as the dimension for a com-mutative algebra, where all N operators mutually commute.

Example From the secular equation (9.9) for su(2) we immediately construct asecond order invariant operator that commutes with all operators in su(2)

φ2(a) = +a21 + a2

2 + a23 −→ φ2(X ) = +X2

1 + X22 + X2

3 (9.16)

A similar calculation for su(1, 1) gives

φ2(b) = −b21 − b2

2 + b23 −→ φ2(Y ) = −Y 2

1 − Y 22 + Y 2

3 (9.17)

Notice that the Casimir invariant operator for su(1, 1) is the analytic continuationof the Casimir invariant operator for su(2).

If m is some matrix Lie algebra of n × n matrices, then any operator in m can bewritten as a linear combination of matrices Mi j , with entry +1 at the intersectionof the i th row and j th column and zeroes elsewhere

M :∑

aij M j

i (9.18)

The coefficients of the secular equation for this algebra of n × n matrices are shownin Fig. 9.1.

Page 158: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

144 Structure theory for simple Lie algebras

fj(ars) = 1

j!(n − j )!

r1

r2

rj

tj+1 tj+1

tn

ar1s1

ar2s2

arisi

tn

•••

•••

•••

•••

•••

•••

s1

s2

sj

Figure 9.1. Coefficients in the secular equation are expressed in terms of the fullyantisymmetric Levi–Civita tensor on n symbols.

fj(Xrs) = 1

j!(n − j )!

Xr1s1

Xr2s2

Xrjsj

•••

•••

•••

•••

•••

•••

r1

tj+1

tn tn

tj+1

r2

rd sj

s1

s2

Figure 9.2. Invariant operators φ j (X ) expressed in terms of the fully antisymmetricLevi–Civita tensor on n symbols. The invariant operators are obtained by replacingthe coordinates ar

s by the operators Xrs in the coefficientsφ j of the secular equation.

Here the general element in the Lie algebra is X = ars Xr

s .

In this figure the vertical symbol is the Levi–Civita symbol for n dimensions(e.g., in R3, = εi jk = +1 for (i jk) a cyclic permutation of (123), −1 for a cyclicpermutation of (321), and zero otherwise). Contracted dummy indices are connectedby lines. The invariant operators for the Lie algebra of n × n matrices are shown inFig. 9.2. Contracted dummy indices are connected by lines. The invariance of theseoperators depends only on the commutation relations of the Lie algebra. Therefore

Page 159: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

9.4 Invariant operators 145

these invariant operators φ j (Xrs) remain invariant when the matrices are replaced

by any set of operators (see Chapter 6) with isomorphic commutation relations.

Example The orthogonal groups O(n) and their subgroups SO(n) have Lie alge-bras that consist of n × n antisymmetric matrices. The secular equation is far easierto compute in the defining representation of n × n antisymmetric matrices than inthe d × d (the dimension of so(n) is d = n(n − 1)/2) regular matrix representation

‖ def(X ) − λIn ‖ =∑

(−λ)n− jφ j (X ) = 0 (9.19)

Further, the secular equation for a matrix and its transpose are equal, but since theLie algebra consists of antisymmetric matrices, def(X )t = −def(X ), and we find

φ j (X ) = φ j (−X ) = (−) jφ j (X ) (9.20)

As a result, the only nonzero coefficients in the secular equation for so(n) are theeven coefficients. Therefore the algebra so(n) has rank [n/2].

Example The second order Casimir invariant operator for so(n) is obtained bysetting j = 2 in Fig. 9.2 for the generators Xi j of SO(n). Since Xi j = −X ji , it ispossible to “rearrange” the contractions between the operators and the two differentantisymmetric tensors, as shown in Fig. 9.3.

As a result, we can write for so(n)

C2[so(n)] =∑

X2i j (9.21)

Similar “rearrangement” arguments can be used to simplify the expressions forhigher order Casimir invariant operators. For example, for so(5) the fourth order

r1

r2

t3

tn tn tn

t3 t3

s2 s1

s1 r1 r2

s2

t3

tn

Xr1s1 Xr1

s1 Xr2s2Xr2

s2

•••

•••

•••

•••

•••

•••

Figure 9.3. If the operators X are antisymmetric, Xrs = −Xs

r , contractions in theexpressions for the Casimir operators can be rearranged as shown.

Page 160: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

146 Structure theory for simple Lie algebras

Casimir operator is

C4[so(5)] =5∑

i=1

v2i (9.22)

where the components of the five-vector v are vm = εi jklm Xi j Xkl , for example

v5 = εi jkl5 Xi j Xkl ∼ X12 X34 − X13 X24 + X14 X23 (9.23)

For so(4) the fourth order Casimir is a perfect square

C4[so(4)] = (εi jkl Xi j Xkl)2 ∼ (X12 X34 − X13 X24 + X14 X23)2 (9.24)

In general, for n even, the nth order Casimir invariant operator for so(n) is a perfectsquare. Its square root, of order n/2, should be taken as an appropriate functionallyindependent Casimir operator. The existence of two second-order Casimir operatorsfor so(4) is another piece of evidence that this algebra is semisimple rather thansimple.

9.5 Regular elements

It is useful to choose elements Z in the Lie algebra (Eq. (9.1)) that maximizethe amount of information that can be extracted from the secular equation. (Atthe opposite extreme, the choice Z = 0 is not clever since all X obey the sameeigenvalue equation [Z , X ] = 0X .)

We do this by choosing a Z for which we:

1. maximize the number of nonzero roots;2. minimize the degeneracy of each nonzero root;3. minimize the degeneracy of the zero root.

Such elements Z in the Lie algebra can always be found. In fact, this is a ‘generic’property. ‘Almost all’ elements Z in the Lie algebra have this property.

As an example of this eigenoperator decomposition we treat again the six-dimensional algebra of two photon operators spanned by n + 1

2 = 12{a, a†},

a†2, a†, I = [a, a†], a, a2. A useful choice for Z is

Z = z1

(n + 1

2

)+ z2 I (9.25)

The secular equations for the 6 × 6 regular representation and the 4 × 4 definingmatrix representations are

regular representation (λ)2(λ + 2z1)(λ − 2z1)(λ + z1)(λ − z1) = 0

defining representation (λ)2(λ + z1)(λ − z1) = 0(9.26)

Page 161: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

9.6 Semisimple Lie algebras 147

–2 –1 00 0 0 0 0

1 2

a2 a n + a a

z2

z1212I

Figure 9.4. The six operators in the two-photon algebra can be organized accordingto their roots, which are eigenvalues of a secular equation. Two operators have zeroroot.

Each secular equation has only one independent coefficient φ. The nontrivial coef-ficients of the secular equation for the regular representation are

φ2(z1, z2) = −5z21

φ4(z1, z2) = 4z41 = 4 (−φ2(z1, z2)/5)2

(9.27)

For the 4 × 4 matrix representation the one nontrivial coefficient is

φ2(z1, z2) = −z21 (9.28)

This is a rank-one Lie algebra since there is only one functionally independentcoefficient in the secular equation. The roots of the secular equation of the regularrepresentation are ±2z1, ±z1, 0, 0 and the commutation relations can be summa-rized in the ‘root space diagram’ shown in Fig. 9.4.

From this diagram we learn[n + 1

2 , X (k,0)] = k X (k,0)[

I, X (k,0)] = 0X (k,0)

(9.29)

where X (2,0) = a†2, X (1,0) = a†, X (0,0) = n + 12 I, I , X (−1,0) = a, X (−2,0) = a2. We

also see that if k, l ∈ {−2, −1, 0, +1, +2}[X (k,0), X (l,0)] ∼ X (k+l,0) (9.30)

if k + l is in the set {−2, −1, 0, +1, +2} and zero otherwise. If k + l = 0 thecommutator is some linear combination of the two operators that span the subspace(0, 0): n + 1

2 and I .

9.6 Semisimple Lie algebras

For simple and semisimple Lie algebras the Cartan–Killing inner product is non-singular. When this inner product is nonsingular, the decomposition of the algebrainto its subspaces, one for each root of the secular equation, has additional prop-erties. We list these properties here, providing only an occasional proof. A more

Page 162: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

148 Structure theory for simple Lie algebras

complete treatment of this, the most beautiful part of Lie algebra theory, can befound elsewhere (Gilmore, 1974b; Helgason 1978).

9.6.1 Rank

For semisimple Lie algebras the rank l is:

(i) the number of independent coefficients in the secular equation;(ii) the number of independent roots α1, α2, . . . , αl of the secular equation; these l inde-

pendent roots can be collected together as the components of an l-dimensional vector(α1, α2, . . . , αl) in a root space;

(iii) the dimension of the subspace V0 (which is a subalgebra) of the root space;(iv) the number of independent invariant operators (Casimir operators).

9.6.2 Properties of roots

Further, the roots have the following properties.

(i) If α and β are roots with subspaces Vα and Vβ in the Lie algebra, then

[Vα, Vβ] ⊂ Vα+β (9.31)

That is, the commutator of any vector in Vα with any vector in Vβ is a vector in Vα+β .If α + β is not a root, the commutator vanishes.

(ii) The l basis vectors H1, H2, . . . , Hl in the l-dimensional subspace V0 commute:

[Hi , Hj ] = 0 1 ≤ i, j ≤ l (9.32)

(iii) Each subspace Vα (α �= 0) is one dimensional. Therefore each subspace Vα is spannedby an operator Eα that can be labeled by the root α. As a result (i.e., [V0, Vα] ⊂ Vα),each Hi maps Eα into a multiple of itself

[Hi , Eα] = αi Eα

(9.33)[H, Eα] = α Eα

(iv) If α is a root, −α is a root. If α is a root and cα is a root, then |c| = 1. Thus, nonzeroroots occur in pairs of opposite sign. In addition, the only root collinear with 0 and α

is −α.(v) The commutator of Eα and E−α is in V0, so can be expanded as a linear superposition

of the Hi :

[Eα, E−α] = αi Hi (9.34)

(vi) An inner product relating αi and α j by αi = hi jα j can be introduced in this root space

(α, β) = αiβi = α jβ j = αi h

i jβ j (9.35)

Page 163: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

9.6 Semisimple Lie algebras 149

This inner product is positive-definite. If the lengths of the roots are normalized sothat ∑

α �=0

αiα j = δi j or∑α �=0

α · α = rank = l (9.36)

then hi j = δi j and we can identify αi with αi : αi = αi .(vii) It remains to compute

[Eα, Eβ] →

0 α + β not a rootNα,β Eα+β α + β a rootα · H α + β = 0

Three cases arise, as indicated. The only detail remaining is to determine the coefficientNα,β when α + β is a nonzero root.

9.6.3 Structure constants

To compute these coefficients we first apply the Jacobi identity to the generatorsEα, Eβ, Eγ of three nonzero roots that sum to zero

[[Eα, Eβ], Eγ ] + [[Eβ, Eγ ], Eα] + [[Eγ , Eα], Eβ] = 0 (9.37)

From this we derive the symmetry

when α + β + γ = 0then αNβ,γ + βNγ,α + γ Nα,β = 0and Nβ,γ = Nγ,α = Nα,β

(9.38)

Next we compute a recursion relation involving these coefficients. This is done byembedding β in a chain of roots involving α additively, as shown in Fig. 9.5. In thischain

β − mα β − (m − 1)α · · · β β + α · · · β + nα

are all roots but

β − (m + 1)αβ + (n + 1)α

(9.39)

are not roots. By applying the Jacobi identity to roots α, β + kα, and −α we obtainthe recursion relation

N 2α,β+(k−1)α = N 2

α,β+kα + α · (β + kα) (9.40)

Page 164: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

150 Structure theory for simple Lie algebras

0

0

0 0 0 0 0

b−ma b+a b+nab

b

a

••• •••

Figure 9.5. α chain containing β. This chain is used to compute coefficients Nα,β

in commutators [Eα, Eβ] = Nα,β Eα+β .

This recursion relation satisfies the boundary conditions

N 2−α,β−mα = 0

N 2+α,β+nα = 0

(9.41)

The initial condition Nα,β+nα = 0 leads to

N 2α,β+(k−1)α = (n − k + 1)

(α · β + 1

2(n + k)α · α

)(9.42)

The other boundary condition N 2−α,β−mα = N 2

α,β−(m+1)α = 0 leads to

N 2α,β−(m+1)α = (n + m + 1)

(α · β + 1

2(n − m)α · α

)= 0 (9.43)

9.6.4 Root reflections

From this we extract the following information

(i) N 2α,β+kα = (n − k)(m + k + 1)(α · α)/2 ≥ 0. We use this expression because it shows

clearly how the boundary conditions are imposed. We note that α · β > 0 when m −n > 0 and α · β < 0 when m − n < 0.

(ii) The inner products obey

− n ≤ 2α · β

α · α= −n + m ≤ m (9.44)

where m and n are nonnegative integers.(iii) If β is a root, then

β ′ = β − 2β · α

α · αα (9.45)

is also a root. This root is obtained by reflecting β in the hyperplane orthogonal to α.

All of the rank-two root space diagrams are shown in Fig. 9.6. There the sym-metries of root spaces under reflection and rotation may be seen.

Page 165: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

9.7 Canonical commutation relations 151

a2 = e1 + √

3/2e2-a1 = - e1 + √

3/2e2

-a2 = - e1 - √3/2e2

-a1 = -√3/2e1 + e2√

3/2e1 + e2

-a1 - a2 = -√3/2e1 + e2

-2a1 - 3a2 = -√3e1

-a1 - 2a2 = -√3/2e1 - e2

-a1 - 3a2 = -√3/2e1 - e2 a1 = √3/2e1 - e2

a1 + a2 = √3/2e2 - e2

a1 + 3a2 =

a1 + 2a2 = √3/2e1 + e2

2a1 + 3a2 = √3e1

-e2 = -a2

e2 = a2

a1 = e1 − √

3/2e2

a1 + a2 = e1-a1 - a2 = -e1

a2 = 2e2

a2 = e2

-a1 = -e1 + e2

-a1 = -e1 + e2

-a1 - a2 = -e1

-a1 = -e1 + e2

-a2 = -e1 - e2

-a1 - 2a2 = -e1 - e2

-a1 - a2 = -e1 - e2

-2a1 - a2 = -2e1

a1 + 2a2 = e1 + e2

a1 + a2 = e1

a1 = e1 - e2

-a2 = -2e2

-a2 = -e2

a2 = e1 + e2

a1 = e1 - e2

a1 = e1 - e2

2a1 + a2 = 2e1

a1 + a2 = e1 + e2

12

32

12

12

32

32

12

12

32

12

12

12

Figure 9.6. Two-dimensional root space diagrams. Top: A2, B2, C2. Bottom: D2, G2.

9.7 Canonical commutation relations

The root space diagram encapsulates in a very convenient way all the structureconstants of a semisimple Lie algebra. The basis vectors are the l (l is the rank)operators H = (H1, H2, . . . , Hl) and the “shift” operators Eα, one correspondingto each nonzero root. The root vector α = (α1, α2, . . . , αl) has l components. Thecommutation relations are

[Hi , Hj ] = 0[H, Eα] = αEα

[Eα, Eβ] = α · H α + β = 0= Nα,β Eα+β α + β �= 0, a root= 0 α + β not a root

(9.46)

Page 166: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

152 Structure theory for simple Lie algebras

a2 = 2e2

-a2 = -2e2

-a1 = -e1+ e2

a1 = e1- e2-a1- a2 = -e1- e2

a1+a2 = e1+ e2

-2a1- a2 = -2e1 2a1+ a2 = 2e1

Figure 9.7. Root space C2. The roots are expressed in terms of a Cartesian(orthogonal) set e1, e2 and a fundamental set α1, α2.

These commutation relations are subject to:

normalization∑α �=0

α · α = rank = l

Jacobi Nα,β = Nβ,γ = Nγ,α α + β + γ = 0

symmetry Nα,β = −N−α,−β = −Nβ,α

(9.47)

As an example of the rigidity of these commutation relations, we write down thecommutation relations described by the rank-two root space C2 shown in Fig. 9.7.If we choose orthogonal vectors e1 and e2 in a two-dimensional Euclidean space,the nonzero roots for C2 are ±2e1, ±2e2, ±e1 ± e2. The 10 basis vectors in the Liealgebra are Hi , i = 1, 2, and Eα, with α the eight nonzero roots. We normalizethese roots by

∑α · α = 2 so that

(ei , e j ) = 1

12δi j (9.48)

Under this normalization condition the commutation relations are given inTable 9.1. All commutators not explicitly shown in this table vanish. For this rank-two algebra two phases may be set arbitrarily. The two commutators at which thephase choices have been made are indicated by * in Table 9.1. Both choices are+1. Other phase choices (−1) give isomorphic commutation relations.

Page 167: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

9.8 Conclusion 153

Table 9.1. Nonzero commutators for Lie algebraswith root space C2

[H1, H2] = 0[H, E±2e1

] = (±2/√

12, 0)E±2e1[H, E±2e2

] = (0, ±2/√

12)E±2e2[H, E±e1±e2

] = (±/√

12, ±/√

12)E±e1±e2[E+2e1 , E−2e1

] = (2/√

12)H1[E+2e2 , E−2e2

] = (2/√

12)H2[E±e1±e2 , E−(±e1±e2)

] = (1/√

12)(±H1 ± H2)[E+2e1 , E−(e1+e2)

] = ∗(1/√

6)Ee1−e2[E−e1+e2 , E−e1−e2)

] = (1/√

6)E−2e1[E−e1−e2 , E+2e1

] = (1/√

6)E+e1−e2[E−2e1 , Ee1−e2

] = (−1/√

6)E−e1−e2[E+e1−e2 , E+e1+e2

] = (−1/√

6)E+2e1[E+e1+e2 , E−2e2

] = (−1/√

6)E+e1−e2[E+2e2 , E−e1−e2

] = ∗(1/√

6)E−e1+e2[E−e1−e2 , E+e1−e2

] = (1/√

6)E−2e2[E+e1−e2 , E+2e2

] = (1/√

6)E+e1+e2[E−2e2 , E+e1+e2

] = (−1/√

6)E+e1−e2[E+e1+e2 , E−e1+e2

] = (−1/√

6)E+2e2[E−e1+e2 , E−2e2

] = (−1/√

6)E−e1−e2

9.8 Conclusion

The structure constants for a Lie algebra have been reduced to a canonical formby studying the properties of its regular representation. Using the Cartan–Killinginner product it is possible to determine the semisimple part of a Lie algebra andits complement, the maximal solvable invariant subalgebra. An eigenvalue decom-position can be used to put the commutation relations of the semisimple part intoa standard form.

When the algebra is simple or semisimple the commutation relations are elegantlysummarized by a root space diagram. This is a simple geometric structure in aEuclidean space of dimension l, where l is the rank of the Lie algebra. The rank is:

(i) the number of functionally independent coefficients in the secular equation;(ii) the number of independent roots of the secular equation;

(iii) the number of Casimir invariant operators;(iv) the dimension of the root space diagram;(v) the number of mutually commuting operators in the Lie algebra.

We have illustrated how to extract commutation relations from a root space diagramfor C2.

Page 168: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

154 Structure theory for simple Lie algebras

In classifying simple Lie algebras by their root space diagram, we were forcedto extend the field of the Lie algebra from the real to the complex numbers in orderto guarantee that the secular equation had as many roots as basis vectors in the Liealgebra. In doing so, we have introduced a situation in which different algebrashave the same complex extension (e.g., sl(2; R) and so(3) have common complexextension sl(2; C)). Root spaces classify commutation relations of these complexLie algebras. Root spaces also summarize the commutation relations for the variousreal subalgebras of these complex algebras – some roots αi and structure constantswill be imaginary. However, determining the real subalgebras of a complex Liealgebra is a not entirely trivial task to which we return in Chapter 10.

9.9 Problems

1. Construct the regular representation for the two-photon operator algebra: 12 {a†, a},

a†2, a†, I, a, a2. Determine the secular equation for this matrix. Determine the rank

of this Lie algebra.

2. Construct the 4 × 4 defining matrix representation and the 6 × 6 regular matrixrepresentation of the Lie algebra so(4). Construct the secular equation. This equa-tion factors into two independent equations, each with one independent coefficientφ. Both are quadratic. Construct these coefficients. Use these to construct the twoquadratic invariant operators on this semisimple Lie algebra. Show that in the canon-ical basis Xi j = xi∂ j − x j∂i (1 ≤ i < j ≤ 4) these operators are C2 = ∑

i< j X2i j and

C ′2 = X12 X34 − X13 X24 + X14 X23.

3. The Lie algebra su(4) has a 4 × 4 defining matrix representation and a 15 × 15 regularmatrix representation. Show that the secular equation of the regular representationhas just three independent coefficients. Do this by showing that there is a relationbetween the secular equation for the regular representation and the secular equationfor the defining matrix representation. What is this relation? The three independentcoefficients in the secular equation for the defining representation are of degree 2, 3,4. Construct the invariant operators on su(4) of degree 2, 3, and 4.

4. For so(2n + 1) the invariant operators (Casimir operators) are of degree 2, 4, . . . , 2n.This is true also for so(2n), with one difference: the invariant operator of degree 2nis a perfect square. Show that its square root, an invariant operator of degree n, isC ′

n = εi1i2···i2n Xi1i2 Xi3i4 · · · Xi2n−1,i2n . Explicitly write out C ′2 for so(4) and C ′

3 for so(6).Compare your results with Fig. 9.3.

5. In Chapter 11 we will show that su(4) = so(6). Both Lie algebras have invariantoperators of degree 2, 3, 4. Constuct the isomorphism between these Lie algebras andtheir invariant operators.

6. Summarize the commutation relations satisfied by the algebra of photon operators fortwo modes. This algebra is ten dimensional. It contains the four operators a†

i a j + 12δi j

Page 169: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

9.9 Problems 155

(1 ≤ i, j ≤ 2) and the two pairs of three operators a†i a†

j and ai a j (ai a j = a j ai ).Show that this root space diagram is isomorphic to C2, shown in Fig. 9.7. Theidentification is: a†

i ai + 12 ↔ Hi , a†

i a†j ↔ E+ei +e j (i �= j), a†

i a j ↔ E+ei −e j (i �= j),

ai a j ↔ E−ei −e j (i �= j), a†i a†

i ↔ E+2ei , ai ai ↔ E−2ei .

7. Repeat Problem 6 for the algebra of two fermion operators for two modes. This algebrais six dimensional. Show that the resulting root space diagram is D2 (Fig. 9.6). Whythe difference? (Hint: f †i f †i = 0.)

8. The Lie algebras su(2) and so(3) are isomorphic. In fact, the latter is the reg-ular representation for the former. Choose X, Y ∈ su(2) and compute (X, Y ) =tr [def(X )def(Y )] by taking the trace of the 2 × 2 matrices in su(2) that representX and Y . Now compute the inner product using instead the Lie algebra so(3), that is,the regular matrix representation of su(2): (X, Y ) = tr [Reg(X )Reg(Y )]. Show thatthe two results are proportional. What is the proportionality constant?

9. Choose two vectors X and Y in the Lie algebra su(n). Compute their inner product inthe n × n defining matrix representation and in the (n2 − 1) × (n2 − 1) regular matrixrepresentation. The two inner products are proportional. What is the proportionalityconstant? (Hint: set Y = X and choose a special X , for example X = H1.)

10. Express the Lie algebras spanned by the following ten sets of operators in canonicalform (b boson operators; f fermion operators; 1 ≤ i, j ≤ N ):

b†i b j b†

i b j + 12δi j , b†

i b†j , bi b j b†, b†b, b†bb b, b†b, b†b†b

f †i f j f †i f j + 12δi j , f †i f †j , fi f j x, x∂, x∂2 ∂, x∂, x2∂

xi∂ j x i∂ j + 12δi j , xi x j , ∂i∂ j

11. Compute R = 12

∑α>0 α, half the sum over all positive roots, in each of the simple Lie

algebras. This vector plays a major role in computing the spectrum of the quadraticCasimir operator for each of the irreducible representations of each of the simple Liealgebras. For example, for Bn , Ri = 1

2 (2n + 1) − i and the spectrum is

C2(M) = (M + R) · (M + R) − (R) · (R) = M · M + M · 2R

where M is the highest weight in the representation. For the (2 j + 1) dimensionalrepresentation of so(3), M = j , R = R1 = 1

2 and C2( j) = ( j + 12 )2 − (0 + 1

2 )2 =j( j + 1).

12. The Weyl group of reflections for a simple Lie algebra is generated by reflections inplanes orthogonal to all the nonzero roots.a. Show that the Weyl group for An−1 is of order n!, the Weyl group for Dn is of

order 2n−1n!, and the Weyl groups for Bn and Cn are of order 2nn!.b. Show that the product of the degrees of the functionally independent coefficients

in the secular equation for each of these algebras is equal to the order of the Weylgroup.

Page 170: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

156 Structure theory for simple Lie algebras

c. Show that the product of the degrees of the Casimir operators for each of thesealgebras is equal to the order of the Weyl group.

13. Compute the dimensions of each of the classical Lie algebras as a function of therank, and show

dim (g)

{rank (g)}2 =

ratio n → ∞ algebra

1 + 2n 1 An

2 − 1n 2 Dn

2 + 1n 2 Bn, Cn

14. Multilinear operations can be defined on a matrix Lie algebra by

(A1, A2, . . . , Ar )Reg = tr Reg(A1)Reg(A2) · · · Reg(Ar )

A multilinear operator can be defined similarly in other representations as well: forexample, the defining representation.a. Show

(A1, A2, . . . , Ar )Reg

fr (Reg)= (A1, A2, . . . , Ar ) = (A1, A2, . . . , Ar )�

fr (�)

where � is some irreducible representation of the Lie algebra. This relation definesthe index fr (�).

b. Show

fr (�)

fr (def)= tr (�(A))r

tr (def(A))r= dim(�) Cr (�)

dim(def) Cr (def)

In this expression Cr is the value of the r th Casimir invariant in the representationindicated.

c. For su(2)

f2( j) = 16 {(2 j)(2 j + 1)(2 j + 2)} f2( j = 1

2 )

15. The matrix Lie algebras so(2n), so(2n + 1), sp(2n) have the form∑

i j ai j Mi j , whereMi j is a square matrix with +1 in the i th row and j th column and zeroes elsewhere,M is 2n × 2n for so(2n), sp(2n) and (2n + 1) × (2n + 1) for so(2n + 1), and suitablereality restrictions are imposed on the coefficients ai j .a. What are the conditions on ai j for each matrix Lie algebra?b. Write down the coefficients φr (ai j ) that occur in the secular equation for each of

these matrix Lie algebras.c. Show that all odd coefficients φr (ai j ) vanish for each of these matrix Lie algebras.d. Express the even coefficients in terms of the Levi–Civita skew tensors εi1i2···il

(l = 2n, 2n + 1, 2n).e. Show that the even coefficients are all functionally independent.f. Conclude that each of these three matrix Lie algebras has rank n.g. Show that φ2n(ai j ) is a perfect square for so(2n); write down its square root; show

that it is of degree n.

Page 171: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

9.9 Problems 157

16. Replace the scalar parameters θi in the 3 × 3 regular representation of so(3) or su(2)by the corresponding operators:

M = 0 θ3 −θ2

−θ3 0 θ1

θ2 −θ1 0

−→ M =

0 J3 −J2

−J3 0 J1

J2 −J1 0

a. Show tr M2 = −2θ · θ .b. Show tr M2 = −2J · J.c. Show

[J, tr M2

] = 0.d. Show tr M2n+1 = 0 and tr M2n = (−2)n(J · J)n .

17. Casimir covariants A semisimple Lie algebra has basis vectors Xi that satisfycommutation relations [Xi , X j ] = C k

i j Xk . There are two linear vector spaces, V (1)

and V (2), that carry irreducible representations of this Lie algebra: Xi → �(1)(Xi ) =Yi and Xi → �(2)(Xi ) = Zi . Show that the Casimir covariant gi j Yi Z j commutes with(Y + Z )k (more accurately, with �(1)(Xi ) ⊗ Idim V (2) + Idim V (1) ⊗ �(2)(Xi )).

18. The Cayley–Hamilton theorem guarantees that a polynomial or analytic function ofa square n × n matrix X can be expressed as a finite polynomial in the first n powersof X , starting at X0 = In:

f (X ) = f0 In + f1 X1 + f2 X2 + · · · + fn−1 Xn−1 =j=n−1∑

j=0

f j X j

The challenge is to compute the coefficients f j in this expansion.a. Show that each coefficient f j is a function of the invariants of the matrix X .b. Show that the invariants can variously be chosen as either the independent

eigenvalues λi (X ) or the independent coefficients φi (X ) of the secular equationfor X .

c. Show that the Cayley–Hamilton expansion simplifies considerably if the matrix Xis chosen as generic diagonal. In fact it reduces to

1 λ1 λ21 · · · λn−1

1

1 λ2 λ22 · · · λn−1

2

......

.... . .

...

1 λn λ2n · · · λn−1

n

f0

f1

...

fn−1

=

f (λ1)

f (λ2)

...

f (λn)

The square matrix on the left is a vanderMonde matrix.d. Compute eiφ Jz for the (2 j + 1) dimensional matrix representations of SU (2) by

computing the vanderMonde matrices. Show that for j = 12 , 1, 3

2 , 2 the resulting

Page 172: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

158 Structure theory for simple Lie algebras

matrices are

[1 1

2

1 − 12

] 1 1 1

1 0 01 −1 1

1 32 ( 3

2 )2 ( 32 )3

1 12 ( 1

2 )2 ( 12 )3

1 − 12 (− 1

2 )2 (− 12 )3

1 − 32 (− 3

2 )2 (− 32 )3

and

1 2 4 8 16

1 1 1 1 1

1 0 0 0 0

1 −1 1 −1 1

1 −2 4 −8 16

f0

(iφ)1 f1

(iφ)2 f2

(iφ)3 f3

(iφ)4 f4

=

e2iφ

eiφ

1

e−iφ

e−2iφ

e. Invert each of these van der Monde matrices and determine the functions f j (φ) inthe expansions of eX for X ∈ su(2). In particular, show

Representation

12 1 3

2 2(iφ) j f j 2 3 4 5

f0 cos(φ/2) 1 98 cos( φ

2 ) − 18 cos( 3φ

2 ) 1

(iφ)1 f1 2i sin(φ/2) i sin(φ) 9i4 sin( φ

2 ) − i12 sin( 3φ

2 ) i3 sin(φ) − i

6 sin(2φ)

(iφ)2 f2 cos(φ)−1 − 12 cos( φ

2 ) + 12 cos( 3φ

2 ) − 54 + 1

3 cos(φ)− 112 cos(2φ)

(iφ)3 f3 −i sin( φ

2 ) + i3 sin( 3φ

2 ) − i3 sin(φ) + i

6 sin(2φ)

(iφ)4 f414 − 1

3 cos(φ) + 112 cos(2φ)

f. Recover the two well-known expansions for j = 12 and l = 1:

j = 1

2eX = cos

2

)I2 + sin(φ/2)

φ/2X

l = 1 eX = I3 + sin(φ)

φX + 1 − cos(φ)

φ2X2

g. Show that the (2 j + 1) × (2 j + 1) real antisymmetric matrix X ∈ su(2) and itsinvariant φ are related by (cf. Problem 9.14)

tr X2 = − j( j + 1)(2 j + 1)

3φ2

Page 173: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

10

Root spaces and Dynkin diagrams

In the previous chapter the canonical commutation relations for semisim-ple Lie algebras were elegantly expressed in terms of roots. Althoughroots were introduced to simplify the expression of commutation rela-tions, they can be used to classify Lie algebras and to provide a completelist of simple Lie algebras. We achieve both aims in this chapter. However,we use two different methods to accomplish this. We classify Lie alge-bras by specifying their root space diagrams. This is a relatively simplejob using a “building up” approach, adding roots to rank l root space dia-grams to construct rank l + 1 root space diagrams. However, it is not easyto prove the completeness of root space diagrams by this method. Com-pleteness is obtained by introducing Dynkin diagrams. These specify theinner products among a fundamental set of basis roots in the root spacediagram. In this approach completeness is relatively simple to prove,while enumeration of the remaining roots within a root space diagram isless so.

10.1 Properties of roots

In an effort to cast the commutation relations of a semisimple Lie algebra into aneigenvalue-eigenvector format, a secular equation was constructed from the regularrepresentation. The rank of an algebra is, among other things:

(i) the number of independent functions in the secular equation;(ii) the number of independent roots of the secular equation;

(iii) the number of mutually commuting operators in the Lie algebra;(iv) the number of invariant operators that commute with all elements in the Lie algebra

(Casimir operators);(v) the dimension of the positive-definite root space that summarizes the commutation

relations.

159

Page 174: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

160 Root spaces and Dynkin diagrams

In terms of the root space decomposition the commutation relations of the l (= rank)operators Hi and the shift operators Eα are

[Hi , Hj ] = 0[H, Eα] = αEα

[Eα, Eβ] = α · H α + β = 0= Nα,β Eα+β α + β �= 0 a root= 0 α + β not a root

(10.1)

The coefficients Nα,β are defined in terms of the nonnegative integers m, n by

N 2α,β+kα = (n − k)(m + k + 1)(α · α)/2 (10.2)

where β + kα is a root only for k = −m, . . . , +n. The roots are normalized by∑α �=0

α · α = rank = l (10.3)

In deriving the value for the structure constant Nα,β we observed

2(α · β)

α · αis an integer

β ′ = β − 2(α · β)

α · αα is a root

(10.4)

The root β ′ is obtained by reflecting β in the hyperplane orthogonal to α. These twoobservations are all that is required to construct root space diagrams of any rank.

If we write 2(α · β)/(α · α) = n and 2(α · β)/(β · β) = n′, where n and n′ areintegers, then by the Schwarz inequality

0 ≤ cos2(α, β) =(

α · β

α · α

) (α · β

β · β

)= n

2

n′

2≤ 1 (10.5)

These two results severely constrain the possible angles between two roots and theirrelative length. The results are summarized in Table 10.1.

10.2 Root space diagrams

The procedure for constructing root space diagrams in spaces of any dimension(= rank) is simple. Begin with the rank-one root space. It is unique, with nonzerovectors ±e1. To construct rank-two root spaces, add a noncollinear vector to thisroot space in such a way that the constraints exhibited in Table 10.1 are obeyed,and complete the root space by reflections in hyperplanes orthogonal to all roots.Only a small number of rank-two root spaces can be constructed in this way. Theseare A2, B2 = C2, D2 and G2, as shown in Fig. 9.6.

Page 175: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

10.2 Root space diagrams 161

Table 10.1. Properties of roots in a root space diagram

cos2(α, β) θ (α, β) n = 2α · β

α · αn′ = 2α · β

β · β

α · α

β · β= n′

n

2± π

2±2 ±2 1

3

4

π

2± π

3±3 ±1 3−1

±1 ±3 3+1

2

4

π

2± π

4±2 ±1 2−1

±1 ±2 2+1

1

4

π

2± π

6±1 ±1 1

20 0 –

Rank-three root spaces are constructed from rank-two root spaces by the sameprocess. A noncoplanar vector is added to a rank-two root space diagram subject tothe condition that all the requirements of Table 10.1 are satisfied. The resultant setof roots is completed by reflection in hyperplanes orthogonal to all roots. If any pairof roots in the completed diagram does not satisfy these conditions, the resultingdiagram is not an allowed root space diagram. The allowed rank-three root spacediagrams are shown in Fig. 10.1.

This procedure is inductive. All rank-l root space diagrams are constructed inthis way from rank-(l − 1) root space diagrams. We find by this building-up processthat there are four infinite series of root spaces with the following sets of roots:

Al−1 +ei − e j 1 ≤ i �= j ≤ l l − 1 ≥ 1Dl ±ei ± e j 1 ≤ i �= j ≤ l l > 3Bl ±ei ± e j , ±ei 1 ≤ i �= j ≤ l l > 2Cl ±ei ± e j , ±2ei 1 ≤ i �= j ≤ l l > 1

(10.6)

The subscript on the letter indicates the rank of the root space. It is easily seen thatDl is constructed by adding roots ±(ei + e j ) to Al−1, and Bl, Cl are constructed byadding roots ±ei , ±2ei to Dl . The root spaces Al−1, Dl, Bl, Cl are all inequivalentwith the following exceptions

A1 = B1 = C1

B2 = C2

A3 = D3

(10.7)

The root space D2 is semisimple

D2 = A1 + A1 (10.8)

Page 176: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

162 Root spaces and Dynkin diagrams

−e2 +e4

−e3 +e4

−e1 +e4

−e1+e2

−e1+e3−e2+e3

e1 −e3

e1−e2

e1−e4

e3−e4

e2−e4

e2−e3

A3

B3

D3

C3

−e1+e3

+e2+e3−e2+e3

+e1+e2

+e1−e2

−e1+e2

+e2−e3

+e1−e3

−e1−e3

−e1−e2

−e2−e3

+e1+e3

e3

−e1+e3

−e1+e2

+e1−e2

−e1−e3

+e2−e3

+e1−e3

−e2−e3

−e1−e2

−e1

−e2

−e3

e2

e1

+e2+e3

+e1+e3

−e2+e3

+e1+e2

−e1+e3

+e2+e3

+e2−e3

+e1+e2

−e1+e2

+e1−e3

+e1+e3

−e2+e3

+e1−e2

−e1−e2

−e1−e3−e2−e3

2e3

2e2

2e1

−2e1

−2e3

−2e2

Figure 10.1. Rank-three root space diagrams. Top: A3, D3. Bottom: B3, C3.

In addition to these four unending series there are five exceptional root spaces:

G2 +ei − e j

± [(ei + e j ) − 2ek

]1 ≤ i �= j �= k ≤ 3

F4 ±ei ± e j

±2ei

±e1 ± e2 ± e3 ± e4 1 ≤ i �= j ≤ 4

E6 ±ei ± e j12 (±e1 ± e2 ± e3 ± e4 ± e5) ±︸ ︷︷ ︸

even number of + signs

√3

4 e6 1 ≤ i �= j ≤ 5

Page 177: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

10.2 Root space diagrams 163

3A1=C1=B1

10B2 C2=

=

=

=

8A2

35A5

21B3

36B4 52F4 36C4

55C555B5

78B6

105B7

B8

105C7

78C6

21C3

48A6

63A7

80A8248E8120D8

D9

n(2n−1) n(n+1) n(2n+1) n(2n+1)

99A9

126E791D7

78E666D6

14G2

24E4

45D5

15D3

28D4

A3

E5

A4

Figure 10.2. Root spaces constructed by the building-up principle. There are fourinfinite series and five exceptional Lie algebras. The root spaces are organized byrank.

E7 ±ei ± e j

12 (±e1 ± e2 ± e3 ± e4 ± e5 ± e6)︸ ︷︷ ︸

even number of + signs

±√

24 e7 1 ≤ i �= j ≤ 6

E8 ±ei ± e j

12 (±e1 ± e2 ± e3 ± e4 ± e5 ± e6 ± e7 ± e8)︸ ︷︷ ︸

even number of + signs

1 ≤ i �= j ≤ 8 (10.9)

The building-up principle is summarized in Fig. 10.2. In this figure all root spacesare shown by rank. Arrows connect pairs related by the building-up principle.

Remark 1. The following classical groups are associated with these root spaces

Al−1 SU (l), SL(l; R), SU (p, q) p + q = lDl SO(2l), SO(p, q) p + q = 2lBl SO(2l + 1), SO(p, q) p + q = 2l + 1Cl Sp(l), Sp(p, q) p + q = l

(10.10)

Page 178: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

164 Root spaces and Dynkin diagrams

-a1 = -e1 + e2

-a2 = -e1 - e2

a2 = +e1 + e2 a2

-a1 a1

-a2a1 = e1 - e2

= +

Figure 10.3. The root space D2 consists of two orthogonal root subspaces. Bothdescribe the rank-one algebra A1.

Several different Lie groups (algebras) are associated with each root space.This comes about because root spaces classify complex Lie algebras. Recall thatextension of the field from real to complex numbers was required to guarantee thatthe secular equation could be solved. Each of the Lie algebras with the same rootspace has the same complex extension, for example, SL(l; C) for Al−1.

Remark 2. The root space D2 consists of two orthogonal sets of roots ±(e1 − e2)and ±(e1 + e2). The decomposition is shown in Fig. 10.3. Orthogonal root spacesdescribe semisimple Lie algebras. Root subspaces that do not have an orthogonaldecomposition describe simple Lie algebras. Complete reducibility of the regu-lar representation corresponds to decomposition of the root space into disjoint(orthogonal) root spaces and of the semisimple Lie algebras to simple invariantsubalgebras.

Remark 3. The root spaces B2 and C2 are equivalent, as is easily seen by ro-tation. The root space B2 describes SO(5) while C2 describes Sp(2) = U (2; Q),which has a four-dimensional matrix representation obtained by replacing eachquaternion by a complex 2 × 2 matrix. Therefore we should expect SO(5) tohave a four-dimensional “spinor” representation based on U (2; Q) in the same waythat SO(3) (B1) has a two-dimensional spinor representation based on U (1; Q) orSU (2) (A1).

Remark 4. In the building-up construction the roots in each root space diagram areexplicitly constructed. What is not immediately obvious is that there are no moresimple root spaces than those listed. How are we sure that there are no more thanfive exceptional root spaces? This question is not easy to resolve in the context ofroot space constructions alone. However, it is easily resolved by another algorithmicprocedure. This procedure yields a beautiful completeness argument. The price wepay is a somewhat greater difficulty in constructing the complete set of roots for

Page 179: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

10.3 Dynkin diagrams 165

these spaces. However, since they have been constructed above, this poses no severelimitation.

10.3 Dynkin diagrams

A plane through the origin of a root space diagram that does not contain any nonzeroroots divides the roots into two sets, one “positive,” the other negative (cf. Fig. 9.6).Among the positive roots the l nearest to this hyperplane in a rank-l root space arelinearly independent. They can therefore be chosen as a basis set in this space. Theseroots are called fundamental roots, and denoted α1, α2, . . . , αl . Every positive rootcan be expressed in terms of this basis as a linear combination of these fundamentalroots with integer coefficients. The integers are all positive or zero, because everyshift operator defined by a positive root can be written as a multiple commutator ofshift operators with fundamental positive roots. By symmetry, every negative rootis a linear combination of fundamental roots with nonpositive integer coefficients.The fundamental roots for G2 are shown in Fig. 10.4. Fundamental roots for the

G2

31

a1 a2

−e1−e2+2e3

−e1+2e2−e3

−e1+e3

−e2+e3

−e1+e2

e1+e2−2e3

2e1−e2−e3

e1−e2

e2−e3

e1−e3

e2+e3−2e1

e1+e3−2e2

Figure 10.4. Root space for G2. Fundamental roots are α1 = e1 − e2 and α2 =−e1 + 2e2 − e3. All roots are orthogonal to R = e1 + e2 + e3.

Page 180: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

166 Root spaces and Dynkin diagrams

3 1 2 2 1

G2 B3

Figure 10.5. Disconnected Dynkin diagrams describe semisimple Lie algebras.Here the disconnected diagram describes G2 ⊕ B3.

root spaces Al−1, Dl, Bl, Cl are

α1 α2 α3 αl−1 αl

Al−1 e1 − e2 e2 − e3 e3 − e4 · · · el−1 − el

Dl ” ” ” · · · ” el−1 + el

Bl ” ” ” · · · ” el

Cl ” ” ” · · · ” 2el

(10.11)

Inner products among the fundamental roots are summarized conveniently in adiagrammatic form. The inner product between two fundamental roots is negativeor zero

(αi , α j ) = −√ni j/4 (10.12)

where ni j is 0, 1, 2, or 3. Each fundamental root is represented by a dot. Dots iand j are joined by ni j lines. Orthogonal roots are not connected. Such a diagramis called a Dynkin diagram. The Dynkin diagram for the semisimple Lie algebrarepresented by orthogonal root spaces G2 + B3 is shown in Fig. 10.5.

Orthogonal root spaces for semisimple Lie algebras are represented by discon-nected Dynkin diagrams. In these diagrams the relative (squared) lengths of thefundamental roots (3, 1 for G2) are indicated over the root symbol, by an arrowpointing from the shorter to the longer, and by open and solid dots. The conventionsare interchangeable: normally not more than one is adopted. We will use only oneat a time.

Only a very limited number of distinct kinds of Dynkin diagrams can occur. Thelimitations derive from two observations.

Observation 1 The root space is positive-definite.Observation 2 If vi is an orthonormal system of vectors in the root space and u is a unit

vector, then the direction cosines u · vi obey∑(u · vi )

2 ≤ 1 (10.13)

These two observations are now used to list a set of properties that constrain theallowed Dynkin diagrams ever more tightly.

Page 181: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

10.3 Dynkin diagrams 167

simple chain

Figure 10.6. A simple linear chain can be removed. If the original is an allowedDynkin diagram, the shortened diagram is also an allowed Dynkin diagram. In thiscase the original diagram is not an allowed Dynkin diagram.

u1

u1

up

w1

v1

wr−1

up−1 vq−1

vq v1

X

(B,C,F )

(D,E )

Figure 10.7. General forms of allowed root space diagrams after the process ofcontraction has been performed.

Property 1 There are no loops. A diagram containing a loop has at least as many lines asvertices. With ui = αi/|αi | the inner product( ∑

ui ,∑

u j

)= n + 2

∑ ∑ui · u j ≥ 0 (10.14)

cannot be positive since 2ui · u j ≤ −1 if αi and α j are connected.Property 2 The number of lines connected to any node is less than four. This results from

Observation 2. If vi are connected to u, then∑(u · vi )

2 =∑

ni/4 < 1 (10.15)

Property 3 A simple chain connecting any two dots can be shrunk. An allowed diagram istransformed to an allowed diagram. This allows the construction shown in Fig. 10.6.Since the constructed diagram violates Property 2, so also does the original diagram.

The only possibilities remaining are shown in Fig. 10.7.For the diagrams (B, C, F) with a single double link, the Schwarz inequality

applied to the vectors

u =p∑

i=1

iui v =q∑

j=1

jv j (10.16)

Page 182: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

168 Root spaces and Dynkin diagrams

where ui , v j are unit vectors ui = αi/|αi | and vi = α j/|α j |, can be transformed tothe inequality (

1 + 1

p

) (1 + 1

q

)> 2 (10.17)

This has the following solutions with p ≥ q

p arbitrary, q = 1, Bl, Cl l = p + 1(10.18)

p = 2, q = 2, F4

For the diagrams (D, E) Observation 2 applied to the vectors u, v, and w definedas in Eq. (10.16) leads to the inequality

1

p+ 1

q+ 1

r> 1 (10.19)

This has the following solutions with p ≥ q ≥ r ≥ 2

p q r Root spacep 2 2 Dp+2

3 3 2 E6

4 3 2 E7

5 3 2 E8

(10.20)

The allowed Dynkin diagrams are summarized in Table 10.2. This table providesa complete list of simple root spaces. Each root space was constructed in Section10.2. The complete set of roots in each of the root spaces is listed in that section.

10.4 Conclusion

The canonical commutation relations for a semisimple Lie algebra have been ex-pressed in terms of root space diagrams. These diagrams have been used to classifyall simple root space diagrams of rank l by constructing a complete set of rootsinductively from each root space diagram of rank l − 1. The completeness of thisconstruction is guaranteed by the 1:1 correspondence between the root space di-agrams constructed in Section 10.2 and the allowed connected Dynkin diagramsconstructed in Section 10.3.

10.5 Problems

1. Show that the following three statements for a semisimple Lie algebra are equivalent:a. the Lie algebra has two simple invariant subalgebras;b. the nonzero roots in its root space diagram fall into two mutually orthogonal subsets;

Page 183: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

10.5 Problems 169

Table 10.2. Allowed root spaces

a1 a2

an

an

an

an−1

a1 a2 an−1

a1 a2

a1 a2

a1 a2 a3 a4

a1 a2 a3 a4 a5

a1 a2 a3 a4 a5 a6

a1 a2 a3 a4 a5 a6 a7

a8

a7

an−1

a1 a2 an−2

an−1

an−1

An

Dn

Bn

Cn

G2

F4

E6

E7

E8

α6

c. its Dynkin diagram is disconnected, with two connected components.Do these statements extend to semisimple Lie algebras with three or more simpleinvariant subalgebras?

2. Show that bilinear combinations of two boson creation and/or annihilation operatorscan be identified with the roots in the ten-dimensional Lie algebra C2 as shown in Fig.10.8(a). Identify H1 and H2.

3. Show that bilinear combinations of two fermion creation and/or annihilation operatorscan be identified with the roots in the six-dimensional Lie algebra D2 as shown in Fig.10.8(b). Identify H1 and H2.

Page 184: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

170 Root spaces and Dynkin diagrams

b2+ b2

+

b1b2+

b1+ b2

+

b2+ b2+ −

b1+ b1

+

b1+ b2

b2b2

(a) (b)

b1b2

b1b1

12

b1+ b1+ −12

f1f †2 f1

† f †2

f1† f2f1

f2

f2† f2+ −2

1

f1† f1+ −

2 1

Figure 10.8. (a) Roots of C2 are identified with products of boson operators.(b) Roots of D2 are identified with products of fermion operators. Note that f †1 f †1 =0, etc.

4. Show that the following identifications are appropriate for the generators of the Liegroup U (l):

Canonical Boson Coordinates and Fermionform operators derivatives operators

Hi b†i bi x i∂i f †i fi

E+ei −e j b†i b j x i∂ j f †i f j

5. Show that the following identifications are appropriate for the eigenoperators for theroot spaces Cl and Dl :

Cl Dl

Canonical Boson Coordinates and Fermion Coordinates andform operators derivatives operators derivatives

Hi b†i bi + 1

2 xi∂i f †i fi + 12 xi∂i + 1

2

E+ei −e j b†i b j x i∂ j f †i f j x i∂ j

E+ei +e j b†i b†

j x i x j f †i f †j x i x j

E−ei −e j bi b j ∂i∂ j fi f j ∂i∂ j

E+2ei b†i b†

i x i x i

E−2ei bi bi ∂i∂i

Page 185: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

10.5 Problems 171

6. Apply the Schwartz inequality to the two vectors in Eq. (10.16) and show that the resultcan be expressed in the form of the inequality given in Eq. (10.17).

7. Use the projection inequality of Eq. (10.13) with the three vectors constucted for theDynkin diagrams of type (D, E) to obtain the inequality of Eq. (10.19).

8. A Lie algebra is spanned by n2 operators of the form a†i a j , with 1 ≤ i, j ≤ n. Show

that the linear vector space for this algebra can be written as the direct sum of twosubspaces: L, Q spanned by the operators

L QLi j = a†

i a j − a†j ai = −L ji Qi j = a†

i a j + a†j ai = +Q ji

For n = 3 the subspaces transform like an angular momentum vector and a quadrupoletensor. Show that the commutation relations are

[L, L] = L[Li j , Lrs

] = +δ jr Lis + δis L jr − δir L js − δ js Lir

[L, Q] = Q[Li j , Qrs

] = +δ jr Qis − δis Q jr − δir Q js + δ js Qir

[Q, Q] = L[Qi j , Qrs

] = +δ jr Lis + δis L jr + δir L js + δ js Lir

The quadrupole tensor, in turn, with six components, can be written as the sum of atraceless tensor Q and a scalar N :

N =3∑

i=1

a†i ai Qi j = Qi j − 2

3Nδi j

The operator N commutes with all operators a†i a j . Interpret these commutation re-

lations in physical terms (scalars, vectors, and traceless quadrupole tensors) and inmathematical terms (commutative invariant subalgebra N , Cartan decomposition of asimple Lie algebra L + Q).

9. Carry out a similar decomposition for any value of n. Show that the only changes inthe discussion of Problem 8 are the dimensions of the spaces L (3 → n(n − 1)/2), Q(6 → n(n + 1)/2), and the definition of N (3 → n).

Page 186: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

11

Real forms

Root space diagrams classify all the simple Lie algebras and summarizetheir commutation relations. The Lie algebras so classified exist over thefield of complex numbers. Each simple Lie algebra over C of complexdimension n has a number of inequivalent real subalgebras over R ofreal dimension n. These are obtained by putting reality restrictions onthe coordinates in the complex Lie algebra. The different real forms ofa complex simple Lie algebra are obtained systematically by a simpleeigenvalue decomposition. For the classical (matrix) Lie algebras, threedifferent procedures suffice to construct all real forms. These are: blocksubmatrix decomposition; subfield restriction; and field embedding.

11.1 Preliminaries

In our attempt to find a canonical form for the commutation relations of a real simpleLie algebra with elements Z = r i Xi (r i are real numbers, Xi the generators of theLie group, or basis vectors in the Lie algebra), we were led to an eigenvalue equationof the form

∑ ∑r i [R j

i (Z ) − λδj

i ]X j = 0. This equation cannot be solved ingeneral unless the field is extended from the real to the complex numbers. Allowingthat extension, we were able to find a canonical form for the operators in semisimpleLie algebras. The general operator in such algebras has the form

rank∑i=1

hi Hi +∑α �=0

eα Eα (11.1)

where hi and eα are complex numbers and the “diagonal” and “shift” operatorswere defined in Section 9.7. The commutation relations were classified in terms ofa root space diagram. These diagrams were used to enumerate all the simple Liealgebras over the complex field.

172

Page 187: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

11.1 Preliminaries 173

We return now to the question of determining the real forms associated with eachof the root space diagrams or, more accurately, the complex Lie algebra associatedwith each root space diagram. We do this by first presenting Cartan’s method ofdecomposing a Lie algebra into two subspaces with very special commutationrelations and orthogonality properties. Three simple decompositions of this typeare applied to the compact matrix Lie algebra to generate all the real forms of theclassical simple Lie algebras An−1, Dn, Bn, Cn . These decompositions are: blocksubmatrix decomposition; subfield restriction; and field embeddings.

Example The noncompact Lie algebras sl(2; R) and su(1, 1) have commutationrelations described by the root space A1. The nonisomorphic Lie algebra su(2) hasthe same root space. To see why, we compute the regular representation of sl(2; R)and su(2) and their secular equations

Algebra Defining representation Regular representation

sl(2; R)1

2

[a1 a2 + a3

a2 − a3 −a1

]−→

0 −a3 −a2

a3 0 a1

−a2 a1 0

−λ[λ2 + (−a2

1 − a22 + a2

3

)] = 0

su(2)i

2

[b3 b1 − ib2

b1 + ib2 −b3

]−→

0 −b3 b2

b3 0 −b1

−b2 b1 0

−λ[λ2 + (

b21 + b2

2 + b23

)] = 0

(11.2)

In the case of sl(2; R) it is possible to find three real roots of the secular equationfor certain choices of the real parameters a1, a2, a3 while in the compact case thisis not possible. If the real parameters (a1, a2, a3) and (b1, b2, b3) are allowed tobecome complex the two Lie algebras become algebras of 2 × 2 complex tracelessmatrices – the Lie algebra for SL(2; C). This relation is shown in Fig. 11.1.

The complex extension Lie algebra has root space A1 describing canonical com-mutation relations for the diagonal and shift operators shown in Fig. 11.1. The mostgeneral element in this Lie algebra is a complex linear combination of the threematrices shown. The algebras sl(2; R) and su(2) have real dimension 3 while theircommon complex extension has complex dimension 3 (real dimension 6). In thefollowing sections we present a systematic way for determining how to restrict thecomplex parameters to real parameters in order to construct all inequivalent real Liealgebras with the same dimension as the complex Lie algebra whose commutationrelations are described by a root space diagram.

Page 188: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

174 Real forms

sl(2;R)

sl(2;C)

su(2) Real Lie algebra

Real Complex

Complex extension Lie algebra

21 0 0

01 21 1 0

−10 21 0 1

00

E−1 E+1H1

Figure 11.1. Lie groups SL(2; R) and SU (2) are related by analytic continuation.The canonical form for the diagonal and shift operators in their Lie algebras is alsoshown.

11.2 Compact and least compact real forms

The Cartan–Killing inner product for the basis vectors Hi , Eα is

11

1. . .

10 11 0

0 11 0

. . .

H1

H2

H3...

Hn

E−α

E−β

...

(11.3)

The inner product can be brought to diagonal form by choosing linear combinationsof basis vectors 1√

2(Eα ± E−α):

11

1. . .

11

−11

−1. . .

H1

H2

H3...

Hn1√2(Eα + E−α)

1√2(Eα − E−α)

1√2(Eβ + E−β)

1√2(Eβ − E−β)

...

(11.4)

Page 189: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

11.2 Compact and least compact real forms 175

If we restrict the coefficients of Hi , 1√2(Eα + E−α), and 1√

2(Eα − E−α) (all α �= 0)

to be real, then the generators 1√2(Eα − E−α) span the maximal compact subalgebra

(closure under commutation must be verified; this is left an an exercise) while thegenerators Hi and 1√

2(Eα + E−α) span a noncompact subspace.

On the other hand, if we restrict the coefficients of Hi and 1√2(Eα + E−α) to be

imaginary and those of 1√2(Eα − E−α) to be real

ihi Hi + ieα 1√2

(Eα + E−α) + e−α 1√2

(Eα − E−α) (11.5)

then the factors i can be absorbed within the generators. With respect to theseredefined generators the Cartan–Killing inner product is negative-definite and thealgebra constructed is compact

h1

h2

h3

...hn

e+α

e−α

e+β

e−β

...

−1−1

−1. . .

−1−1

−1−1

−1. . .

i H1

i H2

i H3...

i Hn

i 1√2(Eα + E−α)

1√2(Eα − E−α)

i 1√2(Eβ + E−β)

1√2(Eβ − E−β)

...

(11.6)

real Cartan–Killing inner product basis vectors incoefficients Lie algebra

Two real forms of A1, sl(2; R) and su(1, 1), are obtained as follows

H11√2(E+1 + E−1) 1√

2(E+1 − E−1)

(hr + ihi )

[1 00 −1

]+ (ar + iai )

[0 11 0

]+ (br + ibi )

[0 1

−1 0

] (11.7)

(hr , ar , br ) −→[

hr ar + br

ar − br −hr

]sl(2; R)

(ihi , iai , br ) −→ i

[hi ai − ibr

ai + ibr −hi

]su(2)

(11.8)

Here the six parameters hr , hi ; ar , ai ; br , bi are real.

Page 190: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

176 Real forms

It is useful to specify how compact a real form is by specifying its index (n+, n−),where n+ is the dimension of the subspace on which the nonsingular Cartan–Killing inner product is positive-definite and n− is the dimension of the subspace(subalgebra) on which it is negative-definite. These two pieces of information maybe abbreviated to a single integer, the character χ = n+ − n−, to describe a realform. This is the trace of the normalized Cartan–Killing form. Inspection of (11.3)and (11.6) shows that the character is +(rank) of the root space for the real Liealgebra spanned by Hi ,

1√2(Eα + E−α), and 1√

2(Eα − E−α) and is −(dimension) for

the compact real form spanned by real linear combinations of iHi , i 1√2(Eα + E−α),

and 1√2(Eα − E−α). In general, for all real forms the character satisfies the bounds

− dimension ≤ χ = character ≤ +rank (11.9)

11.3 Cartan’s procedure for constructing real forms

Cartan has proposed a simple and elegant procedure for constructing all the realforms of a (complex) simple Lie algebra. This procedure constructs one real formfrom another by “analytic continuation.” It is modeled on Minkowski’s transfor-mation of space-time (x, y, z, ct) with indefinite metric gµ,ν = diag(1, 1, 1, −1) tospace-time with imaginary time (x, y, z, ict) and positive-definite metric gµ,ν =diag(1, 1, 1, 1).

Since the compact real form can always be constructed easily for a simple Liealgebra (see Eq. (11.6)) it is useful to begin with that form. The compact Liealgebra g is divided into two pieces with the following commutation relations andorthogonality properties

g = h + p

[h, h] ⊆ h (h, h) < 0[h, p] ⊆ p (h, p) = 0[p, p] ⊆ h (p, p) < 0

(11.10)

In short, the subspace h is a subalgebra and p is its orthogonal complement. Aconcrete example of this decomposition is

su(2) = u(1) + su(2) − u(1)

i

2

[a3 a1 − ia2

a1 + ia2 −a3

]= i

2

[a3 00 −a3

]+ i

2

[0 a1 − ia2

a1 + ia2 0

](11.11)

The Lie algebra g is mapped into a noncompact Lie algebra g′ by means of“Minkowski’s trick”: p → p′ = ip. The mapping, commutation relations, and

Page 191: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

11.4 Real forms of simple matrix Lie algebras 177

orthogonality relations are

g = h + p −→ g′ = h + ip = h + p′

[h, h] ⊆ h (h, h) < 0[h, p′] ⊆ p′ (h, p′) = 0[p′, p′] ⊆ h (p′, p′) > 0

(11.12)

In g′, h is the maximal compact subalgebra and p′ consists of all the noncompactgenerators. The character of this algebra is

χ (g′) = dim(p′) − dim(h) = dim(g) − 2dim(h) = 2dim(p′) − dim(g) (11.13)

As a concrete example of this mapping, we have from (11.11)

su(2) → su(1, 1) :i

2

[a3 00 −a3

]− 1

2

[0 a1 − ia2

a1 + ia2 0

](11.14)

The mapping is reversible: noncompact g′ can be mapped back to compact g.A systematic method exists for finding Cartan decompositions (11.12). Assume

T is a linear mapping of the Lie algebra g onto itself that preserves inner products,and that also obeys

T 2 = I (11.15)

(“involutive automorphism”). Then T has two eigenvalues: ±1. Under T , oneeigenspace of T is mapped into itself while the other (its orthogonal complement)is mapped into its negative. The map T splits g into eigenspaces h and p

g = h + p

T (g) = T (h) + T (p)g = (+1)h + (−1)p

(11.16)

The two subspaces are orthogonal

(h, p) = (T 2h, p) = (T h, T p) = (h, −p) = −(h, p) = 0 (11.17)

and satisfy commutation relations (11.12).As a consequence of this result, a search for all real forms of a complex semisim-

ple Lie algebra reduces to a hunt for all metric-preserving mappings T of thecompact real form of that Lie algebra to itself that obey T 2 = I .

11.4 Real forms of simple matrix Lie algebras

All of the real forms of all of the simple classical (matrix) Lie algebras can beconstructed from one of three types of mappings T of matrices into themselves that

Page 192: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

178 Real forms

obey T 2 = I . These three mapping types are derived from block matrix decompo-sition, subfield restriction, and field embeddings. We discuss each in the next threesubsections, indicating the real forms that are produced. In all instances we beginwith the compact Lie algebras.

11.4.1 Block matrix decomposition

In a block matrix decomposition the compact Lie algebras u(n, F) have the form

u(n; F)

[Ap 00 Aq

]+

[0 B

−B† 0

](11.18)

where Ap = −A†p, Aq = −A†

q , and B is an arbitrary p × q matrix. Under theprocedure described in the previous section the off-diagonal block is multiplied byi . This is equivalent to changing the metric Ip+q that is preserved by u(n; F) to themetric Ip,q that is preserved by u(p, q; F), where p + q = n. The factor i can beabsorbed into the p × q off-diagonal blocks, so that the noncompact algebra hasmatrix form

u(p, q; F)

[Ap 00 Aq

]+

[0 B

+B† 0

](11.19)

For the fields F = R, C, Q related to the root spaces (D, B), A, C the real formsare

R so(p, q) D, B

C su(p, q) A (11.20)

Q sp(p, q) C

11.4.2 Subfield restriction

The real numbers form a subset (subfield) of the complex numbers; the complexnumbers form a subset (subfield) of the quaternions. A Lie algebra over the complexnumbers can be divided into two subsets: real matrices and the remainder, imaginarymatrices. Similarly, a matrix algebra over the quaternions can be divided into twosubsets: complex matrices and the remainder

g = h + p −→ g′

su(n) = so(n) + [su(n) − so(n)] −→ sl(n; R)sp(n) = u(n) + [sp(n) − u(n)] −→ sp(2n; R)

(11.21)

Under the Cartan procedure, su(n) is mapped to sl(n; R) and sp(n) is mapped tosp(2n; R).

Page 193: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

11.4 Real forms of simple matrix Lie algebras 179

We illustrate this for su(2):

su(2) = 1

2

[ia3 ia1 + a2

ia1 − a2 −ia3

]→ 1

2

[0 +a2

−a2 0

]+ i

2

[a3 a1

a1 −a3

]↓ p → ip ↓

sl(2; R) = 1

2

[a3 a1 + a2

a1 − a2 −a3

]← 1

2

[0 +a2

−a2 0

]+ 1

2

[a3 a1

a1 −a3

](11.22)

The transformation from sp(n) = u(n; Q) to sp(2n; R) is somewhat less familiar.To make the mapping more comprehensible, it is useful to recall the mappings ofcomplex numbers into real 2 × 2 matrices and of quaternions into complex 2 × 2matrices (cf. Eqs. (3.3) and (3.4))

α + iβ −→[

α β

−β α

]

α + Iβ + J γ + Kδ −→[

α + iδ iβ + γ

iβ − γ α − iδ

] (11.23)

where α, β, γ and δ are real. With these replacements the Lie algebra of n × ncomplex matrices for u(n) is replaced by a set of 2n × 2n real matrices. We callthese matrices ou(2n), since they form an orthogonal representation of the unitaryalgebra in terms of 2n × 2n matrices. Similarly, the Lie algebra of n × n quaternionmatrices for sp(n) is replaced by a set of 2n × 2n complex matrices usp(2n), theunitary representation of the symplectic algebra of 2n × 2n matrices:

(11.23)

u(n) −→ ou(2n)sp(n) −→ usp(2n)

(11.24)

Since usp(2n) consists of complex matrices, the algebra can be decomposed intothe subalgebra of real matrices, which is ou(2n), and the complementary subspaceof imaginary matrices

sp(n) = u(n) + [sp(n) − u(n)]

↓ ↓ ↓usp(2n) = ou(2n)︸ ︷︷ ︸

real

+ [usp(2n) − ou(2n)]︸ ︷︷ ︸imaginary

↓ ↓ ↓ p → ip

sp(2n; R) = ou(2n)︸ ︷︷ ︸real

+ i [usp(2n) − ou(2n)]︸ ︷︷ ︸real

(11.25)

Page 194: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

180 Real forms

Both sl(n; R) and sp(2n; R) are the least compact real forms associated with theirrespective root spaces An−1 and Cn .

Remark The matrix Lie group Sp(2n; R) leaves invariant a nonsingular antisym-metric metric in R2n . It is possible to choose coordinates p1, q1, p2, q2, . . . , pn, qn

in this space so that the inner product between two vectors v′i Gi jv j is

v′i Gi jv j = (p1, q1, p2, q2, . . . , pn, qn)′

0 1−1 0

0 1−1 0

. . .. . .

0 1−1 0

p1

q1

p2

q2......

pn

qn

=n∑

i=1

(p′i qi − q ′

i pi ) (11.26)

Then symplectic transformations M ∈ Sp(2n; R) leave this metric matrix G invari-ant: Mt G M = G. Symplectic transformations leave invariant the canonical formof the hamiltonian equations of motion in classical mechanics.

11.4.3 Field embeddings

The algebras for the orthogonal and unitary groups of even dimension have thefollowing decompositions:

so(2n) = ou(2n) + [so(2n) − ou(2n)]↓ ↓

ou(2n) + i [so(2n) − ou(2n)] = so∗(2n)

su(2n) = usp(2n) + [su(2n) − usp(2n)]↓ ↓

usp(2n) + i [su(2n) − usp(2n)] = su∗(2n)

(11.27)

Application of the map p → ip produces the real forms so∗(2n) and su∗(2n).

Remark The real forms so∗(2n) of Dn and su∗(2n) of A2n−1 do not occur explicitlyin the list of matrix Lie algebras given in Chapter 5.

Page 195: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

11.5 Results 181

11.5 Results

We summarize in Table 11.1 the real forms of the simple classical Lie algebras.This table indicates the root space associated with each real form.

Some of the low-dimensional root spaces are equivalent. For example, A1 (wherethe compact real form is su(2)), B1 (so(3)), and C1 (sp(1)) are equivalent, as are B2

(so(5)) and C2 (sp(2)). So also are A3 (su(4)) and D3 (so(6)). As a result, there areequivalences between the real forms of these Lie algebras. These equivalences aresummarized in Table 11.2.

Table 11.1. Real forms of the simple classical Lie algebras

Mapping Real form Root space Condition

Block submatrix so(p, q) Dn p + q = 2nso(p, q) Bn p + q = 2n + 1su(p, q) An−1 p + q = nsp(p, q) Cn p + q = n

Subfield restriction sl(n; R) An−1sp(2n; R) Cn

Field embedding so∗(2n) Dnsu∗(2n) A2n−1

Table 11.2. Equivalence among real forms of the simple classical Lie algebras

A1 ∼ B1 ∼ C1su(2) ∼ so(3) ∼ sp(1) = usp(2)su(1, 1) = sl(2; R) ∼ so(2, 1) ∼ sp(2; R)

D2 = A1 + A1so(4) = so(3) + so(3)so∗(4) ∼ so(3) + so(2, 1)so(3, 1) ∼ sl(2; C)so(2, 2) ∼ so(2, 1) + so(2, 1)

B2 = C2so(5) ∼ sp(2) = usp(4)so(4, 1) ∼ sp(1, 1) = usp(2, 2)so(3, 2) ∼ sp(4; R)

D3 = A3so(6) ∼ su(4)so(5, 1) ∼ su∗(4)so∗(6) ∼ su(3, 1)so(4, 2) ∼ su(2, 2)so(3, 3) ∼ sl(4; R)

Page 196: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

182 Real forms

Table 11.3. Real forms of the exceptional Lie algebras

Maximal compact subgroup

Root space Classrank(character) Root space Dimension

G2 G2(−14) G2 14G2(+2) A1 + A1 6

F4 F4(−52) F4 52F4(−20) B4 36F4(+4) C3 + A1 24

E6 E6(−78) E6 78E6(−26) F4 52E6(−14) D5 + D1 46E6(+2) A5 + A1 38E6(+6) C4 36

E7 E7(−133) E7 133E7(−25) E6 + D1 79E7(−5) D6 + A1 69E7(+7) A7 63

E8 E8(−248) E8 248E8(−24) E7 + A1 136E8(+8) D8 120

For completeness, we list the real forms for the exceptional Lie algebras inTable 11.3. The subscript in parentheses after the rank is the character of the realform.

11.6 Conclusion

Connected root space diagrams summarize the commutation relations of simpleLie algebras over the field of complex numbers. By placing various reality restric-tions on the coefficients of the complex algebra, a spectrum of real subalgebrasis obtained, each of which has the same complex extension. To each root spacethere corresponds a unique real form that is compact. All other real forms areobtained from this compact real form by “analytic continuation.” The analyticcontinuation is carried out by determining all linear mappings T on the compactalgebra g that preserve the inner product and obey T 2 = I . The subspace p of g

that obeys T (p) = −p is analytically continued by p → p′ = ip; the subspace h

of g that obeys T (h) = +h is the maximal compact subalgebra of the noncompactreal form g′: g = h + p → g′ = h + ip′. For the simple classical (matrix) Lie alge-bras three types of mappings T suffice to construct all real forms: block submatrixdecomposition; subfield restriction; and field embedding.

Page 197: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

11.7 Problems 183

11.7 Problems

1. Four operators a†i a j can be constructed from boson operators for two modes 1 ≤ i ,

j ≤ 2. These operators close under commutation.a. Show that the regular representation of this Lie algebra is

Reg(wa†1a1 + xa†

1a2 + ya†2a1 + za†

2a2) =

0 −x y 0−y w − z 0 yx 0 −w + z −x0 x −y 0

b. Show that the Cartan–Killing inner product is

tr Reg2(wa†

1a1 + xa†1a2 + ya†

2a1 + za†2a2) = 2(w − z)2 + 8xy

c. Set

w = α + β x = γ + δ

z = α − β y = γ − δinner product → 8(β2 + γ 2 − δ2)

wa†1a1 + xa†

1a2 + ya†2a1 + za†

2a2 = α(a†1a1 + a†

2a2)

+ β(a†1a1 − a†

2a2) + γ (a†1a2 + a†

2a1) + δ(a†1a2 − a†

2a1)

Conclude that a†1a1 + a†

2a2 spans the maximum commutative subalgebra, a†1a2 −

a†2a1 spans the maximal compact subalgebra, and the two generators a†

1a1 − a†2a2

and a†1a2 + a†

2a1 are noncompact.d. Identify the simple three-dimensional subalgebra as sl(2; R) or su(1, 1). Show that

the compact real form is obtained by multiplying the two noncompact generatorsby i .

e. Construct a 2 × 2 matrix representation of the three operators that span su(1, 1)using the methods of Chapter 6. Multiply the two noncompact operators by i .Show that the three matrices that result are exactly iσ j , where σ j are the Pauli spinmatrices.

2. The classical matrix groups SO(n) are not simply connected, so they are k → 1images of their universal covering groups SO(n) = Spin(n), for some integer k.Show that the covering groups Spin(n) are classical matrix groups for n = 3, 4, 5, 6,and make these identifications:

Spin(3) = SU (2)Spin(4) = SU (2) ⊗ SU (2)Spin(5) = U Sp(4)Spin(6) = SU (4)

Show that for n > 6 the groups Spin(n) are not equal to any classical matrix Liegroups.

Page 198: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

184 Real forms

3. Spectrum of quadratic Casimira. Use the metric (11.3) for a simple Lie algebra to show that the quadratic Casimir

operator is

C2 =∑

H 2i +

∑Eα E−α + E−α Eα

b. Since the Hi are mutually commuting, in a hermitian/unitary representation theyare simultaneously diagonalizable. Identify basis states in a Hilbert space by theireigenvalues under the operators Hi : |n1, n2, . . . , nr 〉,

Hi |n1, n2, . . . , nr 〉 = ni |n1, n2, . . . , nr 〉c. For the orthogonal groups SO(n), impose suitable reality conditions (i.e., Hj →

i Hj , etc.), choose a Hilbert space containing the state |l, 0, . . . , 0〉 and show thatthe value of C2 on every vector (i.e., apply shift operators Eα until no new statesare created) is

C2|state〉 = −l(l + n − 2)|state〉The − sign indicates that SO(n) is compact. This spectrum reduces to the well-known spectrum −m2 for SO(2) (on eimφ) and −l(l + 1) for SO(3) (on Y l

m(θ, φ)).

4. Master analytic representation for A1 The complex Lie algebra with root spacediagram A1 has two real forms

su(2) J3, J± [J3, J±] = ±J± [J+, J−] = +2J3

su(1, 1) K3, K± [K3, K±] = ±K± [K+, K−] = −2K3

In Problem 2 of Chapter 6 we exploited the isomorphism between the Lie algebrasu(2) and bilinear combinations of creation and annihilation operators for two modesin order to construct matrix elements of the angular momentum operators. These arematrix elements of a hermitian representation of Ji , i = 1, 2, 3. Exponentials of theform E X P(ir k Jk), with rk real, provide unitary representations of the compact Liegroup SU (2). All unitary irreducible representations of SU (2) are finite dimensional(2 j + 1) and are obtained in this way. In this problem we will review the constructionof the UIR (unitary irreducible representations) of the compact group SU (2) and willuse similar methods to construct all the UIR of its analytic continuation, the non-compact Lie group SU (1, 1). Since the algebras are related by analytic continuation,so also are the UIR. We will begin with the analytic hermitian matrix elements forsu(2) and continue to hermitian matrix elements for the analytically continued algebrasu(1, 1).a. Make the identifications

K3 = 12 (a†

1a1 − a†2a2) = J3

K+ = ia†1a2 = i J+

K− = ia†2a1 = i J−

(11.28)

Verify all commutation relations are satisfied.

Page 199: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

11.7 Problems 185

b. Recall that in both SU (2) and SU (1, 1), rotation about the z-axis by 4π radiansreturns to the same group operation. Show that in any matrix representation ofSU (2) with J3 diagonal, or in any matrix representation of SU (1, 1) with K3

diagonal, the matrix is diagonal with matrix elements eimφδm ′m . Show that thesingle-valuedness condition under φ → φ + 4π requires that m = 1

2 (n1 − n2) isinteger or half-integer. Show that the shift operators J± and K± require that all mvalues in a UIR with J3 or K3 diagonal are either integer or half-integer.

c. Relax the assumption that all indices n1, n2 in the basis states |n1, n2〉 = |n1〉 ⊗|n2〉 must be nonnegative integers. Construct the matrix elements of the operatorsJ3, J±, K3, K± under this relaxed assumption. Show that all commutation relationsare satisfied in the representation afforded by this set of basis states.

d. Show that the matrices for Jx and Jy are also hermitian provided that

〈n1 + 1, n2 − 1|J+|n1, n2〉 = √(n1 + 1)n2

‖ ‖〈n1, n2|J−|n1 + 1, n2 − 1〉∗ = √

(n1 + 1)n2∗

Show that these conditions are satisfied for

n1 ≥ 0 and n2 ≥ 0 or n1 ≤ −1 and n2 ≤ −1

Show that the lattice sites in quadrants I and III of Fig. 11.2, with vertices at (0, 0)(QI) and (−1, −1) (QIII), satisfy these conditions.

e. With the identification | jm

〉 = |n1, n2〉, j = 12 (n1 + n2), m = 1

2 (n1 − n2), show

that⟨j

m ± 1

∣∣∣J±∣∣∣ jm

⟩=

√( j ∓ m)( j ± m + 1) =

√(j + 1

2

)2

−(

m ± 1

2

)2

(11.29)

f. Show that the operators J± act diagonally. In order for all states connected bysuccessive application of these operators to remain in QI or QIII, n1 and n2 mustbe integers so that in the various quadrants the shift operators vanish on the edgesas shown:

Quadrant Operator EdgeI J+ n2 = 0I J− n1 = 0

III J+ n1 = −1III J− n2 = −1

g. Show that the matrix elements for the shift operators K± in su(1, 1) are

⟨j

m ± 1

∣∣∣K±∣∣∣ jm

⟩= i

√( j ∓ m)( j ± m + 1) =

√(m ± 1

2

)2

−(

j + 1

2

)2

(11.30)

Page 200: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

186 Real forms

n2

n1

SU(2)SU(1,1)

SU(2) SU(1,1)

4

3

1

2

X4321−1−2−3

−1

−2

−3

00

X

X

X

X

X

X

I

II III

IV

Figure 11.2. The integer lattice in two dimensions carries representationsof the algebras su(2) and su(1, 1) that exponentiate to unitary irreduciblerepresentations with careful choice of the basis set. All points in this plane aremapped to other points along diagonals of the form n1 + n2 = constant. Thesubspaces of basis vectors for the unitary irreducible representations of SU (2)and SU (1, 1) are separated by a “no man’s land” defined by −1 < n1 < 0and −1 < n2 < 0 (wavy lines). All points labeled x belong to the principalseries of representations of SU (1, 1) with n1 + n2 = − 1

2 + iβ.

h. For su(1, 1) show the hermiticity condition is satisfied for all real numbers exceptthose in QI and QIII.

i. In order to ensure that a set of states |n1 + k, n2 − k〉 (k integer) mapped into eachother by the shift operators K± do not enter QI from QIV, show that the edge(lowest m) state must be |n1, n2 = −1〉 for n1 = 0, 1, 2, . . . . The basis states forthis bounded discrete series of representations are | j

m 〉 = |n1, n2〉 with j + 12 =

0, 12 , 1, 3

2 , . . . or 2 j + 1 = 0, 1, 2, 3, . . . and m = j + 1, j + 2, . . . . This is the

discrete series of representations that is bounded below: D j+.

j. In order to ensure that a set of states |n1 + k, n2 − k〉 (k integer) mapped intoeach other by the shift operators K± do not enter QI from QII, show that the edge(highest m) state must be |n1 = −1, n2〉 for n2 = 0, 1, 2, . . . . The basis states forthis bounded discrete series of representations are | j

m 〉 = |n1, n2〉 with j + 12 =

0, 12 , 1, 3

2 , . . . or 2 j + 1 = 0, 1, 2, 3, . . . and m = − j − 1, − j − 2, . . . . This is

the discrete series of representations that is bounded above: D j−.

Page 201: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

11.7 Problems 187

k. Advance similar arguments to guarantee that states do not enter QIII from QIV(D j

+) or from QII (D j−).

l. Now relax the condition that n1 and n2 are integers. The set of states |n1 + k, n2 −k〉 (k = . . . , −2, −1, 0, +1, +2, . . . ) connected by K± carries a hermitian repre-sentation of su(1, 1) if one of the states falls in the square with corners on thevertices of the four quadrants. If this state is |p, q〉, with −1 ≤ p, q ≤ 0 then thesingle-valuedness condition requires 1

2 (p − q) = integer or half-integer. In the lat-ter case, it is not possible for the matrix element in Eq. (11.30) to be real for allvalues of the U (1) index m. Therefore p − q = 0 and −1 ≤ p = q ≤ 0. The states| j

m 〉 with m integer, j real and − 12 ≤ j + 1

2 ≤ + 12 carry representations D p of the

complementary series of UIR for SU (1, 1).m. By setting j + 1

2 = iβ (β real) the matrix elements in Eq. (11.30) become

⟨j

m ± 1

∣∣∣K±∣∣∣ jm

⟩= i

√( j ∓ m)( j ± m + 1) =

√(m ± 1

2

)2

+ β2 (11.31)

These matrix elements are always positive, for both representations with m integerand those with m half-integer. These states carry UIR belonging to the principalseries of representations of SU (1, 1).

n. The four series of UIR for SU (1, 1) are

principal j + 12 = iβ β real m = 0, ±1, ±2, ...

m = ± 12 , ± 3

2 , ...

complementary j + 12 = p − 1

2 ≤ p ≤ + 12 m = 0, ±1, ±2, ...

discrete, + 2 j + 1 = 0, ±1, ±2, ... m = +| j | + 1, +| j | + 2, ...

discrete, − 2 j + 1 = 0, ±1, ±2, ... m = −| j | − 1, −| j | − 2, ...

Show that states with j ′ < − 12 obtained by reflection through the diagonal contain-

ing the central point in the shaded square with coordinates (n1, n2) = (− 12 , − 1

2 )support representations equivalent to those with index j > − 1

2 . The relation amongindices is j + 1

2 = −( j ′ + 12 ) and the relation among states is | j ′

m ′ 〉 � | jm=m ′ 〉.

o. Show that the following equivalences occur among representations of these fourseries:

principal series j ′ = − 12 − i |β| ↔ j = j ′∗ = − 1

2 + i |β|complementary series − 1

2 ≤ j ′ + 12 ↔ ( j + 1

2 ) = −( j ′ + 12 ) < 0

discrete series, + j ′ + 12 < 0 ↔ j + 1

2 = −( j ′ + 12 )

discrete series, − j ′ + 12 < 0 ↔ j + 1

2 = −( j ′ + 12 )

5. A real simple Lie algebra of rank l and dimension n has basis vectors Hi and Eα .An element X in the Lie algebra is a real linear combination of these generators:X = hi Hi + eα Eα , with hi , eα real. Show that the real subalgebra spanned by the12 (n − l) linear combinations of the form (Eα − E−α)/

√2 is the maximal compact

subalgebra of this simple Lie algebra.

Page 202: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

188 Real forms

6. The noncompact real form sp(p, q) of the symplectic algebra was constructed fromthe compact real form sp(p + q) by “Minkowski’s trick,” or analytic continuation.This procedure is delicate: one must be careful of the complex unit i with quaternions.Show by more careful arguments that the result stated is correct.

Page 203: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

12

Riemannian symmetric spaces

In the classification of the real forms of the simple Lie algebras we en-countered subspaces p, ip on which the Cartan–Killing inner product wasnegative-definite (on p) or positive-definite (on ip). In both cases thesesubspaces exponentiate onto algebraic manifolds on which the invariantmetric gi j is definite, either negative or positive. Manifolds with a definitemetric are Riemannian spaces. These spaces are also globally symmet-ric in the sense that every point looks like every other point – becauseeach point in the space EXP(p) or EXP(ip) is the image of the originunder some group operation. We briefly discuss the properties of theseRiemannian globally symmetric spaces in this chapter.

12.1 Brief review

In the discussion of the group SL(2; R) we encountered three symmetric spaces.These were S2 ∼ SU (2)/U (1), which is compact, and its dual H 2

2+ = SL(2; R)/SO(2) = SU (1, 1)/U (1), which is the upper sheet of the two-sheeted hyperboloid.“Between” these two spaces occurs H 2

1 = SL(2; R)/SO(1, 1), which is the single-sheeted hyperboloid. These spaces are shown in Fig. 12.1.

The Cartan–Killing inner product in the linear vector subspace su(2) − u(1) isnegative definite. This is mapped, under the EXPonential function, to the Cartan–Killing metric on the space SU (2)/U (1) ∼ S2, the sphere. On S2 the Cartan–Killingmetric is negative-definite. We may just as well take it as positive-definite. Underthis metric the sphere becomes a Riemannian manifold since there is a metric on itwith which to measure distances.

The Cartan–Killing inner product on su(1, 1) − u(1) � sl(2; R) − so(2) ispositive-definite. It maps to a positive-definite metric on H 2

2+ = SU (1, 1)/SO(2).The upper sheet of the two-sheeted hyperboloid is topologically equivalent to the flatspace R2 but geometrically it is not: it has intrinsic curvature that can be computed,via its Cartan–Killing metric and the curvature tensor derived from it.

189

Page 204: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

190 Riemannian symmetric spaces

R3

S 2

a2

a3

H 2

x y

z

z

x

y

single-sheeted hyperboloid

EXP

Figure 12.1. S2 = SO(3)/SO(2) = SU (2)/U (1), H 22+ = SO(2, 1)/SO(2) =

SU (1, 1)/U (1), H 21 = SO(2, 1)/SO(1, 1) = SL(2; R)/SO(1, 1). The first two

are Riemannian symmetric spaces, the third is a pseudo-Riemannian symmetricspace.

The most interesting of these spaces is the single-sheeted hyperboloid H 21 . It is

obtained by exponentiating su(1, 1) − so(1, 1). The Cartan–Killing inner productin this linear vector space is indefinite. Therefore the Cartan–Killing metric onthe topological space EXP[su(1, 1) − so(1, 1)] = SU (1, 1)/SO(1, 1) is indefinite.The space is a pseudo-Riemannian manifold. In addition it is multiply connected.

12.2 Globally symmetric spaces

The three cases for A1 reviewed in the previous section serve as a model for thedescription of all other Riemannian symmetric spaces. For a compact simple Liealgebra g (i.e., so(n), su(n), sp(n)) the Cartan decompositions have the form (11.10)

g = h + p (p, p) < 0g′ = h + ip (ip, ip) > 0

(12.1)

Page 205: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

12.3 Rank 191

On the linear vector space p (ip) the Cartan–Killing inner product is negative(positive) definite. On the topological spaces EXP(p) (EXP(ip)) the Cartan–Killingmetric is negative- (positive-) definite also:

G/H = EXP(p) ds2 = gµ,ν dxµdxν < 0G ′/H = EXP(ip) ds2 = gµ,ν dxµdxν > 0

(12.2)

In both cases, the metric is definite and defines a Riemannian space. This space isglobally symmetric. That is, every point “looks like” every other point. This is be-cause they all look like the identity EXP(0), since the identity and its neighborhoodcan be shifted to any other point in the space by multiplication by the appropriategroup operation (for example, by EXP(p) or EXP(ip)).

The space P = G/H = EXP(p) (e.g., S2) is compact. The exponential of astraight line through the origin in p returns periodically to the neighborhood of theidentity. The space P is not topologically equivalent to any Euclidean space, inwhich a straight line (geodesic) through the origin never returns to the origin. Thespace P may be simply connected or multiply connected.

The space P ′ = G ′/H = EXP(ip) (i.e., H 22+) is noncompact. The exponential of

a straight line through the origin in ip (a geodesic through the identity in EXP(ip))simply goes away from this point without ever returning. The space P ′ = EXP(ip) istopologically equivalent to a Euclidean space Rn , where n = dim ip. Geometricallyit is not Euclidean since it has nonzero curvature. This space is simply connected.

The Riemannian spaces P = EXP(p) and P ′ = EXP(ip) are symmetric but notisotropic unless the rank of the space is 1, as it is for S2 and H 2

2+.If g is simple with a Cartan decomposition of the form g = k + p, with standard

commutation relations [k, k] ⊆ k, [k, p] ⊆ p, and [p, p] ⊆ k, the quotient coset P =G/K is a globally symmetric space as every point “looks like” every other point.

12.3 Rank

Rank for a symmetric space can be defined in exactly the same way as rank for a Liegroup or a Lie algebra. This should not be surprising, as a symmetric space consistsof points (coset representatives P = G/H or P ′ = G ′/H ) in the Lie group.

To compute the rank of a symmetric space one starts from the secular equationfor the associated algebra g = h + p

‖ Reg(h + p) − λIn ‖ =n∑

j=0

(−λ)n− jφ j (h, p) (12.3)

and restricts to the subspace p. Calculation of the rank can be carried out inany faithful matrix representation, for example the defining n × n matrix rep-resentation. The secular equations for the spaces SO(p, q)/SO(p) × SO(q),

Page 206: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

192 Riemannian symmetric spaces

SU (p, q)/S[U (p) × U (q)], Sp(p, q)/Sp(p) × Sp(q) are∣∣∣∣∣∣∣∣[

0 BB† 0

]− λIp+q

∣∣∣∣∣∣∣∣ =

n=p+q∑j=0

(−λ)n− jφ j (B, B†) (12.4)

It is easy to check that the function φ j depends on the q × q matrix B†B or thep × p matrix B B†, whichever is smaller. The rank of these spaces is min(p, q).

For Riemannian globally symmetric spaces the rank is (cf. Section 10.1):

(i) the number of independent functions in the secular equation;(ii) the number of independent roots of the secular equation;

(iii) the maximal number of mutually commuting operators in the subspace p or p′;(iv) the number of invariant (Laplace–Beltrami) operators defined over the space P (P ′);(v) the dimension of a positive-definite root space that can be used to define diagrammat-

ically the properties of these spaces (Araki–Satake root diagrams);(vi) the number of distinct, nonisotropic directions;

(vii) the dimension of the largest Euclidean submanifold in P .

We will not elaborate on these points here. We mention briefly that the Laplace–Beltrami operators on P = G/H are the Casimir operators of its parent group G,restricted to the subspace P . The number of nonisotropic directions is determinedby computing the number of distinct eigenvalues of the Cartan–Killing metric onP , or equivalently and more easily, of the Cartan–Killing inner product on p (sameas the metric at the identity). In each of the spaces P there is a Euclidean subspace(submanifold). For S2, any great circle is Euclidean.

12.4 Riemannian symmetric spaces

Table 12.1 lists all the classical noncompact Riemannian symmetric spaces of theform G ′/H , where G ′ is simple and noncompact and H is the maximal compact

Table 12.1. All classical noncompact Riemannian symmetric spaces

Root space Quotient G ′/H Dimension P Rank P

Ap+q−1 SU (p, q)/S [U (p) ⊗ U (q)] 2pq min(p, q)An−1 SL(n; R)/SO(n) 1

2 (n + 2)(n − 1) n − 1A2n−1 SU ∗(2n)/U Sp(2n) (2n + 1)(n − 1) n − 1Bp+q SO(p, q)/SO(p) ⊗ SO(q) pq min(p, q)Dp+q SO(p, q)/SO(p) ⊗ SO(q) pq min(p, q)Dn SO∗(2n)/U (n) n(n − 1) [n/2]C p+q U Sp(2p, 2q)/U Sp(2p) ⊗ U Sp(2q) 4pq min(p, q)Cn Sp(2n; R)/U (n) n(n + 1) n

Page 207: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

12.5 Metric and measure 193

Table 12.2. All exceptional noncompact Riemannian symmetric spaces

Root space G ′/H Dim G ′ Dim H Dim P Rank P

G2 G2(+2)/(A1 ⊕ A1) 14 6 8 2F4 F4(−20)/B4 52 36 16 1

F4(+4)/(C3 ⊕ A1) 52 24 28 4E6 E6(−26)/F4 78 52 26 2

E6(−14)/(D5 ⊕ D1) 78 46 32 2E6(+2)/(A5 ⊕ A1) 78 38 40 4E6(+6)/C4 78 36 42 6

E7 E7(−25)/(E6 ⊕ D1) 133 79 54 3E7(−5)/(D6 ⊕ A1) 133 69 64 4E7(+7)/A7 133 63 70 7

E8 E8(−24)/(E7 ⊕ A1) 248 136 112 4E8(+8)/D8 248 120 128 8

subgroup in G ′. To each there is a compact real form under G ′/H → G/H . Forexample, SO(p, q)/SO(p) ⊗ SO(q) and SO(p + q)/SO(p) ⊗ SO(q) are dual.These spaces are classical because they involve the classical series of Lie groups:the orthogonal, the unitary, and the symplectic.

Table 12.2 lists all the exceptional noncompact Riemannian symmetric spaces.As before, to each there is a dual compact real form.

12.5 Metric and measure

The metric tensor on the spaces P , P ′ is computed by defining a metric at theidentity and then moving it elsewhere by group multiplication. The metric at theidentity is chosen as the Cartan–Killing inner product on ip, or its negative on p.

If dx(Id) are infinitesimal displacements at the identity that are translated toinfinitesimal displacements dx(p) at point p, then these two sets of infinitesimalsare linearly related by a nonsingular linear transformation (cf. Eq. (4.44))

dxi (Id) = Miµ dxµ(p) (12.5)

The metrics and invariant volume elements are related by (cf. Eqs. (4.47) and (4.49))

ds2 = gi j (Id)dxi (Id)dx j (Id)= gµν(p)dxµ(p)dxν(p)⇒ gµν(p) = gi j (Id)Mi

µM jν

dV = ρ(Id)dx1(Id) ∧ dx2(Id) ∧ · · · ∧ dxn(Id)= ρ(p)dx1(p) ∧ dx2(p) ∧ · · · ∧ dxn(p)⇒ ρ(p) =‖ M(p) ‖ ρ(Id) ∼ √

det g(p)

(12.6)

Page 208: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

194 Riemannian symmetric spaces

The matrix Miµ(p) is not easy to compute in general. For the rank-one spaces

SO(n, 1)/SO(n), SU (n, 1)/U (n), Sp(n, 1)/Sp(n) × Sp(1) defined by

P ′ =[

W XX † Y

]

W 2 = In + X X †

Y 2 = 1 + X †X

X =

x1

x2

...xn

(12.7)

the matrix Miµ(X ) is determined from

dx(X ) = W dx(Id) (12.8)

The matrix Miµ(X ) is given by W −1. Since the Cartan–Killing inner product is In

at the identity, we find

gµν(X ) = W −1 InW −1 = {In + X X †}−1

µν

ρ(X ) = ‖ W ‖−1 = 1/√

1 + X †X = Y −1

(12.9)

12.6 Applications and examples

The coset representatives for the Riemannian symmetric spaces SO(2, 1)/SO(2)and SO(3)/SO(2) are

SO(2, 1)/SO(2) SO(3)/SO(2)

[W X

+Xt Y

] [W X

−Xt Y

]

W 2 = I2 +(

xy

)( x y ) W 2 = I2 −

(xy

)( x y )

Y 2 = I1 + ( x y )

(xy

)Y 2 = I1 − ( x y )

(xy

)(12.10)

From these coset representatives we can compute the metric tensors onthe noncompact hyperboloid H 2

2 = SO(2, 1)/SO(2) and the compact sphere

Page 209: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

12.6 Applications and examples 195

S2 = SO(3)/SO(2). The metric tensors in the two cases are the 2 × 2 matrices

SO(2, 1)/SO(2) SO(3)/SO(2)

g∗,∗ = W −2 =[

I2 +(

xy

)( x y )

]−1

g∗,∗ = W −2 =[

I2 −(

xy

)( x y )

]−1

g∗,∗ = W +2 =[

1 + x2 +xy+yx 1 + y2

]g∗,∗ = W +2 =

[1 − x2 −xy−yx 1 − y2

](12.11)

The noncompact Riemannian symmetric space H 22 = SO(2, 1)/SO(2) is param-

eterized by the entire x–y plane while its dual compact Riemannian symmet-ric space SO(2 + 1)/SO(2) is parameterized by the interior of the unit circleY 2 = 1 − (x2 + y2) ≥ 0.

Since the (intrinsic) properties of the Riemannian symmetric space are entirelyencoded in its metric tensor, we can begin to compute its important properties, forexample, the curvature tensor. It is first useful to compute the Christoffel symbolsas a way-station on the road to computing the full Riemannian curvature tensor.The Christoffel symbols (not a tensor!), the Riemannian curvature tensor, the Riccitensor, and the curvature scalars are constructed in terms of the metric tensor asfollows:

Christoffel �σµν = 1

2gσα

(∂gµα

∂xν+ ∂gνα

∂xµ− ∂gµν

∂xα

)

Riemann curvature tensor Rµσ,αβ = ∂�

µσβ

∂xα− ∂�µ

σα

∂xβ+ �µ

ρα�ρσβ − �

µρβ�ρ

σα

Ricci tensor Rσβ = Rµσ,µβ

curvature scalar R = gσβ Rσβ (12.12)

In general, computing these objects is not easy. This task is greatly simplifiedin a symmetric space, for all points look the same and we can compute the ten-sors wherever the computation is easiest. This turns out to be at the origin. Weillustrate by carrying out the computations in the neighborhood of the identityfor the compact case, the sphere. Instead of using the pair x , y as coordinates,we use indexed coordinates xi , i = 1, 2, . . . , N , and set N = 2 at the end of thiscomputation.

We first note that it is sufficient to estimate the behavior of the metric tensor inthe neighborhood of the origin (identity in the coset) only up to quadratic terms,

Page 210: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

196 Riemannian symmetric spaces

so that

gi j = W −2 = [IN − X Xt

]−1i j

� [IN + X Xt

]i j

→ δi j + xi x j (12.13)

The inverse (contravariant metric) is gi j � δi j − xi x j , but we will not need thisresult. In the neighborhood of the identity (gi j → δi j )

�σµν → 1

2

(∂gµσ

∂xν+ ∂gνσ

∂xµ− ∂gµν

∂xσ

)

= 1

2

{δνµxσ + δµνxσ − δσµxν

δνσ xµ + δµσ xν − δσνxµ

}

→ δµνxσ (→ 0 at origin)

(12.14)

Computation of the components of the Riemann curvature tensor at the orign iseven simpler. At the origin the components of the Christoffel symbols all vanish, soit is sufficient to retain only the first two terms in the expression for the curvaturetensor. We find

Rµσ,αβ → ∂

∂xα(δσβ xµ) − ∂

∂xβ(δσαxµ) = δσβδ µ

α − δσαδµ

β (12.15)

The contravariant index µ can be lowered with the metric tensor, which is thedelta function at the origin, and the resulting fully covariant metric tensor Rµσ,αβ =δαµδβσ − δασ δβµ exhibits the full spectrum of expected symmetries.

The Ricci tensor is obtained by contraction

Rσβ = Rµσ,µβ = δσβδµµ − δσµδβµ = Nδσβ − δσβ (12.16)

The curvature scalar is obtained from the Ricci tensor by saturating its covariantindices by the contravariant components of the metric tensor, which is simply adelta function at the origin:

R = gσβ Rσβ → δσβ(N − 1)δσβ = N (N − 1) (12.17)

For N = 2 (sphere S2), R = 2.The computation can be carried out just as easily for the noncompact space H 2

2 .The major change occurs in the first step, where the metric in the neighborhood ofthe origin undergoes the change

SO(2 + 1)/SO(2) SO(2, 1)/SO(2)

gi j → δi j + xi x j → gi j → δi j − xi x j(12.18)

The net result is that a negative sign attaches itself at each step in the computation:for example �σ

µν → −δµνxσ . The end result for H 22 is that R = −2.

Page 211: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

12.7 Pseudo-Riemannian symmetric spaces 197

12.7 Pseudo-Riemannian symmetric spaces

Topological spaces on which a “metric tensor” can be defined that is neitherpositive-definite (ds2 = gµν dxµdxν > 0, equality ⇒ dx = 0) nor negative-definite (ds2 < 0), but which is nonsingular (‖ g ‖�= 0) are called pseudo-Riemannian spaces. Pseudo-Riemannian spaces that are globally symmetric canbe constructed following the procedures described in Sections 12.1 and 12.2. Asthe example of the single-sheeted hyperboloid H 2

1 shows, these spaces are evenmore interesting than the Riemannian globally symmetric spaces.

To make these statements more explicit, assume a Lie algebra g′′ (noncompact)has a decomposition

g′′ = h

′′ + p′′ (12.19)

with commutation relations of the form (11.10)[h

′′, h′′] ⊆ h

′′[h

′′, p′′] ⊆ p

′′ (12.20)[p

′′, p′′] ⊆ h

′′

Then h′′ and p′′ are orthogonal subspaces in g′′ under the Cartan–Killing innerproduct. Assume also that the inner product is indefinite on p′′ (also h′′). Then

P ′′ = EXP(p′′) = G ′′/H ′′ (12.21)

is a pseudo-Riemannian globally symmetric space. The metric on this space isindefinite. The space is curved and typically multiply connected. The space H ′′ =EXP(h′′) is also an interesting pseudo-Riemannian symmetric space.

All of the algebraic properties associated with a Riemannian symmetric spacehold also for pseudo-Riemannian symmetric spaces. That is, rank can be defined,and carries most of the implications listed in Section 12.3.

There is a systematic method for constructing pseudo-Riemannian symmetricspaces. Begin with a compact simple Lie algebra g and suppose T1, T2 are twometric-preserving mappings of the Lie algebra onto itself that obey T 2

1 = I, T 22 = I

(cf. Section 11.3) and T1 �= T2. Define the eigenspaces of g under T1, T2 as g±,±:

T1 g±,∗ = ±g±,∗T2 g∗,± = ±g∗,±

(12.22)

Then T1 can be used to construct a noncompact algebra

g′ = (g+,+ + g+,−) + i(g−,+ + g−,−) (12.23)

Page 212: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

198 Riemannian symmetric spaces

and T2 can be used to split g′ in a different way

g′′ = (g+,+ + ig+,−) + (ig−,+ + g−,−)

= h′′ + p′′(12.24)

The subspaces h′′, p′′ obey commutation relations (12.20). The Cartan-Killing innerproduct is indefinite on both h′′ and p′′ as long as T1 �= T2.

For su(2) the only two mappings are T1 = block diagonal decomposition andT2 = complex conjugation. The eigenspace decomposition is

Operation iσ1 iσ2 iσ3

T1 = block matrix decomposition −1 −1 +1T2 = complex conjugation −1 +1 −1T3 = T1 T2 +1 −1 −1

(12.25)

This gives g+,+ = 0, g+,− = iσ3, g−,+ = iσ2, g−,− = iσ1. Note that each mappingTi has one positive and two negative eigenvalues, and chooses a different generatorfor the maximal compact subalgebra h′ of the noncompact real form g′.

12.8 Conclusion

Globally symmetric spaces have the form P = G/K , where g is a real form ofa simple Lie algebra, g = k + p, with [k, k] ⊆ k, [k, p] ⊆ p, and [p, p] ⊆ k. AllRiemannian globally symmetric spaces are constructed as quotients of a simpleLie group G by a maximal compact subgroup K . More specifically, they are expo-nentials of a subalgebra p of a Lie algebra g for which commutation relations andinner products are given by (11.10). Pseudo-Riemannian globally symmetric spacesare similarly constructed. For these spaces the rank can be defined. This determinesa number of algebraic properties (maximal number of independent mutually com-muting generators and Laplace–Beltrami operators) as well as geometric properties(number of nonisotropic directions, dimension of maximal Euclidean subspaces).Metric and measure are determined on these spaces in an invariant way.

12.9 Problems

1. Show that the invariant polynomials φ j (B, B†) in (12.4) actually depend on the invari-ants of B B† or B†B. These are the eigenvalues of these square, hermitian matrices.Both the p × p and q × q matrix have the same spectrum of nonzero eigenvalues.The remaining (p − q) or (q − p) (whichever is positive) eigenvalues of the largermatrix are zero (singular value decomposition theorem).

Page 213: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

12.9 Problems 199

2. The second order Laplace–Beltrami operator �2 is constructed from the second orderCasimir invariant C2 by restricting the action of the latter to the Riemannian manifoldG/H = P .a. Show that this operator can be expressed in terms of the Cartan–Killing metric

tensor on P as �2 = gi j (∂i∂ j − � ki j ∂k).

b. Show that there is one Laplace–Beltrami on the sphere S2 and compute it in thestandard parameterization in terms of the coordinates (x, y) in the interior of theunit disk x2 + y2 ≤ 1.

c. Show that there is one Laplace–Beltrami on the two-sheeted hyperboloid H 22 and

compute it in the standard parameterization in terms of the coordinates on the planeR2.

d. Show that these two Laplace–Beltrami operators are dual in some sense. Whatsense?

e. Extend these results to the sphere Sn and its dual, H n , n > 2.

3. Show that the two metric-preserving mappings T1 and T2 that satisfy T 21 = T 2

2 = Igenerate a third, T3 = T1T2 and that T1T2 = T2T1. Show that T3 �= I if T1 �= T2. Showthat these three operators, together with the identity, form a group isomorphic withthe “four-group” (“vierergruppe”) V4. Describe the variety of decompositions of acompact Lie algebra g = g+,+ + g+,− + g−,+ + g−,− that is available by choosingfirst, one of these three involutions, and then a second (there are 3!/1!=6 choices).Discuss dualities.

4. Show that the secular equation for the symmetric space SO(3)/SO(2) can be obtainedfrom (11.2) by setting b3 = 0:

det |Reg(p) − λI3| = −λ[λ2 + (b2

1 + b22)

] = 0

There is one independent function in this secular equation. There is one independentroot. What else can be said about this Riemannian symmetric space?

5. Show that the coefficients φ j (p) in the secular equation for a symmetric space areobtained from the coefficients φ j (h, p) in the secular equation for the parent Liealgebra (Eq. (12.3)) by setting h = 0.

6. The hyperbolic plane H 22 is the Riemannian symmetric space SO(2, 1)/SO(2) ob-

tained by exponentiating a real symmetric matrix in the three-dimensional Lie algebra

EXP

0 t1 t2

t1 0 0t2 0 0

=

x0 x1 x2

x1 ∗ ∗x2 ∗ ∗

x2

0 − x21 − x2

2 = 1

a. Show that the hyperbolic plane is the two-dimensional algebraic manifold definedby the condition x2

0 − x21 − x2

2 = 1 in the Lorentz 3-space with signature (1, 2).b. Show that the invariant metric is induced from the metric −ds2 = dx2

0 − dx21 −

dx22 in this Lorentz 3-space.

Page 214: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

200 Riemannian symmetric spaces

c. Use coordinates x1, x2 to parameterize the points in H 22 , and show

ds2 =

(dx1 dx2

) [1 + x2

2 −x1x2

−x1x2 1 + x21

] (dx1

dx2

)1 + x2

1 + x22

d. Show that the invariant measure is

dµ = dx1dx2√1 + x2

1 + x22

e. Introduce polar coordinates (r, θ ), x1 = r cos(θ ), x2 = r sin(θ ). Show that

ds2 =

(dr dθ

) [ 11+r2 0

0 r2

] (drdθ

)1 + r2

dµ = r dr dθ√1 + r2

f. Determine the action of a group operation in SO(1, 2) on the point (x1, x2) ∈ H 22 .

7. The metric on a pseudo-Riemannian symmetric space is gi j (x).a. Show that the generators of infinitesimal rotations at a point are Xrs = grt x t∂s −

gst x t∂r .b. Show [Xab, �] = 0, where � = Gab;rs Xab Xrs is the Laplace–Beltrami operator

on this space, Gab;rs = tr {def(Xab)def(Xrs)}, and Gab;rs is the inverse of Gab;rs .c. Show that � consists of terms that are both quadratic and linear in the operators

∂r , and that

� = grs∂r∂s − grs� trs ∂t

The function � trs is not a tensor. The components of the Christoffel symbol are

given by

� trs = 1

2gtu (∂s gru + ∂r gsu − ∂u grs)

8. Use radial coordinates (r, φ2, φ3, . . . , φn) on the sphere Sn ⊂ Rn+1.a. Show the invariant volume element is

dV =√

‖ g ‖rn−1 sinn−2 φ2 sinn−3 φ3 · · · sin1 φn−1 sin0 φn

dr ∧ dφ2 ∧ dφ3 ∧ · · · ∧ dφn

b. Show that the second order Laplace–Beltrami operator is

� = 1√‖ g ‖∂µ

√‖ g ‖gµν∂ν where ∂ν = ∂/∂φµ

Page 215: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

12.9 Problems 201

c. Compare this with the second order Casimir operator for SO(n + 1):

C2 [SO(n + 1)/SO(n)] =n+1∑

1≤r<s

X2r,s(φ)

d. Show that the Laplace–Beltrami operators on a sphere can be written recursively:

�(Sn) = ∂n( f1(φ)∂n) + f2(φ)�(Sn−1)

Compute f1(φ) and f2(φ).

9. A quantum system with n degrees of freedom is described by a hamiltonian thatis a linear superposition of the bilinear products a†

i a j (H = hi j (t)a†i a j , 1 ≤ i, j ≤

n), so that iH is a time-dependent element in the Lie algebra u(n). Assume thesystem is initially in its ground state. Show that it evolves into a coherent state whosetrajectory exists in the rank one symmetric space SU (n)/U (n − 1). Write down thecoherent state parameters explicitly for a two-level system, and relate the coherentstate parameters to the forcing terms in the hamiltonian.

10. Conformal group The inner product on an n-dimensional linear vector space V (n)

is defined by (x, x)m = mi j xi x j . Define coordinates y in an n + 2 dimensional linearvector space W (n+2) as follows

yi = sxi (1 ≤ i ≤ n)yn+1 = syn+2 = s(x, x)m

and define an inner product M in this space by

M =

mi j

0 − 12

− 12 0

a. Show (y, y)M = Mµν yµyν = (sx, sx)m − 22 s[s(x, x)m] = 0.

b. If m is positive definite and Lie group G preserves inner products in V (n), thenG = O(n).

c. Show that the Lie group H that preserves inner products in W (n+2) is O(n + 1, 1).d. If the metric m has signature n1, n2 (n1 + n2 = n), show that G = O(n1, n2) and

H = O(n1 + 1, n2 + 1).e. H is called a conformal group because it preserves angles. Show this.f. Construct the quotient space SO(n1 + 1, n2 + 1)/SO(n1, n2).g. Under a conformal transformation y → y′ and x → x ′. Show x ′i = y′i/y′n+1.h. The Lorentz metric (+1, −1, −1, −1) leaves the four-momentum invariant:

E2 − (pc)2 = (mc2)2

Show that the conformal group on space-time is SO(4, 2).

Page 216: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

202 Riemannian symmetric spaces

i. Show that the infinitesimal generators of the conformal group are

Lµν = xµ∂ν − xν∂µ

Pµ = ∂µ

Kµ = 2xµ(xν∂ν) − (xνxν)∂µ = 2xµ(x, ∂) − (x, x)∂µ

S = xν∂ν

The operators Lµν are the infinitesimal generators of the Lorentz group SO(3, 1)and Pµ generate translations. Taken together Lµν and Pµ generate the Poincaregroup. The operator S generates dilations and the four operators Kµ generateconformal transformations. Above xµ = gµνxν .

j. Show that the additional operators satisfy the commutation relations

[Lµν, Kλ] = gνλKµ − gµλKν

[Lµν, S] = 0[Pµ, Kν] = 2(gµν S − Lµν)

[S, Pµ] = −Pµ [Pµ, Pν] = 0[S, Kµ] = +Kµ [Kµ, Kν] = 0

k. Show that ecµ Kµ (xν) = x ′ν = xν+cν (x,x)1+2(c,x)+(c,c)(x,x) .

l. Show that the conformal group SO(4, 2) is:• the largest group that leaves the free space (no sources) Maxwell equations form

invariant;• the largest group that maps the (bound, scattering, parabolic) states of the hydrogen

atom to themselves.m. Discuss the duality created by Pµ → P ′

µ = xµ and Kµ → K ′µ = 2(x, ∂)∂µ −

(∂, ∂)xµ.

11. The upper half of the complex plane has coordinates z = x + iy. This upper half-plane provides a well studied model for the hyperbolic plane when a suitable metric isplaced on it. The half-plane is mapped onto itself by linear fractional transformations

z → z′ = az + b

cz + d

[a bc d

]∈ SL(2; R), ad − bc = 1

This transformation group is called the projective special linear transformation groupand denoted P SL(2, R).a. Show that M, −M ∈ SL(2; R) generate identical transformations. The group

SL(2; R) is a two-fold covering group of P SL(2, R).b. Show

z′ = ac(x2 + y2) + (ad + bc)x + bd + iy

|cz + d|2

In particular, show that y′ > 0 if y > 0 and y′ = 0 if y = 0. The transformationmaps the upper half-plane onto the upper half-plane and its boundary, the real axis(y = 0), onto itself.

Page 217: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

12.9 Problems 203

c. Show that the metric

ds2 = [dx dy

] [1y 0

0 1y

] [dxdy

]= dz dz

y2

is invariant under these transformations.d. Show dz′ = dz/|cz + d|2e. Show that the invariant measure is dµ = dx dy/y2

f. Show that the distance between two points z1 and z2 is

s(z1, z2) = 2 tanh−1 |z1 − z2||z1 − z2| = log

{ |z1 − z2| + |z1 − z2||z1 − z2| − |z1 − z2|

}12. The unit disk in the complex plane w = x + iy consists of those points that satisfy

ww = x2 + y2 ≤ 1. The unit disk, with a suitable metric, provides a second represen-tation of the hyperbolic plane. The unit disk is mapped onto itself by linear fractionaltransformations

w → w′ = αw + β

βw + α

[α β

β α

]∈ SU (1, 1), αα − ββ = 1

a. Show that M, −M ∈ SU (1, 1) generate identical mappings of the unit disk intoitself.

b. Show that w = eiφ → w′ = eiψ . Compute ψ(φ).c. Show that the metric

ds2 = (dx dy

)

1

(1 − ww)20

01

(1 − ww)2

(

dxdy

)= dw dw

(1 − ww)2

is invariant under this group.d. Show that the invariant volume element is

dµ = dx dy

(1 − ww)2= dw dw

(1 − ww)2

e. Show that the distance between two points w1 and w2 in this unit disk is

s(w1, w2) = tanh−1

{ |w1 − w2||1 − w1w2|

}13. Show that the mapping from z in the upper half-plane to w in the unit disk given by

w = eiφ z − z0

z − z0

is conformal, that is, it preserves angles. Here z0 is any point in the upper half-plane.a. Compute the inverse of this mapping, and show that it maps the interior of the unit

disk unto the upper half of the complex plane and the boundary of the unit diskonto the real axis (boundary of the upper half-plane).

Page 218: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

204 Riemannian symmetric spaces

b. Choose z0 = i and eiφ = i to give the canonical map

w = i z + 1

z + i

c. Show that the matrices that generate the Mobius transformations of the upperhalf-plane and the unit disk are related by

S

[a bc d

]S−1 =

[α β

β α

]S = 1√

2

[1 −i−i 1

]d. Show that this transformation maps the invariant metric and measure on the upper

half-plane onto the invariant metric and measure on the unit disk.

Page 219: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

13

Contraction

New Lie groups can be constructed from old by a process called groupcontraction. Contraction involves reparameterization of the Lie group’sparameter space in such a way that the group multiplication properties,or commutation relations in the Lie algebra, remain well defined even ina singular limit. In general, the properties of the original Lie group havewell-defined limits in the contracted Lie group. For example, the parame-ter space for the contracted group is well defined and noncompact. Otherproperties with well-defined limits include: Casimir operators; basis statesof representations; matrix elements of operators; and Baker–Campbell–Hausdorff formulas. Contraction provides limiting relations among thespecial functions of mathematical physics. We describe a particularlysimple class of contractions, the Inonu–Wigner contractions, and treatone example of a contraction not in this class.

13.1 Preliminaries

It is possible to construct new Lie algebras from old by a certain limiting processcalled contraction. In this process a new set of basis vectors Yr is related to theinitial set of basis vectors Xi through a parameter-dependent change of basis: Yr =M i

r (ε)Xi . The structure constants have the transformation properties of a tensor:C t

rs (ε) = M ir (ε)M j

s (ε)C ki j (M(ε)−1) t

k (cf. Eq. (4.22)). As long as the change ofbasis transformation is nonsingular the Lie algebra is unchanged.

If the transformation becomes singular, the structure constants C trs (ε) may

still converge to a well-defined limit. It is often the case that the structure con-stants

C trs (0) = lim

ε→0C t

rs (ε) (13.1)

exist and define a Lie algebra that is different from the original Lie algebra.

205

Page 220: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

206 Contraction

13.2 Inonu–Wigner contractions

If a Lie algebra g has a subalgebra h and a complementary subspace p with com-mutation relations of the form

g = h + p

[h, h] ⊆ h subalgebra[h, p] ⊆ p this is important[p, p] ⊆ h + p this is always true

(13.2)

then the Inonu–Wigner contraction of g → g′ involves the following change ofbasis transformation [

h′

p′

]=

[Idim(h) 0

0 ε Idim(p)

] [h

p

](13.3)

where dim(h) is the dimension of the subalgebra h. The commutation relations ofg′ are well defined for all values of ε, including the singular limit ε → 0:[

h′, h′] = [h, h] ⊆ h[h′, p′] = [h, εp] = ε [h, p] limε→0 εp → p′[p′, p′] = [εp, εp] = ε2 [p, p] limε→0 ε2(h + p) → 0

(13.4)

In the limit ε → 0 the contracted algebra g′ is the semidirect sum of the originalsubalgebra h and the subalgebra p′, where p′ is commutative and

[h, p′] ⊆ p′:

g = h + p p → p′ = εp g′ = h + p′

[h, h] ⊆ h −→ [h, h] ⊆ h

[h, p] ⊆ p[h, p′] ⊆ p′

[p, p] ⊆ h + p[p′, p′] = 0

(13.5)

13.3 Simple examples of Inonu–Wigner contractions

In this section we illustrate several facets of Inonu–Wigner contractions by con-tracting three different orthogonal groups.

13.3.1 The contraction SO(3) → ISO(2)

The infinitesimal generators of the Lie group SO(3) may be chosen as L1 = X23 =x2∂3 − x3∂2 = ε1 jk x j∂k , with L2 and L3 defined by cyclic permutation. The com-mutation relations are

[L1, L2] = −L3

[L2, L3] = −L1 (13.6)

[L3, L1] = −L2

Page 221: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

13.3 Simple examples of Inonu–Wigner contractions 207

Under contraction with respect to the subalgebra of rotations about the z-axis (in-finitesimal generator L3) the operators L1 and L2 go to[

εL1

εL2

]ε=1/R−→

[(1/R)L1

(1/R)L2

]→

[−P2

+P1

](13.7)

The commutation relations of the contracted algebra, I SO(2) = E(2), are

[L3, P1] = −P2

[L3, P2] = +P1 (13.8)

[P1, P2] = 0

The three operators L3, P1, P2 generate the group of Euclidean motions of the plane,E(2), or inhomogeneous orthogonal transformations in the plane R2, I SO(2). Thisgroup consists of rotations about the z-axis, generated by L3, and displacements ofthe origin in the x- and y-directions, generated by P1 = ∂1 and P2 = ∂2.

To verify this interpretation we can imagine the group SO(3) acting on the spherex2 + y2 + z2 = R2 in the neighborhood of the north pole (0, 0, R), as shown inFig. 13.1. An element in the Lie algebra so(3) can be written in the form

θ1L1 + θ2L2 + θ3L3 −→ (−d2)

(L1

R

)+ (+d1)

(L2

R

)+ θ3L3 (13.9)

In the limit R → ∞ we find

1

RL1 = 1

R(x2∂3 − x3∂2) = 1

R(y∂/∂z − R∂/∂y) → −∂/∂y = −∂2 = −P2

1

RL2 = 1

R(x3∂1 − x1∂3) = 1

R(R∂/∂x − x∂/∂z) → +∂/∂x = +∂1 = +P1

(13.10)

The contracted limits of the operators L1 and L2 in the limit of a sphere of verylarge radius are operators −P2, +P1 describing displacements in the −y and +xdirections. In addition, the parameters θ1, θ2 and d1, d2 are related by

d1 = +Rθ2

d2 = −Rθ1(13.11)

As the radius of the sphere becomes very large, the two angles θ1, θ2 become smallwith the product Rθi (i = 1, 2) approaching a well-defined limit. This correspondsto a rotation through an angle θ2 = d1/R about the y-axis producing a displacementof d1 in the x-direction, and a rotation through an angle θ1 = d2/R about the x-axisproducing a displacement of −d2 in the y-direction.

Page 222: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

208 Contraction

R2

−d2

+d1

x

y

R

qy

qx

qy

qx

Figure 13.1. Rotations on the surface of a sphere of radius R approach displace-ments in the plane as R → ∞.

The Casimir operator for the group SO(3) contracts to an invariant operator asfollows:

C2[SO(3)] = L21 + L2

2 + L33

C2[I SO(2)] = lim(1/R2) C2[SO(3)]

= lim[(L1/R)2 + (L2/R)2 + (L3/R)2]

= (−P2)2 + (+P1)2 + 0 = ∂2

∂y2+ ∂2

∂x2(13.12)

This is just the Laplacian operator on the plane R2.

13.3.2 The contraction SO(4) → ISO(3)

This group is similar to SO(3) and can be treated similarly. The six generators are

Li = εi jk x j∂k

Vi = xi∂4 − x4∂i 1 ≤ i �= j �= k ≤ 3(13.13)

Page 223: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

13.3 Simple examples of Inonu–Wigner contractions 209

The commutation relations are [Li , L j

] = −εi jk Lk[Li , Vj

] = −εi jk Vk (13.14)[Vi , Vj

] = −εi jk Lk

We contract with respect to the subgroup SO(3) generated by the angular momen-tum operators Li , defining

− Pi = limR→∞

1

RVi = lim

R→∞1

R(xi∂4 − x4∂i ) = −∂i (13.15)

The commutation relations of the contracted algebra are[Li , L j

] = −εi jk Lk[Li , Pj

] = −εi jk Pk (13.16)[Pi , Pj

] = 0

The operators Pi describe displacements in the x-, y-, and z-directions (i = 1, 2, 3).The contracted group is I SO(3), the Euclidean, or inhomogeneous orthogonalgroup, on R3.

As in the case SO(3) → I SO(2), we can contract the second order Casimiroperator of SO(4) to that of I SO(3)

C21 [I SO(3)] = lim

R→∞(1/R2)(L · L + V · V)

= limR→∞

[(L/R) · (L/R) + (V/R) · (V/R)]

= 0 + P · P = ∇2 = ∂2

∂x2+ ∂2

∂y2+ ∂2

∂z2(13.17)

As before, this is no surprise. The contracted operator is the Laplacian on R3. Whatis a surprise is that there is a second nontrivial invariant operator. For SO(4) this is(cf. Eq. (9.24))

C22 [SO(4)] = εi jkl Xi j Xkl → 8L · V (13.18)

The contracted limit of this operator is

C22 [I SO(3)]/8 = lim

R→∞(1/R)(L · V)

= limR→∞

[L · (V/R)] = −L · P (13.19)

The two invariant operators P · P = ∇2 and L · P = −L · ∇ form a complete set ofinvariant operators for the group I SO(3).

Page 224: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

210 Contraction

13.3.3 The contraction SO(4, 1) → ISO(3, 1)

The group I SO(3, 1) consists of proper Lorentz transformations [SO(3, 1)] thatleave invariant the quadratic form

x2 + y2 + z2 − (ct)2 (13.20)

as well as displacements of the origin in the three space-like directions and onetime-like direction. The inhomogeneous Lorentz group, or Poincare group, leavesinvariant space-time intervals

(x − x ′)2 + (y − y′)2 + (z − z′)2 − (ct − ct ′)2 (13.21)

This group can be contracted from either SO(4, 1) or SO(3, 2).We choose as infinitesimal generators for the group SO(4, 1) the operators

Xi j = xi∂ j − x j∂i = εi jk Lk 1 ≤ i, j, k ≤ 3 rotationsBi4 = xi∂4 + x4∂i 1 ≤ i ≤ 3 boostsTi5 = xi∂5 ± x5∂i i = 1, 2, 3 −sign space displacements

i = 4 +sign time displacements

(13.22)

This set of generators is contracted with respect to the subgroup SO(3, 1) generatedby rotations and boosts.

The second order Casimir invariant for SO(4, 1) and its contraction to the secondorder Casimir invariant for the Poincare group are

C2[SO(4, 1)] = L · L − B · B + T · T − T 245

C2[I SO(3, 1)] = 0 − 0 + ∇ · ∇ − 1

c2

∂2

∂t2

However, SO(4, 1) has a second Casimir operator, since it is a real form for therank-two root space B2. This is a fourth-degree operator that is derived by analyticcontinuation from the fourth order Casimir operator of SO(5) (cf. Eqs. (9.22) and(9.23))

C4[SO(5)] = W αWα

W α = εαβγµν Xβγ Xµν

(13.23)

where εαβγµν is the Levi–Civita symbol (antisymmetric tensor) on five symbols,and Wα is similarly defined. The contracted limit of W α is nonzero only if one ofthe four remaining symbols (e.g., ν) is 5:

limR→∞

(1/R)W α → limR→∞

εαβγµ5 Xβγ [(1/R)Xµ5]

= εαβγµ5 Xβγ (∂/∂xµ) (13.24)

Page 225: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

13.4 The contraction U (2) → H4 211

The four vector εαβγµ5 Xβγ (∂/∂xµ) is fairly complicated. Since W αWα is invariant,it is convenient to compute it for a particle of mass m in a frame in which the particleis at rest

Pµ = (0, 0, 0, mc) (13.25)

In this frame

W α = εαβγµ Xβγ mc = Lαmc (13.26)

Therefore the invariant is

W αWα = (L · L)(P · P) (13.27)

with P · P = ∑Pµ Pµ = −(mc)2.

It should be emphasized that if an operator is an invariant and its spectrum orinterpretation is desired, the operator should be viewed from the coordinate systemwhich most simplifies its determination (principle of maximum laziness).

13.4 The contraction U (2) → H4

In this section we consider a group contraction that is not of Inonu–Wigner type.This is the contraction of the compact unitary group U (2) to the solvable groupH4. This contraction relates the angular momentum operators to the single-modephoton operators. These are the infinitesimal generators of the groups U (2) and H4,respectively. This contraction leads to a number of useful relations that are exploredin successive sections.

13.4.1 Contraction of the algebra

The Lie algebra u(2) is spanned by infinitesimal generators J3, J±, J0 with com-mutation relations

[J3, J±] = ±J±[J+, J−] = 2J3 (13.28)

[J0, J] = 0

The operators h3, h±, h0 are related to J3, J±, J0 by the following change of basis

h+h−h3

h0

=

cc

1 12c2

1

J+J−J3

J0

(13.29)

Page 226: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

212 Contraction

These operators satisfy the following commutation relations

[h3, h±] = ±h±[h+, h−] = 2c2h3 − h0 (13.30)

[h0, h] = 0

In the limit c → 0 the change of basis transformation becomes singular but thecommutation relations (Eq. (13.30)) converge to a well-defined limit satisfied bythe single-mode photon operators

h3

h+h−h0

c→0−→

n + 12 I = 1

2

{a, a†}

a†

aI

(13.31)

13.4.2 Contraction of the Casimir operators

The group U (2) has rank two. Its two Casimir operators are of first and secondorder

C1 = J0

C2 = J 23 + 1

2 (J+ J− + J− J+)(13.32)

Under contraction J0 → h0 but the second Casimir operator has a more interestinglimit

limc→0

c2C2 = limc→0

c2

(h3 − 1

2c2h0

)2

+ 1

2[(cJ+)(cJ−) + (cJ−)(cJ+)]

= limc→0

c2h23 − 1

2(h3h0 + h0h3) + c2

(− 1

2c2h0

)2

+ 1

2[(h+)(h−) + (h−)(h+)] (13.33)

The operator (h0/2c)2 is proportional to the square of the first Casimir operator. Ittherefore commutes with all elements in the Lie algebra. Therefore the remaining setof operators on the right-hand side of (13.34) must also commute with all operatorsin the Lie algebra. In the limit c → 0, (ch3)2 → 0 and the remaining operators goto a well-defined limit

limc→0

c2C2[U (2)] − (h0/2c)2 → C2[H4] = −1

2

[(n + 1

2I

)I + I

(n + 1

2I

)]

+ 1

2(aa† + a†a) (13.34)

Page 227: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

13.4 The contraction U (2) → H4 213

This is a quadratic operator in the generators n + 12 I, a†, a, and I of H4. The value

of this operator in the standard Fock space spanned by the photon number states|0〉, |1〉, |2〉, . . . is zero. It is the other “invisible invariant” for H4.

13.4.3 Contraction of the parameter space

An arbitrary element in the Lie algebra u(2) and its counterpart in the algebra h4

with basis h3, h±, h0 is (Arecchi et al., 1972; Gilmore, 1974b)

iθµ Jµ = 1

2θe−iφ J+ − 1

2θe+iφ J− + iθ3 J3 + iθ0 J0

= θ

2ce−iφh+ − θ

2ce+iφh− + iθ3h3 + i

(θ0 − θ3

2c2

)h0 (13.35)

In the limit c → 0 the parameter θ must approach zero so that the limits

limc→0

+ θ2c e−iφ → +α

limc→0

− θ2c e+iφ → −α∗ (13.36)

exist. In addition, θ0 should diverge so that θ0 − θ3/2c2 remains well defined.

13.4.4 Contraction of representations

The action of the operators h3 on the angular momentum state |J, M〉 is

h3

∣∣∣∣ JM

⟩=

(J3 + 1

2c2J0

) ∣∣∣∣ JM

⟩=

(M + 1

2c2

)∣∣∣∣ JM

⟩(13.37)

It is useful to measure states from the “lowest” state |J, −J 〉 in the angular mo-mentum multiplet. The state with the quantum number M is the ground state ifM = −J , and the nth state when

n = J + M (13.38)

In order for the action of h3 on |J, M〉 to have a well-defined limit, we insist that

limc→0

(M + 1

2c2

)= lim

c→0

(n − J + 1

2c2

)(13.39)

be well defined. This is the case when we go through a sequence of larger and largerrepresentations J of dimension (2J + 1) as c becomes smaller and smaller. Specif-ically, we require c and J to be related by (Arecchi et al., 1972; Gilmore, 1974b)

limc→0

(−J + 1

2c2

)= 0 implies 2Jc2 = 1 (13.40)

Page 228: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

214 Contraction

In this case

limc → 0

J → ∞h3

∣∣∣∣ JM

⟩= n

∣∣∣∣ ∞n

⟩(13.41)

13.4.5 Contraction of basis states

The basis states |J, M〉 for an angular momentum multiplet are constructed byapplying the angular momentum shift up operator n = J + M times to the groundstate |J, −J 〉. These states are contracted to the harmonic oscillator states asfollows ∣∣∣∣ J

M = −J + n

⟩= (J+)n

[(2J )!n!/(2J − n)!]1/2

∣∣∣∣ J−J

⟩∣∣∣∣ ∞

n

⟩= lim

J→∞(cJ+)n

[(2Jc2)nn!]1/2

∣∣∣∣ ∞0

⟩(13.42)

= (a†)n

√n!

∣∣∣∣ ∞0

13.4.6 Contraction of matrix elements

The matrix elements of the angular momentum operators on the angular momentumbasis states contract readily to the matrix elements of the photon operators on theFock states

a†a

∣∣∣∣ ∞n

⟩= lim

c→0

(J3 + 1

2c2

) ∣∣∣∣ JM

= limc→0

[J + M +

(1

2c2− J

)] ∣∣∣∣ JM =n− J

⟩→ (n + 0)

∣∣∣∣ ∞n

⟩(13.43)

a†∣∣∣∣ ∞

n

⟩= lim

c→0cJ+

∣∣∣∣ JM

= limc→0

∣∣∣∣ JM + 1

⟩ √(J −M)(J + M + 1)c2 → √

n + 1

∣∣∣∣ ∞n+1

⟩(13.44)

a

∣∣∣∣ ∞n

⟩= lim

c→0cJ−

∣∣∣∣ JM

= limc→0

∣∣∣∣ JM − 1

⟩ √(J + M)(J − M + 1)c2 → √

n

∣∣∣∣ ∞n − 1

⟩(13.45)

Page 229: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

13.4 The contraction U (2) → H4 215

13.4.7 Contraction of BCH formulas

Baker–Campbell–Hausdorff formulas, which can easily be derived for U (2) in itsfaithful 2 × 2 matrix representation, can readily be contracted to BCH formulasfor H4, which can be derived with only a little more difficulty in its faithful 3 × 3matrix representation (cf. Eq. (7.36)). For example, the following BCH formula forU (2)

e(ζ J+−ζ ∗ J−) = eτ J+eln(1+τ ∗τ )J3e−τ ∗ J− ζ

|ζ | tan |ζ | = τ (13.46)

contracts under limc→0 ζ/c → α to the BCH formula for H4

e(αa†−α∗a) = eαa†e− 1

2 α∗α I e−α∗a α = limc→0

ζ/c (13.47)

13.4.8 Contraction of special functions

Special functions that are associated with the group SU (2) include Jacobi polyno-mials, the associated Legendre polynomials and spherical harmonics, and the Leg-endre polynomials. The special functions associated with the “harmonic oscillator”group H4 are the Hermite polynomials and the harmonic oscillator wavefunctions.One might reasonably expect that the Hermite polynomials and harmonic oscillatorwavefunctions are related to the Jacobi or associated Legendre polynomials in somecontraction limit. This is so.

The spherical harmonics Y lm(θ, φ) and associated Legendre polynomials

Plm(cos θ ) are related by (Arecchi et al., 1972; Gilmore, 1974b)

Y lm(θ, φ) = eimφ

√2π

Plm(cos θ ) Y l

−m(θ, φ) = (−)mY l+m(θ, φ)∗ (13.48)

The associated legendre polynomials are defined by

Plm(u) = (−)l+m 1

2l l!

√2l + 1

2

√(l − m)!

(l + m)!(1 − u2)+m/2 dl+m

dul+m(1 − u2)l (13.49)

These polynomials are contracted to harmonic oscillator wavefunctions under u →x/

√l and l + m = n:

limc→0

l−1/4 Plm(u = x/

√l)

= limc→0

(−)n

√(2l)!l1/2

2(2l)l!l!

√1

2nn!(2lc2)n

× [1 − 2c2x2](−1/2c2)/2 dn

dxn[1 − 2c2x2]1/2c2

(13.50)

Page 230: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

216 Contraction

The limit is taken as c → 0, l → ∞, l + m = n, 2lc2 = 1. The limit inside thefirst square root is 1/

√π , that within the second is (2nn!)−1. The result of this

contraction is

limc→0

l−1/4 Plm(u = x/

√l) = 1√

2nn!√

πex2/2

(− d

dx

)n

e−x2 = ψn(x) (13.51)

where ψn(x) is the appropriately normalized harmonic oscillator eigenfunction

ψn(x) = 1√2nn!

√π

Hn(x)e−x2/2 (13.52)

and Hn(x) is the nth Hermite polynomial.Under contraction the orthogonality relations obeyed by the associated Legen-

dre functions go over to the orthogonality relations for the harmonic oscillatoreigenfunctions

δmm ′ =∫ +1

−1Pl

m(u)Plm ′(u)du

→ liml→∞

∫ +√l

−√l

(1

l1/4Pl

m(x/√

l)

) (1

l1/4Pl

m ′(x/√

l)

)d(u

√l)

→∫ +∞

−∞ψn(x)ψn′(x)dx = δnn′ (13.53)

Unfortunately, it is not possible to derive the completeness relations for the har-monic oscillator eigenfunctions from the completeness relations for the Jacobi orassociated Legendre polynomials. However, there is a very simple and beautifulproof of the completeness relations for all special functions associated with compactLie groups. It is due to Wigner and Stone.

13.5 Conclusion

Contraction of groups to form inequivalent groups can be carried out whenever asingular change of basis can be constructed under which the structure constantshave a well-defined limit. Contraction is a particularly useful way to construct non-semisimple Lie groups from simple and semisimple Lie groups. The contractedgroup is always noncompact. Contraction of groups provides many useful relationsbetween the original group and its contracted limit. These involve the commutationrelations in the Lie algebra, the range of values in the parameter spaces that maponto the groups, the Casimir operators, the basis states of representations, opera-tor matrix elements, Baker–Campbell–Hausdorff formulas, and limiting relationsamong special functions. These relations have all been illustrated by example.

Page 231: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

13.6 Problems 217

13.6 Problems

1. Under the contraction SO(3) → I SO(2) the representations of SO(3) contract torepresentations of I SO(2). Since I SO(2) is a noncompact group it has no faith-ful finite-dimensional unitary representations. We therefore consider the followinglimit

lim a ↓ 0 a J± → P± a2l(l + 1) → p2 finite

l ↑ ∞ J3 → P3

∣∣∣∣ lm

⟩→

∣∣∣∣ pm

⟩(p/a)β = lβ = x finite

a. Compute the matrix elements of the operators P± in the algebra iso(2) and show⟨l

m ′

∣∣∣∣a J±

∣∣∣∣ lm

⟩lim−→

⟨p

m ′

∣∣∣∣P±

∣∣∣∣ pm

a√

(l ∓ m)(l ± m + 1) δm ′,m±1lim−→ p δm ′,m±1

b. Compute the contracted limit of the Jacobi polynomials and show that

lim Plmn (cos(x/ l)) = (−)m−n Jm−n(x)

where Jk(x) is the kth Bessel function (Arecchi et al., 1972; Gilmore, 1974b).c. Contract the spherical harmonics and show that

lim

√2π

lY l

m(β = x/ l) → Jm(x)

d. Contract the Legendre polynomials and show that

lim Pl (cos(β = x/ l)) → J0(x)

e. In the generating function expression

eα J+Y lm(θ, φ) =

∑k≥0

AlkY l

m+k(θ, φ) = Y lm(θ ′, φ′)

compute the coefficients Alk and the arguments θ ′, φ′ explicitly. Contract these

results to construct the classical generating functions for Bessel functions.f. Show that the operator L · L contracts to ∇2 in the plane.g. Show that the Casimir invariant operator for SO(3) becomes the Laplace–Beltrami

operator on S2 = SO(3)/SO(2) when restricted to the sphere surface, and thisoperator contracts to the Bessel equation.

2. Under the contraction u(2) → h4 the representations of the unitary group U (2) con-tract to representations of the noncompact Heisenberg group H4. Since H4 is non-compact it has no faithful finite-dimensional unitary irreducible representations. We

Page 232: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

218 Contraction

therefore contract through a series of representations of U (2) of ever increasingdimensions, as follows:

lim ε → ∞ ε J± → h± 2 jε2 → 1

j → +∞, m → −∞ J3 + 12ε2 → h3

∣∣∣∣ jm

⟩→

∣∣∣∣∞n⟩

j + m = n (finite) θ → π2 − √

2εx

a. Compute the matrix elements⟨j

m ′

∣∣∣∣ε J±

∣∣∣∣ jm

⟩lim−→

⟨∞n′

∣∣∣∣h±

∣∣∣∣∞n⟩

ε√

( j ∓ m)( j ± m + 1) δm ′,m±1lim−→

√n + 1 δn′,n+1√

n δn′,n−1

b. Contract the spherical harmonics and show

l1/4 Pln−l,0

2−

√2εx

)lim−→ ψn(x) = Nn Hn(x)e−x2/2

where ψn(x) is the nth excited state wavefunction for the harmonic oscillator,Hn(x) is the nth Hermite polynomial, and Nn is the usual normalization coefficient,Nn = 1/

√2nn!

√π .

c. Carry out steps c–f of the previous problem. The results are obtained by makingthe following replacements:

Bessel function → harmonic oscillator eigenfunctionBessel equation → Schrodinger equation for harmonic oscilator

3. Contract the Lie algebra su(2) spanned by J3, J± ([J3, J±] = ±J±, [J+, J−] = 2J3)with respect to the subalgebra J−. Use a simple Inonu–Wigner contraction toshow

limε→0 ε(2J3) → P P ′ = ∂x

limε→0 ε(J+) → T T ′ = ∂t

limε→0 (J−) → V V ′ = t∂x

Construct the commutation relations of the contracted operators and show that theoperators on the right (P ′, T ′, V ′) satisfy an isomorphic set of commutation relations.The operators ∂x , ∂t , t∂x generate the Galilean group in one dimension. Concludethat if the Lie algebra a1 is contracted with respect to one of its shift operators theGalilean algebra gal(1) results.

4. Contract SO(n + 1) with respect to the subgroup SO(n) and show how the invariantmetric and measure on the sphere Sn = SO(n + 1)/SO(n) reduce to the familiarmetric and measure on Rn = I SO(n)/SO(n).

Page 233: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

13.6 Problems 219

5. Disentangling formulas can also be contracted.a. Use the defining 2 × 2 matrix representation for su(2) to construct the disentan-

gling theorem

eζ J+−ζ ∗ J− = eτ J+elog(1+τ ∗τ )J3 e−τ ∗ J−

and show τ = (ζ/|ζ |)tan(|ζ |).b. Use a faithful matrix representation of the Lie algebra h4 to construct the disen-

tangling theorem

eαa†−α∗a = eαa†e− 1

2 α∗α I e−α∗a

c. Use the contraction relation Eq. (13.30) for u(2) → h4 to show that the u(2)disentangling theorem contracts to the h4 disentangling theorem in the limitα = limc→0 ζ/c.

6. Thermal expectation values of the operator X are constructed by taking the trace:〈X〉 = tr Xe−βH/tr e−βH, and a generating function for expectation values is 〈eαX 〉 =tr eαX e−βH/tr e−βH. When the operators X andH are elements in a finite dimensionalLie algebra these expectation values can often be computed rather simply.a. AssumeH = ε J3 and X is in the Lie algebra su(2). Show that in the 2 × 2 defining

matrix representation

eθ ·J →[

cosh(θ/2) + (θz/θ ) sinh(θ/2) (θx − iθy)/θ sinh(θ/2)(θx + iθy)/θ sinh(θ/2) cosh(θ/2) − (θz/θ ) sinh(θ/2)

]

e−βH →[

e−βε/2 00 e+βε/2

]

b. Show that the trace of this product is

2 cosh(θ/2) cosh(βε/2) − 2(θz/θ ) sinh(θ/2) sinh(βε/2) (= 2 cosh(ψ/2))

c. Show that in the 2 × 2 matrix representation with j = 12 and 2 j + 1 = 2,

〈eθ ·J 〉 = (sinh ψ/ sinh(ψ/2)) / (sinh βε/ sinh(βε/2))

d. Show that in the (2 j + 1) × (2 j + 1) dimensional representation,

〈eθ ·J 〉 = sinh((2 j + 1)ψ/2)/ sinh(ψ/2)

sinh((2 j + 1)βε/2)/ sinh(βε/2)

e. As j becomes large, show that this ratio simplifies to

〈eθ ·J 〉 j→∞−→ sinh( jψ)/ sinh( jβε)

f. Contract this generating function to the Heisenberg algebra.

Page 234: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

220 Contraction

7. One real form of D3 is the conformal group SO(4, 2).a. Write down the quadratic, cubic, and quartic Casimir operators for SO(4, 2). These

are analytic continuations of C2 = ∑i j X2

i j , C3 = εabcde f Xab Xcd Xef , and C4 =∑i j Y 2

i j , where Yi j = εi jcde f Xcd Xef of the group SO(6).b. Contract SO(4, 2) with respect to the subgroup SO(4) ⊗ SO(2).c. Construct the quadratic, cubic, and quartic Casimir operators of the contracted

group. These are analytic continuations of the contractions of the three operatorsof part a. If we define Ai = limε→0 εXi5 and Bi = limε→0 εXi6, then show thatthe Casimir operators contract to

C2 → A · A + B · B

C3 → εi jkl Xi j Ak Bl

C4 → ∑i j (ε

i jkl Ak Bl)2

In these expressions the indices range from 1 to 4.d. Write down the Laplace–Beltrami operators in the eight-dimensional spaces

SO(4, 2)/ [SO(4) ⊗ SO(2)] and I [SO(4) ⊗ SO(2)] / [SO(4) ⊗ SO(2)].

8. Riemannian symmetric spaces have been classified using the Cartan decompositionof simple Lie algebras:

g = h + p

[h, h] ⊆ h

[h, p] = p

[p, p] ⊆ h

Operators Xi span h and Xα span p.a. Show that the metric on p is

gα,β = C iα,γ C γ

β,i + C γ

α,i C iβ,γ

b. Show that in the contracted limit Yα = limε→0 εXα a metric tensor on p is welldefined by

g(p′)α,β = limε→0

(Yα, Yβ)/ε2 = (Xα, Xβ)

Use the structure constants to show this.c. Show that this metric is unchanged on the contracted space P ′ = G ′/H , as opposed

to the metric on P = G/H , which varies from place to place on the space.

Page 235: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

14

Hydrogenic atoms

Many physical systems exhibit symmetry. When a symmetry exists it ispossible to use group theory to simplify both the treatment and the under-standing of the problem. Central two-body forces, such as the gravitationaland Coulomb interactions, give rise to systems exhibiting spherical sym-metry (two particles) or broken spherical symmetry (planetary systems).In this chapter we see how spherical symmetry has been used to probethe details of the hydrogen atom. We find a hierarchy of symmetries andsymmetry groups. At the most obvious level is the geometric symmetrygroup, SO(3), which describes invariance under rotations. At a less ob-vious level is the dynamical symmetry group, SO(4), which accounts forthe degeneracy of the levels in the hydrogen atom with the same princi-pal quantum number. At an even higher level are the spectrum generatinggroups, SO(4, 1) and SO(4, 2), which do not maintain energy degeneracyat all, but rather map any bound (scattering) state of the hydrogen atominto linear combinations of all bound (scattering) states. We begin witha description of the fundamental principles underlying the application ofgroup theory to the study of physical systems. These are the principle ofrelativity (Galileo) and the principle of equivalence (Einstein).

14.1 Introduction

Applications of group theory in physics start with two very important principles.These are Galileo’s principle of relativity (of observers) and Einstein’s principleof equivalence (of states). We show how these principles are used to establishthe standard framework for the application of geometric symmetry groups to thetreatment of quantum mechanical systems that possess some geometric symmetry.For the hydrogen atom the geometric symmetry group is SO(3) and one predictionis that states occur in multiplets with typical angular momentum degeneracy: 2l + 1.This is seen when we solve the Schrodinger and Klein–Gordon equations for thehydrogen atom – more specifically for the spinless electron in the Coulomb potentialof a proton.

221

Page 236: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

222 Hydrogenic atoms

Invariance of a hamiltonian under a group action implies degeneracy of the energyeigenvalues. It is observed that in the nonrelativistic case the energy degeneracy islarger than required by invariance under the rotation group SO(3). If we believethat the greater the symmetry, the greater the degeneracy, we would expect that theHamiltonian is invariant under a larger group than the geometric symmetry groupSO(3). The larger group is called a dynamical symmetry group. This groupis SO(4) for the hydrogen bound states. Its infinitesimal generators include thecomponents of two three-vectors: the angular momentum vector and the Laplace–Runge–Lenz vector.

When the dynamical symmetry is broken, as in the case of the Klein–Gordonequation, the classical orbit is a precessing ellipse and the bound states with a givenprinciple quantum number N are slightly split according to their orbital angularmomentum values l.

This suggests that we could look for even larger groups that do not pretend topreserve (geometric or dynamical) symmetry and do not maintain energy degen-eracy. In fact, they map any bound (scattering) state into linear combinations ofall other bound (scattering) states. Such groups exist. They are called spectrumgenerating groups. For the hydrogen atom the first spectrum generating groupthat was discovered was the deSitter group SO(4, 1). A larger spectrum generat-ing group is the conformal group SO(4, 2). We illustrate how spectrum generatinggroups have been used to construct eigenfunctions and energy eigenvalues. Wealso describe how analytic continuations between two qualitatively different typesof representations of a noncompact group lead to relations between the boundstate spectrum, on the one hand, and the phase shifts of scattering states, on theother.

14.2 Two important principles of physics

There are two principles of fundamental importance that allow group theory to beused in profoundly important ways in physics. These are the principle of relativityand the principle of equivalence. We give a brief statement of both using a variantof Dirac notation.

Principle of relativity (of observers) Two observers, S and S′, describe a physicalstate |ψ〉 in their respective coordinate systems. They describe the state by mathe-matical functions 〈S|ψ〉 and 〈S′|ψ〉. The two observers know the relation betweentheir coordinate systems. The mathematical prescription for transforming func-tions from one coordinate system to the other is 〈S′|S〉. The set of transformationsamong observers forms a group. If observer S′ wants to determine what observerS has seen, he applies the appropriate transformation, 〈S|S′〉, to his mathematical

Page 237: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

14.3 The wave equations 223

functions 〈S′|ψ〉 to determine how S has described the system:

〈S|ψ〉 = 〈S|S′〉 〈S′|ψ〉 (14.1)

The principle of relativity of observers is a statement that the functions determinedby S′ in this fashion are exactly the functions used by S to describe the state |ψ〉.Principle of equivalence (of states): Two observes S and S′ observe a system, asabove. If

the rest of the universe looks the same

to both S and S′, then S can use the mathematical functions 〈S′|ψ〉 written downby S′ to describe a new physical state |ψ ′〉

〈S|ψ ′〉 = 〈S′|ψ〉 (14.2)

and that state must exist.In this notation, the transformation of a hamiltonian under a group operation (for

example, a rotation in SO(3)) is expressed by 〈S′|H |S′〉 = 〈S′|S〉〈S|H |S〉〈S|S′〉,the invariance under the transformation 〈S′|S〉 is represented by 〈S′|H |S′〉 =〈S|H |S〉, and the existence of a 2pz state in a system with spherical symmetryimplies the existence (by the Principle of Equivalence) of 2px and 2py states, aswell as arbitrary linear combinations of these three states.

14.3 The wave equations

Schrodinger’s derivation of a wave equation for a particle of mass m began withthe relativistic dispersion relation for the free particle: pµ pµ = gµν pµ pν = (mc)2.In terms of the energy E and the three-momentum p this is

E2 − (pc)2 = (mc2)2 (14.3)

Interaction of a particle of charge q with the electromagnetic field is described by theprinciple of minimal electromagnetic coupling: pµ → πµ = pµ − (q/c)Aµ, wherethe four-vector potential A consists of the scalar potential � and the vector potentialA. These obey B = ∇ × A and E = −∇� − (1/c)(∂A/∂t). For an electron q =−e, where e is the charge on the proton, positive by convention. In the Coulombfield established by a proton, � = e/r and A = 0, so that E → E + e2/r . Herer is the proton–electron distance. The Schrodinger prescription for converting adispersion relation to a wave equation is to replace p → (�/ i)∇ and allow theresulting equation to act on a spacial function ψ(x). This prescription results in the

Page 238: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

224 Hydrogenic atoms

following wave equation, the Klein–Gordon equation:{E2 − (mc2)2 + 2E

(e2

r

)+

(e2

r

)2

− (−i�c∇)2

}ψ(x) = 0 (14.4)

This equation exhibits spherical symmetry in the sense that it is unchanged (in-variant) in form under rotations: 〈S′|H |S′〉 = 〈S|H |S〉, where 〈S′|S〉 ∈ SO(3).Schrodinger solved this equation, compared its predictions with the spectral en-ergy measurements on the hydrogen atom, was not convinced his theory was anygood, and buried this approach in his desk drawer.

Sometime later he reviewed this calculation and took its nonrelativistic limit.Since the binding energy is about 13.6 eV and the electron rest energy mc2 is about510 000 eV, it makes sense to write E = mc2 + W , where the principal part of therelativistic energy E is the electron rest energy and the nonrelativistic energy W isa small perturbation of either (� 0.0025%). Under this substitution, and neglectingterms of order (W + e2/r )2/mc2, we obtain the nonrelativistic form of Eq. (14.4),the Schrodinger equation:{

p · p2m

− e2

r− W

}ψ(x) =

{− �2

2m∇2 − e2

r− W

}ψ(x) = 0 (14.5)

Equation (14.4) is now known as the Klein–Gordon equation and its nonrelativisticlimit Eq. (14.5) is known as the Schrodinger equation, although the former wasderived by Schrodinger before he derived his namesake equation.

Remark Schrodinger began his quest for a theory of atomic physics withMaxwell’s equations, in particular, the eikonal form of these equations. It is nosurprise that his theory inherits key characteristics of electromagnetic theory: solu-tions that are amplitudes, the superposition principle for solutions, and interferenceeffects that come about by squaring amplitudes to obtain intensities. Had he startedfrom classical mechanics, there would be no amplitude-intensity relation and theonly superposition principle would have been the superposition of forces or theirpotentials. The elegant but forced relation between Poisson brackets and commuta-tor brackets ([A, B]/ i� = {A, B}) is an attempt to fit quantum mechanics into thestraitjacket of classical mechanics.

14.4 Quantization conditions

The standard approach to solving partial differential equations is to separatevariables. Since the two equations derived above have spherical symmetry, it isuseful to introduce spherical coordinates (r, θ, φ). In this coordinate system the

Page 239: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

14.4 Quantization conditions 225

Laplacian is

∇2 =(

1

r

∂rr

)2

+ L2(S2)

r2(14.6)

L2(S2) = 1

sin θ

∂θsin θ

∂θ+ 1

sin2 θ

∂2

∂φ2(14.7)

The second order differential operator L2(S2) is the Laplacian on the sphere S2. Itseigenfunctions are the spherical harmonics Y l

m(θ, φ) and its spectrum of eigenvaluesis L2(S2)Y l

m(θ, φ) = −l(l + 1)Y lm(θ, φ). The integers (l, m) satisfy l = 0, 1, 2, . . .

and −l ≤ m ≤ +l. The negative sign and discrete spectrum characteristically indi-cate that S2 is compact.

The partial differential equations (14.4) and (14.5) are reduced to ordinary dif-ferential equations by substituting the ansatz

ψ(r, θ, φ) → 1

rR(r )Y l

m(θ, φ) (14.8)

into these equations, replacing the angular part of the Laplacian by the eigenvalue−l(l + 1), and multiplying by r on the left. This gives the simple second orderordinary differential equation(

d2

dr2+ A

r2+ B

r+ C

)R(r ) = 0 (14.9)

The values of the coefficients A, B, C that are obtained for the Klein–Gordonequation and the Schrodinger equation are as follows:

Equation A B C

Klein–Gordon −l(l + 1) + (e2/�c)2 2Ee2/(�c)2 [E2 − (mc2)2]/(�c)2

Schrodinger −l(l + 1) 2me2/�2 2mW/�2

(14.10)

There is a standard procedure for solving simple ordinary differential equationsof the type presented in Eq. (14.9). This is the Frobenius method. The steps involvedin this method, and the result of each step, are summarized in Table 14.1.

The energy eigenvalues for the bound states of both the relativistic and non-relativistic problems are expressed in terms of the radial quantum number n =0, 1, 2, . . . and the angular momentum quantum number l = 0, 1, 2, . . ., mass mof the electron, or more precisely the reduced mass of the proton–electron pairm−1

red = m−1e + M−1

p , and the fine structure constant (Gabrielse et al., 2006)

α = e2

�c= 1

137.035 999 796(70)= 0.007 297 352 531 3(3 8) (14.11)

Page 240: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

226 Hydrogenic atoms

Table 14.1. Left column lists the steps followed in the Frobenius method forfinding the square-integrable solutions of simple ordinary differential equations,

the right column shows the result of applying the step to Eq. (14.9)

Procedure Result

1 Locate singularities 0, ∞2 Determine analytic behavior r → 0 : R � rγ , γ (γ − 1) + A = 0

at singular points r → ∞ : R � eλr , λ2 + C = 0

3 Keep only L2 solutions γ = 12 +

√( 1

2 )2 − A, λ = −√−C

4 Look for solutions with proper R = rγ eλr f (r )asymptotic behavior

5 Construct differential[(

r D2 + 2γ D) + (2λγ + B + 2λr D)

]f (r ) = 0

equation for f (r )

6 Construct recursion relation f j+1 = − 2λ( j + γ ) + B

j( j + 1) + 2γ ( j + 1)f j

7 Look at asymptotic behavior f � e−2λr if series does not terminate

� e+1λr if series does terminate (λ < 0)

8 Construct quantization condition 2λ(n + γ ) + B = 0 or

n + 1

2+

√( 1

2 )2 − A = B

2√−C

9 Construct explicit solutions E = mc2√1 + (α/N ′)2

, W = −1

2mc2α2 1

N 2

N ′ = n + 12 +

√(l + 1

2 )2 − α2, N = n + l + 1

This is a dimensionless ratio of three physical constants that are fundamental inthree “different” areas of physics: e (electromagnetism), � (quantum mechanics),and c (relativity). It is one of the most precisely measured of the physical constants.The bound state energy eigenvalues are

Klein–Gordon equation Schrodinger equation

E(n, l) = mc2√1 + (α/N ′)2

W (n, l) = −1

2mc2α2 1

N 2

N ′ = n + 1

2+

√(l + 1

2

)2

− α2 N = n + l + 1

(14.12)

Both the nonrelativistic and relativistic energies have been plotted in Fig. 14.1.The nonrelativistic energies for the hydrogen atom appear as the darker lines.The nonrelativistic energy has been normalized by dividing by the hydrogen atom

Page 241: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

14.5 Geometric symmetry SO(3) 227

40 1 2 3 5 6l, orbital angular momentum

−1.5

−1.25

−1

−0.75

−0.5

−0.25

0

Ene

rgy

/ NR

gro

und

stat

e bi

ndin

g en

ergy

Energy eigenvalues, H atomnonrelativistic (darker), relativistic (lighter, deeper)

Figure 14.1. Spectrum of the hydrogen atom, normalized by the energy of thenonrelativistic ground state. The nonrelativistic spectrum is darker. The relativis-tic spectrum has been computed for Z = 50. These energies are computed byreplacing α → Zα everywhere.

ground state energy |W1| = 12 mc2α2. These normalized energy levels decrease to

zero like 1/N 2, where N = n + l + 1 is the principal quantum number. The ener-gies are displayed as a function of the orbital angular momentum l. The relativisticenergies of the bound states for the proton–electron system converge to the rest en-ergy mc2 as N ′ increases. When this limit is removed these energies (also rescaledby dividing by 1

2 mc2α2) can be plotted on the same graph. At the resolution shown,the two sets of rescaled energies are indistinguishable. To illustrate the difference,we have instead computed and plotted the bound state spectrum for a single elec-tron in a potential with positive charge Z . The energies in this case are obtainedby the substitution α → Zα everywhere. The energies of these bound states havebeen renormalized by subtracting the limit mc2 and dividing by the nonrelativisticenergy for the same ion: 1

2 mc2(Zα)2. The energy difference between the 1s groundstates is pronounced; this difference decreases rapidly as the principal quantumnumber increases.

14.5 Geometric symmetry SO(3)

Symmetry implies degeneracy.To see this, assume gi ∈ G are group operations that leave a hamiltonian H

invariant (unchanged in form)

gi Hg−1i = H or gi H = Hgi (14.13)

Page 242: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

228 Hydrogenic atoms

When G is a group of geometric transformations the physical interpretation of thisequation is as follows. The hamiltonian H has the same form in two coordinatesystems that differ by the group operation gi . Under this condition, if |ψ〉 is aneigenstate of H with eigenvalue E , then gi |ψ〉 is also an eigenstate of H with thesame energy eigenvalue E . The demonstration is straightforward:

H (gi |ψ〉) = (Hgi )|ψ〉 = (gi H )|ψ〉 = gi (H |ψ〉) = gi (E |ψ〉) = E(gi |ψ〉)(14.14)

To illustrate this idea, assume that |ψ〉 = ψ2pz (x). A rotation by π/2 radians aboutthe y-axis maps this state to ψ2px (x) and a rotation by π/2 radians about the x-axismaps this state to −ψ2py (x). By invariance (of the hamiltonian) under the rotationgroup and the principle of equivalence, these new functions describe possible statesof the system, and these states must exist.

The rotation group O(3) leaves the hamiltonian of the hydrogen atom invari-ant in both the nonrelativistic and relativistic cases. In the nonrelativistic case,H = p · p/2m − e2/r . The scalar p · p = −�2∇2 is invariant under rotations, asis also the potential energy term −e2/r . Rotation operators can be expressed interms of the infinitesimal generators of rotations about axis i : εi jk x j∂k . These ge-ometric operators are proportional to the physical angular momentum operatorsLi = (r × p)i = (�/ i)εi jk x j∂k . Finite rotations can be expressed as exponentialsas follows:

R(θ ) = eεi jkθi x j ∂k = eiθ ·L/� (14.15)

The angular momentum operators L = r × p share the same commutation relationsas the infinitesimal generators of rotations r × ∇, up to the proportionality factor�/ i . The commutation relations are[

Li , L j] = i�εi jk Lk (14.16)

It is useful to construct linear combinations of these operators that have canonicalcommutation relations of the type described in Chapter 10. To this end we define theraising (L+) and lowering (L−) operators by L± = Lx ± i L y . The commutationrelations are

[Lz, L±] = ±�L± (14.17)

[L+, L−] = 2�Lz (14.18)

These angular momentum operators are related to the two boson operators asfollows: Lz = � 1

2 (a†1a1 − a†

2a2), L+ = �a†1a2, L− = �a†

2a1. As a result, the angu-lar momentum operators have matrix representations with basis vectors |n1 n2 〉= | j

m 〉, with n1 = 0, 1, 2, . . . , n2 = 0, 1, 2, . . . , n1 + n2 = 2 j , n1 − n2 = 2m,

Page 243: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

14.5 Geometric symmetry SO(3) 229

− j ≤ m ≤ + j . These basis vectors describe the finite-dimensional irreduciblerepresentations of the covering group SU (2) of SO(3). The subset of represen-tations with j = l (integer) describes representations of SO(3).

To see this we construct a coordinate representation of the angular momentumoperators. In spherical coordinates ((x, y, z) → (r, θ, φ) with x = r sin θ cos φ)these operators are

Lz = �

i

∂φ

L± = �

(± ∂

∂θ+ i

cos θ

sin θ

∂φ

) (14.19)

The functions on R3 that transform under the angular momentum operators can beconstructed from the mixed basis argument:

〈θφ|L−

∣∣∣∣ lm

⟩↓ ↓

〈θφ|L−|θ ′φ′〉 〈θ ′φ′∣∣∣∣ lm

⟩= 〈θφ

∣∣∣∣ l ′

m ′

⟩ ⟨l ′

m ′

∣∣∣∣L−

∣∣∣∣ lm

⟩ (14.20)

As usual, the intermediate arguments (with primes) are dummy arguments thatare summed or integrated over. The symbols in Eq. (14.20) have the followingmeanings.

〈θφ|L−|θ ′φ′〉 Matrix element of the angular momentum shift down operatorin the coordinate representation: �(−∂/∂θ + i(cos θ/sin θ )(∂/∂φ)) δ(cos θ ′ − cos θ )δ(φ′ − φ).⟨

l ′m ′ |L−| l

m

⟩Matrix element of the angular momentum shift down operatorin the algebraic representation: �

√(l ′ − m ′)(l + m)

δl ′l δm ′,m−1.〈θφ| l

m

⟩Matrix element of the similarity transformation between thecoordinate representation and algebraic representation. Alsocalled spherical harmonic: Y l

m(θ, φ).

This relation can be used to show that there are no geometric functions associatedwith values of the quantum number j that are half integral. It can also be used toconstruct the extremal function Y l

−l(θ, φ) by solving the equation L−Y l−l(θ, φ) = 0

in the coordinate representation (Problem 14.12). Finally, the action of the shiftup operators can be used to constuct the remaining functions Y l

m(θ, φ) through therecursion relation involving both the coordinate and the algebraic representations

Page 244: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

230 Hydrogenic atoms

Table 14.2. Spherical harmonics Y lm(θ, φ) for low values of l and m

m l = 0 l = 1 l = 2 l = 3

0√

14π

√3

4πcos θ

√5

16π(3 cos2 θ − 1)

√7

16π(5 cos3 θ − 3 cos θ )

±1 ∓√

38π

sin θ e±iφ ∓√

158π

cos θ sin θ e±iφ ∓√

2164π

sin θ (5 cos2 θ − 1) e±iφ

±2√

1532π

sin2 θ e±2iφ√

10532π

sin2 θ cos θ e±2iφ

±3 ∓√

3564π

sin3 θ e±3iφ

of the shift up operator L+

L+Y lm(θ, φ) = Y l

m+1(θ, φ)√

(l − m)(l + m + 1) (14.21)

The lowest spherical harmonics (l = 0, 1, 2, 3) are collected in Table 14.2.

Remark The spectrum of the Casimir invariant for the rotation group SO(3), ormore specifically the Laplace–Beltrami operator constructed from its infinitesimalgenerators acting on the sphere parameterized by coordinates (θ, φ), is −l(l + 1),l = 0, 1, 2, . . . . The fact that the spectrum is negative means that the space, S2, onwhich these operators act, is compact. By the same token, the spectrum of the squareof the angular momentum operator, L · L, is �2l(l + 1). This means physically thatthe inner product of the angular momentum operator with itself is never negative,and is quantized by integer angular momentum values, measured in units of Planck’sconstant �.

14.6 Dynamical symmetry SO(4)

Symmetry implies degeneracy.The greater the symmetry, the greater the degeneracy.The states of the nonrelativistic hydrogen atom with fixed principal quantum

number N = n + l + 1 are degenerate, with energy EN = − 12 mc2α2 1

N 2 . There are∑l=N−1l=0 (2l + 1) = N 2 states with this energy. This N 2-fold degeneracy is larger

than the 2l + 1-fold degeneracy required by rotational invariance of the hamiltonian.If we believe the converse, that degeneracy implies symmetry, then we might beled to expect that the hydrogen atom exhibits more symmetry than meets the eye.

In fact this symmetry, called a dynamical symmetry (Schiff, 1968), exists and isrelated to a constant of motion that is peculiar to 1/r2 force laws. This constant ofmotion is known as the Laplace–Runge–Lenz vector. It is a constant of unperturbedplanetary motion, for which the force law has the form dp/dt = −K r/r3, where

Page 245: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

14.6 Dynamical symmetry SO(4) 231

K = G Mm, G is the universal gravitational constant, M and m are the two attractingmasses, and r = x i + y j + zk is the vector from one mass to the other. The timederivative of the vector p × L is

d

dt(p × L) = dp

dt×L + p×dL

dt↓ ↓

= −Krr3

×(r × m r) + 0

= −mKr(r · r) − r(r · r)

r3= mK

d

dt

(rr

)(14.22)

In going from the first line in Eq. (14.22) to the second, we use the fact that L isa constant of motion in any spherically symmetric potential. We also use the forcelaw for a 1/r potential. In going from the second line to the third, we express thecross product r × L in terms of (generally) nonparallel vectors r and r. We alsouse the identity (d/dt)(r/r ) = r/r − (r · r) r/r3. The result is that the Laplace–Runge–Lenz vector M is a constant of motion: dM/dt = 0, where

M = p × Lm

− Krr

(14.23)

In the transition from classical to quantum mechanics the operator obtained fromthe classical operator in Eq. (14.23) is not hermitian. Pauli (1926) symmetrized itproperly, defining the hermitian quantum mechanical operator

M = p × L − L × p2m

− Krr

(14.24)

where the ˆ over the classical symbol indicates a quantum mechanical operator. Wewill dispense with the ˆ over operators, in part to simplify notation, in part to preventuncertainties in interpretation of the operator r.

The hermitian operator M in Eq. (14.24) is a constant of motion, as it commuteswith the nonrelativistic hamiltonian: [H, M] = 0. The six operators Li , M j obeythe following commutation relations[

Li , L j] = i�εi jk Lk[

Li , M j] = i�εi jk Mk (14.25)[

Mi , M j] =

(−2H

m

)i�εi jk Lk

These are the commutation relations for the Lie algebra of the group SO(4) forbound states (E < 0) or SO(3, 1) for excited states (E > 0). The operators L and

Page 246: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

232 Hydrogenic atoms

M also obey

L · M = M · L = 0

M · M = 2H

m

(L · L + �2

) + K 2(14.26)

In order to simplify the discussion to follow, and make this discussion as in-dependent of the principal quantum number N as possible, we renormalize theLaplace–Runge–Lenz vector by a scale factor as follows: M′ = (−m/2H )1/2M.(For E > 0 change − → + and SO(4) → SO(3, 1).) The commutation relationsof these operators are now

[Li , L j ] = i�εi jk Lk

[Li , M ′j ] = i�εi jk M ′

k (14.27)

[M ′i , M ′

j ] = i�εi jk Lk

The Lie algebra so(4) is the direct sum of two Lie algebras of type so(3) (seeFigs. 10.3, 10.8(b)). It is useful to introduce two vector operators A and B asfollows

A = 12 (L + M′)

B = 12 (L − M′)

(14.28)

The operators A and B have angular momentum commutation relations. Further,they mutually commute. Finally, their squares have the same spectrum.

It is useful at this point to introduce the Schwinger representation for the angularmomentum operators A in terms of two independent boson modes: A3 = 1

2 (a†1a1 −

a†2a2), A+ = a†

1a2, A− = a†2a1 (for simplicity, set � → 1). A similar representation

of the angular momentum operators B in terms of two independent boson operatorsb1, b2 and their creation operators is also introduced.

Basis states for a representation of the algebra spanned by the operators A have theform |p1, p2〉, with p1 + p2 = 2 ja constant and p1 − p2 = ma . The 2 ja + 1 basisstates correspond to p1 = 2 ja, p2 = 0; p1 = 2 ja − 1, p2 = 1; etc. For B the basisstates are |q1, q2〉, with q1 + q2 = 2 jb constant and q1 − q2 = mb. The invariantoperators are A · A = ja( ja + 1) and B · B = jb( jb + 1). Since A · A = B · B (cf.Problem 14.15), ja = jb and the set of states related by the shift operators is (2 j +1)2 fold degenerate, where 2 j + 1 = N = n + l + 1.

States with good l and m quantum numbers can be constructed from these statesusing Clebsch-Gordon coefficients:∣∣∣∣ l

m

⟩=

∣∣∣∣ j/2 j/2ma mb

⟩⟨j/2 j/2ma mb

∣∣∣∣ lm

⟩(14.29)

Page 247: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

14.7 Relation with dynamics in four dimensions 233

The action of the Laplace–Runge–Lenz shift operators on these states, and thespherical harmonics, is determined in a straightforward way. For example, M ′

+ =A+ − B+ = a†

1a2 − b†1b2, so that

M ′+Y l

m = 〈θφ|(∣∣∣∣ j/2 j/2

ma + 1 mb

⟩⟨j/2 j/2ma mb

∣∣∣∣ lm

⟩×

√( j/2 − ma)( j/2 + ma + 1)

−∣∣∣∣ j/2 j/2

ma mb + 1

⟩⟨j/2 j/2ma mb

∣∣∣∣ lm

⟩×

√( j/2 − mb)( j/2 + mb + 1)

)(14.30)

In general, the Laplace–Runge–Lenz operators shift the values of l and m by ±1 or0, while the angular momentum shift operators change only m by ±1. However, forcertain stretched values of the Clebsch–Gordon coefficients, the Laplace–Runge–Lenz vectors act more simply, for example (Burkhardt and Leventhal, 2004)

M ′z

∣∣∣∣N l±l

⟩= D1

∣∣∣∣N l + 1±l

⟩D1 = 1

N

√N 2 − (l + 1)2

2l + 3

M ′±

∣∣∣∣N l±l

⟩= ±D2

∣∣∣∣N l + 1±(l + 1)

⟩D2 = 1

N

√2l + 2

2l + 3

[N 2 − (l + 1)2

] (14.31)

14.7 Relation with dynamics in four dimensions

The operators L and M′ are infinitesimal generators for the orthogonal group SO(4).The relation between motion in the presence of a Coulomb or gravitational potentialand motion in four (mathematical) dimensions was clarified by Fock (1935). Motionof a particle in a 1/r potential is equivalent to motion of a free particle in the sphereS3 ⊂ R4.

It is useful first to establish an orthogonal coordinate system in R3. It is naturalto do this in terms of the constant physical vectors that are available. These includethe vectors L and M. Their cross product W = L × M is orthogonal to both andalso a constant of motion. These classical vectors obey:

L = r × p L · L = L2

M = p × Lm

− Krr

M · M = M2 = 2E

mL2 + K 2

W = pm

L2 − KL × r

rW · W = L2 M2

(14.32)

The particle moves in a plane perpendicular to the angular momentum vector L,since r · L = 0. The momentum vector moves in the same plane, since p · L = 0.

Page 248: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

234 Hydrogenic atoms

While r moves in an ellipse, the momentum vector moves on a circle. For simplicitywe choose the z-axis in the direction of L and the x- and y-axes in the directionsof M and W. In this coordinate system pz = 0, px = p · M/

√M · M and py =

p · W/√

W · W. The two nonzero components of the momentum vector are notindependent, but obey the constraint

p2x +

(py − mM

L

)2

=(

mK

L

)2

(14.33)

This is the equation of a circle in the plane containing the motion. As the particlemoves in the plane of motion on an elliptical orbit with one focus at the source, itsmomentum moves in the same plane on a circular orbit (radius mK/L) with thecenter displaced from the origin by m M/L .

The circle in R3 is lifted to a circle in S3 ⊂ R4 by a projective transformation.We extend coordinates from R3 to R4 as follows:

(x, y, z) ∈ R3 → (w, x, y, z) ∈ R4

(px , py, pz) ∈ R3 → (pw, px , py, pz) ∈ R4 (14.34)

With p0 = √−2E/m, define the unit vector u ∈ S3 ⊂ R4 by the projective trans-formation T :

u T= p · p − p20

p · p + p20

w + 2p0

p · p + p20

p (14.35)

Here w is a unit vector in R4 that is orthogonal to all vectors in the physical space R3.The transformation in Eq. (14.35) is a stereographic projection. It is invertible andpreserves angles (conformal). It is a simple matter to check that u is a unit vector.The circular trajectory in R3 (Eq. (14.33)) lifts to a circle in S3. Reversibly, circles inS3 project down to circles in the physical R3 space under the reverse transformation.

Rotations in SO(4) rigidly rotate the sphere S3 into itself. They rotate circles intocircles, which then project down to circular momentum trajectories in the physicalspace R3:

circle in R3 T−→ circle in S3 SO(4)−→ circle in S3 T −1−→ circle in R3 (14.36)

The subgroup SO(3) of rotations around the w axis acts only on the physical spaceR3. In this subgroup, the subgroup SO(2) of rotations around the L axis leavesL fixed and simply rotates M in the plane of motion. The coset representativesSO(3)/SO(2) act to reorient the plane of motion by rotating the angular momen-tum vector L while keeping the magnitude of M fixed. Rotations in the cosetSO(4)/SO(3) act to change the lengths of both L and M. All group operations inSO(4) keep p0 fixed. In this way the group SO(4) maps states with principal quan-tum number N into (linear combinations of) states with the same principal quantum

Page 249: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

14.8 DeSitter symmetry SO(4, 1) 235

number N . In short, SO(4) acts on the bound hydrogen atom states through unitaryirreducible representations of dimension N 2 = (n + l + 1)2.

14.8 DeSitter symmetry SO(4, 1)

The dynamical symmetry group SO(4) that rotates bound states to bound statesdoes not change their energy; the dynamical symmetry group SO(3, 1) that rotatesscattering states to scattering states does not change their energy either. It would benice to find a set of transformations that rescales the energy. If such a group couldbe found, it would be possible, for example, to map the 1s ground state into anyother bound state. Such a group exists: it is the deSitter group SO(4, 1) (Malkinand Man’ko, 1965; Ogievetskii and Polubarinov, 1960).

That such a group might exist is strongly suggested by the appearance of thehydrogen atom spectrum, as replotted in Fig. 14.2. In this figure we have multi-plied each energy eigenvalue by −N 3, where N is the principal quantum number.The rescaled energies have been plotted as a function of N (vertically) and orbitalangular momentum quantum number l (horizontally). In this format, the eigen-value spectrum bears a strong resemblance to the spectrum of states that supportsfinite-dimensional representations of su(2) (Fig. 6.1) and the infinite-dimensionalrepresentations of su(1, 1) (Fig. 11.2).

We begin with a group that preserves inner products in some N -dimensionallinear vector space: x′ = Mx, with M a transformation in the group and the innerproduct defined by (x, x)N = xt gx = xi gi j x j . As always, the metric-preservingcondition leads to Mt G M = G.

It is useful to define a new N -vector y as a scaled version of the original vector:y = λx. We introduce two additional coordinates by defining z1 = λ and z2 =λ(x, x)N . With these definitions we find the conformal condition

(y, y)N − z1z2 = (λx, λx)N − λ [λ(x, x)N ] = 0 (14.37)

The conformal condition defines an inner product in the N + 2 dimensional linearvector space that is nondiagonal in the coordinates y, z1, z2 but diagonal in thecoordinates y, yN+1, yN+2, with yN+1 = 1

2 (z1 + z2) and yN+2 = 12 (z1 − z2):

G

− 12

− 12

y

z1

z2

G

−1+1

y

yN+1

yN+2

(14.38)

The conformal condition Eq. (14.37) defines a cone in the enlarged N + 2 dimen-sional space. If the group that preserves the metric G in RN is SO(p, q), the group

Page 250: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

236 Hydrogenic atoms

43210 5 6

l, orbital angular momentum

0

1

2

3

4

5

6N

, pri

ncip

al q

uant

um n

umbe

r

−N3 × Energy of hydrogen atomequal spacing suggests algebra structure

Figure 14.2. Nonrelativistic spectrum of the hydrogen atom, replotted to empha-size the possibility of a dynamical algebra.

that preserves the metric in RN+2 is SO(p + 1, q + 1). We seek to construct aspherical or hyperbolic slice of this cone.

The connection with the Kepler problem is made as follows. The momenta p arelifted to the coordinates on a sphere S3 ⊂ R4 (E < 0) or a two-sheeted hyperboloidH 3 ⊂ R4 (E > 0) by the following projective transformations:

u =12 (p2

0 − p · p)12 (p2

0 + p · p)w + p0p

12 (p2

0 + p · p)E < 0

u =12 (p2

0 + p · p)12 (p2

0 − p · p)w + p0p

12 (p2

0 − p · p)E > 0

(14.39)

For the four-vectors u the metric G that appears in Eq. (14.38) is determined fromthe denominators in Eq. (14.39):

ut Gu = u20 ±

3∑i=1

u2i

+ for E < 0− for E > 0

(14.40)

The algebraic surfaces on which the projective vector u lies is defined by thecondition ut Gu = 1.

Page 251: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

14.8 DeSitter symmetry SO(4, 1) 237

The connection with the conformal transformations introduced above is as fol-lows. The group that leaves invariant the conformal metric diag(1, ±I3, −1, +1) isSO(5, 1) for E < 0 and SO(2, 4) for E > 0. On the surfaces (sphere, hyperboloid)the condition ut Gu = 1 is satisfied, so that z1 = z2, y4 = λ and y5 = 0 (the sixcoordinates are labeled (y0, y = λu, y4 = 1

2 (z1 + z2), y5 = 12 (z1 − z2)). Transfor-

mations that map the algebraic surface to itself must map y5 = 0 to y5 = 0. It is asimple matter to verify that this is the matrix subgroup of the 6 × 6 matrix groupSO(5, 1) or SO(2, 4) of the form [ M 0

0 1 ], with M a 5 × 5 matrix that preserves themetric diag(1, ±I3, −1) in R5. This is SO(4, 1) for E < 0 and SO(1, 4) for E > 0.

It remains to show that this group maps these algebraic surfaces into themselves.To this end we write the linear transformation in R5 as follows[

λuλ

]′=

[A BC D

] [λuλ

](14.41)

where A is a 4 × 4 matrix, etc. From this we determine

u′ = A(λu) + Bλ

C(λu) + Dλ(14.42)

The inner product of u′ with itself satisfies

(u′)t Gu′ − 1 = (Aλu + Bλ)t G(Aλu + Bλ) − (Cλu + Dλ)t (Cλu + Dλ)

(Cλu + Dλ)t (Cλu + Dλ)(14.43)

By using the relations among the submatrices required by the metric preservingcondition (e.g., At G A − CtC = G, etc.) it is a simple matter to show that thisreduces to

(u′, u′)N − 1 = (u, u)N − 1

(Cu + D)t (Cu + D)(14.44)

In short, the algebraic surface is invariant under this transformation group.

Remark The subgroup SO(4) rigidly rotates the sphere S3 ⊂ R4 into itself whilethe subgroup SO(3, 1) “rigidly rotates” the hyperboloid into itself. In the lattercase this is less intuitive. This means that the coordinates of the hyperboloid aremapped into themselves by a linear transformation in R4. The group SO(4, 1) mapscoordinates in these spaces to themselves through a nonlinear transformation in R4:in this case a simple projective transformation. It is a linear transformation in R5.

The infinitesimal generators of this nonlinear transformation are constructed asfollows (Bander and Itzykson, 1966a, 1966b). For E < 0 introduce a four-vector uas usual (u0 → u4)

u = 2p4(p · p + p24)−1p

u4 = (p · p − p24)(p · p + p2

4)−1 (14.45)

Page 252: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

238 Hydrogenic atoms

Define the four-vector B in terms of the four-vector u and the angular momentumvector L and the scaled (by 1/

√2m|E |) Runge–Lenz vector M′ as follows:

B = M′u4 + L × u − 32 iu = i

2

[u, L2 + M′2

]B4 = M′ · u + 3

2 iu = i2

[u4, L2 + M′2

]The operators Li , M ′

i , and Bµ are the infinitesimal generators of SO(4, 1) as follows,for E < 0.

0 L3 −L2 M1 B1

−L3 0 L1 M2 B2

L2 −L1 0 M3 B3

−M1 −M2 −M3 0 B4

B1 B2 B3 B4 0

++++−

14.9 Conformal symmetry SO(4, 2)

The largest set of transformations that leave the states of the hydrogen atom invari-ant, in some sense, is the conformal group SO(4, 2). Several different ways havebeen developed to prove this point. We review three here.

14.9.1 Schwinger representation

The algebra of the dynamical symmetry group has infinitesimal generators L andM′. Their linear combinations given two sets of vector operators A and B thatmutually commute and have angular momentum commutation relations on boundstates. It is possible to represent these operators using the boson representation.That is, for the operators A we introduce annihilation and creation operators ai , a†

j

for two independent modes, and similarly we introduce operators bi , b†j to describe

B. Basis states on which these operators act have the form |m1, m2; n1, n2〉 where,for example

a†1a2|m1, m2; n1, n2〉 = |m1 + 1, m2 − 1; n1n2〉

√m1 + 1

√m2

b†1b1|m1, m2; n1, n2〉 = |m1, m2; n1n2〉(√n1)2

The orthogonality of L and M leads to the orthogonality of A and B, and this leadsdirectly to the condition ja = jb, where ja = 1

2 (m1 + m2) and jb = 12 (n1 + n2).

From the previous section we know there is a group that maps bound states into(linear combinations of) bound states. We determine an algebra of operators thatperforms the same function on bound states as follows. Operators that change the

Page 253: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

14.9 Conformal symmetry SO(4, 2) 239

principal quantum number N = 2 ja + 1 = 2 jb + 1 = ( ja + jb) + 1 must changeja = jb. Operators that change ja have the form a†

i or a†i a†

j , but they do not simul-taneously change jb. Only operators that simultaneously add or subtract one exci-tation to the subsystems A and B simultaneously maintain the constraint ja = jb.The largest set of operators bilinear in the boson operators that map hydrogen atombound states to bound states consists of the operators

operators a†i a j b†

i b j a†i b†

j ai b j

subalgebra u(2) u(2)number 4 4 4 4

(14.46)

What is this algebra? Among these 16 operators, the maximal number of mutuallycommuting operators that can be found is four. These are conveniently chosen as thenumber operators for the four boson modes: (H1, H2, H3, H4) = (a†

1a1, a†2a2, b†

1b1,

b†2b2). The remaining twelve operators have eigenoperator commutation relations

with this set:

a†1a2 (+1, −1, 0, 0) a†

1b†1 (+1, 0, +1, 0) a1b1 (−1, 0, −1, 0)

a†2a1 (−1, +1, 0, 0) a†

1b†2 (+1, 0, 0, +1) a1b2 (−1, 0, 0, −1)

b†1b2 (0, 0 + 1, −1) a†

2b†1 (0, +1, +1, 0) a2b1 (0, −1, −1, 0)

b†2b1 (0, 0, −1, +1) a†

2b†2 (0, +1, 0, +1) a2b2 (0, −1, 0, −1)

(14.47)

All these roots have equal length, and inner products among these roots are all ± 12

or 0. The operator

(a†1a1 + a†

2a2) − (b†1b1 + b†

2b2)

commutes with all operators in this set. It is a constant of motion, and in factvanishes on all hydrogen atom bound states. As a result the algebra is the directsum of an abelian invariant subalgebra spanned by this operator, and a rank-threesimple Lie algebra, all of whose roots have equal lengths and are either orthogonalor make angles of π/4 or 3π/4 radians with each other. The algebra is uniquely areal form of A3 = D3.

Which real form? It is possible to form a number of subalgebras of type A1 fromthese operators:

a†1a2 a†

2a112 (a†

1a1 − a†2a2) su(2)

b†1b2 b†

2b112 (b†

1b1 − b†2b2) su(2)

a†i b†

j ai b j12 (a†

i ai + b†j b j + 1) su(1, 1)

Page 254: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

240 Hydrogenic atoms

The first two are compact, the last four are not compact. The maximal compactsubalgebra is spanned by the two compact subalgebras together with the diag-onal operator a†

1a1 + a†2a2 + b†

1b1 + b†2b2. This is the algebra so(4) + so(2). The

fifteen-dimensional Lie algebra that maps bound states to bound states is thereforeso(4, 2) = su(2, 2). This is the conformal algebra.

14.9.2 Dynamical mappings

Although the classical Kepler problem is analytically solvable, analyticity disap-pears under perturbation. In this case classical orbits must be computed numerically.At points of very close approach the velocity of the particles increases greatly, so it isprudent to slow down the integration time step to preserve accuracy. This procedurehas been implemented formally through a canonical transformation (Kustaanheimoand Stiefel, 1965; Stiefel and Scheifele, 1971), and is now widely known as theKustaanheimo–Stiefel transformation. Under this transformation time is stretchedout when the distance R between the interacting particles becomes small. In addi-tion the (relative) coordinates are projected from R3 to a fictitious space R4. Underthis transformation, and a constraint, the Kepler hamiltonian is transformed into afour-dimensional harmonic oscillator hamiltonian.

Coordinates (q1, q2, q3, q4) in the fictitions space R4 are related to coordinates(Q1, Q2, Q3) in the real space by the 4 × 4 transformation

Q1

Q2

Q3

Q4

= MK S

q1

q2

q3

q4

=

q1 −q2 −q3 q4

q2 q1 −q4 −q3

q3 q4 q1 q2

q4 −q3 q2 −q1

q1

q2

q3

q4

(14.48)

The transformation is constructed so that the “fourth” real coordinate Q4 is identi-cally zero. This transformation is invertible provided q2

1 + q22 + q2

3 + q24 �= 0. The

distance R =√

Q21 + Q2

2 + Q23 in R3 and the distance q =

√q2

1 + q22 + q2

3 + q24

in R4 are related by R = q2.The other half of the canonical transformation, involving the momenta in the

real and fictitious spaces, is

(P1, P2, P3, P4)t = 1

2RMK S(p1, p2, p3, p4)t

A constraint condition must be applied to force P4 = 0. This condition is

ζ = −2R P4 = (q1 p4 − q4 p1) + (q3 p2 − q2 p3) = 0 (14.49)

With this constraint we find P2 = P21 + P2

2 + P23 = (1/4R p2) − (ζ 2/4R2) →

(1/4R)(p21 + p2

2 + p23 + p2

4). With these transformations the hamiltonian in the

Page 255: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

14.9 Conformal symmetry SO(4, 2) 241

real space can be transformed to a hamiltonian in the fictitious space by

P2

2m− e2

R= E

×R−→ R P2

2m− e2 = E R

K S−→ p2

8m− e2 = Eq2 (14.50)

This is the hamiltonian for a four-dimensional harmonic oscillator when E < 0, aseasily seen by rearranging the terms

p2

2m− 4Eq2 = 4e2 (14.51)

The angular momentum operators in the real and fictitious spaces are bilinearproducts of the position and momentum coordinates, as follows:

(Q1, Q2, Q3, Q4)

0 θ3 −θ2 ∗−θ3 0 θ1 ∗θ2 −θ1 0 ∗−∗ −∗ −∗ 0

P1

P2

P3

P4

1

2(q1, q2, q3, q4)

0 θ3 −θ2 θ1

−θ3 0 θ1 θ2

θ2 −θ1 0 θ3

−θ1 −θ2 −θ3 0

p1

p2

p3

p4

(14.52)

Similar expressions can be given for the Runge–Lenz vector. However, theseare quadratic in the position and momentum operators. As a result theymust be expressed in matrix form using 8 × 8 matrices acting on the vector(q1, q2, q3, q4; p1, p2, p3, p4) on the left and its transpose on the right (Sadovskiiand Zhilinskii, 1998).

We now ask: what is the largest group of transformations on the coordinates andmomenta that

(i) is linear,(ii) is canonical, and

(iii) preserves ζ = 0.

We address this question in the usual way. Linear transformations allow us touse matrices. These are 8 × 8 matrices acting on the four coordinates and fourmomenta. Preserving the Poisson brackets requires that the matrices satisfy asymplectic metric-preserving condition: Mt G1 M = G1. Preserving the conditionζ = 0 requires these transformations to satisfy another metric-preserving condition:Mt G2 M = G2.

The matrices Gi have the form

Gi =[

0 Mi

−Mi 0

]

Page 256: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

242 Hydrogenic atoms

where

M1 =

1 0 0 00 1 0 00 0 1 00 0 0 1

M2 =

0 0 0 10 0 −1 00 1 0 0

−1 0 0 0

Mt1 = +M1 Gt

1 = −G1 Mt2 = −M2 Gt

2 = +G2

(14.53)

The metric G1 is antisymmetric and the metric G2 is symmetric, with signature(+4, −4). The group that preserves the antisymmetric metric is Sp(8; R) and thegroup that preserves the symmetric metric is SO(4, 4). The group that satisfies bothmetric-preserving conditions is their intersection:

Sp(8; R) ∩ SO(4, 4) = SU (2, 2) � SO(4, 2) (14.54)

The simplest way to see this result is to perform a canonical transformation fromcoordinates (q, p) to coordinates (s, r ):

[s1

r4

]= 1√

2

[1 1

−1 1

] [q1

p4

] [s2

r3

]= 1√

2

[−1 1−1 −1

] [q2

p3

][

s3

r2

]= 1√

2

[1 1

−1 1

] [q3

p2

] [s4

r1

]= 1√

2

[−1 1−1 −1

] [q4

p1

] (14.55)

Since the new coordinates are already canonical, only the condition ζ = 0 remainsto be satisfied. It is a simple matter to verify that

z1 = 1√2(s1 + is2) z2 = 1√

2(r1 + ir2)

z∗1z1 − z∗

2z2 + z∗3z3 − z∗

4z4 = ζ

z3 = 1√2(s3 + is4) z4 = 1√

2(r3 + ir4)

(14.56)The noncompact group U (2, 2) preserves the constraint Eq. (14.49).

14.9.3 Lie algebra of physical operators

A number of workers have shown that the hamiltonian describing the interaction of acharged particle interacting with an external Coulomb field (V (r ) = −e2/r ) can beexpressed in terms of operators that close under commutation. The Lie algebra thatthese operators span is isomorphic with the Lie algebra of a noncompact orthogonalgroup.

Page 257: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

14.10 Spin angular momentum 243

Three vector operators and a scalar operator

J = r × p angular momentum

M = 1

2m(p × L − L × p) − K

rr

Laplace–Runge–Lenz vector

A = 1

2m(p × L − L × p) + K

rr

dual vector

A4 = r · p + 3

2

idual scalar

(14.57)

close under commutation to span a Lie algebra that is isomorphic with so(4, 1).Five additional operators can be introduced that extend the algebra to so(4, 2).

These include one vector operator and two additional operators:

�i = rpi

�4 = 12 (rp · p − r )

�5 = 12 (rp · p + r )

(14.58)

The commutation relations that these 15 operators satisfy are summarized by the6 × 6 matrix

0 J3 −J2 M1 A1 �1

−J3 0 J1 M2 A2 �2

J2 −J1 0 M3 A3 �3

−M1 −M2 −M3 0 A4 �4

A1 A2 A3 A4 0 �5

�1 �2 �3 �4 −�5 0

++++−−

The four triplets Ji , Mi , Ai , �i (i = 1, 2, 3) have transformation properties of three-vectors under rotations. The three additional operators A4, �4, �5 close under com-mutation and span a Lie algebra that is isomorphic with so(2, 1).

The Schrodinger and Klein–Gordon hamiltonians for an electron of charge −ein the Coulomb field �(r ) = e/r of a proton can be expressed in terms of operatorsof type A4, �4, and �5. These operators are displayed in Table 14.3, along with thehamiltonians and the algebraic representation of the wave equations.

14.10 Spin angular momentum

The interaction of the electron with the electromagnetic field is properly de-scribed by the Dirac equation. The electromagnetic field (E, B) is described by thefour-vector potential Aµ = (φ, A). The electron has charge q = −e (where e isthe charge on the proton) and spin 1

2 . The Dirac equation HDψ = Eψ is a matrix

Page 258: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

244 Hydrogenic atoms

Table 14.3. Nonrelativistic and relativistic hamiltonians for a spinlessparticle, operator representation of the operators A4, �4, and �5,

expression of the hamiltonians and wave equations in terms of theseoperators, and explicit values of the coefficients in these equations

Hp2

2m− α

r

√p2 + m2 − α

rA4 r · p − i r · p − i

�412 (rp · p − r )

1

2

(rp · p − r − α2

r

)�5

12 (rp · p + r )

1

2

(rp · p + r − α2

r

)� r (HS − W ) r

{(HK G + α

r

)2−

(E + α

r

)2}

A(�5 + �4) + B(�5 − �4) + C A(�5 + �4) + B(�5 − �4) + CA 1/2m 1B −W m2 − E2

C −α −2αE

In the event a magnetic field B is present, the momentum operators p should bereplaced by π = p − q

c A. Under this condition the operators still close undercommutation.

differential equation of first order:

HD = −eφ(r ) + βmc2 + γ · (cp + eA) (14.59)

The 4 × 4 matrices β and γi can be chosen as

β =[

I2 00 −I2

]γi =

[0 σi

σi 0

](14.60)

Here σi are the standard Pauli 2 × 2 spin matrices (cf., Eq. (3.39), Problem 3.1).The fifteen-dimensional Lie algebra for the Dirac equation is spanned by the

operators J, M, A, � as given in Eq. (14.57), and the three operators A4, �4, �5.The latter two are modified to allow a treatment of the Dirac operator along thesame lines as the treatment of the Schrodinger and Klein–Gordon operators givenin Section 14.9.3. We define operators

M4 = r · p − i

�4 = 1

2

{(rp · p − r − α2

r− iαγ · r

r2

)}

�5 = 1

2

{(rp · p + r − α2

r− iαγ · r

r2

)} (14.61)

Page 259: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

14.11 Spectrum generating group 245

As before, the substitution p → π = p − qc A is in order in the event there is a

nonzero magnetic field B. These operators close under commutation to form anso(2, 1) Lie algebra. These operators also close under commutation with the fourthree-vectors Ji , Mi , Ai , �i defined in Table 14.3. The Dirac hamiltonian is ex-pressed in terms of these generators as follows:

� = r

{(HD + α

r

)2−

(E + α

r

)2}

= A(�5 + �4) + B(�5 − �4) + C (14.62)

where the coefficients A, B, C have exactly the same values as for the Klein–Gordon operator (see Table 14.3). In short, the operators �4, �5 are modified butthe relation among these operators in the algebraic representation of the relativisticwave equations is not.

14.11 Spectrum generating group

The physics of the hydrogenic problem is determined primarily by the radial equa-tion Eq. (14.9). It is possible to determine solutions of this equation using operatorsthat close under commutation. These are the generators of a Lie algebra. The cor-responding group is called a spectrum generating group.

To construct a set of operators that close under commutation, we first simplifythe radial equation by multiplying on the left by r

(r D2 + A

r+ B + Cr

)R(r ) = 0 (14.63)

with D = d/dr . The operators r and D behave under commutation like the bosoncreation and annihilation operators a† and a. In fact, the nonzero commutationrelations are

[r D, r ] = +r[a†a, a†] = +a†

[r D, r D2] = −r D2[a†a, a†aa

] = −a†aa

[r, r D2] = −2r D[a†, a†aa

] = −2a†a

(14.64)

The linear combinations r D2 + r and r D2 − r are compact and noncompact, re-spectively. In order to model the differential operator Eq. (14.63) with a set ofoperators that close under commutation to form a finite-dimensional Lie algebra,

Page 260: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

246 Hydrogenic atoms

we must be careful, as [r D,

1

r

]= −1

r[r D2,

1

r

]= 2

r2− 1

rD

We choose as operators in the Lie algebra so(2, 1) the three differential operators

�5 = 1

2

(r D2 + a

r− r

)�4 = 1

2

(r D2 + a

r+ r

)M4 = r D

(14.65)

The Casimir operator for this algebra is C2 = �25 − �2

4 − M24 = −a. The represen-

tations of this algebra have been described in Problem 11.6.The radial equation Eq. (14.63) is expressed in terms of the three operators as

follows (a → A)

((�5 + �4) + B + C(�4 − �5)) R(r ) = 0 (14.66)

Next, we rotate the generators of the algebra according to

eθ M4

(�5

�4

)e−θ M4 =

[cosh θ − sinh θ

− sinh θ cosh θ

] (�5

�4

)(14.67)

When this similarity transformation is applied to Eq. (14.66) we obtain the followingresult: [(

e−θ − C eθ)�5 + (

e−θ + C eθ)�4 + B

]eθ M4 R(r ) = 0 (14.68)

The rotation angle θ can be chosen to eliminate either the noncompact generator�4 or the compact generator �5, depending on the sign of the parameter C .

14.11.1 Bound states

If C < 0 we can choose e−θ + C eθ = 0, so that the resulting equation becomes(2√−C �5 + B

)u(r ) = 0 (14.69)

where u(r ) = eθ M4 R(r ). If A is the Casimir invariant of this representationof su(1, 1), the discrete spectrum of the compact operator �5 is N = − 1

2 +√( 1

2 )2 − A + 1 + n, n = 0, 1, 2, . . . . This result leads directly to the eigenvalue

Page 261: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

14.11 Spectrum generating group 247

spectrum for the nonrelativistic and the relativistic hydrogen atom (no spin) ob-tained in Eq. (14.12).

Remark The spectrum generating algebra Eq. (14.65) acts in Hilbert spaces thatcarry unitary irreducible representations of the noncompact group SO(2, 1). Theserepresentations are indexed by an integer l that has an interpretation as angular mo-mentum. The energy spectrum that we have computed has the behavior (in the non-relativistic case) W = − 1

2 mc2α2(1/N 2), where N = l + 1 + k, k = 0, 1, 2, . . . .Here N is the principal quantum number. The result is that this algebra acts tochange the principal quantum number while keeping l constant. Since the threeoperators in the spectrum generating algebra commute with the angular momen-tum operators, the quantum number ml (eigenvalue of Lz) is also invariant underthe action of these operators. The states connected by the operators of this so(2, 1)algebra are |N , lm〉 ↔ |N ± 1, lm〉. The states on which these operators act areorganized in “angular momentum towers.” These states are organized vertically inFig. 14.2.

Remark The angular momentum operators Lz, L± act on multiplets shown asa single horizontal line in Figs. 14.1 and 14.2. The operators Mz, M± associatedwith the Laplace–Runge–Lenz vector act horizontally on the levels shown in thesetwo figures. The operators �z, �± = �4 ± i M4 act vertically on the levels shownin these figures. Since [L, �] = 0, the operators � do not change the m values ofhydrogenic states.

Remark The shift down operator �− annihilates the ground state in a given an-gular momentum tower: �−〈r |N l=N−1

m 〉 = 0. Since the differential operators areknown, this relation can be used, as was the relation L−Y l

m=−l(θ, φ) = 0, to deter-mine the radial wavefunction 〈r |N , l = N − 1〉.

14.11.2 Scattering states

If C > 0 we can choose θ so that e−θ − C eθ = 0. Equation (14.66) reduces to(2√

C �4 + B)

u(r ) = 0 (14.70)

where as before u(r ) = eθ M4 R(r ). Since the generator �4 is noncompact, it hasa continuous spectrum. The energy can be written in terms of the scaling factork � e−θ with E = �2k2/2m. The asymptotic form of the wave function is (Gilmoreet al., 1993; Kais and Kim, 1986)

Rk,l(r ) ∼√

2

πsin

(kr − π

2j + α

k(log(2kr ) + δ( j))

)(14.71)

Page 262: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

248 Hydrogenic atoms

where δ( j) = arg [�( j + 1 − i(α/k)] is part of the scattering phase shift, and the

expression for j is given by j = − 12 +

√( 1

2 )2 − A.

14.11.3 Quantum defect

Multielectron atoms are complicated objects. If one of the electrons is promoted toa high lying level, it is on average far from the nucleus and the core electrons. Somesimplifications can then be made in the description of its excited state spectrum.As the “Rydberg” electron approaches the core, the positive nuclear charge isless completely screened by the core electrons, and the electron is more stronglyattracted than a simple −1/r potential suggests. It is possible to represent thisextra attraction by adding a term of the form −1/r2 to the potential to representpenetration of the core electrons. To this end the potential used in the Schrodingerand Klein–Gordon equations is V (r ) = −e2/r → −e2/r − µl(�2/2m)/r2. Thisperturbation produces a modification in the radial equation. The modification isencapsulated entirely in the change

A → A′ = A + µl (14.72)

This change produces a change in the value of j → j ′ = j + � j , where � j =−µl/(2l + 1) in the nonrelativistic case. This change produces a change in thebound state energy spectrum:

EN=n+l+1 = −mc2α2

2N 2→ − mc2α2

2(N + � j)2(14.73)

The quantum defect � j causes the Rydberg states to be bound more strongly than ina pure hydrogenic atom (without screening). The same change occurs in scatteringstates. There is an additional phase shift due to the stronger attraction in the core.The excess phase shift is

�φ = −π

2� j + α

πarg (�[ j + 1 + � j − i(α/k)] − �[ j + 1 − i(α/k)])

(14.74)

Remark More accurate calculations of bound state spectra and scattering phaseshifts employ more accurate representations of core screening (than −1/r2). Nev-ertheless, the results are the same: a quantum defect in the bound state energiestranslates, through analytic continuation, to a corresponding excess phase shift inthe scattering states (Seaton, 1966a, 1966b).

Page 263: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

14.12 Conclusion 249

14.12 Conclusion

Group theory entered physics in two distinct ways. On one level the set of trans-formations from one coordinate system (or observer) to another forms a group.Observers are related by the Galilean principle of relativity. On another level,some physical systems exhibit symmetry. This symmetry allows us to predict newstates on the basis of states that are already observed, together with the applica-tion of some symmetry transformation. This is done through Einstein’s principle ofequivalence.

We have exploited these principles to describe the quantum mechanical prop-erties, particularly the energy level structure, of hydrogenic atoms. Initially, weexploited a geometric symmetry, the symmetry of the hamiltonian under rotations.The symmetry group is SO(3) or the disconnected group O(3). This symmetryrequires that states occur in multiplets with angular momentum degeneracy 2l + 1.It is surprising that hydrogenic states have a larger degeneracy than required by therotation group SO(3).

We believe that symmetry implies degeneracy, and the greater the symmetry,the greater the degeneracy. If we also believe that the N 2-fold degeneracy of thehydrogen states with principal quantum number N is due to invariance under somegroup, we are prodded to search for a larger group G ⊃ SO(3) that explains theN 2-fold degeneracy. This dynamical symmetry group is SO(4): its six infinitesimalgenerators include both the angular momentum operators and the components ofthe Laplace–Runge–Lenz vector.

Why stop here? Why not search for a “symmetry” that breaks the degeneracy butmaps any state of the hydrogen atom to linear combinations of all other states? Suchspectrum generating groups include SO(4). The largest such group is the conformalgroup SO(4, 2). Before this group was discovered, the deSitter group SO(4, 1) wasemployed as a spectrum-generating group. A simple noncompact subgroup of thesegroups, isomorphic with SO(2, 1), was used to illustrate explicitly how the gener-ators of a Lie algebra are used to determine eigenstates and energy eigenvalues. Inaddition, representations that describe bound states can be analytically continuedto representations that describe scattering states. This analytic continuation relatesbound state energies to phase shifts of scattering states. In the case that the Coulombpotential is perturbed by core shielding effects, the energy eigenvalue spectrum isoften simply represented by a quantum defect that depends on the angular momen-tum. The phase shift of scattering states with angular momentum l is related to thequantum defect with the same angular momentum.

In applications to the hydrogen atom, the role and scope of group theory inphysics is seen to extend far beyond applications depending on simple geometricsymmetry.

Page 264: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

250 Hydrogenic atoms

14.13 Problems

1. a. Principle of relativity Assume two observers S and S′ are locked in the holdof a boat without windowports, so they cannot perceive the exterior world. Galileanrelativity is founded on two assumptions: (1) it is impossible to determine whether anoninertial frame is at rest or in uniform relative motion with respect to its surround-ings; (2) a body in an inertial frame will move with uniform velocity unless actedon by a force. Special relativity is also founded on two assumptions: (1) the laws ofphysics are the same in all inertial frames; (2) the speed of light is the same in allinertial frames. The first of the Galilean assumptions is implicit in the special theoryof relativity. Show that the existence of the 3deg microwave background radiation isincompatible with the first of Galileo’s assumptions. Does this create a problem forthe Special Theory of Relativity?

b. Equivalence principle Assume two observers S and S′ are locked inside elevatorswithout windows, so they cannot perceive the exterior world. Assume one elevatoris sitting on the surface of the Earth, so that the observer S experiences a gravita-tional force F = mg in the “down” direction. Assume that the other elevator is in“interstellar space” so that external gravitational forces “vanish,” but that his elevatorexperiences an acceleration g in the “up” direction. If the “rest of the universe” “looksthe same” to both observers, argue that you can represent a gravitational field by alocal acceleration. This use of the equivalence principle is one of the foundations ofthe general theory of relativity.

2. In the presence of a uniform magnetic field B show that the vector potential A can betaken as A = 1

2 B × r, so that B = ∇ × A. Derive the Klein–Gordon equation for anelectron in a Coulomb potential and a uniform magnetic field. Take the nonrelativisticlimit of this and derive the Schrodinger equation for an electron in the presence ofthese two fields.

3. Make the ansatz E = mc2 + W in the Klein–Gordon equation and exhibit the termsin this equation that must be neglected in order to recover the nonrelativistic approx-imation, the Schrodinger equation.

4. Introduce spherical coordinates as follows: (r, θ, φ) = (θ3, θ2, θ1) and

z = x3 = θ3 cos θ2

y = x2 = θ3 sin θ2 cos θ1

x = x1 = θ3 sin θ2 sin θ1

Show that L2(S1) = ∂2/∂θ21 . Show that

sin2 θ2 L2(S2) =(

sin θ2∂

∂θ2

)2

+ L2(S1)

Generalize this result to L2(S3) recursively using L2(S2) and (∂/∂ cos θ3)2. Do thismore generally for L2(Sn).

Page 265: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

14.13 Problems 251

5. This problem carries through the steps indicated in Table 14.1.a. Show that the singular points of Eq. (14.9) occur at r = 0 and r → ∞.b. Show that in the neighborhood of the singular points

r → 0

(d2

dr2+ A

r2+ B

r+ C

)R(r ) →

(d2

dr2+ A

r2

)R(r ) = 0

R(r ) � rγ γ (γ − 1) + A = 0

r → ∞(

d2

dr2+ A

r2+ B

r+ C

)R(r ) →

(d2

dr2+ C

)R(r ) = 0

R(r ) � eλr λ2 + C = 0

Show that γ = 12 ±

√( 1

2 )2 − A and λ = ±√−C .

c. Show that if√

( 12 )2 − A is real, the solution with the positive sign is always square

integrable in the neighborhood of r = 0. Under what conditions is the solutionwith the negative sign square integrable? Show that if C < 0 the solution ±√−Cwith the negative sign is square integrable. What happens if C > 0?

d. Show that a solution of the form R(r ) = rγ eλr f (r ) can be found where the functionf (r ) is a simple polynomial function.

e. Find the equation that the function f (r ) satisfies. Show that it is equivalent to theequation given in Table 14.1.

f. Represent the function f (r ) as an ascending power series: f (r ) = ∑∞j=0 f j r j .

Find the two-term recursion relation satisfied by the coefficients f j . Show that therecursion relation is

[( j + 1) j + 2γ ( j + 1)] f j+1 + (2λγ + 2λ j + B) f j = 0

Use this relation to show

f (r ) =∑j=0

�( j + γ + (B/2λ))

�(γ + (B/2λ))

�(2γ )

�( j + 2γ )

(−2λr ) j

j!

g. If this series does not terminate, show that its asymptotic behavior as r → ∞,determined from the behavior of f j as j → ∞, is f (r ) → e−2λr . Since λ < 0 thissolution is not square integrable.

h. Conclude that the function f (r ) must be a polynomial of finite degree. If thehighest nonzero degree term present is rn , so that fn �= 0 but fn+1 = 0 (⇒ fn+2 =fn+3 = · · · = 0), show that the quantization conditon 2λ(n + γ ) + B = 0 must besatisfied. Show that this leads to the quantization condition in terms of the threeparameters A, B, C that appear in Eq. (14.9):

n + 1

2+

√(1

2

)2

− A = B

2√−C

Page 266: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

252 Hydrogenic atoms

i. Use the values of the parameters A, B, C given in Eq. (14.10) to solve for theenergy eigenvalues of the Klein–Gordon and Schrodinger equations:

E(n, l) = mc2√√√√1 + α2

(n + 12 +

√(l + 1

2 )2 − α2)2

W (n, l) = −1

2mc2α2 1

(n + l + 1)2

Show that the polynomial solution is

f (r ) =n∑

j=0

�(2γ )

�( j + 2γ )

n!

(n − j)! j!(2λr ) j

The radial part of the wavefunction 1r rγ f (r )eλr has exactly n nodes in the open

interval (0, ∞).

6. For a highly ionized atom with Z protons in its nucleus and a single remaining electron,show that the potential is Ze/r and the solutions of the relativistic and nonrelativisticequations are obtained by the replacement α → Zα. How large can Z become beforethe relativistic solution is clearly incorrect? (Hint: set l = 0.)

7. Expand the relativistic energy in ascending powers of the fine structure constant todetermine the relativistic corrections to the nonrelativistic energy. Show that, with

N ′ = n + 12 +

√(l + 1

2

)2 − α2 and N = n + l + 1

E(n, l) = mc2√1 + (

αN ′

)2→ mc2 − mc2 1

2N 2α2 + mc2

(3

8N 4− 1

N 3(2l + 1)

)α4

+ mc2

(− 5

16N 6+ 3

2N 5(2l + 1)− 2N + 3(2l + 1)

2N 4(2l + 1)3

)α6 + mc2

×(

35

128N 8− 15

8N 7(2l + 1)+ 6N + 9(2l + 1)

4N 6(2l + 1)3

− 2N 2 + 3N (2l + 1) + 2(2l + 1)2

N 5(2l + 1)5

)α8 + O(α10)

8. The radial part of the wavefunction dies off like eλr for large r , where λ < 0 for boundstates. The parameter λ−1 has the dimensions of length, and a � 1/|λ| characterizesthe size of a bound state orbit. Show that bound states with quantum numbers (n, l)(N = n + l + 1 is the principal quantum number) have size scales

relativistic a(n, l) =√

(N ′)2 + α2 aB N ′ = n + 12 +

√(l + 1

2

)2 − α2

nonrelativistic a(n, l) = NaB N = n + l + 1

Page 267: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

14.13 Problems 253

Table 14.4. Some particles that can be used to formhydrogen-like atoms

Particle Rest energy (MeV)

electron e± 0.511mu meson µ± 105.7tau meson τ± 1784.0proton, antiproton p± 938.26deuteron d+ 1875.6tritium t+ 2809.4pi meson π± 139.6sigma meson �± 1385.0cascade meson �− 1533.0omega �− 1672.0

In these expressions aB = �2/me2 = 0.529 × 10−8 cm is the Bohr radius: the char-acteristic size of the hydrogen atom in its ground state. By what percentage do thesizes of the atoms in the (n, l) states differ between the relativistic and nonrelativistictreatments?

9. Many charged particles can form hydrogen-like atoms through their electrostaticinteraction. Compute the energy spectrum for bound states of neutral atoms formedfrom a positively charged particle and a negatively charged particle drawn from thislist of particles in Table 14.4. For each particle the mass is given in terms of theparticle rest energy. Recall that the mass, m, that appears in the expression for thebinding energy W = − 1

2 mc2α2/N 2 is the reduced mass: 1/m = 1/m1 + 1/m2 ofthe two particles.

10. The motion of a classical nonrelativistic particle in a 1/r2 radial force field is a conicsection: an elliptical orbit for bound states (E < 0); hyperbolic for scattering states(E > 0); and parabolic at the separatrix (E = 0). If the radial force field includes aradial 1/r3 perturbation

f = − K

r2+ C

r3

the trajectory has the form (Goldstein, 1950)

r = a(1 − ε2)

1 + ε cos(αθ )

where α = √1 − η, η = C/K a. This can be treated as an ellipse that is slowly

rotating, α � 1. In this case the parameters a and ε have their usual meaningsfor elliptical orbits: a is the semimajor axis and ε is the eccentricity. The ratioη is a measure of the strength of the perturbation to the strength of the Coulombpotential.

Page 268: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

254 Hydrogenic atoms

a. Expand the relativistic energy E =√

(mc2)2 + (pc)2 − K/r to fourth order in pand show E = (mc2) + (p2/2m) − (p2/2m)2/(2mc2) − K/r = mc2 + W .

b. Replace the quartic term −(p2/2m)2/(2mc2) by −(W + K/r )2/(2mc2) and ex-pand. Show that the classical hamiltonian for the motion of the (special) relativisticparticle is

H = mc2 + p2

2m− K ′

r+ C ′

r2

Evaluate K ′ and C ′ and show K ′ = K (1 + W/mc2) and C ′ = −K 2/(2mc2).c. Argue that the classical motion involves a renormalized coupling K → K ′ as well

as a 1/r3 component to the force, with C = 2C ′.d. Show that the advance in the perihelion of the orbit is δθ � η/2 per period.e. Evaluate η for the planet Mercury, for which ε = 0.206 and the period is T = 0.24

year. Show that this amounts to about 7′′ per century. The general relativisticcorrection is larger by a factor of 6, and accounts for the observed advance inMercury’s perihelion of 42′′ per century.

f. The existence of precessing elliptical orbits is due to the “relativistic mass velocity”correction. This can be viewed from two perspectives. (1) Newton’s equations arecorrect and the mass of the particle varies with its state of motion according to m =m0/

√1 − (v/c)2. (2) The mass of a particle is a constant of nature and Newton’s

(nonrelativistic) equations of motion are not correct for relativistic particles, andmust be modified. The author feels the second interperetation is far superior to thefirst.

11. When the attracting potential is central and nearly 1/r , the motion of a bound particleis nearly elliptical. It is useful to describe this motion as if it were elliptical, withthe semimajor axis of the ellipse precessing in the plane of motion. Assume that theforce has the form F(r ) = (−K/r2 + p(r ))r, where p(r ) is a small perturbation. Therate at which the Runge–Lenz vector precesses is

ω = ∂

∂L

(1

T

∫ T

0p(r ) dt

)= ∂

∂L

(m

LT

∮r2 p(r ) dθ

)

with 1/r = (mK/L2)(1 + (M/mK ) cos θ ). Here L is the particle’s orbital angularmomentum and T is its period. If the perturbing term is of the form C/r3 the integralis C × 2π mK

L2 . The perturbations due to special and General Relativity are

special relativity C = K L2

2m2c2ω = π K 2

T L2c2

general relativity C = 6 × K L2

2m2c2ω = 6π K 2

T L2c2

For planetary motion K = G Mm. When M # m, ω is (almost) independent of m.Why? Determine how the relativistic precession ω scales (cf. Problem 16.3) with

Page 269: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

14.13 Problems 255

planetary distance from the Sun. What is the precession for the Earth? Use ω = 42′′

per century for Mercury and the following distance ratios:

Mercury Venus Earth Mars Jupiter Saturn Uranus Neptune0.39 0.72 1.00 1.52 5.20 9.54 19.18 30.06

12. The action of the angular momentum shift down operator L− on the lowest m-valuespherical harmonic for a given value of l is zero: L−Y l

m=−l(θ, φ) = 0. Use the coor-dinate representation for L− to compute this function.a. Write Y l

m(θ, φ) = Pl−l(θ )e−ilφ and show(

− ∂

∂θ+ i

cos θ

sin θ

∂φ

)Pl

−l(θ )e−ilφ = e−ilφ

(− ∂

∂θ+ l

cos θ

sin θ

∂φ

)Pl

−l(θ )

b. Show Pl−l = (sin θ )l satisfies this equation.

c. This function is not normalized to unity over the sphere. Normalize it by introducinga normalization coefficient Nl and enforcing the condition∫ π

0dθ sin θ

∫ 2π

0dφ|Nl sinl θe−ilφ|2 = 1

d. Show that

Nl =√

1

√(2l + 1)!!

(2l)!!

e. This leads the the simple recursion relation for normalization coefficients for theY l

±l(θ, φ):

Nl =√

2l + 1

2lNl−1

Compare these results with Table 14.2 using initial condition N0 = √1/4π . Com-

pute N3.f. Use the numerical value of the matrix elements 〈 l

m ′ |L+| lm 〉 = √

(l + m ′)(l − m)δm ′,m+1 and the coordinate representation of the shift up operator L+ to constructthe correctly normalized spherical harmonics Y l

m(θ, φ).

13. Use methods similar to those described in Problem 12 to construct the radial wave-functions for hydrogenic atoms with extreme orbital angular momentum quantumnumbers: l = N − 1, where in general the principal quantum number N = n + l + 1.These functions have no nodes in the interval (0, ∞) (since n = 0).

14. Show

d

dt

(rr

)= r(r · r) − r(r · r)

r3= −r × (r × r)

r3

15. r is the position vector from the sun to a planet, or from the proton to the electronin the hydrogen atom, L = r × p is the orbital angular momentum, and M is theLaplace–Runge–Lenz vector.

Page 270: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

256 Hydrogenic atoms

a. M · L = 0.b. M · M = (2L · L/m)(p · p/2m − K/r ) + K 2.c. M · r = L · L/m − Kr .d. M · r = Mr cos θ .e. r = (L · L/mK/1 + (M/K ) cos θ ).f. Compare this result to the standard solution of the trajectory equations for motion

in a 1/r potential to conclude that L2/mK is the semimajor axis of the ellipticalorbit and ε = M/K is the eccentricity of the orbit.

g. Conclude that the Laplace–Runge–Lenz vector is a constant of motion that pointsto the perihelion of the elliptical orbit.

16. Show that A · A = (−1/4�2)(L · L + M′ · M′ + L · M′ + M′ · L). Show that B · Bhas a similar expression. Show that the two expressions are equal sinceL · M = M · L = 0.

17. Show that the inverse of the stereographic projection given in Eq. (14.35) is

pp0

= u1 − u · w

18. Compute px = p · M/M and py = p · W/W . Show p2x + (py − a)2 = r2. Explicitly

compute the displacement vector a (i.e., (0, a)) and the radius r of circular motion.Show that circles in R3 lift to circles in S3 ⊂ R4 under the stereographic projectionof Eq. (14.35). Show that circles in S3 project back down to circles in R3 under theinverse transformation.

19. Show that the number of independent monomials of the form xa ybzc, with a, b, cnonnegative integers and a + b + c = l is N (l, 3) = (l + 3 − 1)/ l!(3 − 1)!. In N -dimensional space show that the number of homogeneous polynomials of degreel in x1, x2, . . . , xN is obtained by replacing 3 → N in this expression. This is theBose–Einstein counting statistic.a. Show that the functions rlY l

m(θ, φ) are homogeneous polynomials in x, y, z ofdegree l.

b. Show that the number of independent spherical harmonics of degree l is the dif-ference between the number of homogeneous polynomials of degree l and l − 2on three variables: dim

{Y l

m

} = N (l, 3) − N (l − 2, 3) = 2l + 1.c. After stereographic transformation into four dimensions, the hydrogen wavefunc-

tions in the momentum representation are spherical harmonics in four variables(Bander and Itzykson, 1966a). Show that the number of spherical harmonics ofdegree n is dim

{Yn

lm

} = N (n, 4) − N (n − 2, 4) = (n + 1)2.d. Construct homogeneous polynomials of degree 0, 1, 2 and the spherical harmonics

associated with these homogeneous polynomials. Take the inverse Fourier trans-form of these spherical harmonics to obtain the hydrogen atom wavefunctionsψ(x)nlm for n = 0, 1, 2; l = 0, . . . , n − 1; and −l ≤ m ≤ +l.

Page 271: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

14.13 Problems 257

e. Show that the recursive relation used to build up a Pascal triangle can be writtenin the symmetric form

(a + b + 1)!

(a + 12 )!(b + 1

2 )!= (a + b)!

(a − 12 )!(b + 1

2 )!+ (a + b)!

(a + 12 )!(b − 1

2 )!

where a and b are half odd integers: 12 , 3

2 , 52 , . . . .

f. Show homogeneous polynomials satisfy the recursion relation: N (l, d) =N (l, d − 1) + N (l − 1, d).

g. Use this result to derive the following recursion relation for the dimensions of thespaces of spherical harmonics on spheres Sn and Sn−1:

dim Y l(Sn) = dim Y l−1(Sn) + dim Y l(Sn−1)

For the case n = 3 this gives (l + 1)2 = l2 + (2l + 1). The initialization for all nis Y0(Sn) = 1 = dim Y0(Sn).

h. dim Y l(Sn) = (l+n−2)!l!(n−1)! (2l + n − 1).

20. D-dimensional Coulomb problem In D-dimensional space the Schrodinger equa-tion for the Kepler problem is Eq. (14.4) in the relativistic case and Eq. (14.5) in thenonrelativistic case. The only difference is that the Laplacian ∇2 is on D coordinatesrather than three. In this case the Laplacian operator is

∇2 =(

1

r D−1/2

∂rr D−1/2

)2

+ L2

r2

The angular part of the Laplacian operator, L2, acts on spherical harmonics on SD−1,Y l(SD−1). These spherical harmonics are eigenfunctions of this (Laplace–Beltrami)operator with eigenvalue − [

(l + α)2 − α2], and α is a quantity that depends on the

Lie algebra of SO(D): it is half the sum over all positive roots of the algebra. Forthe Lie algebras of the orthogonal roots the coefficient of the sum that is important isα = D − 2.a. Show that ψ(x) = (1/r (D−1)/2)Y l(angles) is a clever ansatz that reduces the

Schrodinger equation in D dimensions to the form of Eq. (14.4) in the relativisticcase and Eq. (14.5) in the nonrelativistic case.

b. Show that the only change in Eq. (14.10) is the replacement(l + 1

2

)2

−(

1

2

)2

→(

l + D − 2

2

)2

−(

D − 2

2

)2

in column A.c. Show that the relativistic and nonrelativistic energies shown in Eq. (14.12) change

as follows:

relativistic N ′ → n + 12 +

√(l + 1

2 )2 + l(D − 3) − α2

nonrelativistic N → n + 12 +

√(l + 1

2 )2 + l(D − 3)

Page 272: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

258 Hydrogenic atoms

21. Compute the quantum defect in heavy atoms by using the Klein–Gordon equation anda −1/r2 perturbation. Show that the bound state energy and scattering phase shiftsare given by the substitution l(l + 1) → l(l + 1) − µl . Argue that electrons in the sstate penetrate the core much more deeply (on average) and p-state electrons (thand-state electrons, . . .) so that µ0 # µ1 > · · · .

22. The isotropic harmonic oscillator in n dimensions has hamiltonian

H =n∑

i=1

�ω

(a†

i ai + 1

2

)

a. Show that the Lie algebra of its geometric symmetry group is spanned by theangular momentum operators Li j = a†

i a j − a†j ai = −L ji .

b. Show that the Lie algebra of its dynamical symmetry group is spanned by theangular momentum operators together with the quadrupole tensor operators Qi j =a†

i a j + a†j ai = +Q ji .

c. Show that one spectrum generating algebra includes the operators L and Q as wellas the single boson operators a†

i and a j , as well as their commutator [ai , a†j ] = 1.

Show that this algebra is nonsemisimple and describe its structure.d. Show that another spectrum generating algebra consists of the operators L and Q as

well as the two boson creation operators a†i a†

j and two boson annihilation operatorsai a j . Show that this algebra is simple and describe its structure. Show that thisspectrum generating algebra does not couple all the states that exist: “parity” isan invariant, where “parity” is even or odd according to whether the number ofexcitations in the spectrum is even or odd.

Page 273: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

15

Maxwell’s equations

The electromagnetic field E(x, t), B(x, t) is determined by Maxwell’sequations. These equations are linear in the space and time derivatives.In the momentum representation, obtained by taking a Fourier transformof the electric and magnetic fields, Maxwell’s equations impose a set offour linear constraints on the six amplitudes E(k), B(k). Why? At a morefundamental level, the electromagnetic field is described by photons. Foreach photon momentum state there are only two degrees of freedom,the helicity (polarization) states, corresponding to an angular momentum1 aligned either in or opposite to the direction of propagation. Thus, theclassical description of the electromagnetic field is profligate, introducingsix amplitudes for each k when in fact only two are independent. Theremaining four degrees must be absent in any description of a physicallyallowed field. The equations that annihilate these four nonphysical linearcombinations are the equations of Maxwell. We derive these equations,in the absence of sources, by comparing the transformation properties ofthe helicity and classical field states for each four-momentum.

15.1 Introduction

The electromagnetic field has been described in two different ways. Following thenineteenth century approach (pre quantum mechanics), a field is introduced havingappropriate transformation properties. The price one pays is that not every fieldrepresents a physically allowed state: such fields must be annihilated by appropriateequations. Following the twentieth century approach, a Hilbert space is introduced.An arbitrary superposition of states in this space represents a physically allowedfield. The price one pays is that the field so constructed does not have obvioustransformation properties.

In the older approach a field is defined at every point in space time. It is requiredto be “manifestly covariant.” That is, it transforms as a tensor under homogeneous

259

Page 274: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

260 Maxwell’s equations

Table 15.1. Comparison of descriptions of the electromagnetic field

Time period Approach Strengths Weaknesses

Nineteenth century Manifestly Fields have elegant Many fields representcovariant transformation properties nonphysical states

Twentieth century Hilbert All linear superpositions Transformation propertiesspace represent physical states are complicated

Lorentz transformations. This requires there to be a certain number of field compo-nents at every space-time point, or more conveniently, for every allowed momentumvector. In the Hilbert space formulation the number of independent components isjust the allowed number of spin or helicity states. The number of components isnever greater than the number of components required to define the “manifestlycovariant” field; however, it may be less than this number. In this case there arelinear combinations of the components of the manifestly covariant field that cannotrepresent physically allowed states. These linear combinations must be suppressed.It is the function of the field equations to suppress those linear combinations ofcomponents that do not correspond to physical states. These two approaches arecompared in Table 15.1.

Maxwell’s equations fulfill this function. The classical description involves sixfield components for each allowed mementum state. These are the classical electricand magnetic fields, E(x, t) and B(x, t), or their components after Fourier transfor-mation, E(k) and B(k), where k is a four-vector that obeys k · k = k · k − k4k4 = 0.Here k is essentially a three-momentum vector and k4 is essentially an energy.The quantum description involves arbitrary superpositions of two helicity compo-nents for each momentum vector. The helicity states involve an angular momentumaligned along the direction of motion (helicity +1 and right-handed polarization)and opposite to the direction of propagation (helicity −1 and left-handed polar-ization). There are four (6 − 2) linear combinations of classical field componentsthat must be suppressed for each k-vector, and that are annihilated by Maxwell’sequations. We derive these equations by comparing the transformation proper-ties of the basis vectors for the “manifestly covariant” but nonunitary representa-tions of the inhomogeneous Lorentz group with the basis vectors for its unitaryirreducible representations, which are not manifestly covariant. The set of con-straints so derived reduce, for j = 1, to Maxwell’s equations. This derivation iscarried out for free fields (no sources) only. When sources are present the pho-ton four-vector k no longer obeys k · k = 0. In this case the manifestly covariantequations provide a beautiful prescription for describing the coupling to sourceterms.

Page 275: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

15.2 Review of the inhomogeneous Lorentz group 261

15.2 Review of the inhomogeneous Lorentz group

15.2.1 Homogeneous Lorentz group

The wavefront for a light signal expanding from a source at the origin of coordinatesfor observers S and S′ obeys the equation

x2 + y2 + z2 − (ct)2 = x ′2 + y′2 + z′2 − (ct ′)2 = 0 (15.1)

This requires that the coordinates (x, y, z, ict) and (x, y, z, ict)′ for observers Sand S′ be related by a homogeneous Lorentz transformation

xyz

ict

=

xyz

ict

(15.2)

The 4 × 4 matrix transformations � belong to the Lie group O(3, 1). The infinites-imal generators of a group operation in SO(3, 1) are

� → I4 + ε

0 +θ3 −θ2 ib1

−θ3 0 +θ1 ib2

+θ2 −θ1 0 ib3

−ib1 −ib2 −ib3 0

= I4 + ε (θ · J + b · K) (15.3)

Homogeneous Lorentz transformations leave invariant inner products: k · a = �k ·�a, where k and a are four vectors and � ∈ O(3, 1). The infinitesimal generatorsJ, K satisfy the following commutation relations:

[Ji , Jj

] = −εi jk Jk[Ji , K j

] = −εi jk Kk (15.4)[Ki , K j

] = +εi jk Jk

15.2.2 Inhomogeneous Lorentz group

Intervals are preserved by the inhomogeneous Lorentz group:

(x2 − x1)2 + (y2 − y1)2 + (z2 − z1)2 − (ct2 − ct1)2 = invariant (15.5)

The inhomogeneous Lorentz group consists of homogeneous Lorentz transforma-tions, �, together with displacements of the origin. The general group transforma-

Page 276: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

262 Maxwell’s equations

tion can be written as a 5 × 5 matrix, in terms of the 4-vector a = (x, y, z, ct):

{�, a} =

x� y

zct

0 0 0 0 1

(15.6)

as shown. The group composition law is matrix multiplication. The following resultsare immediate:

{�2, a2} {�1, a1} = {�2�1, a2 + �2a1}

{I, a} {�, 0} = {�, a} = {�, 0} {I, �−1a

} (15.7)

The inhomogeneous Lorentz group is the semidirect product of the homogeneousLorentz group and the commutative invariant subgroup of translations of the originof coordinates in space and time. The infinitesimal generators for this invariantsubgroup are (∂/∂x, ∂/∂y, ∂/∂z, i∂/∂(ct)).

15.3 Subgroups and their representations

The group of inhomogeneous Lorentz transformations has two important sub-groups. These are the subgroup of homogeneous Lorentz transformations {�, 0}and the invariant subgroup of translations {I, a}. Both their representations play arole in the derivation of the relativistically covariant field equations.

15.3.1 Translations {I, a}The translation subgroup {I, a} is abelian (commutative). All of its unitary irre-ducible representations are one dimensional, and in fact

�k({I, a}) = eik·a (15.8)

where k is a four-vector that parameterizes the one-dimensional representations.We may define a basis state for the one-dimensional representation �k of {I, a} as|k〉:

{I, a} |k〉 = |k ′〉〈k ′| {I, a} |k〉 = |k ′〉δ(k ′ − k)�k({I, a}) = |k〉eik·a (15.9)

Physically, k has a natural interpretation as the four-momentum of the photon.

Page 277: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

15.3 Subgroups and their representations 263

15.3.2 Homogeneous Lorentz transformations

The Lie algebra D2 = A1 + A1 is semisimple: it is the direct sum of two simpleLie algebras of type A1 (see Fig. 10.3). We can construct linear combinations ofthe infinitesimal generators J, K of SO(3, 1) that are mutually commuting and thatsatisfy angular momentum commutation relations. These are

J(1) = 1

2(J − iK)

(15.10)

J(2) = 1

2(J + iK)

These operators satisfy angular momentum commutation relations[J(1)

i , J(1)j

]= −εi jkJ(1)

k[J(2)

i , J(2)j

]= −εi jkJ(2)

k (15.11)[J(1)

i , J(2)j

]= 0

The algebra J(1) has 2 j + 1 dimensional irreducible representations D j while J(2)

has 2 j ′ + 1 dimensional irreducible representations D j ′. Any element in SO(3, 1)

can be expressed in a (2 j + 1)(2 j ′ + 1) dimensional representation D j j ′as follows

EXP(θ · J + b · K) = EXP[(θ + ib) · J(1) + (θ − ib) · J(2)]

= D j[(θ + ib) · J(1)] D j ′ [

(θ − ib) · J(2)] (15.12)

15.3.3 Representations of SO(3, 1)

The Lie algebra so(3, 1) is isomorphic to the Lie algebra for the group of 2 × 2matrices SL(2; C). We have the following two isomorphisms

J = i2σ J = i

K = − 12σ K = + 1

(15.13)

These two isomorphisms give rise to the following two inequivalent sets of repre-sentations

D j0 D0 j

K( j) = iJ( j) K( j) = −iJ( j) (15.14)

Page 278: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

264 Maxwell’s equations

where J( j) are the three (2 j + 1) × (2 j + 1) angular momentum matrices. Thefollowing matrices are associated with these representations

D j0 [θ · J + b · K] = EXP[θ · J( j) + b · (+iJ)( j)] = EXP

[(θ + ib) · J( j)]

D0 j [θ · J + b · K] = EXP[θ · J( j) + b · (−iJ)( j)] = EXP

[(θ − ib) · J( j)]

(15.15)

These representations are complex conjugates of each other. The most generalrepresentation of SO(3, 1) is

D j j ′(θ · J + b · K) = EXP

[(θ + ib) · J( j)] EXP

[(θ − ib) · J( j ′)

]= D j j ′

(�)

(15.16)Basis states for the action of � through the representation D j j ′

(�) can be com-puted

∣∣∣∣ j j ′

µ µ′

⟩=

∣∣∣∣ j j ′

ν ν ′

⟩D j j ′

νν ′;µµ′(�) (15.17)

Under restriction to the subgroup SO(3) ⊂ SO(3, 1) this representation is reduciblein a Clebsch–Gordan series

D j j ′(�)

�↓SO(3)−→ D j [SO(3)] × D j ′[SO(3)] = ∑

j ′′ D j ′′[SO(3)]

| j − j ′| ≤ j ′′ ≤ j + j ′ (15.18)

This representation remains irreducible only if j ′ = 0 or j = 0.

15.4 Representations of the Poincare group

We construct here two kinds of representations for the inhomogeneous Lorentzgroup. These are the manifestly covariant representations and the unitary irreduciblerepresentations.

15.4.1 Manifestly covariant representations

A field Tµν(x) is said to be manifestly covariant (obviously covariant) undertransformations of the homogeneous Lorentz group � ∈ SO(3, 1) if

�Tµν(x) = Tµ′ν ′(x�−1)�µ′µ�ν ′ν (15.19)

That is, the field components obviously form a basis on which the Lorentz transfor-mation acts. The point at which the transformation acts is fixed, but since the coordi-nate system changes, the coordinates of the fixed point are changed by x ′ = x�−1.

Page 279: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

15.4 Representations of the Poincare group 265

We construct manifestly covariant representations of the inhomogeneous Lorentzgroup by constructing direct products of basis vectors

|k〉 ×∣∣∣∣ j j ′

µ µ′

⟩(15.20)

for the subgroups {I, a} and {�, 0} of the inhomogeneous Lorentz group. We definethe action of the inhomogeneous Lorentz group on these direct product states bydefining the action of the two subgroups, of homogeneous Lorentz transformationsand of translations, on the momentum states |k〉 and the field component states| j j ′µ µ′ 〉 separately.

We define the action of {I, a} on these states by

{I, a} |k〉 = |k〉eik·a

{I, a}∣∣∣∣ j j ′

µ µ′

⟩=

∣∣∣∣ j j ′

µ µ′

⟩(15.21)

The action of {�, 0} on the momentum states follows from

{I, a} [{�, 0} |k〉] = {�, 0} {I, �−1a

} |k〉= [{�, 0} |k〉] eik·�−1a

= [{�, 0} |k〉] ei�k·a = |�k〉ei�k·a (15.22)

The action of {�, 0} on the field component states is

{�, 0}∣∣∣∣ j j ′

µ µ′

⟩=

∣∣∣∣ j j ′

ν ν ′

⟩D j j ′

νν ′;µµ′(�) (15.23)

If the vector space that carries a manifestly covariant representation of the inhomo-geneous Lorentz group has the states

|k〉∣∣∣∣ j j ′

µ µ′

⟩(15.24)

then all states of the form

|�k〉∣∣∣∣ j j ′

ν ν ′

⟩(15.25)

are also present in the underlying vector space.

Page 280: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

266 Maxwell’s equations

The action of the two subgroups on the two types of states is summarized by

|k〉∣∣∣∣ j j ′

µ µ′

⟩{I, a} |k〉eik·a

∣∣∣∣ j j ′

ν ν ′

⟩δνν ′;µµ′

{�, 0} |�k〉∣∣∣∣ j j ′

ν ν ′

⟩D j j ′

νν ′;µµ′(�)

(15.26)

15.4.2 Unitary irreducible representations

Suppose we have a representation of {�, a} that is unitary and irreducible. Un-der restriction to the subgroup {I, a} this reduces to a direct sum of irreducibles�k({I, a}) of {I, a}. The basis states are |k; ξ〉, where k is defined by the action ofthe translation {I, a}

{I, a} |k; ξ〉 = |k; ξ〉eik·a (15.27)

and ξ is a helicity index that distinguishes different states with the same four-momentum. A homogeneous Lorentz transformation maps the state |k; ξ〉 into asubspace of states parameterized by k ′ = �k

{I, a} {�, 0} |k; ξ〉 = {�, 0} {I, �−1a

} |k; ξ〉= {�, 0} |k; ξ〉eik·�−1a

= [{�, 0} |k; ξ〉] ei�k·a (15.28)

As a result

{�, 0} |k; ξ〉 = |�k; ξ ′〉Mξ ′ξ (�) (15.29)

where Mξ ′ξ (�) is a matrix that remains to be determined.This simple calculation shows that if the four-vector k parameterizes a state in

an irreducible representation of the inhomogeneous Lorentz group, then the statesk ′ with

k ′ = �k (15.30)

Page 281: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

15.4 Representations of the Poincare group 267

are present also. To construct the matrix M(�), we choose one particular four-vectork0 for each of the possible cases

(i) k · k > 0 k0 = (0, 0, 1, 0)(ii) k · k = 0 k �= 0 k0 = (0, 0, 1, +i) (a)

k0 = (0, 0, 1, −i) (b)(iii) k · k < 0 k0 = (0, 0, 0, +i) (a)

k0 = (0, 0, 0, −i) (b)(iv) k · k = 0 k = 0 k0 = (0, 0, 0, 0)

(15.31)

The states (a), (b) are related to each other by the discrete time reversal operator T .The vector k0 is called the little vector.

The effect of a homogeneous Lorentz transformation on the state |k0; ξ〉 is de-termined by writing each � as a product of two group operations

� = Ck Hk0 (15.32)

where

Hk0k0 = k0

(15.33)Ckk0 = k

That is, Hk0 is the stability subgroup of the little vector k0 and Ck is a coset repre-sentative that maps k0 into k:

Ckk0 = k = �k0 (15.34)

The little groups (stability groups) of the little vectors k0 are

(i) SO(2, 1)(ii) I SO(2)

(iii) SO(3)(iv) SO(3, 1)

These are determined as follows.

Case (i) An arbitrary element in the Lie subgroup acting on k0 must leave k0

invariant. Linearizing, an element in the Lie algebra must annihilate k0:

0 +θ3 −θ2 ib1

−θ3 0 +θ1 ib2

+θ2 −θ1 0 ib3

−ib1 −ib2 −ib3 0

0010

=

−θ2

+θ1

0−ib3

=

0000

(15.35)

Page 282: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

268 Maxwell’s equations

The subalgebra leaving k0 fixed is defined by θ1 = θ2 = b3 = 0, θ3, b1, b2 arbi-trary. This is the three-dimensional subgroup SO(2, 1) consisting of generators forrotations about the z-axis and boosts in the x- and y-directions.

Case (ii) Applying the same arguments, we find

0 +θ3 −θ2 ib1

−θ3 0 +θ1 ib2

+θ2 −θ1 0 ib3

−ib1 −ib2 −ib3 0

001i

=

−θ2 − b1

+θ1 − b2

−b3

−ib3

=

0000

(15.36)

The stability subalgebra is defined by

b3 = 0

b2 = +θ1 (15.37)

b1 = −θ2

A general element in this subalgebra is

0 +θ3 −θ2 −iθ2

−θ3 0 +θ1 iθ1

+θ2 −θ1 0 0iθ2 −iθ1 0 0

=

∑i

θi Yi

Y1 = J1 + K2

Y2 = J2 − K1

Y3 = J3

(15.38)

The operators Yi obey the commutation relations

[Y3, Y1] = −Y2

[Y3, Y2] = +Y1 I SO(2) (15.39)

[Y1, Y2] = 0

These are the commutation relations for the group I SO(2), the group of inhomoge-neous motions of the Euclidean plane R2. Acting on the time-reversed little vector(0, 0, 1, −i) = T (0, 0, 1, +i) the infinitesimal generators are Y1 = J1 − K2, Y2 =J2 + K1, Y3 = J3.

Case (iii) Proceeding as above

0 +θ3 −θ2 ib1

−θ3 0 +θ1 ib2

+θ2 −θ1 0 ib3

−ib1 −ib2 −ib3 0

000i

=

−b1

−b2

−b3

0

=

0000

(15.40)

The subalgebra defined by b = 0 is spanned by the angular momentum operatorsJ. It is su(2).

Page 283: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

15.4 Representations of the Poincare group 269

Case (iv) This is the simplest case:

0 +θ3 −θ2 ib1

−θ3 0 +θ1 ib2

+θ2 −θ1 0 ib3

−ib1 −ib2 −ib3 0

0000

=

0000

(15.41)

The little group of this vector is the entire homogeneous Lorentz group SO(3, 1).The action of the little group on the subspace of states |k0; ξ〉 is

Hk0 |k0; ξ〉 = |Hk0k0; ξ ′〉Dξ ′ξ (Hk0 )

= |k0; ξ ′〉Dξ ′ξ (Hk0 ) (15.42)

The original representation of the inhomogeneous Lorentz group is unitary andirreducible if and only if the representation Dξ ′ξ (Hk0 ) of the little group is unitaryand irreducible.

The cases (i)–(iv) are discussed here.

Case (i) The unitary irreducible representations of the noncompact group SO(2, 1)were described in Problem 5 of Chapter 11. Since k · k > 0 describes negative massparticles, we will not need to discuss these representations here.

Case (ii) See below.

Case (iii) The unitary irreducible representations for the group SU (2), which is thelittle group for a massive particle at rest, were described in Problem 2 of Chapter 6.They are described by an integer or half-integer: j = 0, 1

2 , 1, 32 , . . . . The angular

momentum j is a property of each massive particle.

Case (iv) The unitary irreducible representations of SO(3, 1) are known but notinteresting for the present discussion.

We consider the case of zero mass particles in more detail here. The unitaryirreducible representations of I SO(2) are constructed following the prescriptionwe are using to study the unitary irreducible representations of the inhomogeneousLorentz group – the method of the little group. Since I SO(2) has a two-dimensionaltranslation invariant subgroup, basis states in a unitary irreducible representationcan be labeled by a vector κ = (κ1, κ2) in a two-dimensional Euclidean space,κ ∈ R2, κ · κ ≥ 0. If a state |κ〉 is in one such representation, so are all states|κ ′〉 for which κ ′ · κ ′ = κ · κ . That is, κ ′ = (κ ′

1, κ′2) is related to κ = (κ1, κ2) by a

rotation: κ ′ = R(θ )κ . The invariant length κ · κ parameterizes the representation.As before, two cases occur (cf., Cases (i) or (iii) and Case (iv) above):

(i) κ · κ > 0 little group = Identity(ii) κ · κ = 0 little group = I SO(2)

(15.43)

Page 284: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

270 Maxwell’s equations

The first case presents us with two problems. First, κ2 is a continuous quantumnumber, and there are no known particles with a continuous spin index. Second, ifκ2 > 0 there must be an infinite number of states with this same continuous index,for each four-momentum value. Therefore we require κ = 0. This leaves us with thefollowing physically allowable representations of the little group (Y1 → 0, Y2 → 0)

EXP(θ3Y3 + θ1Y1 + θ2Y2) = eiξθ3 (15.44)

where ξ is an integer or half-integer.The coset representatives Ck permute the four-vector subspaces:

Ck |k0; ξ〉 = |k; ξ〉 (15.45)

The action of an arbitrary element of the inhomogeneous Lorentz group on anystate in this Hilbert space is

{�, a} |k; ξ〉 = {�, 0} {I, �−1a

} |k; ξ〉= {�, 0} |k; ξ〉eik·�−1a

= {�, 0} Ck |k0; ξ〉ei�k·a

= {�Ck, 0} |k0; ξ〉ei�k·a

= {Ck ′ Hk0, 0} |k0; ξ〉ei�k·a

= |k ′; ξ〉eiξ�ei�k·a (15.46)

where

C−1k ′ �Ck = Hk0 = EXP(�J3 + θ1Y1 + θ2Y2) −→ eiξ� (15.47)

15.5 Transformation properties

The Hilbert space that carries a unitary irreducible representation of a masslessparticle with helicity ξ contains all states of the form

|k; ξ〉 k = �k0

k0 = (0, 0, 1, ±i)(15.48)

The vector space that carries a manifestly covariant representation of a masslessparticle with transformation indices ( j, j ′) contains all states of the form

|k〉∣∣∣∣ j j ′

µ µ′

⟩k = �k0

k0 = (0, 0, 1, ±i)(15.49)

To compare these two ways of describing a massless particle we compare transfor-mation properties of their states.

Page 285: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

15.5 Transformation properties 271

A. {Hk0, 0} on |k0; ξ〉

{Hk0, 0} |k0; ξ〉 = |k0; ξ〉eiξ� (15.50)

where Hk0 = EXP(�J3 + θ1Y1 + θ2Y2).

B. {Hk0, 0} on |k0〉| j j ′µ µ′ 〉 The little group maps k0 to k0 but acts in a nontrivial way

on the spin states

{Hk0, 0} |k0〉∣∣∣∣ j j ′

µ µ′

⟩= |k0〉

∣∣∣∣ j j ′

ν ν ′

⟩D j j ′

νν ′;µµ′(Hk0 ) (15.51)

The direct product representation D j j ′has the following form

D j0(Hk0 ) = EXP(θ3 J ( j)

3 + θ1(J ( j)

1 + i J ( j)2

) + θ2(J ( j)

2 − i J ( j)1

))= EXP

(θ3 J ( j)

3 + (θ1 − iθ2

)(J ( j)

1 + i J ( j)2

))

=

ei jθ3 ∗ ∗ ∗ ∗ei( j−1)θ3 ∗ ∗ ∗

. . . ∗ ∗. . . ∗

e−i jθ3

(15.52)

D0 j ′(Hk0 ) = EXP

(θ3 J ( j ′)

3 + θ1(J ( j ′)

1 − i J ( j ′)2

) + θ2(J ( j ′)

2 + i J ( j ′)1

))= EXP

(θ3 J ( j ′)

3 + (θ1 + iθ2

)(J ( j ′)

1 − i J ( j ′)2

))

=

ei j ′θ3

∗ ei( j ′−1)θ3

∗ ∗ . . .

∗ ∗ ∗ . . .∗ ∗ ∗ ∗ e−i j ′θ3

(15.53)

By comparing Eq. (15.50) with Eq. (15.52) and Eq. (15.53) we reach the followingconclusions.

The state |k0〉| j 0j 0 〉 transforms identically to |k0; ξ〉 if ξ > 0 and j = +ξ .

The state |k0〉| 0 j ′0 − j ′ 〉 transforms identically to |k0; ξ〉 if ξ < 0 and j ′ = −ξ .

Page 286: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

272 Maxwell’s equations

If |ψ〉 is any physical state, it can be expanded in terms of either the helicitybasis states |k; ξ〉 or the direct product states |k〉| j j ′

µ µ′ 〉:|ψ〉 =

∑k,ξ

|k; ξ〉〈k; ξ |ψ〉

|ψ〉 =∑k,µµ′

|k〉∣∣∣∣ j j ′

µ µ′

⟩⟨k;

j j ′

µ µ′

∣∣∣∣ψ⟩

The amplitudes of the projection of |ψ〉 onto the basis states are 〈k; ξ |ψ〉 in the firstcase and 〈k; j j ′

µ µ′ |ψ〉 in the second. In both cases the sum extends over all k vectorsfor which �k · �k = 0, k �= 0. In the first case the sum extends over the appropriatehelicity states ξ (ξ = ±1 for photons). In the second case the sum extends over theappropriate values of µ, µ′ : − j ≤ µ ≤ + j, − j ′ ≤ µ′ ≤ + j ′.

We discuss the positive helicity state ξ = j > 0 first. The amplitude 〈k0; j |ψ〉of the state |k0; j〉 in any physical state |ψ〉 may be arbitrary. This is simply theamplitude of the massless particle of helicity j in the state |ψ〉. The amplitude〈k0; j 0

j 0 |ψ〉 in the same physical state |ψ〉 is the same. The amplitudes of thestates 〈k0; j 0

m 0 |ψ〉, m �= j , must all vanish. These states are all superfluous –allowed in the manifestly covariant representation but not present in the Hilbertspace that carries the unitary irreducible representation. A simple linear way toenforce this condition on the superfluous amplitudes is to require{

J ( j)3 k0

3 − jk04 I2 j+1

} ⟨k0;

j 0m 0

|ψ⟩

= 0 (15.54)

The matrix within the bracket {·} is diagonal, with the coefficient ( j − j)k03 = 0

multiplying the allowed amplitude 〈k0; j 0j 0 |ψ〉 and nonzero coefficients (m − j)k0

3

multiplying the amplitudes 〈k0; j 0m 0 |ψ〉. Since (m − j)k0

3 �= 0, the amplitudes thatare absent in the description of a physical state (m �= j) must vanish.

For the negative helicity states ξ = − j we have by a completely similar argument{J ( j)

3 k03 + jk0

4 I2 j+1

} ⟨k0;

0 j ′

0 m ′

∣∣∣∣ψ⟩

= 0 (15.55)

C. Other k-vector subspaces The coset operator Ck maps the state |k0; ξ〉 into thestate

Ck |k0; ξ〉 = |k; ξ〉 (15.56)

and the subspace |k0〉| j j ′µ µ′ 〉 into the subspace |k〉| j j ′

ν ν ′ 〉 through the following non-trivial similarity transformation

Ck |k0〉| j j ′

µ µ′ 〉 = |k〉| j j ′

ν ν ′ 〉D j j ′νν ′;µµ′(Ck) (15.57)

Page 287: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

15.6 Maxwell’s equations 273

The condition on the amplitude 〈k; j j ′µ µ′ |ψ〉 in the subspace |k〉 is related to the

conditions (15.54) and (15.55) in the subspace |k0〉 by a similarity transformation

M j j ′(k0)

⟨k0;

j j ′

µ µ′

∣∣∣∣ψ⟩

= 0

(15.58)

Ck M j j ′(k0)C−1

k

⟨k;

j j ′

µ µ′

∣∣∣∣ψ⟩

= 0

For the positive helicity state ξ = j the matrix M j j ′(k0) = M j0(k0) is given in

(15.54). The coset representative may be taken as the product of a boost in thez-direction,

Bz(k)(0, 0, 1, i) = (0, 0, k, ik) (15.59)

followed by a rotation

R(k)(0, 0, k, ik) = (k1, k2, k3, ik4) k21 + k2

2 + k23 = k2

4 = k2 (15.60)

For j = 1 the similarity transformation becomes

R(k)Bz(k4){

J ( j)3 − j I2 j+1

}B−1

z (k4)R−1(k)

= {J · k − 1k4 I3}⟨k;

1 0µ 0

∣∣∣∣ψ⟩

= 0 (15.61)

as the linear constraint that must be satisfied in the subspace |k〉| 1 0µ 0 〉. The negative

helicity states satisfy the constraint

{J · k + 1k4 I3}⟨k;

0 10 µ′

∣∣∣∣ψ⟩

= 0 (15.62)

15.6 Maxwell’s equations

The constraint equation is conveniently expressed in the coordinate rather than themomentum representation by inverting the original Fourier transform that broughtus from the coordinate to the momentum representation

〈k|x〉{

J·1i∇ + 1

1

i

∂(ict)I3

}〈x |k〉

⟨k;

1 0m 0

∣∣∣∣ψ⟩

= 0 (15.63)

If we define complex fields 〈x |k〉〈k; j 0m 0 |ψ〉 by ψ jm(x), ( j = 1, m = +1, 0, −1

or x, y, z or 1, 2, 3) then this equation simplifies to a differential equation. In the

Page 288: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

274 Maxwell’s equations

standard representation for the angular momentum operators for j = 1 we find

− i

c

∂t+∂3 −∂2

−∂3 − i

c

∂t+∂1

+∂2 −∂1 − i

c

∂t

B1 + i E1

B2 + i E2

B3 + i E3

= 0 (15.64)

− i

c

∂t(B + i E)1 + ∂3(B + i E)2 − ∂2(B + i E)3 = 0

−∂3(B + i E)1 − i

c

∂t(B + i E)2 + ∂1(B + i E)3 = 0 (15.65)

+∂2(B + i E)1 − ∂1(B + i E)2 − i

c

∂t(B + i E)3 = 0

These three equations are summarized as a vector equation by

− i

c

∂t(B + iE) − ∇×(B + iE) = 0 (15.66)

By taking the real and imaginary part of this complex equation we find

Re + 1

c

∂E∂t

− ∇×B = 0(15.67)

Im − 1

c

∂B∂t

− ∇×E = 0

These are Maxwell’s equations for positive helicity +1 massless particles (photons):

∇×B − 1

c

∂E∂t

= 0(15.68)

∇×E + 1

c

∂B∂t

= 0

The equations for negative helicity states are derived from the complex conjugaterepresentation D01 and are

+ i

c

∂t+∂3 −∂2

−∂3 + i

c

∂t+∂1

+∂2 −∂1 + i

c

∂t

B1 − i E1

B2 − i E2

B3 − i E3

= 0 (15.69)

It is easily verified that the resulting equations are identical to Eq. (15.68).

Page 289: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

15.8 Problems 275

15.7 Conclusion

In some sense, Maxwell’s equations were a historical accident. Had the dis-covery of quantum mechanics preceeded the unification of electricity and mag-netism, Maxwell’s equations might not have loomed so large in the history ofphysics.

In the quantum description of the electromagnetic field, photons are the fun-damental building blocks. Photons are described by a four-vector k that obeysk · k = 0 in free space, and a helicity index indicating a projection of an angularmomentum ±1 along the direction of propagation of the photon. Every physicalstate is described by a superposition of the photon basis states, and every superpo-sition describes a possible physical state. In this description of the electromagneticfield in free space no constraint equations are necessary.

The nineteenth century description of the electromagnetic field proceeds alongsomewhat different lines. A multicomponent field (E, B) is introduced at eachpoint in space-time. The components of the field transform in a very elegant wayunder homogeneous Lorentz transformations (as a tensor). If the field is Fouriertransformed from the coordinate to the momentum representation, then each four-momentum has six components associated with it. These are the components ofa second order antisymmetric tensor. Since the quantum description has only twoindependent components associated with each four-momentum, there are four di-mensions worth of linear combinations of the classical field components that donot describe physically allowed states, for each four-momentum. Some mechanismmust be derived for annihilating these superpositions. This mechanism is the setof equations discovered by Maxwell. In this sense, Maxwell’s equations are anexpression of our ignorance.

It is ironic that the first truly powerful applications of group theory were to thesolutions of equations. We now understand that group theory, by pointing to theappropriate Hilbert space for the electromagnetic field, allows us to relate phys-ical states to arbitrary superpositions of basis states. Since no superpositions areforbidden, no equations are necessary.

15.8 Problems

1. So, where are the divergence equations? In the special frame with little vectork0 = (0, 0, 1, i) the only nonvanishing component of the field, 〈k; j=1 0

m 0 |ψ〉, is thecomponent with m = +1 (cf., Eq. (15.54)). The coordinates are −(vx + ivy). Thevector v = (vx , vy, 0) represented by this coordinate is orthogonal to the spacial partof the little vector k0 = (0, 0, 1): k0·v = 0. Under boosts Bz and rotations, the non-vanishing component of the boosted field is orthogonal to the spacial part of the kvector: k · v(k) = 0. Backtransforming from the Fourier to the spacial representation,

Page 290: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

276 Maxwell’s equations

show that

k · v(k) = 0FT−1

−→ ∇ · (B + iE) = 0

Taking the real and imaginary parts of this equation give the source-free divergenceequations ∇ · E = 0 and ∇ · B = 0. Show this.

2. When sources are present the Maxwell equations are modified in a way that is mostclearly expressed in the “manifestly covariant representation.” If particle j at x( j) haselectric charge e j and magnetic charge m j , the electric and magnetic charge densitiesand current densities are defined as follows.

Electric MagneticChargedensity

ρe(x, t) =∑

j

e j x j (t) ρm(x, t) =∑

j

m j x j (t)

Currentdensity

Je(x, t) =∑

j

e jdx j (t)

dtJm(x, t) =

∑j

m jdx j (t)

dt

Conservationlaw

∇ · Je(x, t) + ∂ρe(x, t)

∂t= 0 ∇ · Jm(x, t) + ∂ρm(x, t)

∂t= 0

The conservation equations enforce the conditions of charge conservation (both elec-tric and magnetic, separately).

In order to extend Maxwell’s equations to include sources, the source free (ho-mogeneous) equations (15.66) must be coupled to the source terms in such a waythat the symmetry properties on the left (the fields) match the symmetry propertiesof the sources. Thus, the right-hand side must include only vector terms, and theseterms must have appropriate transformation properties under the discrete operationsT, P, T P . The result is unique up to scale factor:(

∇× + i

c

∂t

)(B + iE) = 1

i

c(Jm + iJe) (15.70)

The factor 4π is the surface area of the unit sphere in R3, and the factor 1/c on theright is determined by the system of units used (Gaussian).a. Show that Maxwell’s equations with sources are

∇ × B − 1

c

∂E∂t

= +4π

cJe

∇ × E + 1

c

∂B∂t

= −4π

cJm

b. Show that the Maxwell equations with sources are invariant under the simultaneoustransformation

B + iE → B′ + iE′ = eiφ(B + iE)Jm + iJe → J

′m + iJ

′e = eiφ(Jm + iJe)

Page 291: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

15.8 Problems 277

In particular, show that for φ = π/2 this is the dual transformation (B, E) →(E, −B).

c. Take the divergence of both sides of Eq. (15.70). Use the vector identity div curl(∗) = 0, for ∗ = anyvector. Show

i

c

∂t{∇ · (B + iE) − 4π (ρm + iρe)} = 0

d. By taking real and imaginary parts and integrating over time, find the following:

∇ · B(x, t) = 4πρm(x, t) + Cm(x)∇ · E(x, t) = 4πρe(x, t) + Ce(x)

e. Two “constants of integration” appear in these equations. They are functions ofspace but not of time. If these “constant functions of position” are zero the Maxwelldivergence equations result. Provide arguments to show that these constants shouldbe zero. These should take the form of investigating what the field looks like whenall particles head towards “infinity.”

Remark So far magnetic charges (monopoles) have not been observed, despitebeing predicted by supersymmetric theories and searched for actively by experimen-talists. This means that the first divergence equation is ∇ · B = 0.

3. In order to describe gravitational waves in free space it is possible to use the rep-resentation D j j ′+ j ′ j (�), with j − j ′ = ±2. In the case with ( j, j ′) = (2, 0) a curlequation is introduced to suppress four nonphysical complex amplitudes. Show thatthe gravitational wave equations in free space are

− 2i

c

∂t(Gm + iGe) − ∇×(Gm + iGe) = 0 (15.71)

The real and imaginary parts of this complex equation are

Re + 2

c

∂Ge

∂t− ∇×Gm = 0

(15.72)

Im − 2

c

∂Gm

∂t− ∇×Ge = 0

The fields Ge and Gm are called the gravitoelectric and gravitomagnetic fields. Thesefields can be treated in Cartesian coordinates as real symmetric 3 × 3 traceless matri-ces and in spherical coordinates as five-component rank-two spherical tensors. In thelatter case the curl operator is J·∇, where J is the 5 × 5 angular momentum operator:

J·∇ =

+2∂0

√4∂+ 0 0 0√

4∂− +1∂0

√6∂+ 0 0

0√

6∂− 0∂0

√6∂+ 0

0 0√

6∂− −1∂0

√4∂+

0 0 0√

6∂− −2∂0

Page 292: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

278 Maxwell’s equations

In Cartesian coordinates the curl operator is slightly more complicated. The Maxwell-like equations for the gravitoelectric and gravitomagnetic field are

0 ∂y −∂x 2∂z 0

−∂y 0 ∂z −∂x −√3∂x

∂x −∂z 0 −∂y

√3∂y

−2∂z ∂x ∂y 0 00

√3∂x −√

3∂y 0 0

F1

F2

F3

F4

F5

+ 2

c

∂t

G1

G2

G3

G4

G5

= 0

0 ∂y −∂x 2∂z 0−∂y 0 ∂z −∂x −√

3∂x

∂x −∂z 0 −∂y

√3∂y

−2∂z ∂x ∂y 0 00

√3∂x −√

3∂y 0 0

G1

G2

G3

G4

G5

− 2

c

∂t

F1

F2

F3

F4

F5

= 0

The relation between the five components of the rank-two spherical tensor and thenine matrix elements of a second order Cartesian tensor are (Ramos and Gilmore,2006)

Fi j = F11 F12 F13

F21 F22 F23

F31 F32 F33

=

F4 − 1√3

F5 F1 F3

F1 −F4 − 1√3

F5 F2

F3 F2 + 2√3

F5

The matrix components obey Fi j = Fji ,∑

i Fii = 0, and ∂ i Fi j = 0. The gravitoelec-tric and gravitomagnetic tensors have the same discrete symmetries as the electricand magnetic fields.

4. Follow the outline of Problem 2 to show the following.a. The gravitoelectric and gravitomagnetic fields satisfy divergence conditions in free

space. Write them down.b. In the presence of source terms (stationary and moving masses) the homogeneous

equations are “dressed” with source terms on the right-hand side. In Cartesian coor-dinates the source term for the gravitoelectric field is Ui j = ∑

k mk(xk(t)xk(t))i j ,and the form of the rank-two tensor is determined from the expression at theconclusion of Problem 3. What is the gravitational analog of the magneticmonopole?

c. The coupled equations are invariant under a gauge transformation of the firstkind of both the gravitoelectric and gravitomagnetic fields and the current terms:Gm + iGe → eiφ(Gm + iGe) and Jm + iJe → eiφ(Jm + iJe). Show this.

d. What are the divergence equations in the presence of moving matter?

5. Construct the source-free field equations for gravitons for the D j j (�) representation,with j = 1. Show that there are seven constraints that correspond to (J, M) with(J, M) = (0, 0), (1, 0), (1, ±1), (2, 0), (2 ± 1). What are these equations in the stan-

Page 293: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

15.8 Problems 279

dard differential representation? How are source terms (moving masses) coupled tothese equations?

6. Observed redshifts are extremely important in interpreting the history of our universe.There appear to be four sources for redshifts (so far):

(i) Doppler shift;(ii) gravitational redshift;

(iii) universal expansion redshift;(iv) Mach redshift.The Doppler shift has been recognized since 1842. Radiation from a source is red-shifted if the source and observer are moving away from each other, blueshifted ifthey are moving towards each other. The gravitational redshift is a consequence ofthe conservation of energy. As a photon climbs out of a gravitational potential it losesenergy and its frequency is redshifted. The universal expansion redshift is a conse-quence of the expansion of the universe. Two points (e.g., a source and an observer)that are at rest with respect to the the COBE background radiation (the “aether”) moveapart due to the expansion of the universe. If a wave with N wavelengths connects thetwo (distance Nλ), as time goes on and the distance increases the wavelength mustalso increase to Nλ′. This redshift source is sometimes confused with the Dopplershift because the two points appear to be moving apart due to the expansion of theuniverse. The fourth redshift source is controversial. Mach proposed that the inertia(mass) of a particle depends on the distribution of mass in the universe. Field theoryrequires that this information is transmitted by the fields set up by charges (electric,magnetic (if they exist), and masses). In fact, the exchange of virtual gravitons pro-vides information about the distribution of mass in the universe within our horizonand should contribute to the mass (inertia) of a particle in the same way that exchangeof virtual photons contributes to the energy (mass) changes in the Lamb effect.a. Assume that the energy density in the universe has the form ρ(x, t) = ρ(t) (time

dependent only). Assume that since recombination (∼300 kY after the Big Bang)the horizon of the accessible universe has been uniformly expanding. Assume thatthe mass of the electron comes from two sources: interactions with electromag-netic radiation and interaction with graviational radiation. Compute how the masschanges with time.

b. Estimate the mass dependence of the electron–proton mass ratio me(t)/Mp(t).c. If the electron mass is increasing in time because of the expansion of the horizon

with time, then the electron was less massive in the past. Radiation emitted fromthe hydrogen atom has frequency ν = 1

2 (mc2/�) × |(1/n21 − 1/n2

2)| where n1 andn2 are the principal quantum numbers of the two states involved in the transitionand m is the reduced mass of the electron–proton system. Show that Hα photonsemitted from hydrogen at rest with the COBE background are redshifted becauseof the universal expansion and because the electron was less massive in the past.Disentangle these two effects and argue that the Mach shift aliases the universalexpansion redshift.

Page 294: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

280 Maxwell’s equations

7. The locally flat metric of space-time and the metric representing a certain type ofgravitational field are given by the matrices

gflat =

c2

−1−1

−1

ggrav =

c2

(1 + 2�(x)

c2

)−1

−1−1

Here �(x) is the local Newtonian gravitational field. Find a locally linear coordinatetransformation S that brings the curved metric to flat form: St ggravS = gflat. InterpretS in terms of a locally free-falling coordinate transformation.

8. Gauss’ law on the sphere S2 Gauss’ law in R3 states∮E·dS =

∫4πρ dV

The integral on the left is over the surface bounding the volume V over which theintegral on the right extends, E is the electric field and ρ is the charge density. For acharge q at the origin of a sphere of radius a, ρ(x) = qδ(x), The E field is sphericallysymmetric, and Gauss’ Law reduces to

4πa2|E(a)| = 4πq

From this, and symmetry, we deduce the Coulomb/gravitational force law:

E(a) = q

a2

a|a|

By completely similar arguments Gauss’ Law in the plane R2 gives |E(a)| = q/|a|.Assume a Gauss law (

∮E·dS = ∫

2πρd A) holds on the sphere S2. Place a chargeq on the north pole of a sphere of radius R (see Fig. 15.1).a. An observation point subtends an angle θ when measured from the center of the

sphere S2 (c.f., Fig. 15.1). Show that its distance a from the north pole is a = Rθ

R

q

q

a

Figure 15.1. A charge q is placed on the north pole of a sphere of radius R.

Page 295: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

15.8 Problems 281

and the circumference of a circle of latitude through this point is 2π R sin θ . Usethis information to deduce

|E| = q

R sin θ= q

R sin(a/R)

Conclude that the field is stronger than the q/a form it would have in a plane.b. Show that this effective strengthening is due to the relative compression of the E

field lines (compared to the planar case) due to the positive curvature of the sphere.c. Rewrite this result as

|E| = q

R sin(a/R)= q(a)

aq(a) = q

(a/R

sin(a/R)

)where a (a = Rθ ) is the distance from the charge to the observation point.

d. If the observer thinks (s)he is in a flat space, conclude (s)he will think the effectivecharge depends on the distance from the observation point. In particular, if a = ct ,the further back in time the observer looks, the stronger (s)he will think the chargeis.

9. Gauss’ law on rank-one homogeneous spaces The invariant metric and mea-sure on the three Riemannian symmetric spaces H n = SO(n, 1)/ SO(n), Rn =I SO(n)/SO(n), and Sn = SO(n + 1)/SO(n) are

ds2 = dr2

1 − kr2+ r2

n∑j=2

(sin θ2 sin θ3 · · · sin θ j−1dθ j )2

where k = (−1, 0, +1) for H n, Rn, Sn and radial coordinates are used:

x1 = r cos θ2

x2 = r sin θ2 cos θ3...

xn−1 = r sin θ2 sin θ3 · · · sin θn−1 cos θn

xn = r sin θ2 sin θ3 · · · sin θn−1 sin θn

a. Derive the metric for H n, Sn from Eq. (12.9) and the coordinate transformationabove.

b. Assume a Gauss Law of the form∮E·dS =

∫�ρ(x)dV

Compute �, the surface area of the unit sphere Sn−1 ⊂ H n, Rn, or Sn . (Hint: use∫e−x2

dx = √π , carry the n-fold integral out in Cartesian and radial coordinates,

and show � = 2πn/2/�(n/2).)c. Carry out the integral for a charge q at the origin to show

|E|an−1 = q

Page 296: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

282 Maxwell’s equations

d. Show that the distance d from the origin to the sphere of radius a is

d(a) =∫ a

0

dr√1 − kr2

−→

sinh−1a k = −1a k = 0sin−1a k = +1

e. Express the electric field strength as

|E| = q(d)

dn−1q(d) = q ×

(d/R

sinh(d/R)

)n−1

1(d/R

sin(d/R)

)n−1

Here R is some characteristic size scale for the spaces H n, Sn .f. Show that in the two curved spaces the observed charge is renormalized upward in

Sn , downward in H n , with lookback time. Give a physical interpretation involvingcompression or rarefaction of field lines. How does this renormalization dependon R, c, t?

10. The special theory of relativity is based on two assumptions that have been raised tothe status of axioms:1. The speed of light is the same in all inertial frames.2. Physical laws have the same form in all inertial frames.The second axiom has been rephrased in the spirit of thermodynamics: “It is impos-sible, by any experiment, to determine the absolute motion of an inertial frame ofreference.” This form is motivated by the failure of the Michelson–Morley experi-ment to detect the motion of the Earth through the “aether.” In this form the secondaxiom is false: This has been shown by measurements of the microwave backgroundradiation, which contains a nonzero dipole moment. This shows that the Solar Systemof galaxies is moving through the microwave background at a speed of ∼370 km/s inthe direction with galactic coordinates (l, b) = (263◦, 48◦).a. What effect does the ability to determine an absolute frame of reference have on

the special theory of relativity?b. Assume the temperature distribution of the microwave background is T (θ, φ; t) =∑

l,m Alm(t)Y l

m(θ, φ). How do you use this information to determine a frame thatis: not translating? not rotating?

c. Since an absolute rest frame (nontranslating, non rotating) is defined by ther-modynamic measurements, argue that this special reference frame is statisticallydetermined.

d. Show that the determination of this special frame of reference is uncertain due tothe uncertainty relations of statistical mechanics: �U �(1/T ) ≥ k in the entropyrepresentation (Gilmore, 1985).

e. If thermodynamic background fields of spin 12 (neutrinos) and spin 2 (gravitons)

also exist, show that they also can be used to determine special rest frames. Argue

Page 297: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

15.8 Problems 283

why, or why not, the special frames defined by j = 12 , 1, 2 are the same. What

happens if they are different?f. Assume (for simplicity) that there is only one massive object in the universe and

that it moves through the microwave background radiation with a velocity v(t).Show that its velocity decays to zero according to v(t) � v(t0)e−(t−t0)/τ becauseit is moving through a viscous medium. Estimate τ and present your answer inthe form τ/Tp, where Tp is the present age of the universe (Tp � 13.7 BY). Tocarry out this estimate you may assume the massive object is a black body –in fact, assume it is a black hole with mass M , radius R at temperature TB H .Use the standard relations for a neutral nonrotating black hole R = 2G M/c2,TB H = �c3/8πkG M . You can assume that the mass M is sufficiently large thatthe temperature TB H can be neglected (set to zero). Assume that the absorption(geometric) cross section for radiation on a black hole is γπ R2, where γ = 33/22.Note that the problem of slowing down in a viscous medium was discused byEinstein in another of the papers from his “annus mirabilis,” the precursor of thefluctuation–dissipation theorem.

Page 298: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

16

Lie groups and differential equations

Lie group theory was initially developed to facilitate the solution of dif-ferential equations. In this guise its many powerful tools and results arenot extensively known in the physics community. This chapter is designedas an antidote to this anemia. Lie’s methods are an extension of Galois’methods for algebraic equations to the study of differential equations.The extension is in the spirit of Galois’ work: the technical details are notsimilar. The principle observation – Lie’s great insight – is that the simpleconstant that can by added to any indefinite integral of dy/dx = g(x) is infact an element of a continuous symmetry group – the group that maps so-lutions of the differential equation into other solutions. This observationwas used – exploited – by Lie to develop an algorithm for determiningwhen a differential equation had an invariance group. If such a group ex-ists, then a first order ordinary differential equation can be integrated byquadratures, or the order of a higher order ordinary differential equationcan be reduced.

Galois inspired Lie. If the discrete invariance group of an algebraic equation couldbe exploited to generate algorithms to solve the algebraic equation “by radicals,”might it be possible that the continuous invariance group of a differential equationcould be exploited to solve the differential equation “by quadratures”? Lie showedemphatically in 1874 that the answer is YES!, and work has hardly slowed downin the field that he pioneered from that time to the present.

But what is the group that leaves the solutions of a differential equation invariant –or maps solutions into solutions? It turns out to be none other than the trivial constantthat can be added to any indefinite integral. The additive constant is an element ina translation group.

We outline Lie’s methods for first order ordinary differential equations. First,we study the simplest first order equation in one independent variable x and onedependent variable y: dy/dx = g(x). This is treated in Section 16.1. In that sectionwe set up the general formulation in terms of a constraint equation dy/dx = p and

284

Page 299: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

16.1 The simplest case 285

a surface equation F(x, y, p) = 0. The special forms of the surface and constraintequations are exploited to write down the solution by quadratures.

Lie’s methods are presented in Section 16.2 in a number of simple, easy to di-gest steps. Taken altogether, these provide an algorithm for determining whetheran ordinary differential equation possesses a symmetry and, if so, what that sym-metry is. Transformation to a set of canonical variables R, S, T is algorithmic. Thecanonical variable R(x, y) is the new independent variable (like x), S(x, y) is thenew dependent variable (like y), and T (x, y, p) is the new constraint between S andR (like dy/dx). In this new coordinate system the surface and constraint equationsassume the desired forms F(R, −, T ) = 0 and d S/d R = f (R, −, T ). The systemhas been reduced to quadratures, and integration follows immediately.

Despite the simplicity of the algorithm, it is not easy to understand these stepswithout a roadmap. Such is provided in Section 16.3, where a simple example isdiscussed in detail.

Lie’s methods extend in many different directions. Several of these are indicatedin Section 16.4.

16.1 The simplest case

The simplest first order ordinary differential equation to deal with has the form

dy

dx= g(x) (16.1)

Here x is the independent variable and y is the dependent variable. The solution ofthis equation is (almost) trivially

y = G(x) =∫

g(x) dx (+ additive constant) = G(x) + c (16.2)

If we write the solution in the form y − G(x) = 0, then the surface y + c − G(x) =0 is also a solution of the original equation (16.1). There is a one-parameter groupof displacements that maps one solution into another. These displacements can berepresented by the Taylor series displacement operator ec∂/∂y , for

ec∂/∂y[y − G(x) = 0] = y + c − G(x) = 0 (16.3)

In short, the “trivial” additive constant is in fact a one-parameter group of transla-tions that maps solutions (16.2) of (16.1) into other solutions of the original simpleequation (16.1). This translation group plays the same role for first order ordinarydifferential equations that the symmetric group Sn plays for nth degree algebraicequations.

Page 300: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

286 Lie groups and differential equations

For convenience, we express the derivative dy/dx as a coordinate p. The firstorder differential equation (16.1) can be written in the form F(x, y, p) = 0, whereF(x, y, p) = p − g(x) for the particular case at hand. There are two relationsamong the three variables x, y, p. They are given by the surface equation andthe constraint equation:

surface equation F(x, y, p) = 0constraint equation p = dy/dx when F(x, y, p) = 0

(16.4)

It is useful to express the action of the three partial derivatives ∂/∂x, ∂/∂y, ∂/∂pon the surface F(x, y, p) defining the ordinary differential equation. It is alsouseful to express the action of the generator of infinitesimal displacements thatmaps solutions of this equation into other solutions of this equation, on the threecoordinates. These two relations are summarized as follows:

∂x∂

∂y∂

∂p

[p − g(x)] =

0∗

∂y

x

yp

=

0

10

(16.5)

These two equations will be generalized to the determining equation for the in-finitesimal generator of the invariance group and the determining equations for thecanonical coordinates.

16.2 First order equations

In this section we will summarize Lie’s approach to the study of differential equa-tions (Blumen and Cole, 1969; Estabrook and Wahlquist, 1975; Wahlquist andEstabrook, 1976). We do this for equations of first order (dn y/dxn , n = 1) andfirst degree (depends on pm = (dy/dx)m , m = 1). The results are independent ofdegree.

If the equation that defines the first order ordinary differential equation,F(x, y, p) = 0, is not of the form p − g(x), so that ∂

∂y F(x, y, p) �= 0, then wecan attempt to find the following.

(i) A one-parameter group that leaves F(x, y, p) = 0 unchanged.(ii) A new “canonical” coordinate system (R, S, T ). In this coordinate system R = R(x, y)

is the independent variable, S = S(x, y) is the dependent variable, and T = T (x, y, p)is the new constraint variable. In this canonical coordinate system the surface equationF(x, y, p) = 0 is not a function of the new dependent variable: F(R, −, T ) = 0.

Page 301: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

16.2 First order equations 287

In this new coordinate system the source term for the constraint equation is alsoindependent of the dependent variable: d S/d R = f (R, −, T ).

16.2.1 One-parameter group

We search for a one-parameter group of transformations that leaves the surfaceequation invariant by changing variables in the (x, y) plane according to

x → x(ε) = x + εξ (x, y) + O(ε2) x(ε = 0) = xy → y(ε) = y + εη(x, y) + O(ε2) y(ε = 0) = yp → p(ε) = p + εζ (x, y, p) + O(ε2) p(ε = 0) = p

(16.6)

In the simplest case Eq. (16.1), this one-parameter group is x → x and y → y + ε,so that ξ = 0, η = 1, and ζ = 0.

16.2.2 First prolongation

The function ζ (x, y, p) is not independent of the functions ξ (x, y) and η(x, y). Theformer is related to the latter pair by the first prolongation formula. Specifically,

p = d y

d x= d y/dx

dx/dx= p + ε(ηx + ηy p)

1 + ε(ξx + ξy p)−→ p + ε[ηx + (ηy − ξx )p − ξy p2]

(16.7)to first order in ε, where ηx = ∂η/∂x , etc. As a result

ζ (x, y, p) = η(1)(x, y, y(1)) = ηx + (ηy − ξx )p − ξy p2 (16.8)

16.2.3 Determining equation

The surface equation must be unchanged under the one-parameter group of trans-formations, so that

F(x, y, p) = 0 → F (x(ε), y(ε), p(ε))ε small−→ F(x + εξ, y + εη, p + εζ )

= F(x, y, p) + ε

∂x+ η

∂y+ ζ

∂p

)F(x, y, p) + h.o.t. (16.9)

These are the leading two terms in the Taylor series expansion

F (x(ε), y(ε), p(ε)) = eεX F(x, y, p) = 0 (16.10)

Page 302: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

288 Lie groups and differential equations

where the generator of infinitesimal displacements for the one-parameter group thatleaves the surface equation invariant is

X = ξ∂

∂x+ η

∂y+ ζ

∂p(16.11)

The first two terms in Eq. (16.9) and (16.10) are

F(x, y, p) = 0 and X F(x, y, p) = 0 (16.12)

These are called the determining equations. The determining equations (16.12)are generalizations of equations (16.5).

Specifically, these equations are used to determine the functions ξ (x, y), η(x, y),and ζ (x, y, p) that define the infinitesimal generator X . These functions are deter-mined by an algorithm based on linear algebra. There are recent versions depend-ing on sophisticated methods of algebraic topology. These methods are elegantimprovements of a conceptually simple brute strength procedure that we summa-rize briefly. The surface equation F(x, y, p) = 0 is solved for p as a function ofx and y: p = p(x, y). This expression is substituted into the determining equationX F(x, y, p(x, y)) = 0, so that this equation depends only on two independent vari-ables x and y. The generators of the infinitesimal displacements, ξ (x, y) and η(x, y),are represented by Laurent expansions, or Taylor series expansions if convergentsolutions are sought:

ξ (x, y) =∑i, j

ξi j xi y j 0 ≤ i, j, i + j ≤ dξ (16.13)

and similarly for η. These representations are truncated at finite degrees dξ , dη. Thedetermining equation X F = 0 is expanded into the form

∑Ci j xi y j = 0. Each

coefficient Ci j must vanish separately, by standard linear independence arguments.This gives a set of simultaneous linear equations in the expansion amplitudesξi j , ηi j . In general, there are more equations than unknowns. Since the equationsare homogeneous, there are no nontrivial solutions if the rank of this system is equalto the number of unknowns. The number of independent solutions (up to an overallscaling factor) is equal to the corank of this system of equations. This is not largerthan one for first order equations but may exceed one for second and higher orderequations. This algorithm is effective when ξ (x, y) and η(x, y) are polynomials offinite degree.

16.2.4 New coordinates

If an infinitesimal generator X can be constructed from the determining equa-tions, then it is possible to determine a new system of coordinates R, S, T which

Page 303: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

16.2 First order equations 289

“straightens out” the surface equation. This is done by solving the determiningequations for canonical coordinates. These are a set of partial differential equationsthat are analogous to the equations on the right-hand side of Eq. (16.5). For con-venience, we summarize the determining equations for the infinitesimal generatorand for the canonical coordinates, analogs of the two equations in Eq. (16.5), asfollows:

X F = 0 X

R(x, y)

S(x, y)T (x, y, p)

=

0

10

(16.14)

The three linear partial differential equations on the right determine the new canon-ical coordinates: the independent variable R(x, y), the dependent variable S(x, y),and the new constraint T (x, y, p) between R and S.

16.2.4.1 Dependent coordinate

The dependent coordinate S is determined from the differential equationX (x, y, p)S(x, y) = 1. We require S to be independent of p, so the conditiondefining S reduces to(

ξ (x, y)∂

∂x+ η(x, y)

∂y

)S(x, y) = 1 (16.15)

The solution is not unique: any function of x and y that is annihilated by X can beadded to the solution. Further, it is not important that X S = +1: we could just aswell choose a solution satisfying X S = −1 or, for that matter, X S = k �= 0, wherek is some constant.

16.2.4.2 Invariant coordinates: independent variable

The two invariant coordinates R and T are unchanged under the one-parametertransformation group. These functions obey X R = 0 and XT = 0, which are ex-plicitly (

ξ (x, y)∂

∂x+ η(x, y)

∂y

)R(x, y) = 0(

ξ (x, y)∂

∂x+ η(x, y)

∂y+ ζ (x, y)

∂p

)T (x, y, p) = 0

(16.16)

The solutions are most simply found by the method of characteristics. They obeythe differential relations

dx

ξ (x, y)= dy

η(x, y)= dp

ζ (x, y, p)(16.17)

The first equation is used to construct R(x, y).

Page 304: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

290 Lie groups and differential equations

16.2.4.3 Invariant coordinates: constraint variable

The second equation in (16.17) is used to construct T (x, y, p). It is often possibleto construct T so that it is a function of p to the first power. When this is possible,it is the preferred form of the nonunique expression for the invariant cordinate T .

16.2.5 Surface and constraint equations

In the new coordinate system there is a constraint equation:

d S

d R= d S(x, y)

d R(x, y)= d S/dx

d R/dx= Sx + Sy p

Rx + Ry p(16.18)

This derivative is independent of the parameter ε of the one-parameter group.Therefore it must be independent of the coordinate S, and depend only on theinvariant coordinates R and T . In this new coordinate system the surface andconstraint equations are

surface equation F(R, −, T ) = 0constraint equation d S/d R = f (R, −, T )

(16.19)

These are directly analogous to Eq. (16.1) and dy/dx = p in Section 16.1.

16.2.6 Solution in new coordinates

To integrate the transformed equation, the surface equation is used to determine Tas a function of R: T = T (R). This expression is used in the constraint equation,which can then “easily” be integrated to give

S =∫

f (R, −, T (R)) d R + c (16.20)

The additive parameter c is the image of the parameter ε of the one-parametergroup of transformations that leaves the original surface equation F(x, y, p) = 0invariant.

16.2.7 Solution in original coordinates

The inverse relation x = x(R, S), y = y(R, S) is used to express the solutionEq. (16.20) of the transformed equation in terms of the original coordinates.

16.3 An example

The algorithm developed in Section 16.2 is, for all practical purposes, impossibleto understand without illustrating its workings by a particular example. To illustrate

Page 305: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

16.3 An example 291

20

10

0

−10

−2

−1

0

1

21

0.80.6

0.40.2

Figure 16.1. The first order ordinary differential equation xp + y − xy2 = 0. Herep (vertical) is plotted over the (x, y) plane for 0.1 ≤ x ≤ 1.1 and −2 ≤ y ≤ +2.The shape of the surface depends on both coordinates x and y.

the algorithm, we use it to integrate the equation

F(x, y, p) = xp + y − xy2 = 0 (16.21)

Before setting out on this path, we first attempt the following scaling trans-formation y → αy and x → βx . Under this transformation the equation trans-forms to α(xp + y − (αβ)xy2) = 0. The equation is invariant provided αβ = 1.The one-parameter group that leaves the surface constraint F(x, y, p) = 0 invari-ant is x → λx , y → λ−1 y, p → λ−2 p. Since there is a one-parameter invariancegroup for this differential equation, Lie’s methods are guaranteed to work. In fact,it is possible to construct the infinitesimal generator X (x, y, p) from this groupdirectly.

The surface p = y2 − y/x is shown in Fig. 16.1. The value of p clearly dependson both coordinates x and y. The purpose of the change of variables is to find anew coordinate system in which the surface is independent of the new dependentvariable S(x, y).

The determining equation Eq. (16.14) is

ξ (p − y2) + η(1 − 2xy) + [ηx + (ηy − ξx )p − ξy p2]x = 0 (16.22)

Page 306: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

292 Lie groups and differential equations

The functions ξ (x, y) and η(x, y) describing the generators of infinitesimal dis-placements are determined following the algorithm outlined in Section 16.2.3. First,we use the surface equation F(x, y, p) = 0 to find an expression for p: p(x, y) =−y/x + y2. This is substituted into the determining equation X F(x, y, p) = 0 toprovide a functional relation between x and y:

ξ(− y

x

)+ η(1 − 2xy) + ηx x + (ηy − ξx )(xy2 − y) − ξy

(xy4 − 2y3 + y2

x

)= 0

(16.23)

We first attempt zeroth degree expressions for ξ and η: ξ = ξ00, η = η00. When theseare substituted into Eq. (16.23) we obtain three equations for the two unknowns. Thecoefficients of the monomials y/x , 1, and xy depend on the unknown parametersξ00, η00 as follows:

monomial ξ00 η00

y/xx0 y0 = 1

xy

−1 0

0 10 −2

[

ξ00

η00

]=

[00

](16.24)

This system of three simultaneous linear equations in two unknowns has rank two,therefore no nontrivial solutions.

We therefore increase the degree of ξ (x, y) and η(x, y) to one and repeat theprocess. The relation Eq. (16.23) between x and y is now

(ξ00 + ξ10x + ξ01 y)(− y

x

)+ (η00 + η10x + η01 y)(1 − 2xy) + η10x

+ (η01 − ξ10)(xy2 − y) − ξ01

(xy4 − 2y3 + y2

x

)= 0 (16.25)

This results in the following set of ten equations for six unknowns:

monomial ξ00 ξ10 ξ01 η00 η10 η01

y2/xy/x1xy

xyx2 yxy2

xy3

xy4

0 0 −2 0 0 0−1 0 0 0 0 0

0 0 0 +1 0 00 0 0 0 +2 00 0 0 0 0 00 0 0 −2 0 00 0 0 0 −2 00 −1 0 0 0 −10 0 −2 0 0 00 0 +1 0 0 0

ξ00

ξ10

ξ01

η00

η10

η01

=

00000

(16.26)

Page 307: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

16.3 An example 293

This set of equations has rank five, so there is one nontrivial solution. From the firstfour equations we determine ξ01 = ξ00 = η00 = η10 = 0, and from the coefficientof xy2 we learn −ξ10 − η01 = 0 so that, up to some overall scaling factor, we cantake ξ (x, y) = x and η(x, y) = −y. Since we have found one nontrivial solutionfor an infinitesimal generator of a one-parameter group of a first order equation, wecan stop searching for additional solutions to the determining equation (for secondorder equations there may be additional solutions).

With this solution ξ (x, y) = x and η(x, y) = −y the prolongation formula Eq.(16.8) gives ζ = −2p, so that the generator of infinitesimal displacements is

X = x∂

∂x− y

∂y− 2p

∂p(16.27)

The infinitesimal generator is now used to determine the new set of coordinates.We first determine the dependent coordinate S(x, y) by attempting to solve(

x∂

∂x− y

∂y

)S(x, y) = 1 (16.28)

It is useful first to seek a solution S(x, y) depending only on the single variabley. Such a solution can be found if the equation −yd S(y)/dy = 1 can be solved.The solution, up to an additive constant, is − ln(y). We will adopt this solution,neglecting the negative sign: S(x, y) = ln(y).

The invariant coordinates are determined using the method of characteristics:

dx

x= dy

−y= dp

−2p(16.29)

The first equation for the new independent variable simplifies to ydx = −xdy ord(xy) = 0, from which we conclude that R(x, y) = xy is an invariant coordinatethat obeys Eq. (16.14). The invariant coordinate involving p is determined by setting−dp/2p equal to either of the other two differentials. We set it equal to dx/x toavoid having the second invariant coordinate dependent on y. The equation isdx/x = −dp/2p and the solution is (1/x)d(x2 p) = 0, so that T (x, y, p) = x2 p.The forward and backward transformations between the two coordinate systemsare

RST

=

xy

ln(y)x2 p

x

yp

=

Re−S

eS

T e2S/R2

(16.30)

In the new coordinate system the surface equation transforms to

F(x, y, p) = xp + y − xy2 = 0 −→ eS

[T

R+ 1 − R

]= 0 (16.31)

Page 308: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

294 Lie groups and differential equations

12

10

8

6

4

2

0

0

1

2 43

21

0−1

−2−3−2

−1

Figure 16.2. The surface xp + y − xy2 = 0 transforms to the surface T/R + 1 −R = 0 in canonical coordinates. Here T (vertical) is plotted over the (R, S) planefor −3 ≤ R ≤ +4 and −2 ≤ S ≤ +2. The function is a simple ruled surface,independent of S.

The expression within the brackets is the transformed surface equation. It is inde-pendent of S. This surface T = T (R, S) is plotted in Fig. 16.2. It has the desiredform: a ruled surface whose shape (height) is independent of the dependent variableS. Such a surface is sometimes called a “cylinder.”

The new constraint equation is

d S

d R= d(ln y)

d(xy)= p/y

y + xp= T eS/R2

eS + (Re−S)(T e2S/R2)(16.32)

The surface and constraint equations are

surface equation T/R + 1 − R = 0constraint equation d S/d R = (T/R)/(T + R)

(16.33)

The surface equation is solved for T as a function of R: T (R) = R2 − R. Thisexpression is substituted into the constraint equation to give a first order differentialequation in quadratures:

d S

d R= 1

R− 1

R2=⇒ S = ln(R) + 1

R+ c (16.34)

The parameter c is the parameter of the translation group that leaves invariant thetransformed equation.

Page 309: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

16.4 Additional insights 295

The inverse transformation, Eq. (16.30), from (R, S) to (x, y) is finally used torewrite the solution in terms of the original set of variables:

y = −1

x(c + ln x)(16.35)

Remarks The operator x d/dx is the infinitesimal generator for scaling transfor-mations, since eλxd/dx x = eλx . As a result, the infinitesimal generator X has thefollowing effect on the coordinates (x, y, p):

EXP

{x

∂x− y

∂y− 2p

∂p

}) x

yp

=

eλx

e−λye−2λ p

(16.36)

From this scaling behavior, it is easy to see that ln(y) is linear in the Lie translationgroup parameter: ln(e−λy) = ln(y) − λ. The invariant operators come right out ofthe scaling transformations: xy and x2 p are unchanged by the scaling transforma-tion. None of these operators is unique. The operator ln(xy2) is linear and x3 ypis invariant. We have just chosen the most convenient (simplest) solutions to theequations defining the new coordinates.

16.4 Additional insights

Lie’s theory of infinitesimal transformation groups has been extended in manydifferent directions, all of which are powerful and beautiful. It is barely possible toscratch the surface here. Instead, we content ourselves by indicating some of thedirections in which it can be extended. These directions are simple consequencesof the analyses presented in the previous two sections.

16.4.1 Other equations, same symmetry

Many differential equations can share the same invariance group. The most gen-eral first order ordinary differential equation invariant under the scaling groupEq. (16.36) has the form F(R, −, T ) = 0 or more simply F(xy, x2 p) = 0. Themost general first order equation of first degree with this symmetry has the formx2 p = h(xy) or dy/dx = x−2h(xy). For the equation studied in Section 16.3,h(z) = −z + z2. For the Riccati equation dy/dx + y2 − 2/x2 = 0, h(z) = z2 − 2.

16.4.2 Higher degree equations

These methods work equally well with first order equations of higher degree.For example, the first order, second degree equation y′2 + y4 − x−4 = 0 hascanonical form R4 + T 2 = 1. The original equation has two solution branches

Page 310: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

296 Lie groups and differential equations

p = ±√

x−4 − y4, corresponding to the two solution branches in the canonicalcoordinate system T = ±√

1 − R4.

16.4.3 Other symmetries

The methods described in Section 16.2 and illustrated by example in Section 16.3apply to any first order ordinary differential equation with a one-parameter group.Table 16.1 provides a list of symmetries that may be encountered for ordinarydifferential equations. For each symmetry the functions ξ (x, y) and η(x, y) are tab-ulated, as well as the first prolongation ζ (x, y, p) = η(1)(x, y, p). We also presentthe canonical coordinates (R, S, T ). Since the constraint equation d S/d R dependsonly on the change of variables, it also can be tabulated, and has been. The simplestcase, Eq. (16.1), is present in the first line of this table. The equation studied inSection 16.3 is present in the eighth line of this table.

The Lie symmetries leaving the equation invariant can be determined from thistable in one of two ways. We can use the generator of infinitesimal displacements tocompute them, as in Eq. (16.36). Or we can look at the transformations effected byS → S′ = S + c, R′ = R. In the latter case we find ln(y) → ln(y) + c = ln(ec y) =ln(y(c)) and since xy = x(c)y(c), the transformation is x(c) = e−cx and y(c) =e+c y.

16.4.4 Second order equations

Second order equations can be studied by simple extensions of the methods usedto study first order equations. The infinitesimal generator for displacements nowinvolves derivatives with respect to y(2) and is given by

X = ξ∂

∂x+ η

∂y+ η(1) ∂

∂y(1)+ η(2) ∂

∂y(2)(16.37)

The second prolongation can be determined from the first in a straightforwardcomputation

d2 y

d x2= d

dx

(d y

d x

)= d

dx(p + εη(1)) = D(1)(p + εη(1))

D(0)(x + εξ )= y(2) + εD(1)η(1)

1 + εD(0)ξ

= y(2) + ε(D(1)η(1) − y(2) D(0)ξ

)(16.38)

As a result,

η(2)(x, y, y(1), y(2)) = D(1)η(1) − y(2) D(0)ξ (16.39)

Page 311: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

16.4 Additional insights 297

Table 16.1. Infinitesimal generators ξ, η, ζ , canonical coordinates R, S, T , andconstraint equation d S/d R for some Lie symmetries

Infinitesimal generators Canonical coordinates Constraint

ξ (x, y) η(x, y) ζ (x, y, p) R(x, y) S(x, y) T (x, y, p) d S/d R

0 1 0 x y p T1 0 0 y x p 1/T

1/a −1/b 0 ax + by bx − ay p (b − aT )/(a + bT )x 0 −p y ln x xp 1/T0 y p x ln y p/y T

x/a y/b (1/b − 1/a)p yb/xa b ln y p/x (a/b−1) (bT/R)/(bT − a R(1/b))x y 0 y/x ln y p (T/R)/(T − R)x −y −2p xy ln y x2 p (T/R)/(T + R)

2x y −p y2/x ln y yp (T/R)/(2T − R)x 2y p y/x2 ln y p/x (T/R)/(T − 2R)y 0 −p2 y x/y x − y/p −T/R2

0 x 1 x y/x xp − y T/R2

−y x 1 + p2√

x2 + y2 tan−1(y/x) (y − xp)/(x + yp) −T/R1 y/x (px − y)/x2 y/x x (xp − y)/x2 1/Ta x 1 x2 − 2ay x/a x − ap 1/(2aT )a y p x − a ln y x/a p/y (1/a)/(1 − aT )x b −p ey/xb y/b ey ∗ pb (bR)−1/

[1 − b(R/T )(1/b)]y b −p2 y2 − 2bx y/b y − b/p 1/(2bT )0 e f (x) f ′e f x y/e f p − y f ′ T/e f (R)

x2 xy y − xp y/x 1/x xp − y 1/Txy y2 yp − xp2 y/x 1/y y/p − x 1/(T R2)xy 0 −yp − xp2 y (ln x)/y y/(xp) − ln x T/R2

0 xy y + xp x (ln y)/x xp/y − ln y T/R2

g(y) 0 −g′ p2 y x/g 1/p − xg′/g T/g(R)0 f (x) f ′ x y/ f f p − f ′ y T/ f 2(R)

f (x) 0 − f ′ p y F (F ′ f = 1) p f 1/T0 g(y) g′ p x G (G ′g = 1) p/g T

xk+1 kxk y xk (k2 y/x − p) y/xk 1/xk xp − ky −k/Tkxyk yk+1 yk (p − k2xp2/y) x/yk 1/yk y/p − kx −k/T

where

D(n) = ∂

∂x+ dy

dx

∂y+ dy(1)

dx

∂y(1)+ · · · + y(n+1) ∂

∂y(n)(16.40)

It is explicitly

η(2) = ηxx + (2ηxy − ξxx )y′ + (ηyy − 2ξxy)y′2 − ξyy y′3

+ (ηy − 2ξx − 3ξyy′)y′′ (16.41)

The determining equations are

F(x, y, y(1), y(2)) = 0 X (x, y, y(1), y(2))F(x, y, y(1), y(2)) = 0 (16.42)

Page 312: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

298 Lie groups and differential equations

Symmetries are found by following the algorithm described in Section 16.2.3 andillustrated in Section 16.3.

16.4.5 Reduction of order

If a higher order equation has a known one-parameter symmetry group, the order ofthe equation can be reduced by one. We illustrate as usual by example. The generalcase can easily be inferred from the example.

Suppose a second order equation F(x, y, y′, y′′) = 0 is invariant under the scal-ing group (16.36). Then the dependent coordinate is S = ln y and the surface equa-tion can be expressed in terms of three invariant coordinates as F(R, −, T, U ) = 0.Here as before R depends only on x and y, T = T (x, y, y′), andU = U (x, y, y′, y′′)is another invariant coordinate. How does one construct such an invariant coordi-nate? It is simple to see that the derivative dT/d R is invariant under the group. Notonly is it invariant, but it is of first degree in the second order term y(2), for

dT

d R= dT/dx

d R/dx= Tx + Ty y(1) + Ty(1) y(2)

Rx + Ry y(1)(16.43)

For the scaling group the new invariant coordinate is

dT

d R= 2xy′ + x2 y′′

y + xy′ (16.44)

and the most general second order equation invariant under this group is

G

(R, −, T,

dT

d R

)= 0 (16.45)

This is a first order equation in the invariant coordinate T . The result is that wehave used a one-parameter symmetry group to reduce the order of a second orderequation by one. If an additional symmetry can be identified, the equation can bereduced to quadratures a second time (i.e., completely integrated).

The most general second order equation invariant under the group of scalingtransformations Eq. (16.36) that is of first degree in y′′ is

dT

d R= x2 y′′ + 2xy′

y + xy′ = g(xy, x2 y′) = g(R, T ) (16.46)

This is a first order equation in T . Certain forms of the function g may admit anotherLie symmetry. If such a symmetry can be found, the order of the equation can againbe reduced by one.

Page 313: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

16.4 Additional insights 299

16.4.6 Higher order equations

These ideas can be extended to higher order equations. We begin with an nth orderequation F(x, y, . . . , y(n)) = 0. As usual, we seek an infinitesimal generator

X = ξ∂

∂x+ η(0) ∂

∂y(0)+ η(1) ∂

∂y(1)+ · · · + η(n) ∂

∂y(n)= ξ

∂x+

n∑j=0

η( j) ∂

∂y( j)

(16.47)The functions in the prolongation formulas are determined following the proceduredemonstrated in Eq. (16.38). They are recursively related:

η(0)(x, y) = η(x, y)

η(1)(x, y, y(1)) = D(0)η(0) − y(1) D(0)ξ

η(2)(x, y, y(1), y(2)) = D(1)η(1) − y(2) D(0)ξ

η(3)(x, y, y(1), y(2), y(3)) = D(2)η(2) − y(3) D(0)ξ

......

... (16.48)

The operator X is used as described in Section 16.2 to compute the functions ξ (x, y)and η(x, y). There will be as many linearly independent infinitesimal generatorsas the corank of the set of simultaneous linear equations for the Taylor seriescoefficients of these functions.

If one or more generators can be constructed, a dependent coordinate S can becomputed by solving Eq. (16.15). The remaining invariant coordinates are obtainedfrom the equations

dx

ξ= dy

η(0)= dy(1)

η(1)= · · · = dy(n)

η(n)(16.49)

In fact, only the first two invariant coordinates R(x, y) and T (x, y, y(1)) need becomputed. The remaining invariant coordinates are dT ( j)/d R( j), j = 0 (for T ) andj = 1, 2, . . . , n − 1. Each of these latter is of first degree in y( j+1). As a result,the existence of a Lie symmetry can be used to reduce an nth order equation to an(n − 1)st order equation.

16.4.7 Partial differential equations: Laplace’s equation

Lie’s methods can be extended to partial differential equations. We illustrate asmall part of the theory by treating Laplace’s equation in this subsection and theheat equation in the following.

In n dimensions, Laplace’s equation with a source term is

∇2u(x1, x2, . . . , xn) = δ(x) (16.50)

Page 314: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

300 Lie groups and differential equations

This equation is clearly invariant under rotations, so that the infinitesimal gen-erators of rotations are Lie symmetries. The equation is also invariant underscaling transformations xi → λxi , u → αu. Under the scaling transformationδ(x) → δ(λx) = λ−nδ(x), so that

∇2u = δ(x) −→ α

λ2∇2u = λ−nδ(x) (16.51)

The equation is invariant provided α = λ2−n . The infinitesimal generators of sym-metries for this equation therefore consist of generators of rotations and scaletransformations (Blumen and Cole, 1969):

Xi j = xi∂ j − x j∂i

Z = xi∂i + (2 − n)u ∂∂u

(16.52)

A new independent coordinate R = R(x, u) satisfies X R = 0, where X is any linearcombination of the generators in Eq. (16.52). A solution is R ∼ u|x |n−2. As a result,u ∼ |x |2−n = k|x |2−n . The constant of proportionality can be computed using thedivergence theorem. Both sides of Eq. (16.50) are integrated over the interior of aunit sphere in Rn . The volume integral on the right is +1. The volume integral onthe left is transformed into a surface integral using the divergence theorem:∫

Vk∇2|x |2−ndV =

∫S=∂V

k(2 − n)n· dS|x |n−1

= (2 − n)kV (Sn) = 1 (16.53)

Here V (Sn−1) = 2πn/2/�( n2 ) is the surface area of a unit sphere in Rn . As a result,

the solution of Laplace’s equation in Rn (n �= 2) with unit source term at the originis

u(x) = k

|x |n−2k = −1

(n − 2)V (Sn)(16.54)

16.4.8 Partial differential equations: heat equation

The heat equation on Rn for u(x, t) with source term

ut − ∇2u = δ(x, t) (16.55)

is treated similarly (Olver, 1993). It is invariant under rotations, so the operatorsXi j are Lie symmetries. Under the scaling transformation u → αu, t → βt , andxi → λxi the equation transforms as follows:

ut − ∇2u = δ(x, t) −→ α

βut − α

λ2∇2u = 1

λnβδ(x, t) (16.56)

Invariance under the scaling transformations places the following two constraints onthe three scaling variables (since there is only one equation): αλn = 1 and β/λ2 = 1.

Page 315: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

16.4 Additional insights 301

From these relations it is possible to construct n + 1 additional Lie symmetries, sothat the entire set is

Xi j = xi∂ j − x j∂i

Yi = 2t∂

∂xi− xi u

∂u(16.57)

Z = 2t∂

∂t+ xi ∂

∂xi− nu

∂u

An invariant coordinate depending on the xi , t and u is R = utn/2e|x |2/4t , fromwhich we obtain as before

u = kt−n/2e−|x |2/4t k =(

1

2√

π

)n

(16.58)

16.4.9 Closing remarks

Galois resolved the problem of determining whether an algebraic equation couldbe solved by radicals, and if so how, between 1829 and 1832. His manuscriptswere lost, rejected, or filed for posterity. His accomplishments were unrecognizedat his death in 1832. They were rescued from oblivion, the black hole of Frenchindifference to its greatest mathematician, by Cauchy in 1843.

Lie’s discoveries began in 1874. He realized that the hodgepodge of seeminglydifferent techniques for solving differential equations that existed at that time (andstill does) were almost all special manifestations of one single principle – theinvariance of solutions of ordinary differential equations under a continuous group.Lie was luckier than Galois when it came to recognition during his lifetime.

There are several problems in the implementation of Lie’s algorithms that haveeither been lightly addressed or passed over in our discussion.

1. Under what conditions is it possible to solve the determining equations for the surface?That is, when is it possible – or impossible – to solve the linear partial differentialequations for ξ (x, y) and η(x, y)?

2. Under what conditions is it possible to solve the determining equations for the canonicalvariables?

3. Under what conditions is it possible to solve the canonical surface equation F(R, −, T ) =0 for T as a function of R? When it is possible, what is the algorithm for accomplishingthis?

4. Under what conditions is it possible to integrate a function of a single variable:∫f (R,-, T (R))d R?

The final question was resolved for algebraic functions by Risch in (1969). Heexploited the tools of Galois theory in a heavy way to provide an algorithm for

Page 316: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

302 Lie groups and differential equations

determining when an algebraic function can be integrated in closed form, and deter-mining the integral when the answer to the first question is positive. We summarizethe dates of these accomplishments here:

1830 Galois solve algebraic equations1874 Lie solve differential equations1969 Risch integrate in closed form? – solve determining equations for ξ, η

? – solve determining equations for R, S, T? – solve F(R, −, T ) = 0 for R.

It is clear that additional algorithms are possible and desirable.

16.5 Conclusion

Lie set out to extend Galois’ treatment of algebraic equations to the field of ordinarydifferential equations. Galois observed that an algebraic equation has a symmetrygroup: a set of operations that maps solutions into solutions. If the symmetry grouphas certain properties, these properties can be used to generate an algorithm forsolving the equation.

It was Lie’s genius to see that the “trivial” additive constant that occurs in the so-lution of a differential equation that has been reduced to quadratures is in fact a groupoperation. The symmetry group in this simplest case is simply the one-parametergroup of translations. Armed with this observation, he developed algorithmic meth-ods to attack ordinary differential equations by searching for their symmetry groups.Lie in fact studied local groups of transformations. The even more beautiful studyof global Lie groups was a later development.

In Section 16.2 we presented Lie’s algorithm for solving first order ordinarydifferential equations in a number of simple steps. These involve the following.

(i) Introduce a set of point transformations in the x–y plane. These are defined by thefunctions ξ (x, y) and η(x, y).

(ii) Construct the first prolongation ζ (x, y, p) = η(1)(x, y, y(1)) from the functions defin-ing the local change of variables.

(iii) Introduce the operator X = ξ∂/∂x + η∂/∂y + ζ∂/∂p. This describes a Taylor seriesexpansion of the surface equation F(x, y, p) = 0 that defines the first order ordinarydifferential equation.

(iv) Solve the determining equation X F = 0 when F = 0 for the functions ξ (x, y) andη(x, y).

(v) Solve the determining equations X R = 0, X S = 1, XT = 0 for the canonical coor-dinates. These are the coordinates in which the surface is a “cylinder” The surfaceequation is independent of the new dependent variable: F → F(R, −, T ) = 0.

Page 317: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

16.6 Problems 303

(vi) Construct the constraint equation d S/d R = f (R, −, T ) in this new coordinate sys-tem.

(vii) Solve the surface equation for T as a function of R: T = T (R).(viii) Solve the constraint equation for S: S = ∫

f (R, −, T (R)) + c.(ix) Backsubstitute the original coordinates for the new coordinates, x = x(R, S) and

y = y(R, S), to obtain the solution of the original equation.

The steps in this algorithm have been illustrated by working out a simple examplein Section 16.3.

These methods extend in any number of ways. We have indicated a number ofuseful directions by example in Section 16.4.

16.6 Problems

1. Show that invariance under a one-parameter group of transformations can also beexpressed in the form

dn

dεnF[x(ε), y(ε), p(ε)]|ε=0 = 0 n = 0, 1, 2, . . . (16.59)

Show that the first two terms n = 0, 1 are exactly the determining equations(16.12).

2. Construct the invariance group for each of the transformations presented in Table16.1.

3. Mechanical similarity The classical Newtonian equation of motion for a particle ofmass m in the presence of a potential V (x) is

md2xdt2

= −∇V (x)

Assume that under a scaling transformation, the mass scales with a factor α (i.e.,m → αm), x → βx, t → γ t . Assume also that the potential is homogeneous of degreek: V (βx) → βk V (x) (Landau and Lifshitz, 1960). Under this scaling transformationshow that the equation of motion transforms to

α1β1γ −2md2xdt2

= −βk−1∇V (x)

a. Show that the scaled equation is identical to the original provided α1β2−k

γ −2 = 1.b. Setα = 1. Show that trajectories are invariant under the scaling transformation with

γ 2 = β2−k . Show that in the cases k = −1, k = 0, k, = +1, k = +2 the following

Page 318: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

304 Lie groups and differential equations

scaling results hold:

k Potential type Transformation−1 Coulomb γ 2 = β3

0 no force γ 2 = β2

+1 local gravitational potential γ 2 = β1

+2 harmonic oscillator γ 2 = β0

The first line is a statement of Kepler’s third law: for closed planetary orbits, thesquare of the period (γ 2) is proportional to the cube of the semiaxis (β3). If R′

and T ′ are the semiaxis and period of planet P ′ and R and T are the semiaxis andperiod of planet P , and the two planets P and P ′ have geometrically similar orbits,β3 → (R′/R)3 = (T ′/T )2 ← γ 2. The second line is a statement of the integral ofNewton’s second law in the absence of forces in an inertial frame: the distancetraveled (β) is proportional to the time elapsed (γ ). The third line is a statement thatin a local gravitational potential of the form V = mgz, the distance fallen increaseslike the square of the time elapsed. The fourth line is a statement of Hooke’s law:in harmonic motion the period (γ ) is independent of the size of the orbit.

c. Fix γ = 1 and construct a table relating the mass and orbital scale under the fourforces described in the table above.

d. Fix β = 1 and show that the period scales like√

M for all homogeneous potentials.Reconcile this result with the well-known result that the period of a planet isindependent of its mass in lowest order.

e. If the motion is bounded for all times, show

2〈T 〉 = 〈x·∇V (x)〉 = 〈kV (x)〉

where T is the kinetic energy. This is the virial theorem for homogenoeous poten-tials.

f. Show that the kinetic energy scales like αβ2γ −2 = βk (use a). Since the potentialenergy scales the same way, the total energy has this scaling property.

4. Assume that the dynamics of a system are derivable from an action principle. Forexample, the Euler–Lagrange equations are derived from the variation of an action:δ∫L(x, x)dx = 0. Show that if a scaling transformation leaves the Lagrangian invari-

ant up to an overall scaling factor, the trajectories will scale under this transformation.

5. The heat equation in one dimension is

∂2u

∂x2= ∂u

∂t

Show that the following six differential operators vi are infinitesimal generators ofthe invariance group of this equation. Show that eεvi f (x, t) has the action shown for

Page 319: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

16.6 Problems 305

each of the six generators (Olver, 1993):

vi Infinitesial eεvi f (x, t) =v1 ∂x f (x − ε, t)v2 ∂t f (x, t − ε)v3 u∂u eε f (x, t)v4 x∂x + 2t∂t f (e−εx, e−2ε t)v5 2t∂x − xu∂u e−εx+ε2t f (x − 2εt, t)v6 4xt∂x + 4t2∂t − (x2 + 2t)u∂u λe−ελ2x2

f (λ2x, λ2t)where λ2 = 1/(1 + 4εt)

6. The two-dimensional wave equation is

∂2u

∂x2+ ∂2u

∂y2= ∂2u

∂t2

Show that the following vector fields map solutions into solutions:

displacements Pi ∂x , ∂y, ∂t

rotations Lz x∂y − y∂x

boosts Bi x∂t + t∂x , y∂t + t∂y

dilations Di x∂x + y∂y + t∂t , u∂u

inversions

ix

iy

it

=

x2 − y2 + t2 2xy 2xt −xu

2yx −x2 + y2 + t2 2yt −yu2t x 2t y x2 + y2 + t2 −tu

∂x

∂y

∂t

∂u

Show that D2 = u∂u commutes with all remaining generators. Construct the commu-tation relations of the remaining ten generators, and show they satisfy the commu-tation relations of the conformal group in 2+1 dimensions. Show that this group isSO(2 + 1, 1 + 1) = SO(3, 2).

7. Construct the invariance group for the wave equation in 3 + 1 dimensions. Thisis the Maxwell equation without sources in space-time. There are 16 infinitesimalgenerators. Show that 15 satisfy the commutation relations for the conformal groupSO(3 + 1, 1 + 1) = SO(4, 2) (Bateman, 1910). The extra generator commutes withall the rest, and is u∂u .

8. The heat equation in one dimension is uxx − ut = 0. The infinitesimal generator ofsymmetries for this equation is X = ξ i ∂

∂xi + η ∂∂u + · · · = ξ 1 ∂

∂x + ξ 2 ∂∂t + η ∂

∂u + · · · .Show that (Stewart, 1989)

ξ 1 = a1 + a2x + a3t + a4xt

ξ 2 = 2a2t + a4t2 + a5

η = − 12 a3xu − a4( 1

2 t + 14 x2)u + a6u + h(x, t)

Page 320: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

306 Lie groups and differential equations

Here h(x, t) is any function that satisfies the homogeneous heat equation. Constructthe infinitesimal generators corresponding to the arbitrary real coordinates ai andcompute their commutation relations. What is the structure of this Lie algebra?

9. Show that the scalar operator

S = t2 ∂

∂t+ tx · ∇ − 1

4(x · x + 2nt) u

∂u

is also a Lie symmetry of Eq. (16.55) with source term.

10. Noether’s theorem for physicists Many dynamical problems can be expressed in anaction principle format:

I =∫ t2

t1

L(t, x, x)dt δ I = 0

Specifically, the action I is stationary on a physically allowed trajectory. The firstvariation leads to the Euler–Lagrange equations

d

dt

(∂L

∂ xi

)− ∂L

∂xi= 0

Under a one-parameter family of change of variables (t → t ′ = T (t, x, ε) = t +εξ (t, x), xi → x ′

i = Xi (t, x, ε) = xi + εηi (t, x)) the action integral transforms to

I =∫ t

′2

t′1

L(t ′, x ′, x ′)dt ′ =∫ t2

t1

L(t ′, x ′, x ′)dt ′

dtdt

where dt ′/dt = ∂T/∂t + (∂T/∂xi )dxi/dt . Show that if you differentiate the actionintegral with respect to ε, then set ε = 0 the result is∫ t2

t1

(ξ∂L

∂t+ ηi

∂L

∂xi+ η

(1)i

∂L

∂ xi+ dξ

dtL

)dt = 0

Show that by standard arguments the integrand must itself be zero. Show that alongan allowed trajectory the vanishing of the integrand can be expressed in the form

d

dt

[ξ L + (ηi − ξ xi )Lxi

] = 0

The expression within the square brackets is a constant of the motion. Applythis theorem to a Lagrangian that is invariant under space displacements, timedisplacements, and rotations around a space axis to construct the following conservedquantities:

Symmetry Conserved quantitySpace displacements momentumTime displacements energySpace-time displacements four-momentumRotations angular momentum

Page 321: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

16.6 Problems 307

11. Noether’s theorem, more general We present a more general form of Noether’stheorem than is presented above. This form is very powerful and sufficient for mostphysical applications. It is not the most general form of Noether’s theorem. Supposethe dynamics of a system is derivable from an action integral of the form L[u] =∫L(x, u)dx , x ∈ R p, u ∈ Rq , and suppose the infinitesimal generators that leave the

dynamics invariant have the form

v =p∑

i=1

ξ i (x, u)∂

∂xi+

q∑α=1

φα(x, u)∂

∂uα

Show that the components Pi defined by

Pi = ξ iL +q∑

α=1

φα(x, u)∂L∂uα

i

−q∑

α=1

p∑j=1

ξ j uαj

∂L∂uα

i

satisfy a conservation law of the form

∇ P = div P = ∂ Pi

∂xi= 0

12. Representation theory G is a compact Lie group with invariant measure dρ(g)and volume Vol(G) = ∫

dρ(g), �λµν(g) are the irreducible representations of G con-

structed by reduction of tensor products (Wigner–Stone theorem), and φ(g), ψ(g)are functions defined on the group manifold. The orthogonality and completenessrelations are ∫

dim λ

Vol(G)�λ′∗

µ′ν ′ (g)�λµν(g)dρ(g) = δλ′λδµ′µδν ′ν∑

λ

∑µ

∑ν

dim λ

Vol(G)�λ∗

µν(g′)�λµν(g) = δ(g′, g)

Introduce Dirac notation for these matrix elements:⟨g

∣∣∣∣ λ

µν

⟩=

√dim λ

Vol(G)�λ

µν(g)

⟨λ

µν

∣∣∣∣g⟩

=√

dim λ

Vol(G)�λ∗

µν(g)

(a) a. Write the orthogonality and completeness relations in Dirac notation and show:∫dρ(g)

⟨λ′

µ′ν ′

∣∣∣∣g⟩ ⟨

g

∣∣∣∣ λ

µν

⟩=

⟨λ′

µ′ν ′

∣∣∣∣ λ

µν

⟩∑

λ

∑µ

∑ν

⟨g′

∣∣∣∣ λ

µν

⟩ ⟨λ

µν

∣∣∣∣g⟩

= ⟨g′|g⟩

b. Show that the orthogonality and completeness relations can be expressed in theform of “resolutions of the identity” in appropriate spaces:

|g〉〈g| = ∫ |g〉dρ(g)〈g| = I in group space∣∣∣∣ λ

µν

⟩⟨λ

µν

∣∣∣∣ =∑

λ

∑µ

∑ν

∣∣∣∣ λ

µν

⟩⟨λ

µν

∣∣∣∣ = I in representation space

Page 322: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

308 Lie groups and differential equations

c. Carry out a Fourier decomposition on the functions ψ(g) = 〈g|ψ〉 and 〈 λµν |ψ〉 =∫

dρ(g)〈 λµν |g〉〈g|ψ〉 (and similarly for φ(g) = 〈g|φ〉) using the Dirac represen-

tation. Write down the Parseval equality for the inner product∫

φ∗(g)ψ(g)dρ(g)expressed in terms of the discrete and continuous basis vectors in this Hilbertspace.

Page 323: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

Bibliography

F. T. Arecchi, E. Courtens, R. Gilmore, and H. Thomas (1972), Atomic coherentstates in quantum optics, Phys. Rev. A 6, 2211–2237.

G. A. Baker, Jr. (1958), Degeneracy of the n-dimensional isotropic harmonic oscillator,Phys. Rev. 103, 1119–1120.

M. Bander and C. Itzykson (1966a), Group theory and the hydrogen atom (I), Rev.Mod. Phys. 38, 330–345.

M. Bander and C. Itzykson (1966b), Group theory and the hydrogen atom (II), Rev.Mod. Phys. 38, 346–358.

V. Bargman and E. P. Wigner (1948), Group theoretical discussion of relativistic waveequations, Proc. Nati. Acad. Sci. (US) 34, 211–223.

A. O. Barut and G. L. Fronsdal (1971), SO(4, 2)-formulation of symmetry-breakingin relativistic Kepler problems with or without magnetic charges, J. Math. Phys. 12,841–846.

A. O. Barut and R. Raczka (1977), Theory of Group Representations and Applications,Warsaw: PWN Polish Scientific Publications.

A. O. Barut and W. Rasmussen (1971), Non-relativistic and relativistic Coulombamplitude as the matrix element of a rotation in SO(4, 2), Phys. Rev. D 3,956–959.

H. Bateman (1910), The transformation of the electrodynamical equations, Proc. LondonMath. Soc. 8, 223–264.

O. Bely (1966), Quantum defect theory III. Electron scattering by He+, Proc. Phys. Soc.88, 833–842.

G. Berendt, E. Weimar, and R. Gilmore (1975), Harmonic oscillator Green’s functionfrom a BCH formula, J. Math. Phys. 16, 1231–1233.

H. A. Bethe and E. E. Salpeter (1957), Quantum Mechanics of One- and Two-ElectronAtoms, Berlin: Springer-Verlag.

L. C. Biedenharn (1962), Invariant operators of the Casimir type, Phys. Lett. 3, 69–70.G. W. Blumen and G. D. Cole (1969), Symmetries and Differential Equations, New

York: Springer-Verlag.C. E. Burkhardt and J. J. Leventhal (2004), Lenz vector operators on spherical

hydrogen atom eigenfunctions, Am. J. Phys. 72, 1013–1016.H. D. Doebner and O. Melsheimer (1967), On a class of generalized group

contractions, Nuovo Cimento A 49, 306–311.L. Dresner (1999), Applications of Lie’s Theory of Ordinary and Partial Differential

Equations, Bristol: IOP Publishing.

309

Page 324: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

310 Bibliography

F. Estabrook and H. Wahlquist (1975), Prolongation structures of nonlinear evolutionequations. J. Math. Phys. 16, 1–7.

V. A. Fock (1935), Zur theorie des Wasserstoffatoms, Z. Phys. 98, 145–154.L. L. Foldy and S. A. Wouthuysen (1950), On the Dirac theory of spin 1/2 particles

and its nonrelativistic limit, Phys. Rev. 78, 29–36.C. Fronsdal (1965), Infinite multiplets and the hydrogen atom, Phys. Rev. 156,

1665–1677.T. Fulton, F. Rohrlich, and L. Witten (1962), Conformal invariance in physics, Rev.

Mod. Phys. 34, 442–457.G. Gabrielse, D. Hanneke, T. Kinoshita, M. Nio, and B. Odom (2006), New

determination of the fine structure constant from the electron g value and QED,Phys. Rev. Lett. 97, 030802.

I. M. Gel’fand and M. L. Tsetlein (1950), Matrix elements for the unitary groups,Dokl. Akad. Nauk SSSR 71, 825–828.

I. M. Gel’fand and M. L. Tsetlein (1950), Matrix elements for the orthogonal groups,Dokl. Akad. Nauk SSSR 71, 1017–1020.

R. Gilmore (1970), Construction of weight spaces for irreducible representations ofAn; Dn, Bn, Cn , J. Math. Phys. 11, 513–523.

R. Gilmore (1970), Spin representations of the orthogonal groups, J. Math. Phys. 11,1853–1854.

R. Gilmore (1970), Spectrum of Casimir invariants for the simple classical Lie groups, J.Math. Phys. 11, 1855–1856.

R. Gilmore (1970), Diagrammatic technique for constructing matrix elements, J. Math.Phys. 11, 3420–3427.

R. Gilmore (1974a), Lie Groups, Lie Algebras, and Some of Their Applications, NewYork: Wiley, 1974; republished New York: Dover.

R. Gilmore (1974b), Baker–Campbell–Hausdorff formulas, J. Math. Phys. 15,2090–2092.

R. Gilmore (1977), Structural stability of the phase transition in Dicke-like models, J.Math. Phys. 18, 17–22.

R. Gilmore (1985), Uncertainty relations of statistical mechanics, Phys. Rev. A 31,3237–3239.

R. Gilmore (2004), Elementary Quantum Mechanics in One Dimension, Baltimore, MD:Johns Hopkins University Press.

R. Gilmore (2006), Lie groups: general theory. In: J.-P. Francoise, G. Naber, and S. T.Tsu, eds., Encyclopedia of Mathematical Physics, Amsterdam: Elsevier, pp.286–304.

R. Gilmore and C. M. Bowden (1976a), Coupled order-parameter treatment of theDicke hamiltonian, Phys. Rev. A 13, 1898–1907.

R. Gilmore and C. M. Bowden (1976b), Bifurcation properties of the Dickehamiltonian, J. Math. Phys. 17, 1617–1625.

R. Gilmore and J. M. Yuan (1987), Group theoretical approach to semiclassicaldynamics: single mode case, J. Chem. Phys. 86, 130–139.

R. Gilmore and J. M. Yuan (1989), Group theoretical approach to semiclassicaldynamics: multimode case, J. Chem. Phys. 91, 917–923.

R. Gilmore, H. G. Solari, and S. K. Kim (1993), Algebraic description of the quantumdefect, Found. Phys. 23, 873–879.

R. J. Glauber (1963), Coherent and incoherent states of the radiation field, Phys. Rev.131, 2766–2788.

H. Goldstein (1950), Classical Mechanics, Reading, MA: Addison-Wesley.

Page 325: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

Bibliography 311

S. Helgason (1962), Differential Geometry and Symmetric Spaces, New York: AcademicPress.

S. Helgason (1978), Differential Geometry, Lie Groups, and Symmetric Spaces, NewYork: Academic Press.

L. K. Hua (1963), Harmonic Analysis of Functions of Several Complex Variables in theClassical Domains, Translations of Mathematical Monographs, Vol. 6, Providence,RI: American Mathematical Society.

E. Inonu and E. P. Wigner (1953), On the contraction of groups and theirrepresentations, Proc. Natl. Acad. Sci. (US) 39, 391–402.

S. Kais and S. K. Kim (1986), Unstable bound states of the Dirac equation by analgebraic approach, Phys. Lett. A 114, 47–50.

P. Kustaanheimo and E. Stiefel (1965), Perturbation theory of Kepler motion based onspinor regularization, J. Reine Angew. Math. 218, 204.

L. D. Landau and E. M. Lifshitz (1960), Mechanics, Reading, MA: Addison-Wesley.S. Lang (1984), Algebra, Reading, MA: Addison-Wesley.I. A. Malkin and V. I. Man’ko (1965), Symmetry of the hydrogen atom, Sov. Phys.

JETP Lett. 2, 146–148.H. V. McIntosh, Symmetry and the hydrogen atom, http://delta.cs.cinvestav.

mx/˜mcintosh/comun/symm/symm.html.W. Miller, Jr. (1968), On Lie Algebras and Some Special Functions of Mathematical

Physics, Memoirs of the American Mathematical Society, vil 50, Providence, RI:American Mathematical Society.

V. I. Ogievetskii and I. V. Polubarinov (1960), Wave equations with zero and nonzerorest masses, Sov. Phys, JETP 10, 335–338.

P. Olver (1993), Applications of Lie Groups to Differential Equations, 2nd edn., NewYork: Springer.

W. Pauli (1926), On the hydrogen spectrum from the standpoint of the new quantummechanics, Z. Phys. 36, 336–363. English translation in: B. L. van der Waerden,ed., Sources of Quantum Mechanics, New York: Dover, 1967, pp. 387–415.

J. Ramos and R. Gilmore (2006), Derivation of the source-free Maxwell andgravitational radiation equations by group theoretical means, Int. J. Mod. Phys.15(4), 505–519.

R. H. Risch (1969), The problem of integration in finite terms, Trans. Am. Math. Soc. 139,167–189.

D. A. Sadovskii and B. I. Zhilinskii (1998), Tuning the hydrogen atom in crossed fieldsbetween the Zeeman and Stark limits, Phys. Rev. A 57, 2867–2884.

E. J. Saletan (1961), Contraction of Lie groups, J. Math. Phys. 2, 1–21.L. I. Schiff (1968), Quantum Mechanics, 3rd edn., New York: McGraw Hill.J. Schwinger (1965), On angular momentum. In L. C. Biedenharn and H. van Dam,

eds., Quantum Theory of Angular Momentum, New York: Academic Press, pp.229–279.

M. J. Seaton (1966a), Quantum defect theory I. General formulation, Proc. Phys. Soc.88, 801–814.

M. J. Seaton (1966b), Quantum defect theory II. Illustration on one-channel andtwo-channel problems, Proc. Phys. Soc. 88, 815–832.

H. Stephani and M. MacCallum (1989), Differential Equations, Their Solutions UsingSymmetery, Cambridge: Cambridge University Press.

I. Stewart (1989), Galois Theory, London: Chapman and Hall.E. L. Stiefel and G. Scheifele (1971), Linear and Regular Celestial Mechanics,

Berlin: Springer-Verlag.

Page 326: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

312 Bibliography

J. D. Talman (1968), Special Functions: A Group Theoretic Approach (Based onLectures by Eugene P. Wigner), New York: Benjamin.

N. Ja Vilenkin (1968), Special Functions and the Theory of Group Representations,Translations of Mathematical Monographs, vol. 22, Providence, RI: AmericanMathematical Society.

H. Wahlquist and F. Estabrook (1976), Prolongation structures of nonlinear evolutionequations. II, J. Math. Phys. 17, 1293–1297.

S. Weinberg (1964), Feynman rules for any spin. II. Massless particles, Phys. Rev. B 134,882–896.

G. H. Weiss and A. A. Maradudin (1962), The Baker–Campbell formula and a problemin crystal physics, J. Math. Phys. 3, 771–777.

H. Weyl (1946), The Classical Groups, Princeton, NJ: Princeton University Press.E. P. Wigner (1939), On unitary representations of the inhomogeneous Lorentz group

Ann. Math., 40, 149–204.E. P. Wigner (1954), Conservation laws in classical and quantum physics, Progr. Theor.

Phys. 11, 437–440.E. P. Wigner (1957), Relativistic invariance and quantum phenomena Rev. Mod. Phys. 29,

255–268.E. P. Wigner (1959), Group Theory and its Application to the Quantum Mechanics of

Atomic Spectra, New York: Academic. Press.R. M. Wilcox (1967), Exponential operators and parameter differentiation in quantum

physics, J. Math. Phys. 8, 962–982.D. P. Zhelobenko (1962), The classical groups, spectral analysis of their finite

dimensional representations, Russ. Math. Surveys 17, 1–92.

Page 327: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

Index

A(p q), 39, 48A1, 161A2, 151, 160A3, 46, 161, 162An , 46, 49, 161, 166B1, 161B2, 151, 160, 161, 164B3, 162Bn , 161, 166, 168C1, 161C2, 151, 160, 161, 164C3, 162Cn , 161, 166, 168D2, 151, 160, 162D3, 161, 162Dn , 161, 166, 168E(2), 91, 207E(3), 42E6, 162, 168E7, 162, 168E8, 162, 168F(n), 45, 49F4, 162, 168GL(1; Q), 40, 47GL(2; C), 47GL(2; R), 43GL(2; Z), 45, 49GL(3; Z), 45GL(n; C), 47GL(n; F), 34, 36, 74GL(n; Q), 47GL(n; R), 47, 104GL(n; Z), 44, 45, 49, 81G2, 151, 160, 162, 165H T (p, q), 37, 48H2

1 , 103, 189H2

2 , 102, 189, 191H4, 211I SO(2), 91, 206, 207, 268I SO(2), little group, 267I SO(3), 208, 209Nil(n), 38, 48

O(3 1), 261O(3), 40, 78O(3; Z), 46O(n), 40, 43, 145O(n; G), 41O(n; Z), 45, 49O(p, q), 43OU (2n), 43Pn , 45SL(2; C), 43SL(2; R), 26, 28, 29, 30, 41, 43, 56, 58, 62, 100, 102,

189SL(n; R), 30SL(n; C), 43, 47SL(n; Q), 43SL(n; R), 43, 47, 164SL(n; Z), 45SO(2, 1), 105SO(2, 1), little group, 267SO(2, 1)/SO(2), 106SO(2), 48, 164SO(2n), 164SO(2n + 1), 164SO(3, 1), 263SO(3, 1), little group, 267, 269SO(3, 2), 210SO(3), 49, 90, 106SO(3), little group, 267SO(3)/SO(2), 107SO(4, 1), 210SO(5), 164SO(n), 43, 145SO(p, q), 43, 164SU (1; Q), 40, 48SU (1, 1), 38, 43, 48, 105SU (1, 1)/U (1), 106SU (2), 48, 106SU (2)/U (1), 107SU (n), 43, 90, 164SU (p, q), 43, 164S2, 189, 191S3, 5, 46

313

Page 328: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

314 Index

Sn , 45, 49Sol(n), 38, 48Sp(1), 40Sp(2; R), 41Sp(2n; R), 41Sp(n), 40, 164Sp(n; C), 41Sp(n; G), 41Sp(n; R, 41Sp(p, q), 164U (1, 1), 43U (2), 40, 78U (2), contraction of, 211U (2; Q), 164U (n), 40, 43, 90U (n), representations of, 90U (n; G), 41U (p, q), 43U Sp(2n), 44U T (1, 1), 83U T (p, q , r ), 37, 48U T (p, q), 48V4, 15Z, integers, 44

abelian group, 39active interpretation, of group action, 93aether, 282affine transformations, 37algebraic constraints, 29algebraic equations, 3algebraic manifold, 29, 104algebras, contraction of, 211alternating group, 5, 46amplitudes, external, 54

internal, 54analytic, continuation, 40, 86, 142, 143, 176

reparameterization, 113angular momentum, matrix elements, 217

operators, 258states, 213

annihilation operators, 84, 88bosons, 88fermions, 89two photon, 77

anticommutation relations, 89anticommutator, 89anticommute, 47antihermitian matrices, 78antipodal points, 106Araki–Satake root diagram, 192associativity, 4, 24, 25Automorphism, involutive, 177auxiliary equation, 11

for cubic, 14, 20for quartic, 15, 18

Baker–Campbell–Hausdorff formulas, 108basis, 61basis functions, 9

basis states, contraction of, 214BCH formulas, 108

contraction of, 215Bessel functions, 217bilinear constraints, 39block diagonal, 64block matrix decomposition, 178Bohr radius, 253Boltzmann constant, 116boost, 31Bose–Einstein counting problem, 95, 96Bose–Einstein statistic, 256boson operator algebras, 88boson operators, 88bounded, 27building up principle, 159building up process, 161

c-number, 127canonical commutation relations, 151, 159, 172canonical coordinates, 286, 302Cartan, covering theorem, 107

decomposition, 84Cartan–Killing form, 65Cartan–Killing inner product, 65, 82, 102, 139, 147Casimir covariants, 157Casimir invariants, 143, 148Casimir operators, 153, 159, 192, 201, 217

contraction of, 207, 212higher order, 146

Cauchy, 301Cayley–Hamilton theorem, 58, 157character table, 9

of S2, 10of S3, 12of S4, 16

character, of real form, 175characteristics, method of, 289Christoffel symbol, 200classical functions, 2classical problems, double a cube, 2

square a circle, 2trisect an angle, 2

Clebsch–Gordan series, 264closed, 27closure, 4, 24, 25Columbus, 25Commutation, 59commutation relations, 89

C2, 153commutative, 3, 133commutative group, 39commutator, 59

in algebra, 59in group, 59

commuting operators, 192compact, 26

and metric, 65compass, 22complementary series, of representations, 187

Page 329: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

Index 315

completeness relations, 307special functions, 216

complex extension, 164complex numbers, 34, 35conformal condition, 235conformal group, 201, 305conformal map, 203conjugate subgroups, 6connectivity matrix, 54conservation, of momentum, 51constraint equation, 285, 294, 303constraints, 35constructable numbers, 22contraction, 205Contraction, of U (2), 211

of algebras, 211of basis states, 214of BCH formulas, 215of Casimir operators, 212of Dynkin diagram, 167of groups, 205of matrix elements, 214of parameter space, 213of representations, 213of special functions, 215

coordinate representation, 273coordinate, dependent, 289

independent, 289coset, 8, 103, 104Coset representative, 104, 267cover, open, 25covering group, 105, 107

SO(2, 1)/SO(2), 108SU (1, 1)/U (1), 108universal, 107

covering problem, 100creation operators, 84, 88

bosons, 88fermions, 89two photon, 77

crossing symmetry, 52cubic equation, 1, 11, 22

Galois group, 12cylinder, 294, 302

defining matrix representation, 131degeneracy, and symmetry, 230dependent coordinate, 289DeSitter symmetry, 235determining equation, 286, 287, 302Dicke model, 126diffeomorphism, 109differential equations, 284

and Lie groups, 284differential operators, first order, 90dimension, 61

of manifold, 26of root space, 153

direct product group, 8discrete invariant subgroup, 107

discrete series, of representations, 187discriminant, 11dispersion relation, 223double the cube, 22dynamical symmetry, 230Dynkin diagram, 159, 165, 166

contraction of, 167

eigenoperator, commutation relations, 140decomposition, 139

electromagnetic field, 259embedded groups, 43entropy representation, 282equation, constraint, 285, 294

determining, 286, 287surface, 285, 294

equilibrium, thermodynamic, 116equivalence principle, 93, 223, 250Euclidean, group, 42

motions, 207submanifold, 192transformations, 79

EXP, 57EXPonential, 55, 58

operation, 59EXPonentiation, 99

factor group, 8faithful, 7

representation, 5, 122fermion operator algebras, 89fermion operators, 89Fibonacci number, 45, 49Fibonacci-type series, 49field, 259

equations, 262theory, 3

fine structure constant, 225first order equations, 286first prolongation, 287fluctuation–dissipation theorem, 283Fock space, 213four-group, 15Frobenius method, 225fully reducible, 63, 134fundamental roots, 166

Galilei group, 42, 80, 86Galiliean transformation, 48Galois, 1, 284, 301Galois group, 4, 21

for quartic, 15Galois theory, 3Galois’ theorem, 9general linear, algebras, 74

groups, 36generating function, 217geometric symmetry, 227globally symmetric spaces, 190gravitons, 283

Page 330: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

316 Index

group theory, 3group, “infinite”, 1

abelian, 6axioms, 3, 24commutative, 6composition function, 28composition map, 28elements, 24generators, 6inversion map, 28multiplication, 3, 5, 24operations, 3, 24

group-subgroup chain, 12, 15group-subgroup diagram, 7Groups, intersections of, 80

Hamilton’s equations, 39, 41, 180harmonic oscillator wavefunctions, 215harmonic oscillator, isotropic, 96heat equation, 300Heisenberg, algebra, 89Heisenberg, commutation relations, 77

group, 38identity, 110

helicity, of photon, 259state, 259

Hermite polynomials, 97, 215higher order equations, 299Hilbert–Schmidt inner product, 64homogeneous Lorentz group, 261homogeneous Lorentz transformation,

263homogeneous polynomials, 140, 256homomorphic image, 7homomorphism, 7Hooke’s law, 304hyperbolic plane, 202hyperboloid, 27, 29

single-sheeted, 102, 103, 189two-sheeted, 102, 189

identity, 4, 24, 25Inonu–Wigner contraction, 205, 206indefinite metric, 40, 197independent coordinate, 289independent functions, 192independent roots, 192index, of real form, 175inertial frame, 282infinitesimal generator, 286, 295inhomogeneous Lorentz group, 210, 261inner product, 61, 64integrability condition, 61interpretations of group action, active, 93

passive, 93intersections, of groups, 43invariance algebra, 96invariant, measure, 66, 193

metric, 66, 193operators, 143, 148, 159subalgebra, 134

subgroup, 6, 8subspace, 36

inverse, 4, 24, 25image, 7

inversion mapping, 30involutive automorphism, 177irreducible, 63, 134

representations, 10isomorphism, 7

problem, 105isotropic, 191

Jacobi identity, 59, 60, 149Jacobi polynomials, 215, 217

Kepler’s third law, 304Klein four-group, 15Klein group, 15Klein–Gordon equation, 224Kustaanheimo–Stiefel transformation, 240

Laplace equation, 299Laplace–Beltrami operators, 192, 200Laplace–Runge–Lenz vector, 230Laplacian operators, 208laziness, principle of maximum, 211Legendre polynomials, 215, 217Levi–Civita skew tensor, 156Levi–Civita symbol, 143Lie, 1, 284, 307Lie algebra, a(p, q)

a(p, q), 77, 129gl(n; F), 74, 83ht(p, q), 75nil(n), 77, 130ou(2n), 179o(n; G), 79o(p, q), 78sl(2; R), 100, 102, 154, 173sl(n), 80sl(n; C), 80, 85, 86sl(n; Q), 80, 86sl(n; R), 80, 85, 178, 180sol(n), 77, 130so(2, 1), 78so(2n), 180so(3, 1), 79so(3, 2), 86so(3), 86, 90, 154so(4, 1), 86so(4), 132so(5), 86, 146so(n), 132, 145, 178so(p, q), 84, 178so∗(2n), 180sp(2n; R), 178, 179, 180sp(G; C), 79sp(G; R), 79sp(n), 132, 178sp(n; G), 79sp(p, q), 78, 178

Page 331: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

Index 317

su(1, 1), 140, 141, 143, 173su(2), 111, 140, 141, 143, 173su(2n), 180su(n), 80, 132, 178su(p, q), 85, 178su∗(2n), 180usp(2n), 179ut(1, 1), 83ut(p, q , r ), 76ut(p, q), 75, 131u(n), 80u(n; F), 178u(n; G), 79u(p, q), 78u(p, q; F), 178sl(2; C), 154

Lie algebra, sl(2; R), 62Lie algebras, 55, 56Lie algebras, properties of, 59Lie groups, 2, 21, 28

and differential equations, 284global properties, 57local properties, 57

Lie symmetries, 296, 300light cone, 101limit points, 27linear constraints, 36little group, 267local groups, 302local Lie groups, 302loops, none in Dynkin diagrams, 167Lorentz group, 31, 40, 79, 260

homogeneous, 261in a plane, 78inhomogeneous, 261

Lorentz transformations, 31, 42, 210, 259homogeneous, 263

lowering operators, 228

Manifestly covariant, 259representations, 264

manifold, 25, 55matrix elements, 2Matrix elements

angular momentum, 217contraction of, 214

matrix groups, 29Matrix groups, 34matrix inversion, 29matrix multiplication, 5, 29matrix representations, 2, 5, 7Maxwell’s Equations, 259, 260, 305measure, 66, 193

invariant, 66, 193mechanical similarity, 303method of characteristics, 289metric, 66, 193metric preserving groups, antisymmetric, 79metric-preserving groups, antisymmetric metric, 41

compact, 39, 78general metric, 41

noncompact, 40, 78singular, 79

metric preserving groups, antisymmetric, 79metric tensor, 193, 197metric, invariant, 66, 193Michelson–Morely experiment, 282microwave background radiation, 282minimal electromagnetic coupling, 223Minkowski, transformation, 176

trick, 177modular groups, 44, 81momentum conservation, 51momentum representation, 273multilinear constraints, 42, 80multiplication table, 9multiply connected, 197Mutually commuting operators, 153, 159

network, 54network topology, 54neutrinos, 283nilpotent, 65, 130, 133

algebras, 77, 141groups, 38

Noether’s theorem, 307noncompact, 26nonsemisimple, 63, 134

group, 2normally ordered, 112

one-parameter group, 287operator algebras, 88operators, momentum, 38

position, 38order, normal, 112

of a group, 8orthogonal groups, 40, 78orthogonality relations, 307

special functions, 216

parameter space, contraction of, 213parameterization problem, 108Parseval inequality, 308partial differential equations, 299partition function, 116Pascal triangle, 257Passive interpretation, of group action, 93Pauli spin matrices, 31, 78Periodic table, Mendelyeev, 50permutation, group

group, 4matrix, 4, 5representation, 45transformation, 141

phase shift, 248photon, 259, 275

number states, 213operators, 38, 77, 84, 110, 130, 136, 140, 146,

211Poincare plane, 202Poincare group, 42, 80, 86, 210

Page 332: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

318 Index

point transformations, 302polarization, 259

and inner products, 69polynomial equation, 4principal series, 187

of representations, 187principle of equivalence, 223, 250principle of relativity, 223, 250problems, of antiquity, 22projective transformation, 234prolongations, first

first, 287, 302higher order, 299second, 296

pseudo–Riemannian symmetric space, 190, 197

quadratic constraints, 39quadratic equation, 1, 10

Galois group, 10quadratic resolvent, 20quadrature, 2, 284quadrupole tensor operators, 258quantum number, principle, 50quartic equation, 1, 15quaternions, 34, 35, 47quintic equation, 1, 17

Galois group, 2quotient, 8, 103, 104quotient, space, 8

radial quantum number, 225radicals, 1, 284, 301raising operators, 228rank, 143, 148, 153

for symmetric space, 192real form, 172

character of, 175classical algebras, 181classical equivalences, 181compact, 174exceptional algebras, 182index of, 175least compact, 174

real numbers, 34, 35recursion relation, root chain, 149reducible, 63, 134reduction of order, 298regular elements, 146regular representation, 62, 129, 139relativity, principle of, 250, 223reparameterization, local, 113representation, 4

contraction of, 213coordinate, 273faithful, 122irreducible, 187manifestly covariant, 264momentum, 273reducible, 187unitary, 187unitary irreducible, 262, 264, 266

representations, of SU (2), 187of SU (1, 1), 187

resolvent equation, 13Riccati equation, 295Riemannian globally symmetric space, 192Riemannian space, 191Riemannian symmetric space, 189, 190Risch, 302Rodriguez formula, 97root chain, 150

recursion relation, 149root reflections, 150root space, 148, 159

decomposition, 160diagram, 147, 151, 153, 159, 160, 172

roots, 148, 153of secular equation, 159properties of, 159

ruler, 22Rydberg electron, 248

scaling transformation, 291, 295, 300scattering matrix, 52scattering phase shift, 248Schrodinger equation, 52, 223, 224Schrodinger prescription, 224Schur’s Lemma, 107Schwarz inequality, 160, 167Schwinger representation, 94, 232, 238second order equations, 296second prolongation, 296secular equation, 58, 139, 140, 148, 159, 192

independent coefficients, 148, 153independent functions, 159roots of, 159

self-conjugate, 6semidirect sum, 206semisimple, 63, 134

group, 2Lie algebras, 147

sheets, 49similarity transformations, 62simple, 63, 134

group, 2simply connected, 107single-sheeted hyperboloid, 102, 103solution surface, cylinder, 294solvable, 133

algebras, 77group, 2, 38

space-time, 176coordinates, 31

special functions, 215completeness relations, 216contraction of, 215orthogonality relations, 216

special linear groups, 43, 80special relativity, 282spectrum generating, algebra, 96, 258

group, 245speed of light, c, 282

Page 333: the-eye.eu...LIE GROUPS, PHYSICS, AND GEOMETRY An Introduction for Physicists, Engineers and Chemists Describing many of the most important aspects of Lie …

Index 319

spherical harmonics, 215, 217, 225spin groups, and SO(n), 183spin states, 40, 259spinor, of SO(3), 164

of SO(5), 164splitting map, 177splitting transformation, 177square the circle, 22squeezed states, 38stability subgroup, of a vector, 267structure constants, 61, 151, 153, 160Structure factor, 122structure theory, for lie algebras, 129

for simple lie algebras, 139subalgebra, 65

invariant, 134subfield restriction, 178subgroup, 5

invariant, 6normal, 6

surface equation, 285, 294, 302symmetric, group

group, 4matrix, 27polynomials, 9spaces, 189

symmetry, and degeneracy, 230crossing, 52

symplectic group, 40, 78symplectic transformations, 180

tensor, 259thermal expectation values, 116Thomas precession, 31time-ordered product, 114time-reversal operator, 267topological space, 25topology, 25

transfer matrix, 51, 52transformation, scaling, 295translation group, 39trisect an angle, 23Tschirnhaus transformation, 11, 20

for cubic, 13for quartic, 18

Tschirnhaus transformation, for quartic, 15two-photon algebra, 77, 146two-sheeted hyperboloid, 102

uncertainty relations, of statistical mechanics, 282unimodular groups, 43unit disk, 203unit sphere, 25unitary groups, 40, 78, 90unitary irreducible representations, 262, 264, 266unitary representation, 38Universal covering group, 107upper half-plane, 202upper triangular, 130

algebras, 75and photon operators, 109groups, 36

Van der Monde matrix, 158variables, dependent, 285

independent, 285velocity addition law, 31vierergruppe, 15, 199viscous medium, 283

wave equation, 224Weyl group, 156

of reflections, 155Weyl symmetry, 150Wick rotation, 114Wigner–Stone theorem, 216, 307


Recommended