Polarization Ray Tracing
Item Type text; Electronic Dissertation
Authors Yun, Garam
Publisher The University of Arizona.
Rights Copyright © is held by the author. Digital access to this materialis made possible by the University Libraries, University of Arizona.Further transmission, reproduction or presentation (such aspublic display or performance) of protected items is prohibitedexcept with permission of the author.
Download date 14/06/2018 17:32:14
Link to Item http://hdl.handle.net/10150/202979
POLARIZATION RAY TRACING
by
Garam Yun
Copyright © Garam Yun 2011
A Dissertation Submitted to the Faculty of the
COLLEGE OF OPTICAL SCIENCES
In Partial Fulfillment of the Requirements
For the Degree of
DOCTOR OF PHILOSOPHY
In the Graduate College
THE UNIVERSITY OF ARIZONA
2011
2
THE UNIVERSITY OF ARIZONA
GRADUATE COLLEGE
As members of the Dissertation Committee, we certify that we have read the dissertation
prepared by Garam Yun
entitled Polarization Ray Tracing
and recommend that it be accepted as fulfilling the dissertation requirement for the
Degree of
Doctor of Philosophy
_______________________________________________________Date: Nov. 21, 2011
Russell A. Chipman
_______________________________________________________Date: Nov. 21, 2011
Arthur Gmitro
_______________________________________________________Date: Nov. 21, 2011
J. Scott Tyo
Final approval and acceptance of this dissertation is contingent upon the candidate’s
submission of the final copies of the dissertation to the Graduate College.
I hereby certify that I have read this dissertation prepared under my direction and
recommend that it be accepted as fulfilling the dissertation requirement.
_______________________________________________________Date: Nov. 21, 2011
Dissertation Director: Russell A. Chipman
3
STATEMENT BY AUTHOR
This dissertation has been submitted in partial fulfillment of requirements for an
advanced degree at the University of Arizona and is deposited in the University Library
to be made available to borrowers under rules of the Library.
Brief quotations from this dissertation are allowable without special permission, provided
that accurate acknowledgment of source is made. Requests for permission for extended
quotation from or reproduction of this manuscript in whole or in part may be granted by
the copyright holder.
SIGNED: Garam Yun
4
ACKNOWLEDGEMENTS
I would like to thank my parents who have encouraged me to come to the States and
pursue my intellectual thirst in science. Without their support, I would not have been here.
I would like to thank my sister for being my best friend and a supporter. Our phone
conversations were always my source of energy and laughter.
I would like to thank my husband, Stefano Young, for being the first person I met
in Tucson, for planning all the great trips we did, for being open to learn my culture and
family, and for encouraging me to find my ultimate goals in life.
I would like to thank my advisor, Professor Russell Chipman who has challenged
me with most interesting polarization problems. Without him, I would not have been
able to complete this dissertation. Professor Chipman has continuously given me
opportunities and pushed me to be a better researcher, a better public speaker, and a
better leader.
I would like to thank Professor Jaisoon Kim, for planting the seed, the idea of
coming to the college of optical sciences for graduate school and learning Optics, in my
senior year in college.
I would like to thank Dr. Art Gmitro and Dr. J. Scott Tyo for serving on my
committee. I would like to thank Dr. Greg Smith for sharing great ideas in programming
and skills in problem solving during many of our long discussions. I would like to thank
Dr. Stephan McClain and Dr. Karlton Crabtree who provided helpful discussions and
ideas. I would like to thank Hannah Noble, Stacey Sueoka, Anna-Britt Mahler, Paula
Smith, and Tiffany Lam for their friendship; you made coming to work more rewarding.
5
DEDICATION
To my parents, 윤철수 and 이상옥, and my husband, Stefano with all my heart and love.
6
TABLE OF CONTENTS
LIST OF FIGURES ............................................................................................................................. 10
LIST OF TABLES ............................................................................................................................... 16
ABSTRACT ……… ........................................................................................................................... 17
CHAPTER 1 HISTORY OF POLARIZATION RAY TRACING ................................................................ 20
1.1 Polarization Ray Tracing in two-dimensions .................................................................. 20
1.2 Polarization Ray Tracing in three-dimensions ............................................................... 23
CHAPTER 2 DEFINITION OF POLARIZATION RAY TRACING MATRIX ............................................. 25
2.1 Definition of Polarization Ray Tracing Matrix, P ............................................................ 25
2.1.1 Polarization States as Three-element Electric Field Vectors ................................. 29
2.2 Formalism of Polarization Ray Tracing Matrix using Orthogonal Transformation ........ 32
2.2.1 Retarder Polarization Ray Tracing Matrix Examples .............................................. 35
2.3 Conclusion ...................................................................................................................... 37
CHAPTER 3 CALCULATION OF DIATTENUATION ........................................................................... 38
3.1 Diattenuation Calculation using Singular Value Decomposition ................................... 38
3.2 Example – A Hollow Corner Cube .................................................................................. 41
3.3 Conclusion ...................................................................................................................... 47
CHAPTER 4 CALCULATION OF RETARDANCE ................................................................................ 48
4.1 Introduction ................................................................................................................... 48
4.2 Purpose of the Proper Retardance Calculation ............................................................. 50
4.3 Geometrical Transformations ........................................................................................ 51
4.3.1 Local Coordinate Rotation: Polarimeter Viewpoint ............................................... 51
4.3.2 Parallel Transport of Vectors ................................................................................. 55
4.3.3 Parallel Transport Ray Tracing Matrix, Q ............................................................... 57
4.4 Proper Retardance Calculations..................................................................................... 62
4.4.1 Definition of the Proper Retardance ...................................................................... 62
4.4.2 Separating Local Coordinate Transformation from P ............................................ 62
4.4.3 The Proper Retardance Algorithm for P, Method 1 ............................................... 63
4.4.4 The Proper Retardance Algorithm for P, Method 2 ............................................... 65
7
TABLE OF CONTENTS - Continued
4.4.5 Retardance Range .................................................................................................. 66
4.5 Examples ........................................................................................................................ 67
4.5.1 Ideal Reflection at Normal Incidence ..................................................................... 67
4.5.2 Brewster’s Angle Analysis ...................................................................................... 70
4.5.3 An Aluminum Coated Three-fold Mirror System ................................................... 76
4.6 Conclusion ...................................................................................................................... 81
CHAPTER 5 POLARIZATION ABERRATION ..................................................................................... 83
5.1 Jones Pupil ..................................................................................................................... 83
5.2 Polarization Aberration Function ................................................................................... 89
5.2.1 Coherent Beam Combination ................................................................................ 95
5.3 Conclusion ...................................................................................................................... 96
CHAPTER 6 A SKEW ABERRATION ................................................................................................. 97
6.1 Definition ....................................................................................................................... 97
6.2 Skew Aberration Algorithm............................................................................................ 98
6.3 Example ........................................................................................................................ 102
6.3.1 Skew Aberration at the Exit Pupil ........................................................................ 103
6.3.2 Skew Aberration’s Effect on Point Spread Function and Modulation Transfer
Function ............................................................................................................................. 106
6.4 Statistics – Code V Patent Library ................................................................................ 109
6.5 Skew Aberration in Paraxial Ray Trace ........................................................................ 110
6.6 Conclusion .................................................................................................................... 114
CHAPTER 7 UNDERSTANDING APPARENT RETARDANCE DISCONTINUITIES .............................. 115
7.1 Retardance Calculation for Jones Matrices ................................................................. 116
7.2 Retarder Space ............................................................................................................. 119
7.3 Trajectories of Jones Retarder Matrices as the Polarization State Analyzer Rotates .. 122
7.4 Phase Unwrapping for Homogeneous Retarder Systems using Dispersion Model ..... 125
7.4.1 Dispersion Model ................................................................................................. 125
7.4.2 Phase Unwrapping of the Homogeneous Retarder System ................................ 127
8
TABLE OF CONTENTS - Continued
7.5 Discontinuity in Phase Unwrapped Retardance Values for Compound Retarder Systems
of Arbitrary Alignment ............................................................................................................. 137
7.6 Conclusion .................................................................................................................... 163
CHAPTER 8 COHERENCE MATRIX AND POLARIZATION RAY TRACING TENSOR .......................... 165
8.1 Introduction ................................................................................................................. 165
8.2 The Coherence Matrix ................................................................................................. 166
8.3 Projection of the Coherence Matrix onto Arbitrary Planes ......................................... 169
8.4 A Definition of Polarization Ray Tracing Tensor .......................................................... 170
8.5 A Polarization Ray Tracing Tensor for a Non-depolarizing Ray Intercept .................... 174
8.5.1 A Polarization Ray Tracing Tensor from Surface Amplitude Coefficients ............ 175
8.5.2 A Polarization Ray Tracing Tensor from the three-by-three Polarization Ray
Tracing Matrix P ................................................................................................................... 181
8.5.3 Example Polarization Ray Tracing Tensor Calculation ......................................... 183
8.6 A Polarization Ray Tracing Tensor for a Ray Intercept with Scattering ....................... 185
8.6.1 Example Polarization Ray Tracing Tensor Calculation ......................................... 188
8.7 Example Polarization Ray Tracing Tensor and Combination of Tensors ...................... 190
8.8 Conclusion .................................................................................................................... 206
CHAPTER 9 THREE-DIMENSIONAL (3D) STOKES PARAMETERS .................................................. 207
9.1 Definition of 3D Stokes Parameters ............................................................................. 207
9.2 Example Incoherent Additions of 3D Stokes Parameters ............................................ 210
9.3 2D Stokes Parameters to 3D Stokes Parameters ......................................................... 217
9.4 3D Mueller Matrix ........................................................................................................ 217
9.4.1 Example 3D Mueller Calculation .......................................................................... 226
9.5 Conclusion .................................................................................................................... 230
CHAPTER 10 CONCLUSIONS AND FUTURE WORK ................................................................ 231
10.1 Summary ...................................................................................................................... 231
10.2 Future Work ................................................................................................................. 235
10.3 Conclusion .................................................................................................................... 237
APPENDIX A USA PATENT 2,896,506 LENS PARAMETERS............................................................ 239
9
TABLE OF CONTENTS - Continued
APPENDIX B DEPOLARIZATION INDEX OF A POLARIZATION RAY TRACING TENSOR ................... 241
APPENDIX C WATER DROPLET SIZES FOR THE CLOUD EXAMPLE ................................................ 243
REFERENCES ................................................................................................................................. 244
10
LIST OF FIGURES
Figure 1.1 (a) Longitude and latitudes lines as the local x and y bases. (b) Singularity at
the North Pole. .................................................................................................................. 22
Figure 2.1 The polarization states before (Eq-1) and after (Eq) the qth
optical interface are
related by the matrix Pq. .................................................................................................... 26
Figure 2.2 The major axis orientation θ measured from 1v to a . The axis vector is
shown in red arrow............................................................................................................ 31
Figure 3.1 Top view of a hollow aluminum coated corner cube. Three surfaces of the
corner cube are perpendicular to each other. .................................................................... 42
Figure 3.2 Top and side views of a corner cube. Propagation vectors are shown in solid
black, local s coordinate vectors in solid red, and local p coordinate vectors in dashed
blue. ................................................................................................................................... 43
Figure 3.3 (a) The exiting state from the corner cube with the maximum intensity and
corresponding incident state. (b) The state with the minimum intensity and
corresponding incident state. All represented in local coordinates where 2D polarization
vectors are defined. Each propagation vector comes out of the page. ............................. 46
Figure 3.4 (a) The exiting state from the corner cube with the maximum intensity and
corresponding incident state. (b) The state with the minimum intensity and
corresponding incident state. All states are represented in global coordinates (x-y plane)
looking into the corner cube; propagation vectors are anti-parallel for the incident and
exiting electric field vectors. ............................................................................................. 47
Figure 4.1 A polarimeter measuring a sample retarder with the polarization state analyzer
(PSA) (a) aligned with the polarization state generator (PSG) and (b) rotated to an
arbitrary orientation. By rotating the PSA, the exiting local coordinates for the Jones
matrix are also rotated. The measured retardance of the sample now includes a “circular
retardance” component of 2 as well as the proper retardance. ...................................... 50
Figure 4.2 Measured retardance values as a function of rotation angle θ for circular
retarders with retardance between zero and one wave of retardance are shown in different
colors. Retardance of a half wave circular retarder is independent of the PSA orientation.
........................................................................................................................................... 53
Figure 4.3 Measured retardance values as a function of rotation angle θ for linear
retarders with retardance between zero and one wave of retardance are shown in different
colors. Retardance of a half wave linear retarder is independent of the PSA orientation.
........................................................................................................................................... 54
11
LIST OF FIGURES - Continued
Figure 4.4 (a) The evolution of a local coordinate pair ˆ ˆ{ , }A Ax y (green) through a system
of three fold-mirrors. The exiting local coordinates (dashed red) undergo a 90° rotation
from the initial local coordinates (solid green). (b) A three fold-mirror system. When a
collimated beam enters the system along the z-axis the beam exits along the z-axis. ...... 56
Figure 4.5 Incident, reflected, and transmitted local coordinates calculated from parallel
transport matrices. ,0 ,0 0ˆˆ ˆ{ , , }L Lx y k are the right handed incident local coordinates,
, ,1 , ,1 ,1ˆˆ ˆ{ , , }L r L r rx y k are the left handed reflected local coordinates, and , ,1 , ,1 ,1
ˆˆ ˆ{ , , }L t L t tx y k are
the right handed transmitted local coordinates. ................................................................ 59
Figure 4.6 A two mirror system. The red solid lines show the s vector at the first mirror
and its geometric transformation along each ray segment using Q. The blue dashed lines
show the p vector in object space and its geometric transformations. .............................. 60
Figure 4.7 An ideal reflection at normal incidence with the incident and exiting right-
handed local coordinates, ,0 ,0 ,1 ,1ˆ ˆ ˆ ˆ{ , } and { , }L L L Lx y x y . In this particular choice of local
coordinates, the ˆLx vector was flipped after the reflection. .............................................. 69
Figure 4.8 Fresnel external (top) and internal (bottom) reflection coefficients for s-
polarization (dashed) and p- polarization (solid) are shown as the angle of incidence
changes. ............................................................................................................................. 71
Figure 4.9 The relative phase shift for external (top) and internal (bottom) reflection as a
function of the angle of incidence. .................................................................................... 72
Figure 4.10 The incident s-polarized states (dotted) and reflected (solid) polarization
states as angles of incidence change from normal incidence to glazing incidence. Red
arrow indicates the reflected electric field at Brewster’s angle. ....................................... 74
Figure 4.11 The incident p-polarized states (dotted) and corresponding reflected
polarization states (solid) as the angle of incidence changes. Red arrow indicates the
reflected electric field vector at the Brewster’s angle. At the Brewster’s angle, Fresnel
reflection coefficient for p-polarization is zero. ............................................................... 75
Figure 4.12 Incident electric field vectors at 45º between s and p-polarizations (dotted)
and corresponding reflected (solid) electric field vectors are plotted as the angle of
incidence changes. Red lines are for the incident and reflected pair at the Brewster’s
angle; the top view shows that at Brewster’s angle, the reflected light (solid red line) is s-
polarized. ........................................................................................................................... 76
Figure 4.13 An aluminum coated three fold-mirror system. ............................................ 77
12
LIST OF FIGURES - Continued
Figure 4.14 Local coordinate transformation using Qq's for the three fold-mirror system.
Incident ,0
ˆLx state (solid red) exits as ˆ-x polarized and incident
,0ˆ
Ly state (dashed blue)
exits as y polarized after three reflections due to the geometric transformation. ............ 79
Figure 5.1 A field vector on the image plane and the exit pupil vector. ........................... 83
Figure 5.2 Optical system layout of USA patent 2896506 from Code V library has seven
lens elements. The system is defined with three field angles. .......................................... 85
Figure 5.3 Wavefront aberration maps at the exit pupil using Code V are plotted for the
on-axis field. For on-axis field, 0.3 waves of spherical aberration is the dominant
aberration. ......................................................................................................................... 86
Figure 5.4 Amplitude part of Jones Pupil at the edge of field of USA patent 2896506
shows variation across the pupil as well as off-diagonal components, which often appear
for systems with high NA. ................................................................................................ 87
Figure 5.5 Jones pupil local x and y coordinates. ............................................................. 88
Figure 5.6 Wavefront aberration maps at the exit pupil using Code V are plotted for a
point source at the edge of the field demonstrating three waves of astigmatism. ............ 89
Figure 5.7 Amplitude part of a polarization aberration function (,Total sP ) at the exit pupil
of USA patent 2896506. ................................................................................................... 93
Figure 5.8 Phase part of a polarization aberration function ( ) at the exit pupil of
USA patent 2896506. ........................................................................................................ 94
Figure 6.1 (a) A double pole grid of reference vectors on a unit sphere viewed along the
chief ray’s propagation vector and (b) oblique view. ..................................................... 100
Figure 6.2 Optical system layout of USA patent 2896506 from Code V library has seven
lenses. The system is defined with three field angles. .................................................... 103
Figure 6.3 (a) Skew aberration of a ray grid with 32º field angle is evaluated at the exit
pupil of the patent 2896506. The maximum skew aberration is +7.01º and the minimum
is -7.01º (ray A). Both extreme occur from skew rays at the edge of the pupil. (b)
Horizontal cross section (indicated in orange dashed line in part (a)) of the skew
aberration exit pupil map has zero skew aberration for the center ray, which is the chief
ray. .................................................................................................................................. 104
Figure 6.4 Skew ray A's skew aberration contribution from each lens surface sums to -
7.01º through the system. ................................................................................................ 105
Figure 6.5 A point spread matrix (PSM) of the example system calculated from a discrete
Fourier transform of the parallel transport matrix of the system at the exit pupil. The
elements are elliptical due to three waves of astigmatism. ............................................. 107
Figure 6.6 An optical transfer matrix (OTM) of the example calculated by a discrete
Fourier transform of the PSM. ........................................................................................ 108
,Total sP
13
LIST OF FIGURES - Continued
Figure 6.7 Histogram of the maximum skew aberration evaluated for 2383 non-reflecting
optical systems in Code V’s library of patented lenses. ................................................. 109
Figure 6.8 Paraxial skew ray at point A's skew aberration contribution from each lens
surface sums to -2.00º through the system. ..................................................................... 113
Figure 7.1 A range of total retardance δ calculated from Eq. (7.1.6). ............................ 119
Figure 7.2 A retarder space with δH, δ45, and δR as axes where δ is the distance from the
origin to a point which has a magnitude of retardance δ. ............................................... 120
Figure 7.3 Two groups of Mueller matrices (A and B) with the same retardance modulo
to 2π are shown in the retarder space. Each point in the groups is 2π away from each
other and shares the same fast and slow axes. ................................................................ 122
Figure 7.4 Trajectories of horizontal fast axis linear retarders in the retarder space as the
PSA rotates by θ from zero to 2π. Each trajectory repeats twice for 0≤ θ ≤ 2π. ........... 124
Figure 7.5 (a) The principal retardance of the first retarder and its fast axis orientation as
a function of wavelength. (b) The principal retardance of the second retarder and its fast
axis orientation with respect to the horizontal axis as a function of wavelength. .......... 128
Figure 7.6 Retardance plots as a function of wavelength for two horizontal fast axis linear
retarders (HLR) with different thicknesses (green and blue) and a combination of two
HLRs with a shared horizontal fast axis (red). The principal retardance has the horizontal
fast axis for downward sloping regions and has the vertical fast axis for upward sloping
regions. ............................................................................................................................ 129
Figure 7.7 The principal retardance of the combination of two HLRs and the system’s
fast axis orientation with respect to the horizontal axis as a function of wavelength. .... 130
Figure 7.8 Each segment of the principal retardance has a mode number q to apply phase
unwrapping algorithm. Starting from the right side of the graph, blue segments have odd
mode numbers and the red segments have even mode numbers. ................................... 131
Figure 7.9 Principal retardance vector trajectories are shown in the retarder space as the
wavelength changes. Each figure corresponds to a different mode number starting from
the longest wavelength (mode 1) to the shortest wavelength. ........................................ 134
Figure 7.10 The principal retardance trajectory in the retarder space as the wavelength of
the ray changes. Discontinuity occurs on a sphere of radius π. ..................................... 134
Figure 7.11 (a) Principal retardance trajectory of a two-aligned HLR system in the
retarder space as the wavelength gets shorter. (b) Retardance trajectory of the same
system after the phase unwrapping. ................................................................................ 136
Figure 7.12 Phase unwrapped retardance as a function of wavelength for two HLRs
(green and blue) and a system of two-aligned HLR (red). ............................................. 137
14
LIST OF FIGURES - Continued
Figure 7.13 Total retardance for a system with two half-wave linear retarders is plotted as
the fast axis orientation (θ) of the second retarder changes with respect to the first
retarder’s fast axis orientation. ........................................................................................ 139
Figure 7.14 The principal retardance of TotalJ , a system of two HRLs misaligned by π/16,
is plotted as a function of wavelength. Green dotted circles in the top figure indicate the
area where the principal retardance changes its slope without going down to zero. ...... 143
Figure 7.15 Principal retardance vector trajectories are plotted in the retarder space as the
wavelength reduces. Each figure corresponds to a segment of the trajectory from the
longer wavelength to the shorter wavelength as the retardance vector approaches the π
sphere. ............................................................................................................................. 145
Figure 7.16 The top view of the principal retardance vector trajectories in the retarder
space as the wavelength reduces. .................................................................................... 146
Figure 7.17 Phase unwrapped retardance plotted as a function of wavelength for the
aligned (red) and forced phase unwrapping for a misaligned (blue) two-HLR system. . 147
Figure 7.18 Phase unwrapped Major (top) and Minor (bottom) plotted as a function of
wavelength. The phase unwrapped Major is the linear addition of
1 and 2 of the each
linear retarder. ................................................................................................................. 148
Figure 7.19 The principal retardance plots as a function of wavelength for two HLRs
(green and blue) and a system of two misaligned HLRs (red). Two HLRs have different
retardances and the angle between two fast axes is θ. .................................................... 151
Figure 7.20 A system of two HLRs misaligned by θ. Four output beams exiting the
system with four different optical path lengths are shown offset for clarity. ................. 152
Figure 7.21 The principal retardance as a function of wavelength for the system with two
HLRs with the fast axes misaligned by π/4. The principal retardance has minima other
than zero. ......................................................................................................................... 156
Figure 7.22 Phase unwrapped retardance of the first (green) and second (blue) retarders
with the fast axis orientation (fast ) of the compound system. Xs mark wavelengths
where individual plates have integer waves of retardance, and don’t contribute to the axis
of the retarder. ................................................................................................................. 157
Figure 7.23 Green plot shows the retardance of the HLR, δ1(λ) as a function of
wavelength. Blue plot is the retardance of the 45° fast axis linear retarder, δ2(λ). Red
plot shows the retardance of the system of the two HLRs with π/4 misalignment between
two fast axes orientation, δTotal(λ). .................................................................................. 158
Figure 7.24 Phase unwrapped Minor plotted as a function of wavelength. .................... 159
15
LIST OF FIGURES - Continued
Figure 7.25 A principal retardance trajectory of the system with two misaligned HLRs at
45° in the retarder space as the wavelength reduces. When the trajectory reaches the
boundary of π, the trajectory moves to the opposite point on the π sphere and the fast axis
changes to the orthogonal state. ...................................................................................... 162
Figure 7.26 Phase unwrapped retardance of the compound system as the wavelength
changes with the corresponding points in Figure 7.25. .................................................. 163
Figure 8.1 A triplet followed by a lens barrel. A collimated grid of rays propagates
through the triplet and scatters off from the lens barrel before it reaches the detector plane
on the right. ..................................................................................................................... 173
Figure 8.2 A ray propagating along the z axis reflects from an Aluminum coated surface.
......................................................................................................................................... 184
Figure 8.3 A volume of water droplets in the air which scatters the incident collimated
beam of light. The incident beam of light is plotted in dark red arrows and some of the
individual scattering ray paths are shown in different colors. ........................................ 191
Figure 8.4 s (red) and p (blue) polarization reflection coefficients as a function of
scattering angle calculated from the MiePlot program. .................................................. 193
Figure 8.5 A nine-by-nine grid of the detector with its surface normal along the x-axis is
shown. Each false color corresponds to the summation of the polarization ray tracing
tensors along the x-axis over the same color mapped water droplets. ............................ 194
Figure 8.6 3D degree of polarization is calculated for each exiting coherence matrix at the
detector. The x-axis indicates the pixel number. 3D DOP shows that the exiting light is
mostly unpolarized. ......................................................................................................... 200
Figure 8.7 2D degree of polarization of each 2D Stokes vector is calculated at each pixel
on the detector. The values indicate that the exiting light is mostly unpolarized. ......... 201
Figure 8.8 S0 components of the exiting 2D Stokes vectors at each pixel. ..................... 202
Figure 8.9 Diattenation of the scattered light calculated from the Mie Plot program. For
positive values s polarization has the greater scattering amplitudes than p polarization.203
Figure 8.10 S2 components of the exiting 2D Stokes vectors at each pixel.................... 204
Figure 8.11 Exiting light polarization vectors on the detector plane are shown in red
whereas linearly polarized light at 45º are shown in dashed blue. ................................. 205
Figure 9.1 Three plane wavefronts propagating along the x, y, and z axis are meeting at
the origin. ........................................................................................................................ 211
16
LIST OF TABLES
Table 2.1 Polarization ray tracing matrix for a horizontal fast axis linear quarter wave
retarder without beam deviation for three different propagation directions, along z-axis, y-
axis, and x-axis. The Jones matrices are specified in a symmetric phase convention
where the fast axis polarization state is advanced by an eighth of a wave and the slow
axis is delayed by an eighth of a wave. ............................................................................. 35
Table 3.1 Propagation vectors, local coordinate basis vectors, surface normal vectors and
polarization ray tracing matrices associated with a ray path through aluminum coated
hollow corner cube. ........................................................................................................... 44
Table 3.2 The maximum intensity of output and associated incident electric field, the
minimum intensity of transmitted electric field and associated incident electric field, and
the diattenuation from the ray through the corner cube system. ....................................... 45
Table 4.1 Propagation vectors, Pq’s, and Qq’s for a ray propagating through the aluminum
coated three fold-mirror system ........................................................................................ 77
Table 8.1 A polarization ray tracing tensor in global coordinates as a function of
amplitude coefficients in local coordinates. Each shows a three-by-three matrix
component of the tensor. ................................................................................................. 180
Table 9.1 3D Stokes vectors from an incoherent addition of three electric field vectors
with different polarization states, measured on xz-plane are shown. All the amplitudes are
set to 1.0 for the simplicity. ............................................................................................ 213
Table 9.2 A nine-by-nine rotation matrix RotM( elements are shown. The RotM(
matrix rotates to . ............................................................................................... 225
17
ABSTRACT
A three-by-three polarization ray tracing matrix method is developed to calculate the
polarization transformations associated with ray paths through optical systems. The
relationship between the three-by-three polarization ray tracing matrix P method and the
Jones calculus is shown in Chapter 2. The diattenuation, polarization dependent
transmittance, is calculated via a singular value decomposition of the P matrix and
presented in Chapter 3.
In Chapter 4 the concept of retardance is critically analyzed for ray paths through
optical systems. Algorithms are presented to separate the effects of retardance from
geometric transformations. The parallel transport of vectors is associated with non-
polarizing propagation through an optical system. A parallel transport matrix Q
establishes a proper relationship between sets of local coordinates along the ray path, a
sequence of ray segments. The proper retardance is calculated by removing this
geometric transformation from the three-by-three polarization ray trace matrix.
Polarization aberration is wavelength and spatial dependent polarization change
that occurs as wavefronts propagate through an optical system. Diattenuation and
retardance of interfaces and anisotropic elements are common sources of polarization
aberrations. Two representations of polarization aberration using the Jones pupil and a
polarization ray tracing matrix pupil, are presented in Chapter 5.
In Chapter 6 a new class of aberration, skew aberration is defined, as a
component of polarization aberration. Skew aberration is an intrinsic rotation of
18
polarization states due to the geometric transformation of local coordinates; skew
aberration occurs independent of coatings and interface polarization. Skew aberration in
a radially symmetric system primarily has the form of a tilt plus circular retardance coma
aberration. Skew aberration causes an undesired polarization distribution in the exit pupil.
A principal retardance is often defined within ( , ]- range. In Chapter 7 an
algorithm which calculates the principal retardance, horizontal retardance component, 45°
retardance component, and circular retardance component for given retarder Jones
matrices is presented. A concept of retarder space is introduced to understand apparent
discontinuities in phase unwrapped retardance. Dispersion properties of retarders for
polychromatic light is used to phase unwrap the principal retardance. Homogeneous and
inhomogeneous compound retarder systems are analyzed and examples of multi-order
retardance are calculated for thick birefringent plates.
Mathematical description of the polarization properties of light and incoherent
addition of light is presented in Chapter 8, using a coherence matrix. A three-by-three-by-
three-by-three polarization ray tracing tensor method is defined in order to ray trace
incoherent light through optical systems with depolarizing surfaces. The polarization ray
tracing tensor relates the incident light’s three-by-three coherence matrix to the exiting
light’s three-by-three coherence matrix. This tensor method is applicable to illumination
systems and polarized stray light calculations where rays at an imaging surface pixel have
optical path lengths which vary over many wavelengths.
19
In Chapter 9 3D Stokes parameters are defined by expanding the coherence
matrix with Gell-Mann matrices as a basis. The definition of nine-by-nine 3D Mueller
matrix is presented. The 3D Mueller matrix relates the incident 3D Stokes parameters to
the exiting 3D Stokes parameters. Both the polarization ray tracing tensor and 3D
Mueller matrix are defined in global coordinates.
In Chapter 10 a summary of my work and future work are presented followed by
a conclusion.
20
CHAPTER 1 HISTORY OF POLARIZATION RAY TRACING
1.1 Polarization Ray Tracing in two-dimensions
The objective of polarization ray tracing is to calculate the evolution of the polarization
state through an optical system and to determine the polarization properties, such as
diattenuation and retardance, associated with ray paths through the system, and assess the
impact of polarization aberration on image formation. By tracing many rays, the
polarization aberrations associated with an optical system can be assessed, and the
behavior of a particular optical and coating design compared with the optical system’s
polarization specifications.
One of the most common descriptions of polarization state used in polarization
analysis is the Jones vector
x
y
E
E
J (1.1.1)
where Ex and Ey are complex amplitudes1. The Jones vector specifically refers to a
monochromatic plane wave, describing the electric field and the polarization ellipse with
respect to an x-y coordinate system in the transverse plane. If the plane wave is not
propagating along the z-axis, then the x-y coordinates are referred to as “local coordinates”
associated with a particular transverse plane. Most optical design software use two-by-
two Jones matrices to describe the optical elements since, unlike Mueller matrices, Jones
matrices preserve the phase information. Jones matrices describe polarization effects
21
such as the polarization elements used to transform between polarization states1. For
paraxial beams or beams with small numerical aperture where the wavefront is not very
curved and the z-component of the field is small, Jones matrix pupil functions2, the Jones
matrix as a function of exit pupil coordinates, are widely used. However, to use Jones
vectors and matrices in optical design for the ray tracing of highly curved beams, local
coordinate systems are required for each ray, and each of its ray segments, to define the
direction of the Jones vector’s x- and y- components in space, and these local coordinate
systems lead to complications due to the intrinsic singularities of local coordinates. For
example, to define a Jones matrix between an incident spherical wave and an exiting
wave two pairs of local coordinates are necessary and this pair of local coordinates is
different for each of the wavefront’s rays.
Jones vectors and matrices are readily adapted to sequences of ray segments with
beam deviation by carrying along a local coordinate system. One algorithm the author
and others have used for arbitrary propagation direction is to use lines of longitudes and
latitudes to establish a local x and y-basis for Jones vectors, as shown in Figure 1.1 (a).
Consider the unit propagation vector k puncturing a unit sphere demarcated with latitude
and longitude arcs; the sphere’s poles might be located at {0, 0, ±1}. The latitude
specifies ˆLocalx and longitude specifies ˆ
Localy for any ray, although there is an obvious
problem for propagation along the z-axis.
This leads to the following problem. Using any latitude/longitude algorithm for
specifying Jones vector local coordinates results in two singularities at the north and
22
south poles where the local coordinates are undefined. (Figure 1.1 (b)) Near the poles, the
local coordinates vary rapidly with small change in propagation direction. The
workaround that polarization ray tracing computer programs use is to trap propagation
close to the poles (z in this case) and handle the associated quantities as a special case via
separate rules. In the experience of the author, working in Jones vector local coordinates
leads to a cascade of minor complications, both in handling rays near the singularities and
in describing high numerical aperture beams.
Figure 1.1 (a) Longitude and latitudes lines as the local x and y bases. (b)
Singularity at the North Pole.
Such issues are intrinsic to any choice of local coordinates. According to the
Winding Number Theorem3, it is impossible to define a continuous and differentiable
vector field constrained to lie on the surface of a sphere over the entire sphere without at
least two zeros in the field; a set of latitude vectors or conversely a set of longitude
vectors provide two examples, where the zeros occur at the poles. All local coordinate
choices have such singularities.
23
1.2 Polarization Ray Tracing in three-dimensions
This problem of singularities in local coordinates can be avoided and systematized by
generalizing a two-by-two Jones matrix into a three-by-three polarization ray tracing
matrix to handle arbitrary propagation directions. Polarization ray tracing matrix
methods using Jones matrices have been in use in optical design for at least twenty years.
Knowlden used a polarization ray tracing technique to analyze the instrumental
polarization effects caused by coatings on several nonplanar optical surfaces4
.
Polarization ray tracing was used to integrate thin-film design and crystal optics into
optical design5. Waluschka merged traditional lens design and analysis computer codes
and thin-film codes6 in order to construct a polarization ray trace algorithm. Polarization
ray tracing, which calculates the Jones matrix associated with an arbitrary ray path
through an optical system, was introduced to calculate the polarization aberration
function7. Polarization ray tracing of “thick” thin-film optical coatings (thicker than
several 's ) was used to calculate the phase shift caused by the physical thickness of the
film8.
From Chapter 2 to Chapter 6, except in Chapter 7, polarization effects at each ray
intercept are described by a three-by-three polarization ray tracing matrix, P.
Polarization effects are propagated along ray paths through optical systems by matrix
multiplication of the P matrices for each ray intercept. In image space the P matrix can
then be used to determine a full three element electric field vector. Three-dimensional
24
polarization ray tracing methods have been mentioned in several manuscripts9,10,11,12
and
one of the methods is contained in a Code V macro. Three-dimensional polarization ray
tracing algorithms in [13] and [14] are the basis of the polarization ray tracing code,
“Polaris-M”15,16
, developed and in use at our Polarization Laboratory at the University of
Arizona.
Chapter 2 defines the three-by-three polarization ray tracing matrix P and Chapter
3 and Chapter 4 provide diattenuation and retardance algorithms using the P matrix.
Chapter 5 discusses polarization aberration analysis using the P matrix and Chapter 6
defines a skew aberration, which is a component of polarization aberration. Materials in
Chapter 2 to Chapter 5 are based on reference [13] and [14] and Chapter 6 is based on
reference [17]. Chapter 7 addresses a discontinuity in retardance values that occurs in the
simulation and measurement of compound retarders, and provides a phase unwrapping
method using Jones matrices. No systematic development of this method has yet been
presented in the literature. Chapter 8 introduces a coherence matrix and defines a
polarization ray tracing tensor, which provides a polarization ray tracing method through
depolarizing optical systems. Chapter 9 further extends the coherence matrix to three-
dimensional (3D) Stokes parameters and defines a three-dimensional Mueller matrix.
Chapter 10 summarizes the previous chapters and states conclusions.
25
CHAPTER 2 DEFINITION OF POLARIZATION RAY TRACING
MATRIX
In this section, the polarization ray tracing matrix is defined. It is shown that the
relationships between all pairs of incident and exiting electric field vectors do not provide
enough constraints to uniquely specify all nine elements of the polarization ray tracing
matrix, so an additional constraint involving operation on the propagation vector is added
to the matrix definition.
2.1 Definition of Polarization Ray Tracing Matrix, P
The polarization ray tracing matrix P characterizes the change in a three-element electric
field vector due to interaction with an optical element, a sequence of optical elements, or
an entire optical system. Consider the evolution of the polarization state of a ray through
an optical system with N interfaces labeled by index q. Further, if all the materials are
isotropic, then polarization changes will occur only at interfaces. (This restriction will be
removed later.) At each interface q, the propagation vectors -1ˆ
qk and ˆqk may be different
due to reflection or refraction. -1 and q qE E before and after the interaction are related by
the polarization ray tracing matrix for the qth
ray intercept, as shown in Figure 2.1,
,
, -1
,
x q
q y q q q
z q
E
E
E
E P E . (2.1.1)
26
Note qP is associated with specific incident and exiting propagation vectors, -1
ˆqk and ˆ
qk .
Figure 2.1 The polarization states before (Eq-1) and after (Eq) the qth
optical interface
are related by the matrix Pq.
A ray interacting with a series of optical elements is represented by cascading the
Pq matrices for each ray intercept yielding a net polarization ray tracing matrix PTotal
which represents the entire ray path,
1
-1 2 1
,-1
Total N N q q
q N
P P P P P P P . (2.1.2)
The optical path lengths between the optical system entrance and exit pupils are
summed to calculate a ray’s effect to the wavefront aberration (Section 5.2). The
polarization ray tracing matrix describes the polarization dependent transmission and
polarization dependent corrections to the optical path length.
If the optical system includes anisotropic or birefringent media, the propagation
portions cannot be modeled as identity matrices, but will take the form of retarder
27
matrices for birefringent media and/or diattenuation matrices for dichroic media. The
propagation effect from ray interface q to q+1 is denoted as +1,q qA . With the inclusion of
polarization ray tracing matrices for anisotropic materials, Eq. (2.1.2) for the polarization
ray tracing matrix for a ray through an optical system becomes
1
, -1 -1 3,2 2 2,1 1 +1,
-1,-1
Total N N N N N q q q
q N
P P A P A P A P P A P . (2.1.3)
This formulation works well for stress birefringent and weakly anisotropic materials. In
strongly birefringent materials like calcite and rutile, birefraction between the two modes
(ordinary and extraordinary) causes ray doubling and two separated rays will continue to
the exit port of the optical system. In this case, each of the rays refracting into a
birefringent material has a separate polarization ray tracing matrix in the form of a
polarizer. This polarizer matrix selects the incident state that couples into the specified
mode. Further comments on ray tracing in anisotropic materials are beyond the scope of
the present section.
Pq as defined in Eq. (2.1.1) is under-constrained. Eq. (2.1.1) is equivalent to the
Jones matrix equation,
1
x q
q q q
y q
E
E
w J w,
-
,
(2.1.4)
except that Eq. (2.1.1) is formulated in global coordinates. In Eq. (2.1.4), Jones vectors,
wq-1 and wq, are constrained to the transverse plane; so only two linearly independent
polarization states are required to form a basis for all possible wq-1, for example wa and
28
wb. In Eq. (2.1.1) the transformation of all polarization states can be described as linear
combinations of the transformations of wa and wb
, .a q a b q b E P E E P E (2.1.5)
The relationship in Eq. (2.1.5) yields six equations, one for each row, but qP has
nine elements. So Eq. (2.1.1) does not fully constrain qP . In order to uniquely define
qP ,
an additional set of three constraints is applied,
-1ˆ ˆ
q q qP k k (2.1.6)
The choice of is arbitrary, but only two values, either 0 or 1, allow qP to be repeatedly
cascaded and maintain the value of . Both choices of describe the same
polarization effects at the ray intercept with minor differences as described below.
With = 0, qP is always singular and so 1
q
P
never exists. One of the singular
values of qP will always be zero, as will one of the eigenvalues. With = 1, only ideal
polarizers have singular matrices. Thus = 1 was elected for this manuscript. This gives
qP the additional property that
-1ˆ ˆ
q q qP k k . (2.1.7)
By adding this relationship between the incident and exiting propagation vectors, qP is
now uniquely defined by nine constraints, Eq.(2.1.1) and (2.1.7).
29
2.1.1 Polarization States as Three-element Electric Field Vectors
For a plane wave propagating in an arbitrary direction k , the electric field , te r is
ˆ -
, Re ,i k t
t e
k r
e r E (2.1.8)
where 2 n
k
, n is the refractive index, and E has units of volts/meter. E is
perpendicular to the propagation vector in isotropic and linear media so
ˆ 0. E k (2.1.9)
For linearly polarized light, the electric field magnitude goes to zero twice per
period i.e., all three elements of the time-dependent polarization vector in Eq. (2.1.8)
become zero simultaneously. The condition for linearly polarized light is that the phases
of all three elements of E are equal or 180° out of phase.
From Eq. (2.1.8), r0, 0 Ret e r E E . The electric field at / 2t is
- / 2
i0, / 2 Re Imit e e r E E E . (2.1.10)
If the electric field is circularly polarized, the electric field vector at time zero and the
electric field vector at quarter of the full cycle are perpendicular to each other and have
the same magnitude. Thus, E describes circularly polarized light if
0 and | | | |,r i r i E E E E (2.1.11)
since the electric field for circularly polarized light should have the same amplitude at
any given time. Otherwise the light is in an elliptical state.
30
In this presentation we adopt the convention that when a circularly polarized
electric field vector rotates clockwise at a fixed observation plane (for example, 0r )
looking into the beam, the electric field is right circularly polarized. If the vector rotates
counterclockwise the electric field is left circularly polarized.
For a unit amplitude circularly polarized electric field vector with a propagation
vector along the z-axis, the cross product of electric field vectors at 0 and / 2t t is
{0, 0, -1},
ˆ0,0 0, / 2 -e e z . (2.1.12)
In general, the cross product of the electric field vector at time t = 0 and
/ 2t is anti-parallel to the propagation vector for right circularly polarized light, and
parallel for left circularly polarized light. The same relation holds for elliptically
polarized light. Thus the handedness of the elliptically polarized or circularly polarized
light is determined by the sign of
ˆ0,0 0, / 2 } {e e k . (2.1.13)
If Eq. (2.1.13) is positive, the electric field is rotating counter clockwise (i.e. left circular)
and if Eq. (2.1.13) is negative, the electric field is rotating clockwise (i.e. right circular).
For elliptically polarized light, the major axis orientation can be calculated by
calculating the electric field vector defined in local coordinates in a plane perpendicular
to k . For a given k , two normalized real-valued vectors ( 1 2ˆ ˆ and v v ) that are orthogonal
31
to each other and orthogonal to k can be calculated. 1 2
ˆ ˆ and v v can be any normalized
three-element vectors which satisfy
1 2 1 2ˆ ˆˆ ˆ ˆ ˆ, , and v v k v k v . (2.1.14)
Then any electric field vector propagating along k can be written as a
superposition of 1 2
ˆ ˆ and v v ,
ˆ ˆ- -
1 1 2 2ˆ ˆ ˆ ˆ, Re{ } Re{[( ) ( ) ] }
i k t i k tt e e
k r k re r E E v v E v v
1 2
ˆ -
1 1 2 2ˆ ˆRe{( ) }v v
i k ti i
v vE e E e e
k r
v v , (2.1.15)
where viE are real amplitudes along ˆ
iv . The major axis orientation is
2
1
arctan( )v
v
E
E , (2.2.1)
where is measured from 1v to
2v as shown in Figure 2.2.
Figure 2.2 The major axis orientation θ measured from 1v to a . The axis vector is
shown in red arrow.
32
The major axis vector a in global coordinates is
1 2ˆ ˆcos( ) sin( ) a v v . (2.2.2)
2.2 Formalism of Polarization Ray Tracing Matrix using Orthogonal
Transformation
Ray tracing calculations using the polarization ray tracing calculus involve frequent
transformations between the global coordinates of the polarization ray tracing matrix and
local coordinates where the physics of polarization elements, anisotropic materials, thin
film interfaces, anisotropic materials, diffraction gratings, reflection, refraction, and other
phenomena are formulated. Orthogonal transformations between different coordinate
systems, such as s-p coordinates, are straightforward and ubiquitous. This section
explains the coordinate transformation notation.
Orthogonal matrices, also known as real unitary matrices, describe rotations of
orthogonal coordinate systems. In our case, orthogonal matrices transform between a
local coordinate basis selected for a calculation at an interface and the global coordinate
basis and vice versa. A separate pair of basis vectors is needed before and after the
interface due to the change of ray direction.
For reflection and refraction from surfaces, the s and p-polarization states along
with the propagation vector form a natural basis, the ˆˆ ˆ{ , , }s p k basis. For simple and
33
isotropic media, ˆ ˆ and s p are defined as being perpendicular and parallel to the plane of
incidence and thus are the eigenpolarizations for the Fresnel equations. The surface local
coordinates -1ˆˆ ˆ{ , , }q q qs p k before and ˆˆ ˆ{ , , }q q qs p k after the q
th surface are,
-1
-1
-1
ˆ ˆˆ ˆˆ ˆ ˆ ˆ ˆ ˆ ˆ, , and , .
ˆ ˆ| |
q q
q q q q q q q q q
q q
k ks p k s s s p k s
k k
(2.2.3)
The ˆqs vector is the same before and after the surface; only the ˆ
qp vector changes. The
orthogonal matrices are
, , , -1 , , ,
, , , , -1 , , , ,
, , , -1 , , ,
ˆ ˆˆ ˆ ˆ ˆ '
ˆ ˆˆ ˆ ˆ ˆ, ' .
ˆ ˆˆ ˆ ˆ ˆ '
x q x q x q x q x q x q
In q y q y q y q out q y q y q y q
z q z q z q z q z q z q
s p k s p k
s p k s p k
s p k s p k
O O (2.2.4)
1
,in q
O operates on -1qE in global coordinates and calculates the -1
ˆˆ ˆ{ , , }q q qs p k basis
components for the incident light,
, -1
, -1
0
s q
p q
E
E
, which is a projection of -1qE onto the incident
local coordinates. ,out qO rotates the surface local vectors ˆˆ ˆ{ , , }q q qs p k back to the global
coordinates ˆ ˆ ˆ{ , , }x y z .
Reflection and refraction at dielectric, metal, and multilayer coated interfaces are
described in terms of ˆ ˆ{ , }s p components. qP for a refraction or reflection can be derived
by using ,t qJ and ,r qJ , which are defined in local s and p basis, and Eq. (2.2.4)
34
, , , ,
, , , , , ,
0 0 0 0
0 0 and 0 0 ,
0 0 1 0 0 1
s t q s r q
t q p t q r q p r qJ J
(2.2.5)
Subscript t indicates refraction, r indicates reflection, s for s-polarization and p for p-
polarization. , , , ,,s t q p t q are s and p-amplitude transmission coefficients and
, , , ,,s r q p r q
are reflection coefficients. For an uncoated interface between two isotropic media, the
coefficients are calculated from the Fresnel equations. For coated interfaces, the
coefficients are calculated from multi-layer coating calculations18,19,20
. The polarization
ray tracing matrix for refraction or reflection is
1
, ,q out q q in q
P O J O . (2.2.6)
The Jones matrices for gratings, holograms, sub-wavelength gratings, and other
non-isotropic interfaces have off-diagonal elements. The qP matrix for a ray intercept is
11 12
1
, , 21 22
0
where 0
0 0 1
q out q q in q q
j j
j j
P O J O J , (2.2.7)
where qJ represents the Jones matrix for the interaction in a local coordinate system.
, , and in q out qO O transform the local coordinate basis vectors associated with qJ into the
global coordinates. Different diffraction orders would naturally have different qJ .
For interactions that don’t change the ray direction, for example sheet polarization
elements, the surface local coordinates are arbitrarily chosen to be perpendicular to the
propagation vector and Eq. (2.2.7) becomes
35
1
, ,q in q q in q
P O J O . (2.2.8)
2.2.1 Retarder Polarization Ray Tracing Matrix Examples
The matrix P is a polarization ray tracing matrix for a single ray since orthogonal
transformation matrices as in Eq. (2.2.4) are different for each ray (unless the beam of
light is collimated and all surfaces are plane surfaces). The P matrix in Eq. (2.2.7) is not
only dependent on the corresponding Jones matrix but also on the propagation vector.
qJ -1
ˆqk ˆ
qk Pq matrix
- / 4
/ 4
0
0
i
i
e
e
0
0
1
0
0
1
- / 4
/ 4
0 0
0 0
0 0 1
i
i
e
e
- / 4
/ 4
0
0
i
i
e
e
0
1
0
0
1
0
- / 4
/ 4
0 0
0 1 0
0 0
i
i
e
e
- / 4
/ 4
0
0
i
i
e
e
1
0
0
1
0
0
- / 4
/ 4
1 0 0
0 0
0 0
i
i
e
e
Table 2.1 Polarization ray tracing matrix for a horizontal fast axis linear quarter
wave retarder without beam deviation for three different propagation directions,
along z-axis, y-axis, and x-axis. The Jones matrices are specified in a symmetric
phase convention where the fast axis polarization state is advanced by an eighth of a
wave and the slow axis is delayed by an eighth of a wave.
Because of the many subtleties involved in transforming between Jones and
polarization ray tracing matrices, several example P matrices are provided. Since P is
unique for each ray, P can be different for the same optical element (i.e. Jones matrix)
depending on the ray’s propagation direction. Table 2.1 shows P matrices for a quarter-
36
wave retarder with different propagation vectors at normal incidence. The Jones matrix
for a quarter-wave retarder oriented along x- and y-axes is - / 4
/ 4
0
0
i
i
e
e
. The
corresponding P matrix for a ray propagating along the z-axis is different from P for a ray
propagating along the y-axis or x-axis.
Note that Pq for a ray propagating along the z-axis is same as the Jones matrix
padded with zeros and a single one. But in general, P matrices are different from Jones
matrices. Pq’s in Table 2.1 relate the phase to the corresponding component of the
electric field in global coordinates.
For a more complicated example, the P matrix for a quarter wave linear retarder
with a fast axis along {1,0,0} , surface normal {0,sin ,cos } and propagation vector
parallel to the surface normal vector will be
- / 4
/ 4 2 2 / 4
/ 4 2 / 4 2
0 0
0 cos + sin cos sin (1- ) .
0 cos sin (1- ) cos + sin
i
i i
i i
e
e e
e e
(2.2.9)
Since -1ˆ
qk is parallel to the surface normal vector of the retarder
ˆ {0,sin ,cos }q k . Eq (2.2.9) advances eigenpolarization {1,0,0} by / 4 and retards
eigenpolarization {0,cos , -sin } by / 4 .
37
2.3 Conclusion
These three-by-three polarization ray tracing matrices perform ray tracing in global
coordinates, which provide an easy basis to interpret polarization properties for most
systems. The apparent rapid variation of polarization states and properties around local
coordinate singularities is avoided. It remains straightforward to convert results into
other local coordinate bases.
Formalism for polarization ray tracing using three-by-three matrices has been
demonstrated. The relationship to the Jones calculus has been shown. Algorithms for
reflection, refraction, and polarization elements are summarized with specific examples.
38
CHAPTER 3 CALCULATION OF DIATTENUATION
3.1 Diattenuation Calculation using Singular Value Decomposition
One objective of a polarization ray trace is to understand the polarization properties
associated with the polarization changes induced by the optical system. Analyzing the
polarization properties of Jones matrices is well established in the literature21, 22, 23
. This
section derives an algorithm for the diattenuation associated with P. Here, it is assumed
the ray begins and ends in air or vacuum with refractive index of one; these results are
readily generalized to other object and image space refractive indices.
The diattenuation, D , depends on the difference of the maximum Imax and
minimum Imin intensity transmittances considered over all incident polarization states as
max min
max min
and 0 1+
I I
I I
D D . (3.1.1)
An ideal polarizer has diattenuation equal to one; one incident polarization state is
completely discarded.
Eigenvectors of the P matrix do not generally represent polarization states
because in general light rays enter and exit in different directions24
. The diattenuation of
the P matrix can be calculated by using the singular value decomposition (SVD)25, 26
.
* * *, ,1 ,2 ,0 ,0 ,0
† * * *
, ,1 ,2 1 ,1 ,1 ,1
* * *
2 ,2 ,2 ,2, ,1 ,2
ˆ ˆ ˆ ˆ1 0 0
ˆ 0 0
ˆ 0 0
x Q x x x y z
y Q y y x y z
x y zz Q z z
k u u k k k
k u u v v v
v v vk u u
P U DV , (3.1.2)
39
which decomposes P into unitary matrices and U V , and a diagonal matrix D. The
diagonal elements of D are the non-negative real singular values i
27
1 2 0 (3.1.3)
and † indicates Hermitian adjoint
†V
*( )T V . (3.1.4)
The diagonal elements of D are the singular values of P. Since P was
constructed such that -1ˆ ˆ
q qPk k (Eq. (2.1.7)), one of P’s singular values is always one,
and the associated column of V is 0k . Eq. (3.1.2) places the one in the first column.
The other two columns of V , 1 2andv v , are two special polarization vectors in the
incident transverse plane that generate the maximum and minimum transmitted flux.
Similarly, the columns of U are the exiting propagation vector ˆQk and two orthogonal
polarization vectors 1 2andu u in the exiting transverse plane.
The relationship between P, its singular values and these special polarization
vectors, is
1 1 1 2 2 2 0ˆ ˆ, , and .Q P v u P v u Pk k (3.1.5)
1 2andv v are in general the only two orthogonal incident polarization states that remain
orthogonal when they emerge from P as 1 2andu u . So these orthogonal states form a
canonical basis for incident and exiting polarization states.
40
An arbitrary normalized incident polarization state E can be expressed as a linear
combination of 1 2andv v as
1 2, E v v (3.1.6)
where and are complex, 2 2| | | | 1 .
The transmitted electric field vector after P is P E . Therefore the flux of the
transmitted electric field is
2| |TransI PE † †E P PE (3.1.7)
Using Eq. (3.1.5) and (3.1.7), it can be shown that the flux of the transmitted
electric field is
2 2 2 2 2 2 2 2
1 2 1 2 2| | | | | | ( )TransI . (3.1.8)
Since both 2 2 2 2
1 2 2| | ( ) and are positive since 1 2 by construction, the
maximum intensity transmittance occurs when the incident state is v1, and the minimum
intensity occurs when the incident state is v2 i.e.
2 2
max 1
2 2
min 2
if | | 1
if | | 0Trans
II
I
, (3.1.9)
for any polarization ray tracing matrix P . Thus the diattenuation of P is
2 2
1 2
2 2
1 2
D , (3.1.10)
41
and 1 max 2 min and v v v v are the incident polarization states for which P gives the
maximum and minimum transmittance. 1P v and
2P v are the corresponding exiting
polarization states.
For the case when the beam is undeviated ( 0ˆ ˆ
Qk k ) the two unitary matrices, U
and V , have the same eigenstates; thus the diattenuation can be calculated from the
eigenvalues of the P matrix as well as from Eq. (3.1.10).
3.2 Example – A Hollow Corner Cube
A hollow aluminum coated corner cube provides an example of an inhomogeneous
polarization component, an element in which the diattenuation and retardance are not
aligned. Corner cubes are commonly used as retroreflectors, and their polarization
properties have been studied in several manuscripts28, 29, 30
.
As shown in Figure 3.1, the hollow corner cube consists of three mutually
perpendicular aluminum surfaces. A refractive index of 0.77 + 6.06i is assumed for
aluminum at 500nm. There are six different ray paths for a collimated beam of light that
enters the corner cube31
. Figure 3.1 shows one of the ray paths with a set of propagation
vectors (black arrows). The incident and the exiting propagation vectors are anti-parallel.
42
Figure 3.1 Top view of a hollow aluminum coated corner cube. Three surfaces of
the corner cube are perpendicular to each other.
Our example ray is incident along the symmetry axis, 0
ˆ {0,0,-1}k . Figure 3.2
shows the corner cube with the propagation vectors in black, s-local coordinate vectors in
solid red and p-local coordinate vectors in dashed blue. The figure shows how the local
coordinate bases change as the ray propagates through the corner cube.
43
Figure 3.2 Top and side views of a corner cube. Propagation vectors are shown in
solid black, local s coordinate vectors in solid red, and local p coordinate vectors in
dashed blue.
The reflecting surface configuration is specified in Table 3.1 by the surface
normal vectors, Pq’s, and the various vectors for each ray intercept. Pq is calculated using
Eq.(2.2.4), (2.2.6), and the Fresnel equations for aluminum.
44
1ˆ
qk ˆqk ˆ
qp ˆqp ˆ
qs Surface
Normal
qP
1 0
0
-1
-2 2
3
0
-1
3
-1
0
0
-1
3
0
2 2
3
0
-1
0
2
3
0
-1
3
0.26 0.16 0 0.94
0 -0.96 - 0.18 0
-0.75- 0.46 0 0.33
i
i
i
2 -2 2
3
0
-1
3
- 2
3
2
3
1
3
-1
6
3
2
2
3
5
6
1
2 3
2
3
-1
2 3
1
2
2
3
-1
6
-1
2
-1
3
0.25- 0.08 0.44 0.33 0.7 0.24
-0.95- 0.05 -0.04 0.08 0.23 0.14
-0.15 0.72 0.27 -0.57 - 0.01
i i i
i i i
i i
3 - 2
3
2
3
1
3
0
0
1
1
6
-1
2 3
2 2
3
1
2
- 3
2
0
- 3
2
-1
2
0
-1
6
1
2
-1
3
-0.65- 0.09 -0.53- 0.15 0.37 0.23
-0.53- 0.15 -0.04 0.08 -0.65- 0.4
-0.47 0.82 0.33
i i i
i i i
Table 3.1 Propagation vectors, local coordinate basis vectors, surface normal
vectors and polarization ray tracing matrices associated with a ray path through
aluminum coated hollow corner cube.
The net polarization ray tracing matrix Pcc (cc for corner cube) for this ray path is
calculated by cascading the three P matrices in Table 3.1,
3 2 1
0.39 + 0.78 0.01+ 0.02 0
-0.02 0.40 + 0.78 0 .
0 0 -1
cc
i i
i i
P P P P (3.2.1)
The singular value decomposition of ccP gives
45
0 0.63+ 0.15 0.74 0.17
0 0.37 - 0.66 -0.32 0.57 ,
1 0 0
1 0 0
0 0.88 0 ,
0 0 0.87
0 0.43- 0.49 0.47 - 0.6
0 -0.41- 0.64 0.38 0.52 .
-1 0 0
cc
cc
cc
i i
i i
i i
i i
U
D
V
(3.2.2)
As shown in Eq. (3.1.2), and cc ccV U have the incident and exiting propagation
vectors as their first columns. Table 3.2 lists the maximum and minimum intensity
transmittances assuming the incident electric field’s intensity is one and the diattenuation
of the corner cube is calculated from the singular values of the ccP matrix.
maxI 1v
minI 2v D
0.774 -0.85 1.30
-2.15 -2.15
0.65 0.65
0.76 0.76
0 0
i i
i i
e e
e e
0.757 -0.91
0.94 -0.91 1.85
0.76 0.76
0.65 0.65
0 0
i
i i i
e
e e e
0.014
Table 3.2 The maximum intensity of output and associated incident electric field, the
minimum intensity of transmitted electric field and associated incident electric field,
and the diattenuation from the ray through the corner cube system.
Figure 3.3 shows the polarization states associated with the maximum and the
minimum intensity transmittances; the last two columns of and cc ccV U represent two
incident polarization states ( 1 2,v v ) and exiting states ( 1 2,u u ) with the maximum and the
minimum intensity transmittances. 1 2 1 2, and ,v v u u are elliptically polarized, so this path
46
through the corner cube acts as a weak elliptical diattenuator with a diattenuation of
0.014. 1 2 and v v are the only pair of two orthogonal incident polarization states which
remain orthogonal upon exit. Incident polarization states ( ) are defined in the
incident local coordinates and exiting polarization states ( ) are defined in the
exiting local coordinates.
(a) (b)
Figure 3.3 (a) The exiting state from the corner cube with the maximum intensity
and corresponding incident state. (b) The state with the minimum intensity and
corresponding incident state. All represented in local coordinates where 2D
polarization vectors are defined. Each propagation vector comes out of the page.
Using local coordinate systems in describing polarization vectors with opposite
propagation directions complicates the discussion of the polarization state and its
transformation. In global coordinates, as shown in Figure 3.4, the direction of rotation of
the electric field is in the same direction for the corresponding incident and exiting states;
1 2,v v
1 2,u u
47
1 1 and v u have the same direction of rotation and 2 2 and v u have the same direction of
rotation due to the anti-parallel propagation vectors, 0 3
ˆ ˆ and k k .
(a) (b)
Figure 3.4 (a) The exiting state from the corner cube with the maximum intensity
and corresponding incident state. (b) The state with the minimum intensity and
corresponding incident state. All states are represented in global coordinates (x-y
plane) looking into the corner cube; propagation vectors are anti-parallel for the
incident and exiting electric field vectors.
3.3 Conclusion
The calculation of the diattenuation is achieved via the singular value decomposition.
Unitary matrices provide two canonical polarization states that are orthogonal to each
other and related by the singular values. The incident propagation vector and the exiting
propagation vector are related by a singular value of unity due to Eq. (2.1.7). The method
was illustrated on a hollow aluminum corner cube.
48
CHAPTER 4 CALCULATION OF RETARDANCE
4.1 Introduction
When describing rays propagating through optical systems, the effects of coordinate
system changes on refraction can masquerade as circular retardance; this is shown in
Section 4.3.1. Similarly, coordinate system changes on reflection can masquerade as a
half wave of linear retardance. This section’s objectives are: (1) explore the local
coordinate transformation associated with parallel transport of transverse vectors along
ray paths through optical systems, and (2) present an algorithm for the calculation of
proper retardance in polarization ray tracing using the three-by-three polarization ray-
tracing calculus13
. This algorithm separates the part of the polarization ray-tracing matrix
that describes proper retardance from the part that describes non-polarizing rotations.
Examples highlight the associated subtleties.
The term “retardance” refers to a physical property by which optical path length
accumulation depends on the incident polarization state. The classic retarder is a
crystalline waveplate that divides a beam into two modes having two distinct polarization
states and optical path lengths 1, 32, 33
. The retardance, measured in radians, is the phase
difference that accrues corresponding to that optical path difference, a difference in
transit time. For more complex optical systems the optical path length can be multi-
valued (such as for sequences of crystals) or even undefined (such as for thin-film coated
surfaces). The Jones matrix has two eigenpolarizations. When the eigenpolarizations are
49
orthogonal, the retardance is calculated to be the difference in the phases of the
associated eigenvalues. For the case of inhomogeneous elements, which have non-
orthogonal eigenpolarizations, the element is expressed as a product of a pure retarder
with a pure diattenuator, and the retardance is well-defined to be that of the pure retarder,
as described by Lu21
. Often skew rays through optical systems are slightly
inhomogeneous.
The action of a retarder on polarization states can be depicted by the rotation of
the Poincaré sphere by the ideal retarder; the rotation angle is the retardance and the
Poincaré rotation axis identifies the fast and slow axes34
. Knowledge of the distribution
of an optical system’s retardance provides a partial description of the polarization
dependence of the exiting wavefronts35
.
There are complications in extending the concept of retardance to three-
dimensional polarization ray trace matrices. Since the entering and exiting rays need not
be collinear, the eigenpolarizations of the polarization ray tracing matrix may not
represent actual electric field states. This can be solved by dropping back into a local
coordinate system, but then the calculated retardance will depend on the local coordinate
system selected. Section 4.4 presents well-defined retardance calculation algorithms for
the three-by-three polarization ray tracing matrix, which require using the entire set of
propagation vectors associated with all ray segments.
50
4.2 Purpose of the Proper Retardance Calculation
The polarization dependent phase change associated with a ray path through an optical
system has two components: (1) the proper retardance; the phase retardation (optical
path difference) arising from physical processes, such as propagation through birefringent
materials or reflection or refraction from a surface, and (2) a geometric transformation
due to the local coordinate selection used for determining the phase. Figure 4.1 is a
graphical representation of a Jones matrix polarimeter performing a calibration run in air.
By rotating the polarization state analyzer (PSA) by θ, the exiting local coordinates for
the Jones matrix also rotate by θ. Therefore, the polarimeter measures this empty
compartment as a circular retarder with retardance of 2θ. Rotating the exiting local
coordinates does not introduce an optical path difference between right and left circularly
polarized light. Here the “proper” retardance is zero. The retardance value measured by
polarimeters depends on the relative choice of incident and exiting local coordinates.
Figure 4.1 A polarimeter measuring a sample retarder with the polarization state
analyzer (PSA) (a) aligned with the polarization state generator (PSG) and (b)
rotated to an arbitrary orientation. By rotating the PSA, the exiting local
coordinates for the Jones matrix are also rotated. The measured retardance of the
51
sample now includes a “circular retardance” component of 2 as well as the proper
retardance.
Our goal is to develop a retardance calculation algorithm which separates the
geometric transformation, an “optical activity-like” geometric rotation and/or inversion,
from the proper retardance. A parallel transport matrix, Q, described in Section 4.3.3,
identifies canonical pairs of local coordinate systems for general sequences of ray paths,
thus characterizing the geometric transformation. Q is the tool which separates
retardance from geometric transformation.
4.3 Geometrical Transformations
The local coordinates which are necessary to specify Jones vectors propagating in
arbitrary directions may be rotated and/or inverted between object and image space10
in
ways which are not associated with any retardance. This section provides a definition of
a parallel transport matrix and describes how it keeps track of geometric transformations
of the local coordinates.
4.3.1 Local Coordinate Rotation: Polarimeter Viewpoint
Jones matrices are defined with respect to local coordinates in the transverse plane; one
set is associated with the incident Jones vector and another with the exiting Jones vector.
Retardance of a Jones matrix is calculated from eigenvalues of the matrix
1 2arg( ) arg( ), (4.3.1)
52
where 1 1 1J w w and
2 2 2J w w36
.
First, consider a Jones matrix polarimeter measurement of an empty compartment,
which has the identity Jones matrix. If the exiting local coordinates are rotated by an
angle θ with respect to the incident local coordinates by rotating the PSA by θ, the
measured Jones matrix is a rotation matrix instead of the identity matrix,
cos -sin 1 0 cos -sin
( ) ( )sin cos 0 1 sin cos
J R I , (4.3.2)
with eigenvalues
1 2exp( ), exp(- )i i (4.3.3)
and right and left circularly polarized polarizations
1 2
1 1, .
-i iw w
(4.3.4)
Note the similarity of ( )J to the form of a circular retarder. Unless the exiting
local coordinate orientation is parallel to the incident local coordinate orientation, a non-
polarizing element, which should be described by the identity matrix, appears to have a
“circular retardance” of
1 2arg( ) arg( ) 2 (4.3.5)
where is the local coordinate rotation. This is an example of the geometric
transformation, a result of the choice of local coordinates in the description of the Jones
matrix.
53
Now consider a retarder Jones matrix measured by this polarimeter with rotated
PSA. Eq. (4.3.6) shows the measured Jones matrix, ( )J from this polarimeter
11 12
21 22
cos sin( )
sin cos
j j
j j
J . (4.3.6)
Retardance calculated from Eq. (4.3.1) and (4.3.6) changes as the rotation angle θ
changes. For circular retarders with retardance of δ, the measured retardance from the
polarimeter as a function of the PSA angle is
2arg(cos( ) | sin( ) |)2 2
Measured i
. (4.3.7)
Figure 4.2 is the plot of Eq. (4.3.7) for different retardance ( ) of the circular
retarder in [0, 2 ] in different colors. For a half wave circular retarder ( ),
retardance is independent of the PSA orientation and remains as a half wave of retardance
as shown as a blue line in the figure.
Figure 4.2 Measured retardance values as a function of rotation angle θ for circular
retarders with retardance between zero and one wave of retardance are shown in
54
different colors. Retardance of a half wave circular retarder is independent of the
PSA orientation.
For a linear retarder with retardance of , retardance as a function of the PSA
angle is
21-(cos cos )22Arctan( )
cos cos2
Measured
. (4.3.8)
Figure 4.3 shows the plot of Eq. (4.3.8) different retardance ( ) of the linear retarder in
[0, 2 ] in different colors. Again, retardance of half wave linear retarders is independent
of the PSA orientation.
Figure 4.3 Measured retardance values as a function of rotation angle θ for linear
retarders with retardance between zero and one wave of retardance are shown in
different colors. Retardance of a half wave linear retarder is independent of the
PSA orientation.
55
4.3.2 Parallel Transport of Vectors
We now explore this geometric transformation for skew ray paths through optical
systems using parallel transport of local coordinate vectors. Parallel transport of a vector
over a sphere is the process of moving the vector along a series of great circle arcs such
that the angle between the vector and each arc is constant. At a vertex where the path
transitions from the first to the second arc (and so on), the angle at the vertex is
maintained along the second arc, and so on for the entire series of arcs.
Consider tracing a single skew ray through an optical system with N interfaces.
First we consider a ray where the incident propagation vector ˆincidentk (first ray segment)
is parallel to the exiting propagation vector ˆexitk (last ray segment), but the skew ray has
changed its propagation direction ˆqk many times while traversing the optical system.
This set of ˆqk can be represented as points on the unit sphere connected by great circle
arcs. Since ˆ ˆincident exitk k , the arcs form a closed spherical polygon. Let a set of
orthogonal local coordinate vectors be defined in the transverse plane for the first ray
segment. When this set of local coordinates is carried through the system by parallel
transport, it rotates by an angle in radians equal to the spherical polygon’s solid angle37
.
This rotation is equivalent to the Pancharatnam phase38,39
or the Berry phase40
.
This rotation is shown in Figure 4.4 (a), which depicts the parallel transport of an
arbitrarily selected pair of incident local basis vectors ˆ ˆ{ , }A Ax y (solid green) through an
56
optical system where the propagation vector changes its direction from point
A B C D . Only one of the pair of basis vectors is labeled in Figure 4.4. The
exiting local basis vectors ˆ ˆ{ , }D Dx y (dashed red) are rotated from ˆ ˆ{ , }A Ax y by / 2
radian, equivalent to the solid angle of / 2 steradians of the associated spherical
triangle.
Figure 4.4 (a) The evolution of a local coordinate pair ˆ ˆ{ , }A Ax y (green) through a
system of three fold-mirrors. The exiting local coordinates (dashed red) undergo a
90° rotation from the initial local coordinates (solid green). (b) A three fold-mirror
system. When a collimated beam enters the system along the z-axis the beam exits
along the z-axis.
A simple example of parallel transport and the Pancharatnam phase is the three
fold-mirror system shown in Figure 4.4 (b). Three mirrors are aligned so that the angle of
incidence for a collimated beam at each mirror is 45°. Let each reflection be an ideal
non-polarizing reflection so that the incident polarization ellipse enters and exits the
57
optical system with the same ellipticity. The incident propagation vector 0k is {0,0,1}
and after three reflections, the exiting propagation vector 3k is also {0,0,1} where
1ˆ {1,0,0}k and
2ˆ {0,1,0}k . Thus for this system, propagation vectors are mapped to
points A, B, and C on a unit sphere. One might naively select the same local coordinates
for the incident space and exiting space since 0 3
ˆ ˆk k for this system. However, this ray
path has a geometric transformation of a 90° rotation, the solid angle subtended by the
spherical triangle ABC. Thus if the initial local coordinates and the exiting local
coordinates for a Jones matrix are chosen to be parallel to each other, the system appears
to have “circular retardance”.
Associated with this rotation of the local coordinates, incident x-polarized light
(solid green) in Figure 4.4 (b) exits the system as y-polarized light (dashed red), even
though in this model the mirrors are non-polarizing. If this non-polarizing system is
measured by a polarimeter with parallel polarization state generator and polarization state
analyzer, the polarimeter will measure 180° of circular retardance and inversion resulting
from the odd number of reflections, the retardance arising purely from the local
coordinate transformation. To measure the proper retardance, the analyzer should be
rotated 90° from the generator.
4.3.3 Parallel Transport Ray Tracing Matrix, Q
The parallel transport matrix qQ at qth
ray intercept is defined as a real unitary three-by-
three ray tracing matrix, calculated by assuming that each ray intercept is non-polarizing.
58
qQ for refraction is a rotation about -1ˆ ˆ
q qk k (the s vector) by the angle the ray is
deviated which is equivalent to sliding basis vectors around a unit sphere via parallel
transport of vectors. qQ for refraction rotates all incident polarization ellipses transverse
to -1ˆ
qk to the same ellipses transverse to ˆqk .
qQ for reflection is an inversion with
respect to the reflecting surface which is normal to -1ˆ ˆ
q qk k . Other interactions such as
diffraction and scattering can use either reflection or refraction qQ matrices depending
on the geometry of the system; if both -1ˆ ˆ and q qk k are in the same medium, the reflection
algorithm is used and if the two propagation vectors are in different media, the refraction
algorithm is used. If an incident electric field vector is aligned with a basis vector, a
sequence of qQ ’s performs the vector parallel transport into the corresponding exiting
basis vector. Other than this function, qQ has no polarization effects.
The cumulative parallel transport ray tracing matrix for a ray through a system
with N interfaces is
1
-1 2 1
,-1
Total q N N q
q N
Q Q Q Q Q Q Q . (4.3.9)
Depending on the number of reflections in the system, , ,ˆˆ ˆ{ , , }L q L q qx y k can form
either a right-handed coordinate system or a left-handed coordinate system. Figure 4.5
shows the incident ( ,0 ,0ˆ ˆ{ , }L Lx y ), reflected ( , ,1 , ,1
ˆ ˆ{ , }L r L rx y ), and transmitted ( , ,1 , ,1ˆ ˆ{ , }L t L tx y )
local coordinate pairs calculated from 1Q where the incident local coordinates are
59
0 1,0 ,0 0 ,0
0 1
ˆ ˆˆˆ ˆ ˆ, .
ˆ ˆ| |L L L
k kx y k x
k k (4.3.10)
Figure 4.5 Incident, reflected, and transmitted local coordinates calculated from
parallel transport matrices. ,0 ,0 0ˆˆ ˆ{ , , }L Lx y k are the right handed incident local
coordinates, , ,1 , ,1 ,1ˆˆ ˆ{ , , }L r L r rx y k are the left handed reflected local coordinates, and
, ,1 , ,1 ,1ˆˆ ˆ{ , , }L t L t tx y k are the right handed transmitted local coordinates.
Note that ,0 ,0 0ˆˆ ˆ{ , , }L Lx y k and , ,1 , ,1 ,1
ˆˆ ˆ{ , , }L t L t tx y k form a right-handed set while
, ,1 , ,1 ,1ˆˆ ˆ{ , , }L r L r rx y k form a left-handed set. Subscript r stands for reflection and t for
transmission. For reflection and transmission ,0ˆ
Lx is the s-basis vector and ,0ˆ
Ly is the p-
basis vector.
60
For birefringent interfaces, grating diffraction, and other similar cases, ,0
ˆLx and
,0ˆ
Ly do not generally correspond to s and p. Figure 4.6 shows the s and p vectors in
object space and their geometric transformation along each ray segment using a set of
qQ .
Figure 4.6 A two mirror system. The red solid lines show the s vector at the first
mirror and its geometric transformation along each ray segment using Q. The blue
dashed lines show the p vector in object space and its geometric transformations.
The explicit formula for refraction qQ is
2 2
2 2
2
2 2
(1 ) [(1 ) ] [(1 ) ]1
[(1 ) ] (1 ) [(1 ) ]
[(1 ) ] [(1 ) ] (1 )
q
A B B D F B D G D B C H
F B D L A B B F C B F HA
D B C L C B F G A B B C
Q (4.3.11)
where
-1 -1 -1
2 2 2 2 2 2 2 2 2
2 2 2 2 2 2
ˆ ˆ ˆ ˆ ˆ ˆ Norm[ ], , { , , } ,
1 1 1, , .
q q q q q qA B C D F
A B A F A B A C A B A DG H L
C F C D D F
k k k k k k
(4.3.12)
61
qQ for reflection is
2
, -1 ,
2
, -1 ,
2
, -1 ,
2( )1
2( )1
2( )1
x q x q
y q y q
q
z q z q
k k B C
A A A
k kB D
A A A
k kC D
A A A
Q
(4.3.13)
where
-1 , -1 , , -1 ,
, -1 , , -1 , , -1 , , -1 ,
ˆ ˆNorm[ ], 2( )( ),
2( )( ), 2( )( ).
q q x q x q y q y q
x q x q z q z q y q y q z q z q
A B k k k k
C k k k k D k k k k
k k
(4.3.14)
1
Total
Q reverses the geometric transformation contained within
TotalP . Therefore,
for an optical system with N interfaces,
1
,0 , , ,0
1
,0 , , ,0
1
0 0
ˆ ˆ ˆ ˆ
ˆ ˆ ˆ ˆ
ˆ ˆ ˆ ˆ
Total L L N Total L N L
Total L L N Total L N L
Total N Total N
Q x x Q x x
Q y y Q y y
Q k k Q k k
(4.3.15)
,0 ,0 0ˆˆ ˆwhere { , , }L Lx y k are the incident local coordinates and , ,
ˆˆ ˆ{ , , }L N L N Nx y k are the
parallel local coordinates in the exit space transformed by TotalQ . The vectors ,0
ˆLx and
,0ˆ
Ly here are assumed to be an arbitrary pair of orthogonal vectors in the transverse plane
of the first ray segment calculated by Eq. (4.3.10).
62
4.4 Proper Retardance Calculations
4.4.1 Definition of the Proper Retardance
The “proper retardance” or just “retardance” is the accumulation of polarization
dependent optical path difference from physical processes associated with the ray path.
Retardance is generated by mechanisms which cause a polarization dependent phase
change, such as s and p phase differences in reflection or refraction, propagation through
a waveplate, birefringent material, or diffraction grating. Retardance is invariant with
respect to the selection of local or global coordinates.
The following sections present retardance algorithms for ray paths through optical
systems that are represented by P matrices; this algorithm calculates retardance which
does not contain any geometric transformation.
4.4.2 Separating Local Coordinate Transformation from P
A method to keep track of the local coordinate transformation for a ray path through an
optical system is presented in this section.
Using the parallel transport matrix allows the retardance to be uniquely defined
despite the fact the incident and exiting propagation vectors are different. T o t a lQ (Eq.
(4.3.9)) provides a well-defined relationship between local coordinates in the two
transverse planes; if an arbitrary polarization state represented by an electric field vector
is specified along the first ray segment, the corresponding electric field vector for a non-
polarizing system is determined along the exiting ray segment. 1
Total
Q reverses the
63
geometric transformation so that the coordinate system in exit space is reverted back to
the initial coordinate system. The operation
1
Total Total Total
Μ Q P (4.4.1)
yields TotalΜ , a polarization ray tracing matrix with the exiting electric field vectors
rotated about 0
ˆ ˆNk k and/or reflected in the surface so the incident and exiting
transverse planes are parallel, and both are orthogonal to 0k . Note that
TotalΜ is not a
Mueller matrix but a three-by-three matrix. Because calculation of Q requires knowledge
of all ray segments, the proper retardance cannot be separated for an unknown “black box”
system.
4.4.3 The Proper Retardance Algorithm for P, Method 1
Calculating proper retardance consists of applying the polar decomposition to TotalΜ and
computing the eigenvalues of the unitary retarder matrix. Two methods may be used.
The polar decomposition can be applied directly to the three-by-three TotalΜ , or the two-
by-two Jones matrix can be retrieved from TotalΜ and formulas from ref[24] applied.
Here we follow the first approach. We present the second approach in the next section.
The polar decomposition of TotalΜ yields a unitary matrix and a nonnegative
definite Hermitian matrix
, , , ,Total Total R Total D Total D Total R Μ Μ Μ Μ Μ , (4.4.2)
64
where ,Total RΜ is a retarder (unitary) matrix and
, , and Total D Total DΜ Μ are diattenuator
(nonnegative definite Hermitian) matrices. The retardance of TotalΜ is the retardance of
,Total RΜ . ,Total RΜ has three eigenvectors
1 2 0
ˆ, ,v v k , (4.4.3)
and three associated eigenvalues
1 2 3, , . (4.4.4)
One of the eigenvalues, 3 1 , relates the incident propagation vector
0k to the rotated
exiting propagation vector.
The retardance ( ) is calculated from the two eigenvalues associated with the
transverse plane, 1 2 and , as the difference in their phases,
2 1arg( ) arg( ) (4.4.5)
assuming 2 1arg( ) arg( ) . The fast axis orientation is along the eigenpolarization 1v
which is defined in the object space of the system.
When TotalP is homogeneous, applying the polar decomposition to
TotalΜ is
unnecessary; TotalΜ and ,Total RΜ have the same eigenvalues and eigenpolarizations.
Thus, Eq. (4.4.5) gives the retardance of TotalP where 1 2 and are eigenvalues of TotalΜ .
65
4.4.4 The Proper Retardance Algorithm for P, Method 2
In this section the second approach is presented; a two-by-two Jones matrix is retrieved
from an inhomogeneous TotalΜ , and retardance is calculated from the Jones matrix. The
first step of retrieving a two-by-two Jones matrix is rotating TotalΜ ,
†
0
0
0 0 1
R Total
JS UΜ U . (4.4.6)
U is a unitary three-by-three rotation matrix which rotates all vectors by 1
0ˆ ˆcos ( ) k z
counterclockwise about the 0
ˆ ˆk z axis so that 0k is rotated to z . For 0
ˆ ˆ ˆˆ { , , }x y zk k kk U
is
2 2
2 2
cos + (cos - ) - sin1
(cos -1) + cos - sin
sin sin cos
x y x y x
x y x y y
x y
k k k k H k
k k k k H kH
H k H k H
, (4.4.7)
where 2 2= +x yH k k .
The upper two-by-two submatrix of RS serves as a Jones matrix J in the
following equation for the retardance of the TotalP matrix,
†
1
†
det| tr tr |
| det |2cos ( )
2 tr( ) 2 | det |
JJ J
J
J J J. (4.4.8)
66
The unitary matrix (retarder) of polar decomposed J (RJ in ref [24]) has two
eigenpolarizations1 2{ , }w w . The fast axis orientation of the retarder is along the
eigenpolarization which has the smaller eigenvalue argument since the author follows the
decreasing phase convention. These eigenpolarizations can be written as three-element
electric field vectors, which provide a canonical basis set in the incident space,
† †
1 1 2 2 0ˆ, , v U w v U w k , (4.4.9)
where 1 ,1 ,1{ , ,0}x yw w w and
2 ,2 ,2{ , ,0}x yw w w .
In exit space, the canonical basis set is
† †
1 1 1 2 2 2ˆ, , N
v Q v QU w v Q v QU w k . (4.4.10)
4.4.5 Retardance Range
Optical path difference (OPD) and retardance may assume any value between 0 and
infinity. However, just as with the Jones calculus and Mueller calculus, the retardance
algorithms in the previous section return a retardance of less than a wave. This is similar
to the phase of the electric field which is usually represented modulo 2π while the optical
path length can assume any value. Further discussion of methods to extend the
retardance calculation beyond 2π by phase unwrapping or other methods is beyond the
scope of this section41,42
and will be discussed in Chapter 7.
67
4.5 Examples
In this section simple examples are presented to elucidate retardance algorithms. All the
examples have homogeneous polarization ray tracing matrices thus eigenvalues of TotalΜ
were used directly.
4.5.1 Ideal Reflection at Normal Incidence
Consider ideal (100%) reflection from a mirror at normal incidence. Since the mirror
itself is non-polarizing, the retardance should be zero. The P matrix of this system with
the incident propagation vector along the z-axis, is
-1 0 0
0 -1 0
0 0 -1
P
. (4.5.1)
P demonstrates that x and y electric fields reflect without a differential phase change, but
the propagation vector direction flips from z to -z. The upper diagonal elements have -1
due to a π phase shift upon external reflection for x and y component of the electric field
vector. When right circularly polarized light enters,
-1 0 0 1 1
0 -1 0 - -
0 0 -1 0 0
ii e i
(4.5.2)
the same electric field vector exits, but since the propagation vector changed to {0,0,-1},
this light is left circularly polarized. Similarly when linearly polarized light enters,
68
-1 0 0 cos cos
0 -1 0 sin sin
0 0 -1 0 0
ie
(4.5.3)
the same electric field vector exits. The electric field is oscillating in the same global
plane, but in the Jones matrix local coordinates the incident angle θ is mapped into -θ. In
Eq (4.5.3) no relative phase changes have been introduced between x and y component of
the electric field.
This is very different from the standard Jones matrix for reflection43,44
, Jf =
-1 0
0 1
. Jf appears to include a relative π phase shift between x and y polarization
components. In Jf this phase shift serves two purposes: (1) it reflects right circularly
polarized light into left circularly polarized and vice versa, and (2) it changes the
orientation of incident linearly polarized light from θ to -θ, which is appropriate when
maintaining right-handed ˆˆ ˆ{ , , }L Lx y k coordinates after reflection.
In order to keep all the Jones matrix local coordinates right-handed (as shown in
Figure 4.7), the Jones matrix for reflection has to contain -1 in one of the diagonal
elements. Thus the Jones reflection matrices have the form of half wave linear retarders.
The minus sign does not indicate a physical half wave linear retardance; it indicates a
local coordinate system change.
69
Figure 4.7 An ideal reflection at normal incidence with the incident and exiting
right-handed local coordinates, ,0 ,0 ,1 ,1ˆ ˆ ˆ ˆ{ , } and { , }L L L Lx y x y . In this particular choice
of local coordinates, the ˆLx vector was flipped after the reflection.
The Q matrix for this system reveals the local coordinate transformation clearly,
1 0 0
0 1 0
0 0 -1
Q
. (4.5.4)
The minus sign is now associated with the propagation vector, where it properly belongs.
Using Eq. (4.4.5) and (4.5.4), zero retardance is calculated from an ideal retro-reflection.
70
4.5.2 Brewster’s Angle Analysis
This section extends the normal incidence reflection analysis to reflections at any angles
of incidence. Unlike reflection from normal incidence, when the p-polarized light
reflects from a non-absorbing dielectric material, a physical phase change occurs as angle
of incidence approaches Brewster’s angle; the p Fresnel reflection coefficient is zero at
Brewster’s angle. If unpolarized light is incident on a surface at the Brewster’s angle, the
reflected light is rendered linearly polarized with the electric vector transverse to the
plane of incidence; it is s-polarized. The transmitted light is partially polarized while the
reflected beam is completely polarized. Since the p Fresnel reflection coefficient changes
its sign before and after Brewster’s angle (B ), there will be π phase shift upon reflection
for the incident angle larger thanB . This change in sign of the coefficient is not
discontinuous since the p-polarization element has very small amplitude reflection
coefficient near B and becomes zero at
B .
Figure 4.8 shows Fresnel s and p reflection coefficients for external and internal
reflections as a function of angle of incidence for an air-glass (n=1.5) interface33
. The
Fresnel coefficients are defined in the incident and exiting local coordinates which are
both right-handed; at normal incidence s and p reflection coefficients have the opposite
sign due to this choice of local coordinates.
71
Figure 4.8 Fresnel external (top) and internal (bottom) reflection coefficients for s-
polarization (dashed) and p- polarization (solid) are shown as the angle of incidence
changes.
Figure 4.9 shows relative phase shifts in reflection coefficients for external and
internal reflections as a function of angle of incidence for an air-glass (n=1.5) interface.
B is the Brewster’s angle and C is the critical angle. Again this result is based on the
right-handed incident and exiting local coordinate choice.
0.5 1.0 1.5
Angle of
Incidence
1.0
0.8
0.6
0.4
0.2
0.2
Reflection
Coefficient
External Reflection
0.5 1.0 1.5
Angle of
Incidence0.2
0.2
0.4
0.6
0.8
1.0
Reflection
Coefficient
Internal Reflection
72
Figure 4.9 The relative phase shift for external (top) and internal (bottom) reflection
as a function of the angle of incidence.
Jones matrices use Fresnel reflection or refraction coefficients to describe rays
reflecting from or refracting into isotropic media. If the Jones matrix is defined in {s, p}
local coordinates, the matrix is a diagonal matrix. Thus the relative phase shifts in
Fresnel coefficients are the relative phase shifts in the Jones calculus. For angles of
incidence smaller thanB , Jones calculus shows π phase shift due to the right-handed
73
local coordinate choice before and after the reflection. This value is not the proper
retardation that electric field experiences upon reflection but is the geometric
transformation of the local coordinates described.
The retardance algorithm using the P matrix yields zero retardance for internal
reflections atB and π retardance for reflections at
B C . Following three
figures plot the normalized incident electric field vector and corresponding internal
reflected electric field vector calculated from P matrices at an air-glass interface as the
angle of incidence ( ) changes from 0º to 90º. By plotting electric field vectors in global
{x, y, z} coordinates, one can truly understand reflection.
All three figures follow the following rules: the surface normal vector of the glass
surface is along the z axis, {0, 0, -1}. All the propagation vectors lie in the x-z plane.
Each incident electric field vector (dotted line) and corresponding reflected electric field
vector (solid line) pair is plotted in the same color. The arrows represent the oscillations
of the electric field vector in time. The length of the arrow represents the amplitude of
the electric field vector. For better visualization, the first two figures show s and p-
polarization states separately and the third figure shows the combination of s and p-
polarization states.
Figure 4.10 shows s-polarized incident electric field vector (dotted) and
corresponding reflected electric field vector (solid) for the external reflection as the angle
of incidence changes. Figure 4.10 shows a side view (left) and a top view (right) of the
external reflection for better visualization. Note that the reflected electric field vector has
74
smaller amplitude than the corresponding incident electric field. The s-component of the
electric field reflects more as the angle of incidence increases.
Side View Top View
Figure 4.10 The incident s-polarized states (dotted) and reflected (solid) polarization
states as angles of incidence change from normal incidence to glazing incidence. Red
arrow indicates the reflected electric field at Brewster’s angle.
Similarly, Figure 4.11 shows p-polarized incident electric field vector (dotted) and
corresponding reflected electric field vector (solid). Since the p-polarization figures
would overlap for different angles of incidence, each incident and reflected pair has been
spatially translated for better visualization. The p-component of the electric field changes
its sign after Brewster’s angle and has zero reflectance at the Brewster’s angle. This is
the origin of π retardance for reflections at B C .
s pol reflection
s pol reflection
75
Figure 4.11 The incident p-polarized states (dotted) and corresponding reflected
polarization states (solid) as the angle of incidence changes. Red arrow indicates the
reflected electric field vector at the Brewster’s angle. At the Brewster’s angle,
Fresnel reflection coefficient for p-polarization is zero.
Figure 4.12 shows incident electric field vectors at 45º between the s and p–
polarizations (dotted) and corresponding reflected electric field vectors (solid) as the
angle of incidence changes. At the Brewster’s angle, reflection of p-polarization is zero;
the reflected electric field vector is s-polarized.
p pol reflection
p pol reflection
rp = 0 at
Brewster’s
Angle
76
Side View Top View
Figure 4.12 Incident electric field vectors at 45º between s and p-polarizations
(dotted) and corresponding reflected (solid) electric field vectors are plotted as the
angle of incidence changes. Red lines are for the incident and reflected pair at the
Brewster’s angle; the top view shows that at Brewster’s angle, the reflected light
(solid red line) is s-polarized.
In summary, polarization ray tracing matrix calculus yields zero retardance upon
reflection at normal incidence and π phase shift for B C . The retardance changes
rapidly for the angle of incidence greater than C .
4.5.3 An Aluminum Coated Three-fold Mirror System
The aluminum coated three fold-mirror system is analyzed. In Figure 4.13 two
polarization states are followed through the optical system. Each mirror is aligned so that
the collimated incident light has a 45° angle of incidence. Mirrors are coated with
aluminum with a refractive index of 0.77 6.06i assumed.
45° pol reflection45° pol reflection
Reflected
beam at
Brewster’s
Angle
77
Figure 4.13 An aluminum coated three fold-mirror system.
Table 4.1 contains the propagation vector, the polarization ray tracing matrix, and
the Q matrix for each surface. The exiting propagation vector 3k is the same as the
incident propagation vector 0k , and both are along the z-axis.
q ˆqk qP qQ
1 1
0
0
0 0 1
0 -0.947 + 0.219 0
-0.849 + 0.415 0 0
i
i
0 0 1
0 1 0
1 0 0
2 0
1
0
0 -0.849 + 0.415 0
1 0 0
0 0 -0.947 + 0.219
i
i
0 1 0
1 0 0
0 0 1
3 0
0
1
-0.947 + 0.219 0 0
0 0 -0.849 + 0.415
0 1 0
i
i
1 0 0
0 0 1
0 1 0
Table 4.1 Propagation vectors, Pq’s, and Qq’s for a ray propagating through the
aluminum coated three fold-mirror system
78
The diattenuation for this ray is 0.0285. The diattenuation of the 2nd
and 3rd
mirrors are equal but 90° apart and thus cancel each other; thus the total diattenuation is
equal to the first mirror’s contribution. For the diattenuation calculation algorithm see
Section 3.1.
The system’s P matrix and Q matrix are
0 -0.549 + 0.705 0 0 1 0
-0.365 + 0.788 0 0 , 1 0 0 .
0 0 1 0 0 1
Total Total
i
iP Q
(4.5.5)
TotalP demonstrates that x-polarized incident light exits as y-polarized and y-polarized
incident light exits as x-polarized while the incident and exiting propagation vectors are
the same.
79
Figure 4.14 Local coordinate transformation using Qq's for the three fold-mirror
system. Incident ,0
ˆLx state (solid red) exits as ˆ-x polarized and incident
,0ˆ
Ly state
(dashed blue) exits as y polarized after three reflections due to the geometric
transformation.
Figure 4.14 shows how each reflection transforms the incident local coordinates,
,0 ,0ˆ ˆ{ , }L Lx y (red and blue arrows). The incident local coordinates are
,0 ,0 0ˆˆ ˆ ˆ ˆ ˆ{ , , } {- , , }L Lx y k y x z (4.5.6)
and the corresponding local coordinates in exit space are
,3 ,3 3ˆˆ ˆ ˆ ˆ ˆ{ , , } {- , , }L Lx y k x y z (4.5.7)
80
where ˆ ˆ ˆ{ , , }x y z are global coordinates. Thus one proper pairing of Jones matrix basis
vectors between entrance and exit space would be ˆ ˆ{- , }y x and ˆ ˆ{- , }x y , a result of the 90°
rotation after the parallel transport of the initial local coordinates through the system as
shown in Figure 4.4 (a) and an inversion from an odd number of reflections.
Note that the exiting local coordinates, ,3 ,3 3ˆˆ ˆ{ , , }L Lx y k are left-handed;
,0 ,0ˆ ˆ
L Lx y
in the incident space gives 0k but ,3 ,3
ˆ ˆL Lx y in the exiting space gives
3ˆ-k . Local
coordinates transformed by Q change handedness if a system has an odd number of
reflections and maintain their handedness for an even number of reflections.
,3 ,3 3ˆˆ ˆ{ , , }L Lx y k are the proper set of local coordinates for the polarization state
analyzer (as described in Section 4.3.1) for measuring the proper retardance of the system.
Multiplying TotalP by 1
Total
Q cancels the geometric transformation .
TotalΜ of the system
is
1
-0.365 + 0.788 0 0
0 -0.549 + 0.705 0
0 0 1
Total Total Total
i
i
Μ Q P . (4.5.8)
Since TotalP is homogeneous, the retardance of the system is found by calculating
eigenvalues of the TotalΜ . The eigenvalues of Eq. (4.5.8) are
2.005 2.232
1 2 30.868 , 0.8938 , 1i ie e (4.5.9)
and the eigenpolarization states associated with the eigenvalues are
81
1 2 3 0
ˆ{1,0,0}, {0,1,0}, {0,0,1}. v v v k (4.5.10)
The retardance of the system is
2 1arg( ) arg( ) 0.227 (4.5.11)
with the fast axis orientation along the global x -axis.
This value does not contain any effects from the geometric transformation.
Similar to the cancelation of diattenuation, this proper retardance is equal to the first
mirror’s contribution, since the last two mirror retardances cancel.
The retardance calculated from the Jones matrix of the first mirror is
- 0.455 2.914
1 2 2 10.945 , 0.972 arg( ) arg( ) 3.369 193.0i ie e (4.5.12)
with the fast axis orientation along the ,0Ly -axis which is the global x -axis. The
retardance from the Jones calculus and the one from the polarization ray tracing matrix
differ by π since Jones calculus uses right-handed local coordinates for data reduction.
4.6 Conclusion
This section presented a critical analysis of retardance. To calculate the true polarization
dependent phase change, the retardance, the geometric transformation needs to be
removed. P describes all polarization state changes due to diattenuation, retardance, and
geometric transformations. The parallel transport matrix Q describes the associated non-
polarizing optical system and thus keeps track of the geometric transformation.
82
1Μ Q P is a fundamental equation for calculating retardance without spurious circular
retardance arising from a poor choice of local coordinates. Μ also clarifies the meaning
of the troublesome minus sign in the Jones matrix for reflection.
The difference in eigenvalue arguments of RΜ , the unitary part of the polar
decomposed Μ , gives proper retardance. However, proper retardance cannot be
assigned to a black box whose ray propagation vectors are unknown.
83
CHAPTER 5 POLARIZATION ABERRATION
A polarization aberration is variation of the polarization properties of an optical system
with wavelength, pupil, and object coordinates. Rays propagating through systems with
optical coatings and birefringent materials experience diattenuation, polarization
dependent transmission, and retardance, polarization dependent optical path length. In
this section two ways of representing polarization aberration- Jones pupil and
polarization aberration function- are contrasted.
5.1 Jones Pupil
Many optical systems with no depolarizing elements or scattering elements can be
described by a Jones matrix. The Jones matrix of the system is a function of pupil and
field coordinates as shown in Figure 5.1. For a given field point, the Jones matrix as a
function pupil coordinates is called a Jones pupil.
Figure 5.1 A field vector on the image plane and the exit pupil vector.
84
When a grid of rays gets traced through a system and reaches the exit pupil
surface, each ray’s Jones matrix gets calculated, and a grid of Jones matrices is obtained.
Each Jones matrix is associated with a particular pupil coordinate; the grid of Jones
matrices is the Jones pupil for that object. The Jones pupil is usually decomposed into
apodization pupil, wavefront aberration pupil, diattenuation pupil, and retardance pupil.
Apodization and wavefront pupils are scalar functions, and diattenuation and retardance
pupils have magnitude and orientation45
.
Each ray’s Jones matrix is defined in its exit pupil local coordinates; each Jones
matrix on the Jones pupil is defined on a different transverse plane perpendicular to the
propagation vector of the corresponding ray.
Jones pupil at the edge of field of USA patent 289650646
from Code V47
library is
calculated and the lens parameters are in Appendix A. The system has f/1.494, maximum
FOV of 32º and is rotationally symmetric. Figure 5.2 shows the layout of the system
with seven lens elements; three field angles are shown.
85
Figure 5.2 Optical system layout of USA patent 2896506 from Code V library has
seven lens elements. The system is defined with three field angles.
Figure 5.3 shows two types of wavefront aberration maps at the exit pupil of the
USA patent 2896506 for the on-axis field. For the on-axis field, the dominant wavefront
aberration is the spherical aberration. This figure was generated by Code V.
86
Figure 5.3 Wavefront aberration maps at the exit pupil using Code V are plotted for
the on-axis field. For on-axis field, 0.3 waves of spherical aberration is the dominant
aberration.
Figure 5.4 shows amplitude part of Jones pupil at the edge of field of the system.
The shape of the pupil is due to vignetting. As the rays propagate through this high NA
and wide FOV optical system, each ray experiences different Fresnel refraction
coefficients. Thus, the diagonal elements have amplitude variation across the pupil.
Systems with only paraxial rays have zero off-diagonal components. Non-zero off-
diagonal components of Jones pupil, as in Figure 5.4, show that the y-component of the
incident electric field effects the x-component of the exiting electric field and the x-
component of the incident electric field effects the y-component of the exiting electric
field.
Waves
-0.200
.01087
-.0945
WAVEFRONT ABERRATION
USA PATENT 2896506 AZUMA
Field = ( 0.000, 0.000) DegreesWavelength = 656.0 nmDefocusing = 0.000000 mm
87
Figure 5.4 Amplitude part of Jones Pupil at the edge of field of USA patent 2896506
shows variation across the pupil as well as off-diagonal components, which often
appear for systems with high NA.
The exit pupil local coordinates where the Jones pupil is defined is shown in Figure 5.5.
Red arrows are the local x vectors in the exit pupil and the blue arrows are the local y
vectors. Depending on the local pupil coordinates, Jones Pupil changes its values.
88
Figure 5.5 Jones pupil local x and y coordinates.
Wavefront aberration maps at the edge of the field calculated from Code V are
shown in Figure 5.6. For this field, the dominant wavefront aberration is astigmatism.
16:51:12
USA PATENT 2896506
AZUMA
POSITION 1
ORA 11-Oct-11
4 Waves
WAVE ABERRATION
FIELD ANGLE - Y: 32.00 DEGREES X: 0.00 DEGREES
DEFOCUSING: 0.000000 MM
WAVELENGTH: 656.00 NM
HORIZONTAL WIDTH REPRESENTS GRID SIZE 128 X 128
16:51:12
USA PATENT 2896506
AZUMA
POSITION 1
ORA 11-Oct-11
4 Waves
WAVE ABERRATION
FIELD ANGLE - Y: 32.00 DEGREES X: 0.00 DEGREES
DEFOCUSING: 0.000000 MM
WAVELENGTH: 656.00 NM
HORIZONTAL WIDTH REPRESENTS GRID SIZE 128 X 128
89
Figure 5.6 Wavefront aberration maps at the exit pupil using Code V are plotted for
a point source at the edge of the field demonstrating three waves of astigmatism.
5.2 Polarization Aberration Function
In this section, polarization ray tracing matrices are extended to construct a polarization
aberration function to characterize the transformations of a wavefront described by a grid
of rays, as is common in optical design programs7, 9.
A polarization aberration function ( )P r generalizes the wavefront aberration
function in optical design, where r represents the position vector on the terminal surface
of the ray trace, typically the exit pupil. The wavefront’s polarization information is
contained in the polarization aberration function ( )P r and the optical path length function
( )OPL r . The polarization aberration function will give an amplitude transmission map,
a phase map within less than one wave limit, the wavefront diattenuation ( )rD , which is
90
a polarization dependent apodization map and the wavefront retardance ( ) r , which is a
polarization dependent optical path difference (OPD) map.
The exiting electric field can be calculated from the polarization aberration
function and the incident electric field,
0( ) ( ) ( )exp(- 2 / ( ))i OPL E r P r E r r , (5.2.1)
contains corrections to ( )OPL r from coatings and other polarization effects. When
coatings on optical elements change, changes but ( )OPL r does not change.
Coatings frequently add small amounts of defocus and astigmatism.
An optical system without OPD wavefront aberration can still have a polarization-
induced wavefront aberration arising, for example, from differences in the Fresnel
reflection and transmission coefficients for s-polarized light and p-polarized light.
A ray tracing program can only sample ( )P r by tracing grids of rays, not calculate
it at an infinite number of locations. Assume an optical system for which a grid of
L M rays has been traced yielding an L M grid or matrix of propagation vectors,
positions, and optical path lengths
( )P r
( )P r
91
1,1 1,2 1,
,1 ,2 ,
1,1 1,2 1,
,1 ,2 ,
1,1 1,2 1,
,1 ,
ˆ ˆ ˆ
ˆ , propagation vector grid
ˆ ˆ ˆ
, exit pupil position grid
M
s
L L L M
M
s
L L L M
M
s
L L
L M
L M
OPL OPL OPL
OPL
OPL OPL
k k k
k
k k k
r r r
r
r r r
2 ,
, optical path length grid.
L M
L M
OPL
(5.2.2)
where s stands for sampled quantity.
The sampled polarization aberration function is also an L M grid of polarization
ray tracing matrices for each ray position,
1,1 1,2 1,
2,1 2,2 2,
,
,1 ,2 ,
M
M
Total s
L L L M
P P P
P P P
P
P P P
(5.2.3)
where Total stands for ray tracing through an optical system from entrance to exit pupil.
If the system has a pupil, such as a typical circular pupil, the sampled polarization
aberration function will have three-by-three matrices filled with zeros outside of the pupil,
represented here by the digit zero,
92
2,2 2, -1
3,2 3, -1
,
-2,2 -2, -1
-1,2 -1, -1
0 0 0 0 0 0 0 0 0
0 0 0 0 0 0
0 0 0 0
.
0 0 0 0
0 0 0 0 0 0
0 0 0 0 0 0 0 0 0
M
M
Total s
N N M
N N M
P P
P P
P
P P
P P
(5.2.4)
The elements of ,Total sP are complex matrices and the argument of a complex
number is always less than 2π. Therefore, the Optical Path Length ( sOPL ) is calculated
separately from ,i jP in order to keep track of optical path length which is greater than 2π
(or one wavelength). All the wavefront polarization information can be extracted from
the polarization aberration function ,Total sP which is a grid of polarization ray tracing
matrices sampled by rays.
Figure 5.7 shows an example polarization aberration function at the exit pupil of
the USA patent 2896506, the same system as in Figure 5.4.
93
Figure 5.7 Amplitude part of a polarization aberration function (,Total sP ) at the exit
pupil of USA patent 2896506.
Each point on the Jones pupil has different local coordinates i.e., the Jones pupil
is a collection of different transverse planes. ,Total sP is defined in global coordinates; thus
shows polarization effects along the optic axis component as well as the transverse
plane components in x, y, z global coordinates. Figure 5.8 shows the phase part of the
,Total sP
94
polarization aberration function of the USA patent 2896506. Since all the lenses in this
system are uncoated, has little retardance due to different phases coming from
Fresnel refraction coefficients for different rays; Fresnel refraction coefficients for air-
glass interface are real valued. This is why the phase map in Figure 5.8 has mostly zero
or π phase values across the pupil.
Figure 5.8 Phase part of a polarization aberration function ( ) at the exit pupil
of USA patent 2896506.
,Total sP
,Total sP
95
5.2.1 Coherent Beam Combination
Many optical analysis simulations involve beams of collimated rays propagating through
one or more plane parallel crystals or optical elements. Consider an example where a ray
enters a calcite plate, ray splitting occurs, and two parallel rays, the ordinary and
extraordinary rays, exit the plate. In the region where the exiting beams overlap, it is
desired to calculate the combined polarization properties, which can be done by
combining the two P matrices from the ordinary and extraordinary rays coherently. The
individual P matrices represent polarizers, since only one linear polarization propagates
into each mode. The combined P matrix represents a retarder, a more useful description
of the calcite plate. Similar optical design calculations arise when simulating the beams
exiting an interferometer if the beams have been adjusted to exit parallel to one another.
combinedP , the coherent combination of P matrices for a pair of rays with shared incident
propagation vectors, is not merely the sum of the individual P matrices. Each P must be
multiplied by the corresponding phase factors ( exp(- 2 / )i OPL ) to account for its
different optical path lengths. Further modification is required in order for combinedP to
correctly transform the incident propagation vector ( ˆInk ) to the exiting propagation
vector ( ˆExitk ) as in Eq.(2.1.7); ˆ ˆ
Exit combined Ink P k . The dyad Dk is the outer product
matrix of ˆqk and 1
ˆqk , and represents the propagation vector components in P
96
, , , , , ,
, , , , , ,
, , , , , ,
ˆ ˆ ˆ ˆ ˆ ˆ
ˆ ˆ ˆ ˆ ˆ ˆˆ ˆ[ , ]
ˆ ˆ ˆ ˆ ˆ ˆ
x In x Exit y In x Exit z In x Exit
D Exit In x In y Exit y In y Exit z In y Exit
x In z Exit y In z Exit z In z Exit
k k k k k k
Outer k k k k k k
k k k k k k
k k k . (5.2.5)
The following equation forcombinedP takes these effects into account:
1 1 2 2( )exp(- 2 / ) ( )exp(- 2 / )combined D D Di OPL i OPL P P k P k k . (5.2.6)
The dyad is subtracted from each iP before the coherent addition so that
ˆ ˆExit combined Ink P k .
5.3 Conclusion
Jones pupil contains the polarization effects of the system as a function of exit pupil
coordinates. Similarly, a polarization aberration function ,Total sP represents the spatial
variation of the polarization ray tracing matrix over the exit pupil. It is a generalized
wavefront aberration function that characterizes the polarization dependent
transformations of a wavefront. Under certain circumstances, the matrices for rays can
be coherently combined by a modified form of matrix addition.
Dk
97
CHAPTER 6 A SKEW ABERRATION
Aberrations can be considered as deviations from the mapping of spherical waves with
uniform amplitude and polarization into spherical waves with uniform amplitude and
polarization, i.e., the ideal behavior of imaging optical systems. The main categories are
wavefront aberration48
, apodization (amplitude aberration)49 , 50 , 51
, and polarization
aberration7. Wavefront aberration is the variation in optical path length, and all the
commercial ray tracing programs calculate the wavefront aberration. Apodization is an
amplitude aberration; different rays have different transmittances due to reflection losses
and absorption. Polarization aberration, a non-uniform polarization change across
wavefronts is divided into diattenuation aberration, which is polarization dependent
transmission, and retardance aberration, which is polarization dependent optical path
length difference.
In this section, a definition of skew aberration is given and an algorithm which calculates
the skew aberration is presented. The algorithm is applied to 2383 non-reflecting optical
systems in the Code V47
patent library and the statistics of the skew aberration is
demonstrated. The skew aberration of an example system from the library is further
analyzed.
6.1 Definition
This work introduces another component of polarization aberration, “skew aberration.”
Skew aberration is a rotation of each ray’s polarization state between the entrance pupil
98
space and the exit pupil space due to intrinsic geometric transformation of local
coordinates via parallel transport of vectors. Skew aberration is independent of the
incident polarization state or the coatings applied to the optical interface.
Skew aberration is categorized as a class of polarization aberration that is distinct
from diattenuation and retardance, since its origin arises from purely geometric effects.
Skew aberration does not arise from polarization properties of optical elements. Skew
aberration occurs even for rays propagating through ideal, aberration-free, and non-
polarizing optical systems. The definition of ideal and non-polarizing optical systems is
discussed at length in ref [14]. Thus, if a ray has non-zero geometric transformation of
local coordinates, its skew aberration is also non-zero.
The skew aberration is determined solely by the ray’s propagation path, i.e., its
sequence of normalized propagation vectors 1 2{ , , ,... ,..., }In j Exitk k k k k . Ink is a
propagation vector of a ray at the entrance pupil, jk is a propagation vector after the j th
surface, and Exitk is a propagation vector of a ray at the exit pupil.
6.2 Skew Aberration Algorithm
A skew aberration map in the exit pupil elucidates whether the polarization distribution
of the exiting wavefront after a non-polarizing optical system is identical to the
polarization distribution of the incident wavefront. This change in polarization
distribution is independent of the incident polarization states. Therefore a grid of
99
reference vectors (,In ig ), defined on the entrance pupil, is traced through an ideal and
non-polarizing optical system, and compared with another grid of reference vectors
(,Exit ig ), defined on the exit pupil. Sub index i stands for i
th ray on a grid. The only
requirements for the reference vectors are,
, , , , and In i In i Exit i Exit i g k g k . (6.2.1)
Our preferred choice of reference grids are calculated by the following steps.
First, define a vector (Cg ) which is perpendicular to the center ray’s propagation vectors
on the entrance and exit pupil
, ,C In C Exit C g k k , (6.2.2)
and use Cg as a reference vector of the center ray on the entrance and exit pupil.
,In ig is a counterclockwise rotation of Cg along ,In iaxis by ,In i and ,Exit ig is a
counterclockwise rotation of Cg along ,Exit iaxis by ,Exit i ,
, , ,
, , ,
( , )
( , )
In i In i In i C
Exit i Exit i Exit i C
axis
axis
g R g
g R g (6.2.3)
, , ,where arccos( )In i In i In C k k , , , ,In i In C In iaxis k k and , , ,arccos( )Exit i Exit i Exit C k k ,
, , ,Exit i Exit C Exit iaxis k k , and ( , )axisR is a 3D rotation matrix for a counterclockwise
rotation around the 3D vector axis by .
100
This rotation method is analogous to parallel transporting Cg along a great circle
arc on a unit sphere which connects points ,In Ck and
,In ik . Therefore, the resulting grid
has radial symmetry. We call this grid a “double-pole grid.” The double-pole grid is
often used to describe linearly polarized grid of rays on a spherical wavefront.
Figure 6.1 shows an example double-pole grid on a spherical entrance pupil in
two different views.
Figure 6.1 (a) A double pole grid of reference vectors on a unit sphere viewed along
the chief ray’s propagation vector and (b) oblique view.
Once ,In ig and
,Exit ig are calculated for an optical system, the system’s geometric
transformation needs to be calculated in order to trace ,In ig through ideal, non-polarizing
optical system. Each surface in the system has a certain amount of geometric
transformation of local coordinates due to the change in ray propagation direction from
101
-1jk to jk . A parallel transport matrix
jQ14
of the j th
surface calculates the geometric
transformation of the surface. jQ for refracting surface j is equivalent to sliding vectors
from a point-1jk to a point
jk on a unit sphere following the great circle arc which
connects two points. jQ for reflecting surface j is equivalent to inverting vectors on a
point -1jk about
-1j jk k and then moving them to a point jk by parallel transport.
Equations are shown below and derivations can be found in the reference14
.
jQ for a refracting surface j is
2 2
2 2
2
2 2
(1 ) [(1 ) ] [(1 ) ]1
[(1 ) ] (1 ) [(1 ) ]
[(1 ) ] [(1 ) ] (1 )
A B B D F B D G D B C H
F B D L A B B F C B F HA
D B C L C B F G A B B C
(6.2.4)
-1 -1 -1where Norm[ ], ,{ , , } , andj j j j j jA B C D F k k k k k k
22 2 2 2 2 2
2 2
1{ , , } { , , }.
A BG H L A F A C A D
C F
jQ for a reflecting surface j is
2
, -1 ,
2
, -1 ,
2
, -1 ,
2( )1
2( )
2( )
x j x j
y j y j
z j z j
A k k B C
B A k k DA
C D A k k
(6.2.5)
1where Norm[ ],j jA k k , 1 , , 1 ,2( )( ),x j x j y j y jB k k k k , -1 , , -1 ,2( )( ),x j x j z j z jC k k k k
and , -1 , , -1 ,2( )( ).y j y j z j z jD k k k k
102
,Total iQ determines the i th
ray’s geometric transformation
, 1
In
Total i j Exit j In
j Exit
Q Q Q Q Q Q . (6.2.6)
Tracing ,In ig though the non-polarizing system yields
, , ,Exit i Total i In i g Q g . (6.2.7)
The i th
ray’s skew aberration is defined as the angle between ,Exit ig and
,Exit ig . If
,Exit ig is the counter clockwise rotation of the
,Exit ig looking into the beam, the ray has
positive skew aberration.
6.3 Example
Optical systems with high numerical aperture (NA) and wide field of view (FOV) tend to
experience larger skew aberration. A skew aberration analysis is presented on USA patent
289,650,646, which has a high skew aberration compared to other systems in the Code V patent
library. The system has f/1.494, maximum FOV of 32º and is rotationally symmetric. Figure 6.2
shows the layout of the system with seven lenses and three field angles.
103
Figure 6.2 Optical system layout of USA patent 2896506 from Code V library has
seven lenses. The system is defined with three field angles.
6.3.1 Skew Aberration at the Exit Pupil
Figure 6.3 shows the skew aberration on the exit pupil for a grid of rays with 32º field
angle. The pupil coordinates are in mm. The maximum skew aberration of 7.01º rotation
occurs for a skew ray passing the edge of the exit pupil. A skew ray at the opposite end
of the pupil (point A) has -7.01º of skew aberration.
Rays of rotationally symmetric optical systems are classified as meridional rays
and skew rays. Skew aberration only occurs for skew rays but not for meridional rays
since meridional rays remain on a plane containing the optical axis. A meridional fan of
rays through the center of the exit pupil has zero skew aberration as shown in Figure 6.3
(a).
104
Figure 6.3 (a) Skew aberration of a ray grid with 32º field angle is evaluated at the
exit pupil of the patent 2896506. The maximum skew aberration is +7.01º and the
minimum is -7.01º (ray A). Both extreme occur from skew rays at the edge of the
pupil. (b) Horizontal cross section (indicated in orange dashed line in part (a)) of
the skew aberration exit pupil map has zero skew aberration for the center ray,
which is the chief ray.
105
Skew aberration of this example has a form of a circular retardance tilt (linear
pupil dependence) plus coma (cubic pupil dependence) aberration; skew aberration has
the same Jones matrix as a circular retarder, both of them cause a rotation of the
polarization state. For the radially symmetric lenses in our study the pupil dependence is
linear near the center of the field with an increasing cubic component towards the edge of
the object.
Skew aberration contribution of ray A (Figure 6.3 (a)) at each lens surface is
shown in Figure 6.4.
Figure 6.4 Skew ray A's skew aberration contribution from each lens surface sums
to -7.01º through the system.
106
6.3.2 Skew Aberration’s Effect on Point Spread Function and
Modulation Transfer Function
When there is a variation of skew aberration across the exit pupil of the wavefront, there
are changes in polarization character of the wavefront. This variation of skew aberration
across the pupil creates undesired polarization components in the exit pupil; typically
cross polarized satellites form around the PSF45, 52, 53
. Thus the point spread function
(PSF) and an optical transfer function (OTF) changes and image quality can be degraded.
In the presence of polarization aberration, the scalar PSF can be generalized to a four-by-
four point spread matrix (PSM) in Mueller matrix notation and an OTF can be
generalized to a four-by-four optical transfer matrix (OTM)53
. Following figures show a
PSM and OTM of the example system.
Figure 6.5 is the PSM of the example system calculated by discrete Fourier
transform of the parallel transport matrix of the system at the exit pupil. The shape of the
peaks in the figure is not radially symmetric since the optical system is not isoplanatic for
this maximum field angle of 32º.
The diagonal elements are the dominant terms; the exiting wavefront after the ideal, non-
polarizing ray tracing is almost the same as the incident wavefront polarization. The off-
diagonal elements introduce polarization mixing; the exiting wavefront has polarization
components that did not exist in the incident wavefront due to rotation that are introduced
by skew aberration. Since different skew rays have different skew aberration, the exiting
107
wavefront not only has undesired polarization components but also has some variation
across the pupil.
Figure 6.5 A point spread matrix (PSM) of the example system calculated from a
discrete Fourier transform of the parallel transport matrix of the system at the exit
pupil. The elements are elliptical due to three waves of astigmatism.
Figure 6.6 shows the MTF of the example system calculated by a discrete Fourier
transform of the PSF in Figure 6.5. Each term is normalized so that the maximum is one.
108
The off-diagonal elements show the additional effects due to polarization components
that are presents on the exit pupil, which skew aberration creates.
Figure 6.6 An optical transfer matrix (OTM) of the example calculated by a discrete
Fourier transform of the PSM.
109
Due to the variation of skew aberration across the pupil, the deviation of PSM and
OTM from the ideal PSM and OTM also has variation across the pupil. And this can
degrade the image quality of the system.
6.4 Statistics – Code V Patent Library
To understand the typical magnitude of skew aberration and its importance in the families of
aberrations, the skew aberration of 2383 non-reflecting optical systems in Code V’s US patent
library was calculated and shown in Figure 6.7. The mean is 0.89º and the standard deviation is
1.37º. The maximum skew aberration is 17.45º and the minimum is -11.33º.
Figure 6.7 Histogram of the maximum skew aberration evaluated for 2383 non-
reflecting optical systems in Code V’s library of patented lenses.
110
When all polarization states of an incident wavefront experience the same amount
of rotation, the exiting wavefront polarization is simply a rotation of the incident
wavefront polarization. This is analogous to having a constant phase over the entire
wavefront such as piston aberration. Neither piston nor uniform polarization rotation
should degrade image quality or the point spread function (PSF). It is the variation of
skew aberration across the exit pupil of the wavefront that changes the polarization
character of the wavefront.
6.5 Skew Aberration in Paraxial Ray Trace
Paraxial optics is a method of determining the first-order properties of a radially
symmetric optical system that assumes all ray angles and angles of incidence are small54
.
A paraxial ray trace or first-order ray trace is a linearized approximation of real ray
behavior. The first order properties of radially symmetric systems, such as focal lengths,
magnifications, principal planes and others are defined in terms of the paraxial ray trace.
The paraxial ray trace provides an invaluable coordinate system for the description of
aberrations. Although the computer now makes it easy to trace real rays, the linearity of
paraxial ray slopes and coordinates makes them useful for solves, which specify
thickness or curvature indirectly in terms of paraxial ray properties.
Paraxial refraction and transfer equations at the qth
ray intercept are
1
1
1
1
q q q q
q q q q
q q q q
q q q q
y y w
y y w
w w y
w w y
(6.5.1)
111
where / , ,q q q q q q qt n w n u t is the distance between the qth
and the q+1th
surface
vertices along the axis, qn is the refractive index of the material following the q
th surface,
y is the marginal ray height, y is the chief ray height, u is the marginal ray angle, and u
is the chief ray angle.
The Code V paraxial ray trace table traces two rays, the paraxial marginal ray
(from center of object to edge of entrance pupil) and full-field paraxial chief ray (from
top of object to center of entrance pupil)55
. All paraxial rays can be calculated from
linear combinations of these two rays. For the paraxial skew ray from the top of the
object and edge of the pupil the paraxial marginal ray height at each surface is the x-
coordinate of the skew ray, the paraxial chief ray height is the y-coordinate of the skew
ray, and the vertex of each surface is the z-coordinate of the skew ray. Thus, the
propagation vector qk after the qth
ray intercept is along 1 1{ , , }q q q q qy y y y t . Since
the skew aberration calculation uses the normalized propagation vectors, further
manipulation provides a normalized qk
2 2
{ , ,1}
1
q q
q
q q
w w
w w
k . (6.5.2)
For paraxial ray trace, spherical polygons that the parallel transport of skew rays
trace reduces to a polygon on the transverse plane that is perpendicular to {0,0,1} . Thus,
the paraxial ray trace skew aberration is proportional to the area of the polygon. By
112
dropping the z-component of the propagation vectors, 2D propagation vectors which form
the polygon can be calculated
2 ,2 2
{ , }
1
q q
D q
q q
w w
w w
k . (6.5.3)
An area of a triangle that connects the origin, 2 ,D qk , and
2 , 1D qk is
1 1
2 2 2 2
1 1
1
2 1 1
q q q q
q
q q q q
w w w wArea
w w w w
. (6.5.4)
Further manipulations using Eq. (6.5.1) results
2 2 2 2
1 1
2 2 2 2
1 1
2 1 1
2 1 1
q q q q q
q
q q q q
q
q q q q
w y w yArea
w w w w
H
w w w w
(6.5.5)
where q q q qH w y w y is the Lagrange invariant of the system.
Therefore, the skew aberration of the system is proportional Lagrange invariant,
and is closely related to the sum of the individual surface powers (Eq. (6.5.5))
2 2 2 2
1 12 1 1
q
qq q q q
HTotal Area
w w w w
. (6.5.6)
The optical system of Section 6.3 is used to calculate the skew aberration in
paraxial ray trace. A paraxial skew ray at point A in Figure 6.3 (a) is created by paraxial
113
ray traced marginal ray height and chief ray height. The skew aberration of -2.00º
rotation occurs for a paraxial ray at point A. The skew aberration is calculated by
following the algorithm in Section 6.2. Skew aberration contribution of a paraxial ray at
point A at each lens surface is shown in Figure 6.8.
Figure 6.8 Paraxial skew ray at point A's skew aberration contribution from each
lens surface sums to -2.00º through the system.
The existence of skew aberration in paraxial regime shows the possibility to
further describe skew aberration using series expansion method, which is one of the
directions for future work.
114
6.6 Conclusion
Skew aberration is a component of polarization aberration that originates from pure
geometric effects. Pupil variation of skew aberration affects PSF and degrades image
quality. The skew aberration of a chief ray serves as a piston-like aberration i.e., overall
polarization rotation across the pupil. Skew aberration is typically a small effect in lenses
but it could be a concern in microlithography optics and other polarization sensitive
systems with high NA and large FOV.
115
CHAPTER 7 UNDERSTANDING APPARENT RETARDANCE
DISCONTINUITIES
Issues in the measurement and modeling of the order of retarders are addressed. When
measuring the retardance spectra of compound retarders, the retardance can appear to
“turn around” and avoid integer numbers of waves of retardance. In this chapter, studies
on phase unwrapping of the principal retardance for polychromatic light are presented in
order to explain the behavior of retardance of compound retarders. One approach is
using a dispersion model to describe the retardance behavior. The other approach is
considering multiple wavefronts exiting the compound retarder system i.e., multi-valued
optical path length (OPL).
A common definition describes a retarder as a device that divides a beam into two
orthogonal modes and introduces a relative phase difference δ1. Another view of
retarders is provided by the Mueller calculus and Poincaré sphere. A retarder rotates
polarization states on the Poincaré sphere by retardance δ; as light propagates through a
retarder the incident state on the Poincaré sphere is rotated about a retardance axis to
another state. In this Mueller / Poincaré picture, cascading retarders is equivalent to
cascading rotations of the Poincaré sphere. This view of retarders and its implementation
with Mueller matrices have ambiguities of 2 n where n are integers describing the order
of the retarder. In the Mueller picture, the final polarization state always ends up in the
116
right place but we only know the retardance modulo 2 . Similarly, the retardance axis
orientation is modulo to 90 .
In the following section an algorithm for finding total retardance, horizontal
retardance, 45° retardance, and circular retardance for given retarder Jones matrices is
presented. In Section 7.2 a retarder space concept is introduced to better understand
phase unwrapped retardance and compound retarder systems with multiple retarder
elements. Dispersion properties of retardance for homogeneous and inhomogeneous
compound retarders are analyzed.
7.1 Retardance Calculation for Jones Matrices
In this section an algorithm calculating retardance values – the total retardance ( ), a
horizontal component of the retardance (H ), a 45º component of the retardance (
45 ),
and a circular component of the retardance (R ) - from the Jones matrix of a pure
retarder is presented. Any Jones matrix J can be expanded with Pauli spin matrices
0 1 2 311 12
0 0 1 1 2 2 3 3
2 3 0 121 22
c c c icj jc c c c
c ic c cj j
J σ σ σ σ , (7.1.1)
Where are complex numbers, 0
1 0
0 1
σ , 1
1 0
0 1
σ , 2
0 1
1 0
σ , and
3
0
0
i
i
σ . For any given pure retarder Jones matrix, calculating ic is simple,
117
11 22 11 22 12 21 12 210 1 2 3
( ), , , .
2 2 2 2
j j j j j j i j jc c c c
(7.1.2)
The Jones matrix of a pure retarder (no amplitude, specified phase) can be
expressed as the exponential of a sum of Pauli spin matrices,
1 45 2 3
1 45 2 30
exp( ( ) / 2)
cos( ) sin( )( ),2 2
H R
H R
i
i
J σ σ σ
σ σ σσ
(7.1.3)
where 2 2 2
45H R .
Eq. (7.1.1) can be written as
31 20 0 1 2 3 0 0 1 1 2 2 3 3
0 0 0
( ) ( ),cc c
c c d d dc c c
J σ σ σ σ σ σ σ σ (7.1.4)
where 0c is the polarization independent part of the Jones matrix i.e., the absolute
amplitude and phase change.
Comparing Eq. (7.1.1) and Eq. (7.1.4)
451 2 3tan( ) , tan( ) , tan( ) ,
2 2 2
H Rd i d i d i
(7.1.5)
and using 2 2 2
45H R
2 2 2
1 2 3
31 245
2arctan( )
, , ,tan( / 2) tan( / 2) tan( / 2)
H R
-d - d - d
i di d i d
(7.1.6)
118
with the fast and slow axes along
45 45{ , , } and { , , }.fast H R slow H Ra a - - - (7.1.7)
If 1 , Eq. (7.1.6) converges to
1 45 2 32 , 2 , 2 ,H Rid id id (7.1.8)
using 0
lim 2tan( / 2)
.
When c0 is zero, Eq. (7.1.1) becomes
1 1 2 2 3 3c c c J σ σ σ (7.1.9)
and cos( )2
in Eq. (7.1.3) should be zero. Thus this Jones matrix is a half wave retarder.
Eq. (7.1.3) becomes
1 45 2 31 45 2 3sin( )( ) ( ).
2
H RH R
ii
σ σ σJ σ σ σ (7.1.10)
Using Eq. (7.1.9) and (7.1.10),
1 45 2 3, , ,H Ri c i c i c (7.1.11)
and using 2 2 2 2
1 2 3 sin ( / 2)c c c -
2 2 2
1 2 32arcsin( ).-c - c - c (7.1.12)
119
The total retardance δ from Eq. (7.1.6) has π as its maximum as shown in Figure
7.1. The red dotted line indicates the retardance value , which is the value that the total
retardance approaches.
Figure 7.1 A range of total retardance δ calculated from Eq. (7.1.6).
Retardance magnitude less π than is called the “principal retardance”.
7.2 Retarder Space
Retarders can be represented as points in a retarder space which has horizontal
component of the retardance ( H ), 45º component of the retardance (45 ), and circular
component of the retardance (R ) as axes. The Figure 7.2 shows a corresponding
location of a retarder with total retardance δ in the retarder space. The coordinate of the
point is 45{ , , }H R , where
H is the horizontal component of the retarder, 45 is the 45°
120
component of the retarder, R is the right circular component of the retarder, and
2 2 2
45H R .
Figure 7.2 A retarder space with δH, δ45, and δR as axes where δ is the distance from
the origin to a point which has a magnitude of retardance δ.
Points in the45H - plane represent linear retarders while points along the
R -
axis represent pure circular retarders. The corresponding Jones eigenpolarizations for a
retarder with retardance45, , and H R are,
45 45,
1 1
H H
R RF Si i
v v , (7.2.1)
and Stokes eigenpolarizations are
H
45
R
121
45 45
,H H
F S
R R
-
-
-
S S , (7.2.2)
where F and S stand for fast and slow modes.
The ellipticity of the eigenpolarizations is
2 2
45
1arctan( ),
2
R
H
(7.2.3)
and the orientation of major axis is
451arctan( ), .
2 2f s f
H
(7.2.4)
In the retarder space, there is no limit on the range of retardance; retardance value
can have any magnitude. Points on a sphere of radius 2π with its center at the origin
represent retarders with one wave of retardance and points on a sphere of radius 4π
represent retarders with two waves of retardance, etc.
Figure 7.3 shows two groups of identical retarder Mueller matrices with different
absolute phases in the retarder space. Each sphere represents retarders with retardance
n . The origin and spheres of radius (retardance) 2n are identity Mueller matrices.
Red points ( A and A ) are identical retarder Mueller matrices and are symmetrically
located about the origin. Similarly, green points ( B and B ) show another set of
identical Mueller matrices in the retarder space.
122
Figure 7.3 Two groups of Mueller matrices (A and B) with the same retardance
modulo to 2π are shown in the retarder space. Each point in the groups is 2π away
from each other and shares the same fast and slow axes.
For each Mueller matrix with retardance there is a series of Mueller matrices with the
retardance 2n with the same fast axes 45{ , , }H R and with the retardance 2n
with the orthogonal fast axes 45{ , , }H R- - - .
7.3 Trajectories of Jones Retarder Matrices as the Polarization State
Analyzer Rotates
As mentioned in section 4.3.1 rotation of the polarization state analyzer (PSA) affects the
retardance value measured from the Jones matrix polarimeter. Figure 7.4 shows two
H
45
2
4
I
I
I
A
A
A B
B
B
B
A
B
B
123
views of trajectories of horizontal fast-axis linear retarder (HLR) with retardance δ0 in the
retarder space as the PSA is rotated by θ from zero to 2π. Since retarders have
redundancy in every nπ, points in the retarder space repeat as the PSA rotates; an initial
point for each δ0 is on theH -axis and moves toward positive
R -axis along the trajectory.
When the trajectory reaches the boundary it moves to the origin symmetric point and
comes back to the starting point as θ reaches 2π. During 2π rotation of θ the trajectory
repeats twice.
2
0
2H
202
45
2
0
2
R
124
Figure 7.4 Trajectories of horizontal fast axis linear retarders in the retarder space
as the PSA rotates by θ from zero to 2π. Each trajectory repeats twice for 0≤ θ ≤ 2π.
For δ0 = 0 retarder, which is an identity matrix, the trajectory is a single line and the
retarder behaves as a circular retarder as θ increases. As δ0 increases trajectories start to
curve and form a spiral. For δ0 = π, which is a half wave retarder, the trajectory stays in
45-H plane, as we expected, and repeats around the circle twice keeping the total
retardance to π.
A family of parameters - rotation angle θ between the incident and exiting local
coordinates, retardance δ, eigenpolarization trajectories on the retardance space, and
2 0 2
H
202
45
2
0
2
R
0 0 0 4
0 2
30 4
0
125
eigenpolarizations of a Jones matrix - are needed to calculate correct retardance of a
Jones matrix. For simple cases, such as single retarder with rotating analyzer or s-p
reflection, there is no complexity. However, calculating correct retardance from multiple
skew reflections through compound inhomogeneous systems requires a more thorough
analysis.
7.4 Phase Unwrapping for Homogeneous Retarder Systems using
Dispersion Model
7.4.1 Dispersion Model
For monochromatic light we cannot distinguish matrices that are identical with different
absolute phase terms; for example,
2 4
1 2 3
1 0 1 0 1 0, ,
0 1 0 1 0 1
i ie e
J J J , (7.4.1)
1 2 3, , and J J J are the same polarization elements for monochromatic light. Thus the
absolute phase term is often ignored for monochromatic light source and all three
matrices are treated as an identity matrix. In the Michelson interferometer, using
polychromatic light the location of the zero optical path difference can be found. So
polychromatic light can be applied with care to distinguish absolute phases in cases such
as Eq. (7.4.1).
126
When a polychromatic wavefront propagates through a waveplate, each
wavelength of the light experiences different retardance. This is the origin of the
dispersion model for retarders. A plane wave can be written as
2 ˆ( )( )
0 0( , )n
i ti tt E e E e
k r
k rE r (7.4.2)
where k is the normalized propagation direction, is the wavelength of the plane wave,
n is the refractive index of the material, is the frequency of the plane wave. When this
plane wave propagates through a waveplate, the distance that the plane wave propagates
within the waveplate is
ˆd k r (7.4.3)
and the wavelength dependent retardance, which comes from the phase of the plane wave,
is
02( )
nd
, (7.4.4)
assuming the refractive index of the waveplate is not dependent on the wavelength. Eq.
(7.4.4) is the dispersion model of the retardance. This model is used for the phase
unwrapping of the principal retardance for homogeneous and inhomogeneous retarder
systems.
127
7.4.2 Phase Unwrapping of the Homogeneous Retarder System
In this section, a phase unwrapping algorithm of the principal retardance using the
dispersion model is introduced. For a given Jones or Mueller matrix, retarder order at
is determined using the dispersion model and pieces of principal retardance from
different orders are rearranged. Unwrapped retardance is the true retardance of the
system for a given wavelength. An exam system consisting of two horizontal linear
retarders (HLR) with shared fast axes is used to understand the phase unwrapping
algorithm.
When two HLRs are aligned to have the same fast or slow retarder axis, they form
a homogeneous retarder system since the eigenpolarizations are orthogonal to each other.
In this section two HLRs made from the same material with different thicknesses are
aligned along a shared fast axis. The retardance of each plate as a function of wavelength
has a linear relationship to each other, i.e., 2 1( ) ( )K . Consider the case when the
thickness of the second retarder is twice of the first retarder i.e., 2K . Figure 7.5 shows
the first and the second retarders’ principal retardance and the fast axis orientation as a
function of wavelength ranging from 200nm to 4μm. The principal retardance is
calculated by following the algorithm explained in Section 7.1 and the fast axis
orientation is calculated using Eq. (7.1.7). The orientation of the fast axis is along the
horizontal axis for downward sloping regions and is along the vertical axis for upward
sloping regions.
128
(a) (b)
Figure 7.5 (a) The principal retardance of the first retarder and its fast axis
orientation as a function of wavelength. (b) The principal retardance of the second
retarder and its fast axis orientation with respect to the horizontal axis as a function
of wavelength.
The total retardance is the sum of each retarder’s retardance
1 2 1 1 1( ) ( ) ( ) ( ) 2 ( ) 3 ( )Total . (7.4.5)
Figure 7.6 shows the retardance of the first retarder 1( ) in green, the retardance
of the second retarder 2( ) in blue, and the total retardance ( )Total of the system in red
as a function of wavelength in one plot. All the retardance values are principal
500 1000 1500 2000 2500 3000 3500 4000nm
0.5
1.0
1.5
2.0
2.5
3.0
rad
500 1000 1500 2000 2500 3000 3500 4000nm
0.5
1.0
1.5
2.0
2.5
3.0
rad
1000 2000 3000 4000nm
0.5
1.0
1.5
fast axis
1000 2000 3000 4000nm
0.5
1.0
1.5
fast axis
129
retardances with horizontal fast axis for downward sloping regions and with vertical fast
axis for upward sloping regions.
Figure 7.6 Retardance plots as a function of wavelength for two horizontal fast axis
linear retarders (HLR) with different thicknesses (green and blue) and a
combination of two HLRs with a shared horizontal fast axis (red). The principal
retardance has the horizontal fast axis for downward sloping regions and has the
vertical fast axis for upward sloping regions.
The principal retardance and the fast axis of the combination of two HLRs as a
function of wavelength are shown in Figure 7.7.
1000 2000 3000 4000nm
0.5
1.0
1.5
2.0
2.5
3.0
rad
130
Figure 7.7 The principal retardance of the combination of two HLRs and the
system’s fast axis orientation with respect to the horizontal axis as a function of
wavelength.
The principal retardance value oscillates between zero and π as the wavelength
gets shorter. Thus, a phase unwrapping is necessary to recover the real retardance values
at a given wavelength. This phase unwrapping method requires knowledge of the
500 1000 1500 2000 2500 3000 3500 4000nm
0.5
1.0
1.5
2.0
2.5
3.0
rad
1000 2000 3000 4000nm
0.5
1.0
1.5
fast axis
131
wavelength dependence of a retarder; the phase unwrapping algorithm uses the
retardance dispersion model shown in Eq. (7.4.4). Coming from the right side of the
principal retardance plot in Figure 7.7, the retardance increases to π with the horizontal
fast axis, then decreases with the vertical fast axis. When the retardance reaches zero,
then it increases with the horizontal fast axis. In order to phase unwrap the principal
retardance, a mode number q is assigned to each segment of the principal retardance with
different fast axis orientation; odd q’s are for the horizontal fast axis and even q’s are for
the vertical fast axis. Figure 7.8 shows segments with odd mode numbers in blue and
segments with even mode numbers in red coming from the right side of the graph (longer
wavelength). For this example, q = 1, 2, …, 16.
Figure 7.8 Each segment of the principal retardance has a mode number q to apply
phase unwrapping algorithm. Starting from the right side of the graph, blue
segments have odd mode numbers and the red segments have even mode numbers.
132
Figure 7.9 show the trajectory of the principal retardance within the π sphere as
the wavelength reduces; the upper left corner figure corresponds to the mode 1 in Figure
7.8. The red color indicates the principal retardance vector 45{ , , }H R for the longest
wavelength. As the wavelength reduces, color changes to yellow → green → blue →
purple → magenta. The R axis is coming out of the page and each segment shows the
points along the same fast axis. The upper left corner figure has the horizontal fast axis
(along H axis) and the retardance is increasing. Once the retardance reaches the π
sphere, its fast axis changed to the vertical direction (along H- axis) and moves to the
origin symmetric point. The next trajectory is continued in the upper middle figure and
so on. The origin {0,0,0} is equivalent to the identity matrix which is a full wave
retarder or 2nπ retarder for an integer n. As the wavelength reduces, the fast axis
orientation changes 15 times alternating along the horizontal and vertical directions.
133
134
Figure 7.9 Principal retardance vector trajectories are shown in the retarder space
as the wavelength changes. Each figure corresponds to a different mode number
starting from the longest wavelength (mode 1) to the shortest wavelength.
Figure 7.10 shows the principal retardance trajectory as the wavelength changes;
the system’s principal retardance gets larger as the wavelength gets shorter. When the
trajectory reaches a sphere with radius π (point x in the figure) it goes to x’, which is the
origin symmetric point of x and spirals to x’ within the sphere, instead of continuing to x.
Figure 7.10 The principal retardance trajectory in the retarder space as the
wavelength of the ray changes. Discontinuity occurs on a sphere of radius π.
R
XX
2H
45
'X
'X
135
The phase unwrapping algorithm maintains the fast axis orientation as the
wavelength reduces and calculates the true retardance at a given wavelength without the
upper limit. The basic assumption of this method is that at long enough wavelengths, for
retarders under consideration will have less than a half wave of retardance, which is the
true retardance of the retarder at that wavelength. Therefore when q = 1, the principal
retardance is the true retardance. For even q’s the true retardance is principalq while
the fast axis orientation is kept along the horizontal axis. For odd q’s the true retardance
is ( 1) principalq with the fast axis along the horizontal axis, i.e.,
when 1
when
( 1) when
principal
unwrapped principal
principal
q
q q even
q q odd
. (7.4.6)
Thus the phase unwrapped retardance range has no upper limit.
Figure 7.11 (a) shows the principal retardance trajectory in the retarder space for
two-aligned HLR system as the wavelength reduces; when the principal retardance
reaches the boundary value π the trajectory moves to the origin symmetric point on the
sphere and changes its fast axis to the orthogonal direction. Figure 7.11 (b) shows the
retardance trajectory of the same system after the phase unwrapping; the horizontal
retardance increases continuously keeping the fast axis orientation along the horizontal
direction. As shown in Figure 7.2, the distance from the origin to a point in the retarder
space is the total retardance of the system.
136
(a)
(b)
Figure 7.11 (a) Principal retardance trajectory of a two-aligned HLR system in the
retarder space as the wavelength gets shorter. (b) Retardance trajectory of the
same system after the phase unwrapping.
Retardance plots in Figure 7.6 can be phase unwrapped to Figure 7.12.
H
45
A
3 5
R
H
45R
A A’ B B’
C
B C
137
Figure 7.12 Phase unwrapped retardance as a function of wavelength for two HLRs
(green and blue) and a system of two-aligned HLR (red).
In phase unwrapped figure, the total retardance value always follows the relationship in
Eq. (7.4.5) for all the wavelengths. For homogeneous retarder systems, there is no
discontinuity in phase unwrapped retardance values.
7.5 Discontinuity in Phase Unwrapped Retardance Values for Compound
Retarder Systems of Arbitrary Alignment
For two retarders with the same fast axes, the total retardance is the sum of individual
retardance and the retardance is linear. Many interesting phenomena are associated with
500 1000 1500 2000 2500 3000 3500 4000nm
10
20
30
40
rad
138
phase unwrapping and order determination of compound linear retarders whose fast axes
are neither parallel nor perpendicular. Such retarders may result from misalignment or be
intentionally at arbitrary angles. Sequences of retarders whose axis are neither parallel
nor perpendicular are in general elliptical. We will explore phenomena in which the
retardance itself appears to be discontinuous and explain why.
If the fast axis of the second retarder is slightly misaligned from that of the first
retarder, as it always will be in practice, the total principal retardance has slightly
different behaviors from the aligned system. For example, if two horizontal fast axis
linear retarders with retardance 1( ) and
2( ) respectively, are misaligned by θ, the
Jones matrix of the system is
Total J
( [ ] [ ] ( ) [ ]
( ) [ ] ( [ ] [ ]))
(7.5.1)
Using the algorithm explained in Section 7.1, the retardance of the systems is
. (7.5.2)
From this equation, it is clear that the retardance of the system is not only dependent on
individual retarder’s retardance but also dependent on the angle between the fast axes
orientations. If both retarders are half-wave retarders with horizontal fast axes, the total
retardance will be (full wave) when two fast axes are aligned and the total retardance
139
will be zero when the fast axes are orthogonal to each other. Figure 7.13 shows how the
total retardance of the system varies as the second retarder’s fast axis orientation changes
(θ) with respect to the horizontal direction, which is the first retarder’s fast axis
orientation.
Figure 7.13 Total retardance for a system with two half-wave linear retarders is
plotted as the fast axis orientation (θ) of the second retarder changes with respect to
the first retarder’s fast axis orientation.
Note that in the figure, zero total retardance when θ = π/2 and two end points when θ = 0
and π imply different total retardance. Since the plot shows the principal retardance, two
end points are when the total retardance is 2π and the middle zero is when the total
retardance is zero.
First, the origin of apparent discontinuities in phase unwrapped retardance of
compound retarder systems is explained by separating the Jones matrix TotalJ of the
0.0 0.5 1.0 1.5 2.0 2.5 3.0rad
0.5
1.0
1.5
2.0
2.5
3.0
rad
140
compound retarder system in Eq. (7.5.1) into two parts, θ dependent Jones matrix MajorJ
and θ independent Jones matrix MinorJ .
Two linear retarders in Eq. (7.5.1) can be written as a sum of Pauli spin matrices
1 12 2 0 1
2 2 21 1 0 1 2
( , ) cos( ) sin( )2 2
( ,0) cos( ) cos(2 )sin( ) sin(2 )sin( ) ,2 2 2
i
i i
LRJ σ σ
LRJ σ σ σ
(7.5.3)
where ( , ) LRJ is a linear retarder with a principal retardance and the fast axis along
with respect to the horizontal axis. Thus Eq. (7.5.1) becomes
2 2 1 1
1 2 1 20 1
2 2 10
2 2 11
2 12
, indepen
( , ) ( ,0)
cos( ) sin( ) 2 2
2sin ( )sin( )sin( )2 2
2 sin ( )sin( )cos( )2 2
2 sin( )cos( )sin( )cos( )
dent
2 2
2
pa
sin( )cos
rtM
Total Major Minor
ajori
i
i
i
J LRJ LRJ J J
σ σ J
σ
σ
σ
2 13
, depende
(
nt pa
)sin( )sin
r
(2 2
t
)
Minor
J
σ . (7.5.4)
The MajorJ matrix is a horizontal linear retarder with retardance 1 2Major
( 1 2( ,0) LRJ ) and MinorJ can be written as
0 0 1 1 2 2 3 3
2 2 1 1 10 1 2 3
( )
2sin ( )sin( )sin( )[ cot( ) cot( )cot( ) cot( ) ].2 2 2 2
Minor c d d d
i i i
J σ σ σ σ
σ σ σ σ (7.5.5)
141
Using Eq. (7.1.6), the retardance of MinorJ is
2 2 21 1
2 21
cos ( ) sin ( )cos ( )2 22arctan( ).
sin ( )sin ( )2
Minor
(7.5.6)
The discontinuities, which occur when the principal retardance in Eq. (7.5.6) is phase
unwrapped, originate from the MinorJ matrix. The MajorJ matrix always has continuous
phase unwrapped retardance. The discontinuities get greater as gets larger since the
amplitude of Minor gets greater as gets larger.
Let’s consider a system consists of two linear retarders – a HLR with retardance
δ1 and a HLR with retardance δ2 but misaligned by π/16 from the horizontal axis. The
retardance ratio is δ1: δ2 = 1 22 2:
nd nd
= 1.1: 2 and both retarders are made from the
same material where 1d and
2d are the thickness of the retarders as in the dispersion
model (Eq. (7.4.4)). The misaligned retarder system has a Jones matrix,
2 2 1 1( , /16) ( ,0)Total J LRJ LRJ
(
). (7.5.7)
Using the dispersion model for each retarder’s retardance and Eq. (7.5.2) with = π/16,
the principal retardance of the combination ( TotalJ ) is calculated as a function of
wavelength.
142
Figure 7.14 shows principal retardance of TotalJ and the fast axis orientation as a
function of wavelength. Coming from the right side, the principal retardance increases to
π (half wave), then decreases and the axis rotates to vertical. The principal retardance
only decreases to 0.7 before turning around and increasing to π, the second maximum
from the right. At this half wave point the retardance (δ) must be 3π/2, corresponding to
the second maxima in the top figure in Figure 7.7. The retardance of the system never
has the transformation which would correspond to δ = 2π.
143
Figure 7.14 The principal retardance of TotalJ , a system of two HRLs misaligned by
π/16, is plotted as a function of wavelength. Green dotted circles in the top figure
indicate the area where the principal retardance changes its slope without going
down to zero.
144
A full wave retarder or 2nπ corresponds to the origin {0,0,0} in the retarder space.
When the retardance vectors of the compound system are plotted in the retarder system,
the trajectory misses the origin as the wavelength reduces. Figure 7.15 and Figure 7.16
show two different views of the trajectories in the retarder space as the wavelength
reduces. Two views are shown for the clarity and both figures follow the same color
scheme as Figure 7.9; red → yellow → green → blue → purple → magenta. Each
segment shows the trajectory as it approaches the π sphere boundary. For example, the
upper middle figure starts at and follows the trajectory to
as the retardance vector reaches the π sphere.
145
Figure 7.15 Principal retardance vector trajectories are plotted in the retarder
space as the wavelength reduces. Each figure corresponds to a segment of the
trajectory from the longer wavelength to the shorter wavelength as the retardance
vector approaches the π sphere.
146
Figure 7.16 The top view of the principal retardance vector trajectories in the
retarder space as the wavelength reduces.
Unlike the aligned system’s retarder space trajectory (Figure 7.9), the compound
system’s retarder space trajectory misses the origin, i.e., the phase unwrapped retardance
of the compound system increases from π to 3π without passing through 2π point, the
origin.
147
Using the principal retardance of TotalJ and the fast axis orientation, the true
retardance can be calculated by using the phase unwrapping algorithm, Eq. (7.4.6).
Discontinuities are clearly visible when phase unwrapped retardances for the misaligned
(blue) and aligned (red) system are plotted together as shown in Figure 7.17. The blue
plot has similar values as the red plot since the misalignment is small. However, the blue
plot has discontinuities whenever the retardance value crosses 2nπ boundaries, which are
plotted as horizontal blue dashed lines.
Figure 7.17 Phase unwrapped retardance plotted as a function of wavelength for the
aligned (red) and forced phase unwrapping for a misaligned (blue) two-HLR system.
148
Figure 7.18 shows phase unwrapped Major of the compound system in the top and
Minor in the bottom.
Figure 7.18 Phase unwrapped Major (top) and
Minor (bottom) plotted as a function of
wavelength. The phase unwrapped Major is the linear addition of 1 and
2 of the
each linear retarder.
149
The phase unwrapped Major follows the exact same curve as the red plot in Figure 7.17
since Major is a linear addition of
1 and 2 of the each linear retarder. The
discontinuities in Figure 7.17 occur due to the non-linear retardance coming from the
MinorJ matrix.
Figure 7.19 shows the principal retardance as a function of wavelength for a
system of two misaligned HLRs with different misalignment amounts θ; the angle
between two fast axes orientation θ varies from π/16 to π/2. Green and blue lines are
principal retardance of the first and the second retarders. Red lines show the principal
retardance of the misaligned system while purple lines indicate a half wave of retardance,
the boundary value.
150
151
Figure 7.19 The principal retardance plots as a function of wavelength for two
HLRs (green and blue) and a system of two misaligned HLRs (red). Two HLRs
have different retardances and the angle between two fast axes is θ.
Whenever the principal retardance of the misaligned HLR system has minima
other than zero, the phase unwrapped retardance has discontinuity. At 2
two HLRs
are orthogonal to each other thus, the total retardance (red) is
2 21 2 2
( ) ( )( ) ( ) ( ) ( )
2 2Total
, (7.5.8)
and the fast axis of the combined system is along the fast axis of the second retarder.
Retarders have two modes, fast and slow modes. For misaligned compound
retarder systems which consists of two retarders, multiple modes, typically more than two,
exit the system. They are F1 F2, F1 S2, S1 F2, and S1 S2 where the F and S stand for the fast
and slow modes and 1 and 2 stand for the first and the second retarders. Two of the
modes (F1 F2 and S1 S2) have most of the intensity and the other two modes (F1 S2 and S1
152
F2) have fairly small intensity. For example, F1 F2 and S1 S2 modes for TotalJ in Eq.
(7.5.7) have 96% (
) of the intensity and the other two modes have 4% of the
intensity. Therefore, the phase unwrapped retardance has a similar behavior as the
aligned system in Section 7.4. However, whenever the phase difference between F1 F2
and S1 S2 modes are multiple of 2π (full waves of retardance), effects from the other two
modes become more noticeable and thus discontinuities occur as shown in Figure 7.17.
Consider a misaligned compound retarder system as shown in Figure 7.20; the
system has two HLRs, one with retardance δ1 and the other with retardance δ2, and the
fast axes are misaligned by . When a ray enters the system, there are four modes, not
two, exiting the system with different optical path lengths (OPLs); F1F2, F1S2, S1F2 and
S1S2 have different OPLs.
Figure 7.20 A system of two HLRs misaligned by θ. Four output beams exiting the
system with four different optical path lengths are shown offset for clarity.
F1S2
S1F2
1,0 2,
F1F2
S1S2
153
A HLR can be represented as sum of horizontal and vertical polarizers with
absolute phases of optical path lengths along fast and slow axes, respectively
1 1
2 21 0 0 0HLR = ,
0 0 0 1
f si OPL i OPL
e e
(7.5.9)
and a HLR at is a sum of linear polarizers at and / 2 with absolute phases of
optical path lengths along fast and slow axes, respectively
2 2
2 2
LR[ ] =
1 cos(2 ) sin(2 ) 1 cos(2 ) sin(2 ).
sin(2 ) 1 cos(2 ) sin(2 ) 1 cos(2 )2 2
f si OPL i OPL
e e
(7.5.10)
Therefore, each mode from the system of two linear retarders at can be
calculated from the multiplication of two linear polarizers with associated OPLs as
absolute phases. For example, the first mode (1 2FF ) is the eigenstate of a system of a
horizontal linear polarizer followed by a linear polarizer at with associated absolute
phases,
1 2
1
( )
1 2
cos( )e .
sin( )
f fi OPL OPLF F
X
(7.5.11)
Similarly, the other three modes are
154
1 2
1 2
1 2
1
2
( )
1 2
( )
1 2
( )
1 2
2
cos( )e
sin( )
sin( )e
cos( )
sin( )e .
cos( )
s f
f s
s s
i OPL OPL
i OPL OPL
i OPL OPL
S F
F S
S S
X
X
X
(7.5.12)
Red X marks indicate the exiting mode from the second retarder is the fast mode and blue
X marks indicate the exiting mode from the second retarder is the slow mode.
A conventional definition of retardance takes the difference in optical path lengths
of two modes exiting the retarder. Since there are more than two modes exiting the
system with the same intensity, the compound system retardance value cannot be
calculated from the conventional eigen-analysis. Multiple modes exiting the compound
retarder system is the origin of the discontinuities in phase unwrapped retardance.
For simplicity in calculation, set 0sOPL . This does not affect retardance of the
system since the retardance is the optical path difference. Using the dispersion model
(Eq. (7.4.4)) each mode’s phase is
1 1 2 21 2
2 21 2
1 11 2
1
1
2
2
1
2
2 2arg( )
2arg( ) 0
2arg( ) 0
arg( ) 0 0 0.
f f
s f
f s
s s
n d n dOPL OPL
n dOPL OPL
n dOPL OPL
OPL OPL
X
X
X
X
(7.5.13)
155
where in is the refractive index of the i
th retarder and
id is the distance that ray
propagates within the ith
retarder.
The retardance is the optical path difference (OPD) between four modes. When
the misalignment is small, most of the intensity of the exiting light is in 21 and XX .
Therefore, the retardance of the compound system as a function of wavelength follows
the curve of the OPD between 21 and XX ,
1 1 2 21 2 1 2
2 ( )major f f s s
n d n dOPL OPL OPL OPL
, (7.5.14)
where major indicates the overall tendency of the compound system’s retardance
behavior; retardance curve of both aligned and misaligned retarder systems follow 1
curve. However, whenever the retardance of one of the retarders becomes 2nπ for an
integer n, effects from the OPLs of the other two modes (12 and XX ) become larger and
the retardance of the compound system deviates from the major curve. An example
compound retarder system, two HLRs with the π/4 misalignment in fast axes, is used for
further studies. The retardance ratio between two retarders is 2 1/ 4/ 2.2 1.82 and
two retarders are made from the same material.
Due to multi-valued OPL for this system, interesting polarization artifacts occur;
sometimes the principal retardance of the system avoids certain values such as 2 n . The
principal retardance is puzzling and discontinuity occurs due to characteristics of
156
ArcTan(λ). Figure 7.21 shows the principal retardance as a function of wavelength for
this system using algorithms in Section 7.1. The principal retardance of the system has
minima other than zero which are shown in blue dotted areas; this is the origin of the
discontinuities in phase unwrapped retardance values near 2nπ.
Figure 7.21 The principal retardance as a function of wavelength for the system
with two HLRs with the fast axes misaligned by π/4. The principal retardance has
minima other than zero.
To better understand the behavior of the fast axis orientation of the compound
system, Figure 7.22 shows the first (green) and second (blue) retarders’ retardance as
functions of wavelength along with the fast axis orientation of the compound system, θfast.
When the retardance of the second retarder becomes 2nπ, the fast axis of the system is
Arbitrary Wavelength Units
157
along θfast = 0. When the retardance of the first retarder becomes 2nπ, the system fast
axis is along θfast = π/4, the misalignment amount between two HLRs.
Figure 7.22 Phase unwrapped retardance of the first (green) and second (blue)
retarders with the fast axis orientation ( fast ) of the compound system. Xs mark
wavelengths where individual plates have integer waves of retardance, and don’t
contribute to the axis of the retarder.
Using Eq. (7.4.6), the principal retardance can be phase unwrapped assuming that
the overall behavior of the retardance is 1/λ. Figure 7.23 shows the phase unwrapped
retardance of the first HLR (green), a 45° fast axis linear retarder (blue), and the
combined system (red). Orange dotted lines indicate 2nπ.
158
Unlike the compound system with 16
misalignment, each mode in this
system has 25% of the total intensity. Therefore, the discontinuities in phase unwrapped
retardance are more apparent than the one in Figure 7.14.
Figure 7.23 Green plot shows the retardance of the HLR, δ1(λ) as a function of
wavelength. Blue plot is the retardance of the 45° fast axis linear retarder, δ2(λ).
Red plot shows the retardance of the system of the two HLRs with π/4 misalignment
between two fast axes orientation, δTotal(λ).
Note that the discontinuities in red plot occur whenever the retardance of one of the
retarders is 2nπ; when one of the retarders has multiple waves of retardance, interferences
159
between 1 2 1 2( , )S F F F and
1 2 1 2( , )FS S S become dominant while major is defined by the
OPD between 1 2 1 2( , )F F S S .
Figure 7.24 shows phase unwrapped Minor of the compound system. The phase
unwrapped Major follows the exact same curve as the
Major in Figure 7.18 since Major is
a linear addition of 1 and
2 of the each linear retarder. The amplitude of the phase
unwrapped Minor is greater than the one in Figure 7.18. This explains the greater
discontinuities in Figure 7.23 compared to Figure 7.17, where the misalignment in fast
axes was /16 .
Figure 7.24 Phase unwrapped Minor plotted as a function of wavelength.
Figure 7.25 shows the principal retardance trajectory of the compound system in
the retarder space as the wavelength reduces. A side and top views are shown for clarity.
160
The trajectory starts at a point A at and stays inside a sphere with
radius π; once the trajectory reaches the boundary of the sphere (point B) it moves to the
opposite point (point B′) on the π sphere and the fast axis changes to the orthogonal state.
Thus, point B and B′ are the same retarder within the π sphere. Each figure shows part of
the principal retardance trajectory in a continuous fashion; point A corresponds to the
retardance for the longest wavelength and point J corresponds to the retardance for the
shortest wavelength. Trajectory follows A→B→ B′→C→ C′→D→ D′→E→ E′→F→
F′→G→ G′→H→ H′→I→ I′→J.
161
162
Figure 7.25 A principal retardance trajectory of the system with two misaligned
HLRs at 45° in the retarder space as the wavelength reduces. When the trajectory
reaches the boundary of π, the trajectory moves to the opposite point on the π
sphere and the fast axis changes to the orthogonal state.
163
Figure 7.26 shows points in Figure 7.25 on phase unwrapped retardance plot as a
function of wavelength. Dotted horizontal lines indicate nπ for an integer n. Whenever
the phase unwrapped retardance crosses nπ lines, points in the retarder space moves to
the opposite points on the π sphere.
Figure 7.26 Phase unwrapped retardance of the compound system as the wavelength
changes with the corresponding points in Figure 7.25.
7.6 Conclusion
Using Pauli spin matrices as basis the principal retardance, horizontal retardance ( H ),
45º retardance ( 45 ) and circular retardance ( R ) can be calculated. These components
164
can be mapped on a retarder space which has 45, , and H R
components as axes. Each
point 1 2 3{ , , } in the retarder space represents a retarder with retardance
2 2 2
1 2 3 which is the distance from the origin to that point. The retarder space
provides an insightful tool to understand the retardance trajectories.
A retardance dispersion model 0( )
is used to phase unwrap the principal
retardance. Systems consisting of two horizontal linear retarders with shared and non-
parallel fast axes were used as examples to demonstrate the principal retardance and
phase unwrapped retardance behaviors. For the misaligned two-waveplate system,
discontinuities in phase unwrapped retardance were identified; sometimes the principal
retardance of the system avoids certain values such as 2 n . The origin of discontinuities
is the multi-valued optical path lengths for compound retarder systems; for the
misaligned two-waveplate system, four modes (1 2F F ,
1 2S F , 1 2F S , and
1 2S S ) have
different OPLs. As the wavelength reduces, individual retarder’s retardance as well as
the fast axis of the compound system changes.
For N waveplates, there will be 2N different optical path lengths. Although this
chapter only explained two linear retarder systems, mathematics for N waveplates is
similar to two-waveplate system.
165
CHAPTER 8 COHERENCE MATRIX AND POLARIZATION RAY
TRACING TENSOR
8.1 Introduction
This chapter considers polarization ray tracing with incoherent light and presents a
method that is suitable for stray light calculation. For non-polarizing systmes, eight
independent parameters – amplitude, phase, three for diattenuation, and three for
retardance – are required to describe polarization characteristics of the media. The
general case, with depolarizing elements or scattering, requires sixteen independent
parameters – amplitude, three for diattenuation, three for retardance, and additional nine
degrees of freedom for depolarization – for a complete polarization characterization of
the system.
To provide mathematical description of the polarization properties of light, a
coherence matrix of an electric field vector in global coordinates is used. A polarization
ray tracing tensor is defined. Algorithms to calculate the tensor from surface amplitude
coefficients defined in local coordinates, a Mueller matrix defined in its local coordinates,
and a three-by-three polarization ray tracing matrix defined in global coordinates, are
derived. The polarization ray tracing tensor is defined in global coordinates and is used
to ray trace incoherent light through optical systems with depolarizing surfaces. The
polarization ray tracing tensor operates on the incident electric field’s coherence matrix
166
and returns the exiting coherence matrix in global coordinates. Therefore, this method is
suitable for scattered light ray tracing and incoherent addition of light.
For the case of a collimated beam of light, polarization ray tracing tensors can be
added to get the exiting coherence matrix. Therefore, the combined polarization ray
tracing tensor is defined for a specific incident propagation vector but not restricted by
the exiting propagation vector. Polarization ray tracing tensor calculus through a volume
of scattering particles is presented as an example.
8.2 The Coherence Matrix
The coherence matrix of a light beam contains all the measurable 2nd
order
correlation information about the state of polarization, including intensity, of an ensemble
of electromagnetic waves at a point56, 57
. This positive semidefinite Hermitian 3 x 3
matrix is defined as
(
)
(
) (8.2.1)
167
where is the instantaneous electric field vector; is the transpose conjugate of
; stands for the Kronecker product; is the complex conjugate of ;
; the brackets indicate the time average of the components
∫
. (8.2.2)
Under the assumption that are stationary and ergodic, the brackets can alternatively
be considered as ensemble averaging of .
In general, the time of measurement T is much larger than the coherence time for
the partially coherent electromagnetic waves. Therefore, is suited to describe
coherency of quasimonochromatic partially polarized light.
Conventional, two-dimensional (2D) Stokes parameters are defined on a plane
perpendicular to the propagation vector of the light, using specific local coordinates. For
a plane wave propagating along the z-axis
(
). (8.2.3)
Thus the Stokes parameters associated with this plane wave are
( ), (8.2.4)
and the degree of polarization for this plane wave is
168
√
√
. (8.2.5)
Similar relations can be developed for plane waves propagating in other directions.
The three-dimensional degree of polarization57
is defined by using the
coherence matrix
|√
(
‖ ‖
‖ ‖
)|, (8.2.6)
where the Euclidean norms are
‖ ‖ ( ) ,
‖ ‖ √(∑ | |
). (8.2.7)
The 3D degree of polarization takes into account not only the degree of
polarization of the mean polarization ellipse but also the stability of the plane that
contains the instantaneous components of the electric field of the wave. Unpolarized
light with a fixed propagation vector direction has 2D degree of polarization
whereas . Further discussions on can be found in the ref [57].
169
8.3 Projection of the Coherence Matrix onto Arbitrary Planes
To understand the 2nd
order correlations that result when two wavefronts with two
different electric fields overlap at a point, each electric field vector is converted to a
coherence matrix ,
,
. (8.3.1)
Coherence matrices can be added since the addition operator and integral operator
commute. Therefore, the total coherence matrix of the two wavefronts is
. (8.3.2)
The advantage of is it provides incoherent addition of two wavefronts in
global coordinates with complete polarization information along all three axes. Therefore,
this method is particularly useful for incoherent addition of multiple wavefronts with
different propagation directions. Scattered light wavefronts do not follow law of
reflection, refraction or diffraction but have various distributions of propagation
directions depending on the type of scattering at the ray intercept. Since each coherence
matrix is defined in global coordinates, simple summation of coherence matrices
provides the incoherent addition of wavefronts with different propagation directions. 3D
electric field vectors or the coherence matrices contain full information along x, y, and z
direction. However, polarization state is defined on a 2D plane and majority of
polarization analysis or intensity measurements are done on a 2D plane. Therefore, an
170
algorithm to find a projection of the coherence matrix onto an arbitrary plane is
necessary.
Projecting onto an arbitrary plane of interest is done by using proper
local coordinates on the plane and , which are perpendicular to the plane’s
surface normal
. (8.3.3)
Then, projected onto a plane spanned by and is
(
). (8.3.4)
This matrix is written in its local coordinates as Jones matrices are often written in s and
p local coordinates. Using Eq. (8.5.4), can be written in global coordinates,
. (8.3.5)
where .
8.4 A Definition of Polarization Ray Tracing Tensor
The coherence matrix is defined in global coordinates and thus allows incoherent
addition of light by simple addition. Therefore, an operator which deals with in
171
global coordinates can provide a new tool for incoherent ray tracing through depolarizing
optical systems.
A Polarization Ray Tracing Tensor (, , ,i j k ltT ) describes a depolarizing or non-
depolarizing polarization element or an interaction at a ray intercept
(
(
) (
) (
)
(
) (
) (
)
(
) (
) (
))
.(8.4.1)
operates on the incident coherence matrix which is defined in global
coordinates
∑
(8.4.2)
yielding the output coherence matrix where i, j, k, l = x, y, z, i.e.,
(
) (
) . (8.4.3)
The main advantage of the polarization ray tracing tensor over a polarization
ray tracing matrix P is that can describe depolarizing optical systems. Therefore
having an index that indicates how depolarizing a given tensor is will be meaningful.
172
Analogous to how the Mueller depolarization index58, 59
is defined, the depolarization
index of the can be defined when the tensor is associated with a single exiting
propagation vector. However, since the definition of depolarization index for the tensor
is not complete, it is included in Appendix B.
For a ray propagating through an optical system with multiple surfaces, each
surface in the system contributes a polarization ray tracing tensor. To get a cumulative
polarization ray tracing tensor for that particular ray, cascading the tensors is necessary.
Figure 8.1 shows an example optical system with a triplet followed by a lens barrel. A
collimated grid of rays enters the optical system, propagates through the triplet, and
scatters off the lens barrel before it reaches the detector. Each ray in the grid has a
polarization ray tracing tensor at each ray intercept. In order to ray trace through the
entire system, each ray’s polarization ray tracing tensors need to be cascaded to get a grid
of cumulative polarization ray tracing tensors at the detector.
173
Figure 8.1 A triplet followed by a lens barrel. A collimated grid of rays propagates
through the triplet and scatters off from the lens barrel before it reaches the
detector plane on the right.
Considering the primary property of the polarization ray tracing tensor shown in
Eq. (8.4.2) and its dimensions ( ), cascading two polarization ray tracing tensors
and is
∑ , (8.4.4)
and the exiting coherence matrix after the is
( )
∑ (
)
∑ ∑
. (8.4.5)
Similarly, the (i, j ) component of the for a ray propagating through N
ray intercepts can be calculated by cascading summations,
∑ ∑ ∑ ∑
(8.4.6)
When a collimated N-by-N grid of rays with a propagation vector enters an
optical system with N surfaces, each ray’s cumulative polarization ray tracing tensor can
be added to get the exiting coherence matrix for the incoherent addition of the exiting
rays,
174
( )
∑ (∑
)
( )
, (8.4.7)
where q stands for the ray index, . Since all the rays in the incident grid
have the same , all the cumulative polarization ray tracing tensors can be added and
then applied to to get
. Note that the combined polarization ray tracing
tensor (∑
) is defined for a single but is not restricted for the exiting
propagation vector direction. Thus, the combined polarization ray tracing tensor can
accommodate multiple exiting propagation vector directions, and this is one of the main
advantages of using the tensor method in stray light calculus.
If the incident grid of rays were not collimated, then the exiting coherence matrix
of each ray needs to be calculated using Eq. (8.4.2) and then added incoherently in order
to get the incoherent addition of the exiting rays,
∑ ∑ ( )
. (8.4.8)
8.5 A Polarization Ray Tracing Tensor for a Non-depolarizing Ray
Intercept
Although the main purpose of using the polarization ray tracing tensor is incoherent ray
tracing through depolarizing optical systems, can be used for the incoherent ray tracing
through non-depolarizing optical systems. In this section, two ways of calculating the
tensor are presented; one is using amplitude coefficients, defined in the surface local
175
coordinates. The other is using the three-by-three polarization ray tracing matrix, defined
in global coordinates.
8.5.1 A Polarization Ray Tracing Tensor from Surface Amplitude
Coefficients
If surface reflection or transmission coefficients are given in local coordinates
as in Section 2.2 for the light propagating along , the exiting electric field vector
projected onto the local coordinate plane perpendicular to is
, ,
, ,
Out s In sss ps
sp ppOut p In p
E E
E E
. (8.5.1)
Therefore, the exiting coherence matrix in this local coordinates ( ) is
,
,
, (8.5.2)
,
,
where
Eq. (8.5.2) can be written in terms of the polarization ray tracing tensor in local
{ } and { } coordinates, ,
176
( )
(
)
∑ ( )
∑
(
(
)
(
)
(
)
(
)
)
(
)
(8.5.3)
where (
).
The polarization ray tracing tensor in global coordinates ( ) can be calculated by
applying proper coordinate transformation from the local coordinates to global
coordinates using rotation matrices. Using vectors defined in Eq. (2.2.3)
as basis vectors of the rotation matrices, the incident coherence matrix ( ) in global
coordinates is
, (8.5.4)
where { } . Thus, the incident coherence matrix in local coordinates
( can be written as a function of ’s
(
)
. (8.5.5)
177
Similarly, the exiting coherence matrix in local coordinates ( ) is
, (8.5.6)
where { } .
Inserting Eq. (8.5.5) and (8.5.6) to Eq. (8.5.3),
∑
. (8.5.7)
Therefore, the exiting coherence matrix in global coordinates is
[ {∑ (
)
} ] . (8.5.8)
Comparing Eq. (8.4.2) and (8.5.8), the components of the polarization ray tracing
tensor in global coordinates ( ) are the corresponding coefficients of for
using Eq. (8.5.8). Table 8.1 shows the polarization ray tracing tensor ( ) in
global coordinates as a function of amplitude coefficients in local coordinates and the
incident and exiting local coordinate basis vectors for,
{ } { }
{ } { }
( )
178
(
) (
)
(
) (
)
(
) (
) (8.5.9)
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
179
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
180
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
( )
Table 8.1 A polarization ray tracing tensor in global coordinates as a function of
amplitude coefficients in local coordinates. Each shows a three-by-three matrix
component of the tensor.
In Mathematica, the tensor can be found by using the following command
[ ] (8.5.10)
181
where is calculated from Eq. (8.5.8).
8.5.2 A Polarization Ray Tracing Tensor from the three-by-three
Polarization Ray Tracing Matrix P
This section presents an algorithm to calculate the polarization ray tracing tensor T that
corresponds to a three-by-three polarization ray tracing matrix P. This conversion is
straightforward since the P matrix is already defined in global coordinates. The exiting
electric field vector from the P matrix is
(
) (
)(
) (
).
(8.5.11)
Using Eq. (8.2.1), can be calculated and comparing the result with Eq.
(8.4.2), the relationship of the polarization ray tracing tensor to the P matrix is
, (8.5.12)
where i, j, k, l = 1, 2, 3. Again, a set of Ts can be added for a collimated grid of incident
rays using Eq. (8.4.7).
If a polarization ray tracing tensor T is associated with a single { pair
and is representing a non-depolarizing incoherent ray trace, associated three-by-three
polarization ray tracing matrix P can be uniquely defined to within an unknown absolute
phase . Since the tensor T is associated with intensity values of the electric field
182
vector while the P matrix is associated with the amplitude values of the electric field
vector, the absolute phase of the P matrix is lost when transforming P into T. First, the
norm of each element in P matrix is calculated. Then, the phase of each element is
calculated relative to . Expressing P’s elements in polar coordinates, the matrix
becomes
(
) (
). (8.5.13)
From Eq. (8.5.12), diagonal elements in the tensor gives the norm of the P matrix
elements
| |
, (8.5.14)
therefore,
√ . (8.5.15)
The phase of can be calculated by choosing a reference; if for a
particular i and j, its phase can be set to the absolute phase, . And all the other
phases are defined relative to the absolute phase. From Eq. (8.5.12)
[ { (
)}
. (8.5.16)
For example, if
183
[ { (
)}
. (8.5.17)
Using Eq. (8.5.15) and (8.5.16), the P matrix is uniquely defined with the absolute
phase .
8.5.3 Example Polarization Ray Tracing Tensor Calculation
This section provides an example tensor calculation where the incident light reflects from
an Aluminum coated surface with the following parameters (shown in the Figure 8.2):
{
√
} , (8.5.18)
where n is Aluminum’s refractive index at 500nm. The corresponding amplitude
reflection coefficients ’s are the Fresnel reflection coefficients,
(
) (
) (
). (8.5.19)
184
Figure 8.2 A ray propagating along the z axis reflects from an Aluminum coated
surface.
The polarization ray tracing tensor in local coordinates is
(
(
) (
)
(
) (
)
)
,
(
)
(
√
√
)
. (8.5.20)
Since the incident propagation vector is along the z-axis, any incident electric
field vector can be written as . Therefore, the incident coherence
matrix in local coordinates and the one in global coordinates are the same
(
) (8.5.21)
Using Eq. (8.5.8),
(
).(8.5.22)
185
Therefore, the polarization ray tracing tensor in global coordinates is
(
(
) (
) (
)
(
) (
) (
)
(
) (
) (
))
.
(8.5.23)
Using the algorithm in Section 2.2, a three-dimensional polarization ray tracing
matrix can be calculated for this example,
(
), (8.5.24)
and the exiting electric field vector is
(
), (8.5.25)
which gives the same as in Eq. (8.5.22).
8.6 A Polarization Ray Tracing Tensor for a Ray Intercept with Scattering
In order to ray trace through optical systems with scattering surfaces or depolarizing
surfaces, Fresnel coefficients or amplitude coefficients do not provide sufficient
186
information to describe polarization characteristics of such interactions. In general, a
Mueller matrix or a Mueller matrix BRDF is used to describe depolarizing optical
surfaces. In this section, a method to transform the Mueller matrix into a polarization ray
tracing tensor when the incident and exiting propagation vector, a Mueller matrix, and its
local coordinates for the incident and exiting space are given. The logic is analogous to
the previous section and Eq. (8.4.2) and (8.5.10) still holds. The only differences are the
intermediate steps in getting a relationship between and
.
As shown in Eq. (8.2.4), 2D Stokes parameters are related to the coherence matrix
elements. Similar to Jones vectors, 2D Stokes parameters are defined in its local
coordinates (Section 9.3). Therefore, the tensor can be calculated by representing the
incident and exiting Stokes vectors in global coordinate coherence matrix elements. 2D
Mueller calculus shows
(
)
(
)
(
)
. (8.6.1)
The incident Stokes vector has a coherence matrix in the incident local
coordinates { },
(
), (8.6.2)
187
and the exiting Stokes vector has a coherence matrix in the exiting local coordinates
{ },
(
) (8.6.3)
where ( ) ( ) ( ) ( )
( )
,
( ) ( ) ( ) ( )
( )
,
and each local coordinate basis vectors form right-handed local coordinates,
where i = In, Out.
Using the inverse of Eq. (8.5.4) can be written as a function of ’s
in global coordinates
, (8.6.4)
where { }
. Eq. (8.6.4) provides relationship between ’s and
’s.
Similarly,
188
, (8.6.5)
where { } .
Using Eq. (8.6.2) and (8.6.4), can be written as a function of ’s. Then
using Eq. (8.6.3), can be written as a function of and where i, j = x, y, z
and k, l = 0, 1, 2, 3. Then can be written as by using Eq. (8.6.5) i.e.,
can be written as a function of and .
Again, the components of the tensor are the coefficients of for
.
8.6.1 Example Polarization Ray Tracing Tensor Calculation
In this section, the example in Section 8.5.3 is revisited; the incident and exiting
propagation vectors and a Mueller matrix, which is defined in the incident and exiting
local coordinates are the given parameters,
{
√
}, (
)
(
√
√
)
(
). (8.6.6)
Using Eq. (8.6.4)
189
(
) (
). (8.6.7)
Thus,
(
)
(
( ))
(8.6.8)
From the Mueller matrix and Eq. (8.6.8)
(
( ) ( )
( ) ( )
( ) ( )
( ) ( ) )
, (8.6.9)
and
(
)
(
) (8.6.10)
Using Eq. (8.6.5), the exiting coherence matrix in global coordinates is
190
(
),(8.6.11)
and the polarization ray tracing tensor is
(
(
) (
) (
)
(
) (
) (
)
(
) (
) (
))
,
(8.6.12)
which are the same as Eq. (8.5.22) and (8.5.23).
8.7 Example Polarization Ray Tracing Tensor and Combination of
Tensors
Most clouds have various particles with different scattering properties, sizes, refractive
indices, etc. In this section, a simple and tractable but also realistic cloud model is set up
in order to understand some aspects of the complex phenomena of cloud polarization.
The polarization ray tracing tensor calculus is implemented to ray trace through an
example cloud model and incoherently added for the data analysis. The scattering
particles are spherical water droplets with refractive index of
in air with refractive index of 1.0002857; this is the simplified model of cubical cloud.
Mie scattering is assumed and the s and p polarization reflection coefficients at various
191
scattering angles are calculated from MiePlot60
. The black body radiation from the sun
with the spectrum between 380nm to 700nm is assumed for the light source. The size of
the water droplets has a normal distribution with mean of 5μm and 5% standard deviation.
Scattered light intensity is calculated by averaging over 30 different wavelengths within
the spectrum and 50 different water droplet sizes from the normal distribution as the
scattering angle changes. The 50 sample sizes are shown in Appendix C.
The geometry of the volume scattering calculation is shown in Figure 8.3 in two
different views. The darker red array of arrows indicates the incident beam of light and
different colored thicker arrows indicate sample single and double scattered ray paths
from the incident light to the detector.
Figure 8.3 A volume of water droplets in the air which scatters the incident
collimated beam of light. The incident beam of light is plotted in dark red arrows
and some of the individual scattering ray paths are shown in different colors.
192
Twenty seven scattering volumes of water droplets are positioned in a cubic grid.
The position vectors are , , and . Each water
droplet is numbered from 1 to 27 starting from to . A collimated beam of
light from the sun is incident on the scattering volume along ˆInk . A polarimeter views
the volume along the x-axis ( ˆOutk ) which is chosen to allow easy summation of many
different ray paths,
ˆ ˆ{1,1,1}/ 3, {1,0,0}.In Out k k (8.7.1)
No absorption or extinction is assumed along the ray paths. The majority of the
ray paths experience two scattering events (red, green, and orange ray paths in Figure 8.3)
and the rest experiences single scattering event (blue ray path in Figure 8.3). Ray paths
with single scattering event are called 1path and ray paths with two scattering events are
called 2path . There are 27 1path and 702 2path ray paths. By fixing the viewing angle
of the polarimeter along ˆOutk , only the scattered light along ˆ
Outk after the first scattering
for 1path , and after the second scattering for 2path , get detected by the polarimeter.
A polarization ray tracing tensor is calculated for each scattering event and for
each ray path using the reflection coefficients calculated from the Mie scattering function
at a given scattering angle as shown in Figure 8.4. Each tensor has subscripts q and r
where r stands for the first water droplet and q stands for the second water droplet from
which each ray path scatters; . When , represents the single
scattering tensor from the qth
water droplet. Scattering angles are in degrees and the s
193
polarization reflection coefficients are plotted in red whereas the p polarization reflection
coefficients are plotted in blue. The scattering angle is the angle between the incident
and scattered light propagation vectors. If the scattering angle is less than 90º, the
interaction is forward scattering since the propagation vectors are along the same
direction and if the scattering angle is greater than 90º, the interaction is backward
scattering.
Figure 8.4 s (red) and p (blue) polarization reflection coefficients as a function of
scattering angle calculated from the MiePlot program.
Then tensors representing ray paths that scatter from the qth
water droplet toward
the polarimeter are added
∑ (8.7.2)
where . Each now contains depolarization effects from scattering.
194
The last step is adding the tensors from the same colored water droplets (along the
x-axis) and calculating ,
∑ (8.7.3)
where . This step creates a nine-by-nine grid of polarization ray tracing
tensors as shown in Figure 8.5. The false color in the figure is coded to match the color
for each column of water droplets.
Figure 8.5 A nine-by-nine grid of the detector with its surface normal along the x-
axis is shown. Each false color corresponds to the summation of the polarization
ray tracing tensors along the x-axis over the same color mapped water droplets.
The polarization ray tracing tensors corresponding to detector pixels are
195
196
197
(8.7.4)
For an electric field vector with a coherence matrix
(
), the exiting coherence matrix ( ) after the propagating through
a volume of scatterers and detected at jth
pixel can be calculated from Eq. (8.4.2) and
in Eq. (8.7.4). For example, is
,
198
( ) ( ) ( ),
,
( ( ) ),
( )
( )
,
( )
( ) (8.7.5)
The coherence matrix of unpolarized light is
(
) (8.7.6)
and the . This light is equivalent to a 2D Stokes vector with the
propagation vector ˆInk . The exiting coherence matrix at each pixel on the detector is
calculated from tensors in Eq. (8.7.4)
(
),
(
),
199
(
),
(
),
(
),
(
),
(
),
(
),
(
).
(8.7.7)
Note that none of the exiting coherence matrices have x-electric field components since
the exiting propagation vector is along the x-axis. The 3D degree of polarization of each
exiting coherence matrix indicates that the exiting light is mostly unpolarized. Figure 8.6
show for each pixel number 1, 2,…, 9.
200
Figure 8.6 3D degree of polarization is calculated for each exiting coherence matrix
at the detector. The x-axis indicates the pixel number. 3D DOP shows that the
exiting light is mostly unpolarized.
Each exiting coherence matrix can be reduced to 2D Stokes vectors in its local
coordinates on the detector plane,
(
), (
),
(
), (
),
(
), (
),
201
(
), (
),
(
), (8.7.8)
where the local coordinates are
0 0 1
ˆˆ ˆ0 , 1 , 0 .
1 0 0
Loc Loc Out
-
x y k (8.7.9)
2D degree of polarization can be calculated for each 2D Stokes vector and is
plotted in Figure 8.7. Again, 2D degree of polarization indicates that the exiting light is
mostly unpolarized.
Figure 8.7 2D degree of polarization of each 2D Stokes vector is calculated at each
pixel on the detector. The values indicate that the exiting light is mostly unpolarized.
202
Both 2D and 3D degree of polarization of pixels 1, 2, 3, 4, and 7 are the same
values. The pixels 1, 2, and 3 are aligned with the bottom row (the smallest z-value) of
the water droplets and the pixels 3, 4, and 7 are aligned with the left most column (the
smallest y-values) of the water droplets, which are the closest water droplets from the
collimated incident plane wave. Therefore, most of the 2path exiting from the water
droplets that are aligned with pixels 1, 2, 3, 4, and 7 come from the backward scattering
(scattering angles > 90º) whereas other water droplets have more forward scatterings than
backward scatterings. All the single scattering paths 1path are equally distributed
among nine pixels. As shown in Figure 8.4 backward scattering reflection coefficients
are smaller than the forward scattering reflection coefficients. Therefore, 0S components
of the 2D Stokes vectors in Eq. (8.7.8) are smaller for the pixels 1, 2, 3, 4, and 7 than
other pixels as shown in Figure 8.8.
Figure 8.8 S0 components of the exiting 2D Stokes vectors at each pixel.
203
However, the diattenuation along the s-polarization for the backward scattering is
greater than that of the forward scattering as shown in Figure 8.9. Therefore, the
backward scattered light is more polarized than the forward scattered light as shown in
Figure 8.6 and Figure 8.7.
Figure 8.9 Diattenation of the scattered light calculated from the Mie Plot program.
For positive values s polarization has the greater scattering amplitudes than p
polarization.
For rays following 1path , polarization of the scattered light is s-polarized since
they experience single scattering. The s-polarization for this example is
ˆ ˆˆ {0, 1,1}/ 0.Out In - s k k (8.7.10)
204
The s-polarization is linearly polarized light at 45º in the detector’s local coordinates.
Therefore, 2S components of the 2D Stokes vectors in Eq. (8.7.8) provide how much of s-
polarization exists in the exiting light. Figure 8.10 shows the 2S components of the 2D
Stokes vectors at each pixel. Again, the pixels 1, 2, 3, 4, and 7 have the same value.
Figure 8.10 S2 components of the exiting 2D Stokes vectors at each pixel.
However, OutS at each pixel is incoherent addition of exiting vectors from 1path
and 2path . Therefore, the polarization of the exiting light is not purely s-polarized.
Figure 8.11 shows the orientation of the exiting light polarization OutS on the detector in
red arrows. Blue dashed arrows are linearly polarized light at 45º on the detector plane.
OutS is mostly polarized along 45º with little deviations,
(
). (8.7.11)
205
Figure 8.11 Exiting light polarization vectors on the detector plane are shown in red
whereas linearly polarized light at 45º are shown in dashed blue.
This example can be extended to describe the larger cubical cloud by using more
scattering water droplets. Similar example can be setup with different scattering volumes
by choosing different refractive index of the scattering particles and the atmosphere. The
incident light properties as well as the camera / detector viewing angle can be changed.
All the tensor calculation methods that have been used in this example are general and
can be modified depending on the assumptions and other conditions of the applications.
206
8.8 Conclusion
Algorithms for the incoherent polarization ray tracing through depolarizing optical
systems are presented. A coherence matrix of the incident ray’s electric field vector
( ) and a coherence matrix of the exiting ray’s electric field vector (
) at qth
ray
intercept are related by a polarization ray tracing tensor, by Eq. (8.4.2). By cascading
the polarization ray tracing tensors as shown in Eq. (8.4.6), the electric field vector’s
coherence matrix at the exit pupil or a detector can be calculated from the coherence
matrix of the incident electric field vector at the entrance pupil or the source. Since ,
and are defined in global coordinates, incoherent addition of the coherence
matrices or polarization ray tracing tensors are much less error prone than adding 2D
Stokes parameters or Mueller matrices. As shown in Eq. (8.4.7), the polarization ray
tracing tensor is not restricted by a single ˆOutk , which is a critical characteristic for
dealing with scattering or stray light analysis.
207
CHAPTER 9 THREE-DIMENSIONAL (3D) STOKES PARAMETERS
In Chapter 8 incoherent ray tracing was developed with coherence matrices and
polarization ray tracing tensors. In this chapter 3D generalization of Stokes and Mueller
matrices are explored for incoherent ray tracing, but it is still under study. At this time
we don’t recommend 3D Stokes or Mueller method since our current thinking is that
coherence matrices are more straightforward.
9.1 Definition of 3D Stokes Parameters
Components of the 3D coherence matrix define three-dimensional (3D) Stokes
parameters by expanding with Gell-Mann matrices as a basis. These are eight
generators of the SU(3) symmetry group and a 3x3 unit matrix61
. The basis matrices are
Hermitian, trace orthogonal, and linearly independent
(
) √
(
) √
(
)
√
(
) √
(
) √
(
) (9.1.1)
√
(
) √
(
)
√ (
)
For the basis matrices, the following trace-orthogonality equation holds
208
( ) . (9.1.2)
The basis matrices in (9.1.1) allow the coherence matrix to be expressed as
∑
(9.1.3)
where the nine real coefficients are the 3D Stokes parameters,
, √
,
√
( ) √ , √
√
√
√ , √
√ , (9.1.4)
√
√ , √
√ ,
√ ( )
The coherence matrix can be represented using
(
√ √ √ √ √ √
√ √ √ √ √ √
√ √ √ √ √
). (9.1.5)
is analogous to 2D Stokes parameter and both are proportional to the total
intensity of the light. is analogous to and shows predominance of the x component
of the electric field intensity ( ) or of the y component ( ). and are
209
analogous to and on the xy-plane; if projection of the field component on
the 45º bisector axis of xy-plane is dominant and if right circular polarization
component on the xy-plane is dominant. Similarly interpretation holds for and )
on xz-plane and and ) on yz-plane. represents the intensity in the xy-plane
additional to that in the z-direction.
3D Stokes parameters are derived from an ensemble of three-element electric
field vectors defined in global coordinates. Therefore, 3D Stokes parameters represents
the predominance of x, y, or z component of the field in global coordinates, regardless of
the propagation vector direction. Since a polarization state is always defined relative to a
propagation vector direction with specific local coordinates, positive 3D Stokes
parameter does not always mean the predominance of polarization states along that
particular axis. For example, a 3D Stokes vector for a right circularly polarized light
( √ ) propagating along the y-axis is
{ √
√
√ }. (9.1.6)
Note that of is negative for this right circularly polarized 3D Stokes vector
propagating along the y-axis.
The three-dimensional degree of polarization57
in Eq. (8.2.6) can be written using
the 3D Stokes parameters
|√
∑
|. (9.1.7)
210
Since the 3D Stokes parameters are defined in global coordinates, ’s provide
complete polarization information in any direction in three-dimensions. Although ’s
are real valued quantities which are measurable in real experiments, its basis matrices ( )
have imaginary values. Therefore, finding a projection operator for ’s is not easy. On
the other hand, the coherence matrix is a Hermitian matrix defined in global coordinates
and projection onto a plane is a simply inner product as shown in Eq. (8.3.5). Therefore,
the projected coherence matrix is converted to 3D Stokes parameters ( ) using Eq. (9.1.4)
to calculate incoherent addition of 3D Stokes parameters on an arbitrary plane.
9.2 Example Incoherent Additions of 3D Stokes Parameters
This section has an example which explains the basic properties of 3D Stokes parameters.
In this example, three mutually incoherent plane wavefronts meet at the origin as shown
in Figure 9.1.
211
Figure 9.1 Three plane wavefronts propagating along the x, y, and z axis are meeting
at the origin.
is propagating along the x-axis,
is propagating along the y-axis, and is propagating along the z-
axis. Each electric field vector relates to a coherence matrix and can be incoherently
added
(
), (9.2.1)
212
where
If the projection plane is on xz-plane, and (
). Using
Eq.(8.3.5) ,
(
). (9.2.2)
As expected, has no y-component. 3D Stokes parameters for this projected field
are
, ,
√
, √
, √
, (9.2.3)
,
√ √ .
Table 9.1 shows the result of the incoherent addition of different electric field
vector components with equal amplitudes projected onto xz-plane. Incoherent addition of
orthogonal polarizations results in an unpolarized 3D Stokes vector ( ) with twice
the intensity of individual electric fields. For this example, only x and z components of
213
the electric field vector contributes to 3D Stokes parameters on the xz-plane. Therefore,
all the possible combinations of orthogonal polarizations have the same unpolarized 3D
Stokes vector, . Incoherent addition of the same polarizations results a fully
polarized 3D Stokes vector with twice the intensity of the individual electric field. The
diagonal elements are left empty since they represent addition of each electric field vector
with itself. For the case of is when is a linearly polarized along 135º
axis between x and z axes, propagating along the y-axis.
Table 9.1 3D Stokes vectors from an incoherent addition of three electric field
vectors with different polarization states, measured on xz-plane are shown. All the
amplitudes are set to 1.0 for the simplicity.
The 3D Stokes vectors corresponding to the Table 9.1 entries are as follows
214
(
√
√ )
=
(
√
√ )
=
(
√ )
=
(
√
√
√ )
.
(9.2.4)
Consider if all three electric fields are right circularly polarized,
(
√
√
) (
√
√
) (
√
√
) (9.2.5)
the resulting coherence matrix is
(
), (9.2.6)
and the 3D Stokes parameters are
(
√
√
√
)
, (9.2.7)
215
which has the total intensity of three and three circular Stokes components. Using Eq.
(9.1.7), the degree of polarization of the total 3D Stokes vector is 0.5.
When is projected onto yz, xz, and xy planes,
(
)
(
) (9.2.8)
(
).
The 3D Stokes parameters for each are
(
√
√
√ )
(
√
√
√ )
(
√
√ )
(9.2.9)
All three projections give total intensity of 2; each projection gets full intensity from one
of the electric field vectors and each of the other two electric field vectors contributes ½
of the intensity. The degree of polarization of all three vectors in Eq. (9.2.9) is √ .
216
and in Eq. (9.2.9) have which is analogous to 2D Stokes parameter
and which represents the intensity in the xy-plane additional to that in the z-direction.
Although Eq. (9.2.8) does not imply any linear polarization components, corresponding
3D Stokes vectors have and . This is the confusing aspect of the 3D Stokes
parameters which requires further research.
When is projected onto a plane B which is perpendicular to √ ,
(
)
, (9.2.10)
and the 3D Stokes vector is
(
√
√
√
√
√
√
)
. (9.2.11)
The degree of polarization of this vector is .
217
9.3 2D Stokes Parameters to 3D Stokes Parameters
2D Stokes parameters are defined in local coordinates and 3D Stokes parameters are
defined in global coordinates. Therefore, the coherence matrix can be used as an
intermediate step between the 2D Stokes parameters and 3D Stokes parameters. Using
Eq. (8.6.2) and (8.5.4), the coherence matrix can be written as a function of 2D
Stokes parameters . Then using Eq. (9.1.4), 3D Stokes parameters can be
calculated from a given 2D Stokes parameters.
For example, linearly polarized at 45º in xy-plane with is in
2D Stokes representations. 3D Stokes parameter for the same light is
√
√ The vector
√
√
√
represents the same
polarization propagating along the y-axis while the vector √
√
√
represents the same polarization propagating along the x-axis.
9.4 3D Mueller Matrix
In this section, 3D generalization of Mueller matrix is presented. The main purpose of
this chapter is demonstrating the algorithm which calculates a nine-by-nine Mueller
matrix ( defined in global coordinates. The nine-by-nine Mueller matrix operates on
a 3D Stokes parameters matrix,
218
(
)
(
)
. (9.4.1)
Similar to the polarization ray tracing tensor calculation from a given 2D Mueller
matrix, the incident and exiting coherence matrices in local coordinates are calculated
first and then transformed to 3D Stokes parameters using the following equations:
(
)
(
)
,
219
(
√
√
√
√
√
√
√
√
√
√
√
√
√
√
√
√ √ )
.
(9.4.3)
Using Eq. (8.2.4) and , the incident and exiting coherence matrices
for a ray propagating along the z-axis, and
, can be written as a
function of the incident 2D Stokes parameters and ’s. This is analogous to the
method used in Section 8.6 and equations are copied for the convenience of the readers
(
), (8.6.2)
( ) ( ) ( ) ( )
220
( )
( )
( ) ( ) ( ) ( )
. (8.6.3)
A nine-by-nine matrix satisfies
, (9.4.4)
where is a flattened coherence matrix in local coordinates into a nine element vector,
(
)
(
)
. (9.4.5)
The vector assumes that the propagation vector is along the z-axis.
Using the above three equations, components of a nine-by-nine matrix can
be calculated
( )
( )
221
( )
( )
( )
,
( )
( )
( )
,
( )
( )
( )
,
( )
( ),
(9.4.6)
where (
).
Using a proper rotation matrix, a nine-by-nine matrix defined in global
coordinates can be calculated,
( ) . (9.4.7)
222
The matrix represents a surface matrix for a ray propagating along while the
matrix represents the corresponding matrix for a ray propagating along
(
)
(
)
. (9.4.8)
The can be any rotation matrix which rotates to and my
preferred choice of rotation matrix is provided in the Table 9.2. This rotation matrix is
constructed in nine-by-nine format based on a direction cosine rotation matrix which
Mathematica62
uses. In Mathematica, a three-by-three rotation matrix which rotates
to is
{ }
(
√
√
√
√
√
)
. (9.4.9)
The matrix rotates to { }. By calculating how { } changes after
the rotation, components of matrix are calculated.
223
√
0 0 0
√
0
√
0 0 0
√
√
√
0 0 0
√
√
0
√
0
√
√
0
√
224
√
√
√
√
√
√
√
0
√
0
√
√
0
√
√
√
√
√
225
√
√
√
Table 9.2 A nine-by-nine rotation matrix RotM( elements are shown. The
RotM( matrix rotates to .
The inverse of Eq. gives a nine-by-nine matrix which converts 3D Stokes
parameters to flattened coherence matrix vectors
(
)
(
)
, (9.4.10)
226
(
√
√
√
√
√
√
√
√
√
√
√
√
√
√
√
√
√
)
. (9.4.11)
Then Eq.(9.4.8) can be written as
, (9.4.12)
and therefore
. (9.4.13)
Thus, the 3D Mueller matrix ( ) is
( ) ( ) . (9.4.14)
9.4.1 Example 3D Mueller Calculation
In this section, the same example used in Section 8.5.3 and 8.6.1 is revisited; the incident
and exiting propagation vectors and a Mueller matrix, which is defined in the incident
and exiting local coordinates are the given parameters,
227
{
√
}, (
).
(9.4.15)
Using Eq. (9.4.6) the nine-by-nine Mueller matrix for the light propagating along
the z-axis is
(
)
.
(9.4.16)
From Table 9.2,
( )
(
)
228
( )
(
√
√
√
√
√
√
√
√
√
√
√
√
)
. (9.4.17)
Using Eq. (9.4.7) for this surface reflection is
(
)
.
(9.4.18)
The calculated from Eq. (9.4.8) is
229
(
)
, (9.4.19)
which is exactly the same as calculated from the tensor methods with either the
amplitude coefficients (Eq. (8.5.22)) or the 2D Mueller matrix (Eq. (8.6.11)).
Using Eq. (9.4.14) the 3D Mueller matrix is
(
)
, (9.4.20)
and the exiting 3D Stokes parameters for the incident 3D Stokes parameters are
(
)
. (9.4.21)
230
9.5 Conclusion
In this chapter 3D Stokes parameters and 3D Mueller matrix are defined. The 3D Stokes
parameters are the expansion coefficients of the coherence matrix when Gell-Mann
matrices are used as basis matrices. The 3D Mueller matrix is a generalized 2D Mueller
matrix in global coordinates. Similar to three-by-three polarization ray tracing matrix P,
the 3D Mueller matrix is designed to incorporate rays propagating in any directions.
As mentioned in the beginning of this chapter, further research is required to
better understand 3D Stokes parameters and 3D Mueller matrices. I think that coherence
matrix calculus is more straightforward and easier to understand than the 3D Stokes
parameters and Mueller matrices, but in my mind this remains an open question. I hope
that this chapter aids others in understanding the relationship and characteristics of the
nine-by-nine matrix calculus and the three-by-three-by-three-by-three tensor calculus.
231
CHAPTER 10 CONCLUSIONS AND FUTURE WORK
10.1 Summary
This work contains two major topics; coherent polarization ray tracing and incoherent
polarization ray tracing. Unlike Jones or Mueller calculus, both coherent and incoherent
polarization ray tracing methods are defined in global coordinates. Thus they provide an
easy basis to interpret the polarization properties for most systems. They avoid the
apparent rapid variation of polarization states and properties around local coordinate
singularities. However, it still remains straightforward to convert results from global
coordinates into other local coordinate bases.
Coherent polarization ray tracing uses a three-by-three polarization ray tracing
matrix P, which is the generalization of the Jones matrix. The three-by-three polarization
ray tracing matrix describes all polarization state changes due to diattenuation, geometric
transformations, and retardance. The calculation of diattenuation is achieved via singular
value decomposition of the three-by-three polarization ray tracing matrix. The parallel
transport matrix Q describes the associated non-polarizing optical system and thus keeps
track of the geometric transformation. To calculate the true polarization-dependent phase
change, also known as the retardance, the geometric transformation needs to be removed.
1Μ Q P is a fundamental equation for calculating retardance without spurious circular
retardance arising from a poor choice of local coordinates. Μ clarifies the meaning of
the troublesome minus sign in the Jones matrix for reflection. One important and not
232
initially obvious result of my analysis is that the proper retardance cannot be assigned to
an optical system inside a black box whose ray propagation vectors are unknown. Two
rays with different ray paths through an optical system can have the same polarization ray
tracing matrix but different retardances.
The three-by-three polarization ray tracing matrix contains corrections to optical
path length from coatings and other polarization effects. A polarization aberration
function ( )P r is a grid of polarization ray tracing matrices over the exit pupil. ( )P r and
( )OPL r provide a generalized wavefront aberration function that characterizes the
polarization-dependent transformations of a wavefront.
A skew aberration is a component of polarization aberration which originates
from purely geometric effects. Skew aberration of an optical system is calculated from
the parallel transport matrix Q. Skew aberration rotates polarization state as rays
propagate through optical systems, and this rotation is ray path dependent. Thus, pupil
variation of skew aberration creates the undesired polarization components at the exit
pupil and thus, affects PSF and degrades image quality. The skew aberration of a chief
ray serves as a piston-like aberration, an overall polarization rotation across the pupil.
Skew aberration is typically a small effect in lenses but it could be important in
microlithography optics and other polarization-sensitive systems with high NA or large
FOV.
One aspect of the three-by-three polarization ray tracing calculus that continues to
trouble me and my colleagues is that the P matrix cannot characterize the phase changes
233
greater than ±λ/2. We suspect that the problem arises due to the intrinsic nature of the
polar description of complex numbers. The arguments of complex numbers are located
between -π and π so the three-by-three polarization ray tracing matrix can calculate the
phase of the exiting light but cannot necessarily calculate the optical path length, if the
coating or other contributions exceed the ± λ/2. Similarly, the principal retardance is
defined between - π and π.
The purpose of optical design simulation is to predict the outcome of
measurements by devices such as interferometers, Hartmann-Shack sensors, and the like.
A laser interferometer measures phase difference between a reference and test beam. It
does not measure optical path length. It cannot distinguish between a wave with zero, or
one wave, or two waves of optical path difference. More importantly, it cannot
distinguish between partial waves with no uniquely defined optical path lengths. So
phase remains the measurable quantity in interferometry and optical path length becomes
a difficult concept in systems with multiple partial waves, such as thick multi-layer thin
film coatings. In Chapter 7, a system of two non-parallel retarders was used to
demonstrate multiple partial waves exiting a system. A phase unwrapping algorithm
using the dispersion model was applied to determine the order of the retardance greater
than π.
Since the P matrix originated from a Jones matrix, it cannot be used by itself to
ray trace and calculate the depolarizing effects from depolarizing optical components or
surfaces. Therefore, incoherent polarization ray tracing uses a polarization ray tracing
tensor T to ray trace through depolarizing optical systems. In this approach, a coherence
234
matrix of an electric field vector is used as a basis. The polarization ray tracing tensor
relates the coherence matrix of the incident light and the coherence matrix of the exiting
light. This tensor can be derived from amplitude coefficients in local coordinates of the
optical surface, a three-by-three polarization ray tracing matrix P in global coordinates, as
well as from a Mueller matrix of the surface in its local coordinates.
The addition of coherence matrices for the exiting grid of rays represents the
incoherent addition of the light. By using the coherence matrix, full polarization
information of the light along x, y, and z directions can be calculated from a single
polarization ray trace through an optical system, and the polarization information can
then be projected onto a plane for further data reduction if necessary. For a collimated
grid of incident rays, the polarization ray tracing tensors of each ray can be added. Thus,
the combined polarization ray tracing tensor can accommodate multiple exiting
propagation vector directions. This is the main advantage of the tensor calculus and why
it should be suitable for stray light analysis.
When the coherence matrix is expanded by Hermitian, trace-orthogonal, and
linearly-independent Gell-Mann matrices, the expansion coefficients are 3D Stokes
parameters. The 3D Mueller matrix can also be used for incoherent polarization ray
tracing. Both 3D Stokes parameters and 3D Mueller matrices are defined in global
coordinates. At the current time, I think the coherence matrix and the polarization ray
tracing tensor method is more straightforward than 3D Stokes or 3D Mueller calculus.
235
10.2 Future Work
Skew aberration was defined in this dissertation for the first time. In order to complete
the analysis of skew aberration’s effects on optical systems, the following goals need to
be accomplished: 1) complete understanding of skew aberration’s field and pupil
coordinate dependence and 2) thorough skew aberration statistics with larger sets of
optical systems. One way of achieving the first goal is using the series expansion of
skew aberration in a paraxial ray trace. To complete the second goal, we need to identify
further studies on the relationships between skew aberration and skewness of the ray,
numerical aperture (NA) of the optical system, and field of view (FOV) of the optical
system. Although Code V offers over 2000 optical systems, cutting-edge high NA
optical systems with wide FOV are generally not available to regular users. Current
micro-lithography optical systems would be the ideal example systems to test the effects
of skew aberration on image quality.
An example of ray tracing through a volumetric scattering medium was presented
in Section 8.7. More accurate scattering models for different materials such as clouds or
aerosol particles are necessary to implement the polarization ray tracing tensor method
for remote sensing / imaging applications. Since the basic concepts of the polarization
ray tracing tensor method are general, the method can be implemented for various studies
in polarization ray tracing through the scattering particles in the atmosphere or tissue
samples.
236
Definitions of 3D Stokes parameters and the 3D Mueller matrix are presented.
However, due to the greater complexity in 3D Stokes parameters than that of the 2D
Stokes parameters, a particular analysis devoted to 3D Stokes parameters has not been
developed. Further studies on 3D Stokes parameters and the 3D Mueller matrix can be
done by implementing the method to ray tracing through systems with volumetric
scatterers. Although the coherence matrix and the polarization ray tracing tensor
methods are more straightforward, 3D Mueller calculus has advantages of using matrix
multiplication over tensor calculations.
237
10.3 Conclusion
Several authors [4-12] have described the use of three-by-three polarization ray tracing
matrices for optical design and image formation problems. However, none of these
references contained details on their implementation of the three-by-three matrix. Now
with the conclusion of this dissertation, perhaps we see why. The straightforward and
complete implementation of a three dimensional ray tracing matrix method requires
attention to a large number of details, such as the necessity of adding additional
constraints (Eq. (2.1.7)) to uniquely define the three-by-three matrix P and the difficulties
in calculating the retardance from the three-by-three matrix (Chapter 4). Therefore I
hope that this work provides a satisfactory treatment of all the essential issues to
implement three-by-three extended Jones matrices in polarization ray tracing and optical
design. My advisor, Russell Chipman, confesses to beginning work on the three-by-three
matrix formulation of polarization ray tracing in 1989 but postponed publication due to
the large number of unresolved issues which troubled him until my work. That said,
from the present perspective, I feel that the three-by-three P matrix calculus provides a
simplification of important issues in polarization ray tracing.
Extending the three-by-three P matrix calculus to incoherent polarization ray
tracing to describe depolarizing optical systems was one of my long-standing goals in
polarization ray tracing. The concepts of 3D Stokes parameters or the coherence matrix
are well-known. However, the method of using the polarization ray tracing tensor T for
incoherent polarization ray tracing is original and novel. I believe that this method has
238
substantial potential to be used in polarization ray tracing through depolarizing optical
systems.
239
APPENDIX A USA PATENT 2,896,506 LENS PARAMETERS
The USA patent 2,896,506 lens parameters are shown in PolarisM convention.
240
Shape is the surface type, v is the vertex location, a is the axis of the surface, material 1
and material 2 are the material of the optical surfaces. In PolarisM, all the locations are
in global coordinates and the values are in mm. PolarisM calculates the next ray
intercept’s material based on in which material that the ray is currently. No coatings
were applied to optical surfaces.
241
APPENDIX B DEPOLARIZATION INDEX OF A POLARIZATION
RAY TRACING TENSOR
The ideal depolarizer is a tensor which returns completely unpolarized coherence
matrix ( ) for any incident coherence matrix, where ID stands for ideal
depolarizer
(
(
) (
) (
)
(
) (
) (
)
(
) (
) (
))
(
).
(1)
The depolarization index of is the distance between and . The Euclidean
distance of a matrix is the maximum singular value of the matrix. For polarization ray
tracing tensors its dimension is three-by-three-by-three-by-three and each three-by-three
sub-matrix ( ) relates the incident coherence matrix to the i, j component of the exiting
coherence matrix as shown in section 8.4. Therefore, it makes the most sense to calculate
each ’s Euclidean distance to calculate the distance between and . For example
the Euclidean distance between and the zero tensor is
(
). (2)
242
However, having a three-by-three matrix form distance is not as convenient as
having a scalar value that tells the distance between two tensors. Therefore, the
Euclidean distance between and is the sum of each Euclidean distances of ’s,
∑ , (3)
where is the maximum singular value of . Eq. (3) gives a scalar which
indicates how far is from .
In order to define the depolarization index (DI) between zero and one, an
appropriate denominator is required to normalize the distance between and . The
most appropriate choice is the Euclidean distance between and the zero tensor.
Therefore, the DI is defined as
. (4)
for the ideal depolarizing tensor. Different definition of the distance between two
tensors can be used and as a result, DI will change depending on how one defines the
distance between two tensors. However, at the current time, I think that Eq. (3) is the
best choice.
243
APPENDIX C WATER DROPLET SIZES FOR THE CLOUD
EXAMPLE
50 water droplet sizes from a normal distribution with mean 5μm and deviation 5 are
shown in the table.
244
REFERENCES
1. R.C. Jones, “A New Calculus for the Treatment of Optical Systems,” J. Opt. Soc. Am.,
31, 488-493, 493-499, 500-503 (1941), 32, 486-493 (1942), 37, 107-110, 110-112 (1947),
38, 671-685 (1948), 46, 126-131 (1956).
2. W. Singer, M. Totzeck, and H. Gross, “Physical Image Formation,” in Handbook of
optical systems pp. 613-620 (New York, Wiley, 2005).
3. S. G. Krantz, “The index or winding number of a curve about a point,” in Handbook of
Complex Variables pp. 49-50 (Boston, MA, 1999).
4. R. E. Knowlden, “Wavefront errors produced by multilayer thin film optical coatings,”
Ph. D. dissertation, (University of Arizona, 1981).
5. R. A. Chipman, “Polarization Ray Tracing,” Proc. SPIE 766, 61-68 (1987).
6. E. Waluschka, “A polarization Ray Trace,” Opt. Eng. 28, 86-89 (1989).
7. R. A. Chipman, “Polarization Analysis of Optical Systems,” Opt. Eng. 28, 90-99
(1989).
8. J. Wesner, F. Eisenkramer, J. Heil, and T. Sure, “Improved Polarization Ray Tracing
of Thin-film optical Coatings,” Proc. SPIE 5524, 261-272 (2004).
9. R.A. Chipman, “Mechanics of polarization ray tracing,” Opt. Eng. 34, 1636-1645
(1995).
10. P. Torok, P. Varga, Z. Laczik, and G. R. Booker, “Electromagnetic diffraction of light
focused through a planar interface between materials of mismatched refractive indices: an
integral representation,” J. Opt. Soc. Am. A 12, 325-332 (1995).
11 . P. Torok, and T. Wilson, “Rigorous theory for axial resolution in confocal
microscopes,” Optics Comm. 137, 127-135 (1997).
245
12. P. Torok, P.D. Higdon, and T. Wilson, “Theory for confocal and conventional
microscopes imaging small dielectric scatterers,” J. Mod. Optics, 45, pp. 1681-1698
(1997).
13. G. Yun, K. Crabtree, and R.A. Chipman, “Three-dimensional Polarization Ray-
tracing Calculus I, Definition and Diattenuation,” Appl. Opt. 50, pp. 2855-2865 (2011).
14 . G. Yun, S. McClain, and R.A. Chipman, “Three-dimensional Polarization Ray-
tracing Calculus II, Retardance,” Appl. Opt. 50, pp. 2866-2874 (2011).
15. R. A. Chipman, "Challenges for Polarization Ray Tracing," in International Optical
Design Conference, OSA Technical Digest (CD), paper IWA4 (OSA, 2010).
16. W. S. T. Lam, S. McClain, G. Smith, and R. Chipman, "Ray Tracing in Biaxial
Materials," in International Optical Design Conference, OSA Technical Digest (CD),
paper IWA1 (OSA, 2010).
17. G. Yun, K. Crabtree, and R.A. Chipman, “A skew aberration: a form of polarization
aberration,” Opt. Lett. 36, pp. 4062-4064 (2011).
18. H. A. Macleod, Thin-Film Optical Filters pp. 179-209 (McGraw-Hill, 1986).
19. P. H. Berning, “Theory and calculations of optical thin films,” Physics of Thin Films,
1, 69 -121 (1963).
20. M. Born and E. Wolf, Principles of Optics pp. 63-74, (Cambridge University Press,
1999).
21. S. Lu and R. A. Chipman, “Homogeneous and Inhomogeneous Jones Matrices,” J.
Opt. Soc. Am. A, 11, 766-773 (1994).
22. R. M. A. Azzam and N. M. Bashara, Ellipsometry and Polarized light pp. 67-84,
(North Holland, 1977).
246
23 . R. A. Chipman, “Polarization Aberrations,” Ph. D. Dissertation, (University of
Arizona, 1987).
24 . S. Lu and R. A. Chipman, “Interpretation of Mueller matrices based on polar
decomposition,” J. Opt. Soc. Am. A 13, 1106-1113 (1996).
25 . R. Barakat, “Conditions for the physical realizability of polarization matrices
characterizing passive systems,” J. Mod. Opt. 34, 1535-1544 (1987).
26 . P. Lancaster and M. Tismenetsky, The Theory of Matrices pp. 1535-1544,
(Academic, 1985).
27. G. H. Gloub and C. F. Van Loan, Matrix Computations pp. 70-73, (Johns Hopkins
University Press, 1996).
28. E. R. Peck, "Polarization properties of corner reflectors and cavities," J. Opt. Soc.
Am. 52, pp. 253-257 (1962).
29 . M. A. Acharekar, "Derivation of internal incidence angles and coordinate
transformations between internal reflections for corner reflectors at normal incidence,"
Opt. Eng. 23, pp. 669-674 (1984).
30. R. Kalibjian, “Stokes Polarization Vector and Mueller Matrix for a Corner-Cube
Reflector,” Optics Comm. 240, 39-68 (2004).
31. J. Liu and R. M. A. Azzam, “Polarization properties of corner-cube retroreflectors:
theory and experiment,” Appl. Opt. 36, No. 7 (1997).
32. W. A. Shurcliff, Polarized Light pp. 87-88, (Harvard University Press, 1962).
33. E. Hecht, Optics pp. 352-357, (Addison-Wesley, 2002).
34. W. A. Shurcliff, Polarized Light pp. 95-98, (Harvard University Press, 1962).
247
35. R. A. Chipman, “Polarization Analysis of Optical Systems,” Proc. SPIE, 891, 10
(1988).
36. Christian Brosseau, Fundamentals of polarized light, A statistical optics approach pp.
185-186 (Jone Wiley & Sons, Inc., NY, 1998)
37. J. M. Leinaas and J. Myrheim, “On the Theory of Identical Particles,” Il Nuovo
Cimento B 37, N. 1, 1-23 (1977).
38. S. Pancharatnam, “Generalized Theory of Interference, and its Applications,” S.,
Proc. Indian Acad. Sci. A 44, 247 (1956).
39. R. Bhandari and J. Samuel, “Observation of Topological Phase by Use of a Laser
Interferometer,” Phys. Review Lett. 60, 1211 (1988).
40. M. V. Berry, ”The Adiabatic Phase and Pancharatnam’s Phase for Polarized Light,”
J. Mod. Opt. 34, 1401 (1987).
41. D. Bone, “Fourier fringe analysis: the two-dimensional phase unwrapping problem,”
Appl. Opt. 30, 3627-3632 (1991).
42. A. Collaro, G. Franceschetti, F. Palmieri and M. S. Ferreiro, “Phase unwrapping by
means of genetic algorithms,” J. Opt. Soc. Am. A 15, 407-418 (1998).
43. G. R. Fowles, Introduction to Modern Optics p. 53, (Dover Publications, 1975).
44. A. Macleod, “Phase Matters,” SPIE’s OE magazine, June/July (2005), pp. 29-31.
45. J. Ruoff and M. Totzeck, “Orientation Zernike polynomials: a useful way to describe
the polarization effects of optical imaging systems,” J. Micro/Nanolith. MEMS MOEMS
8(3), 031404 (Jul-Sep 2009).
46. Hideo Azuma, “High aperture wide-angle objective lens,” U.S. patent 2,896,506 (July
28, 1959).
248
47. Code V Version 10.3, Synopsys, http://www.opticalres.com/cv/cvprodds_f.html
48. H. H. Hopkins, Wave Theory of Aberrations (Clarendon, London, 1950).
49. E. Sklar, “Effects of small rotationally symmetrical aberrations on the irradiance
spread function of a system with gaussian apodization over the pupil,” J. Opt. Soc. Am.
65, pp. 1520-1521 (1975).
50. J. P. Mills and B. J. Thompson, “Effect of aberrations and apodization on the
performance of coherent optical systems. I. The amplitude impulse response,” J. Opt. Soc.
Am. A 3, pp. 694-703 (1986).
51 . J. P. Mills and B. J. Thompson, “Effect of aberrations and apodization on the
performance of coherent optical systems. II. Imaging,” J. Opt. Soc. Am. A 3, pp. 704-716
(1986).
52 . M. Mansuripur, “Effects of high-numerical-aperture focusing on the state of
polarization in optical and magneto-optic data storage systems,” Appl. Opt. 50, pp. 3154-
3162 (1991).
53. J. P. McGuire and R. A. Chipman, “Diffraction image formation in optical systems
with polarization aberrations I: Formulation and example,” J. Opt. Soc. Am. A 7, pp.
1614-1626 (1990).
54. J. E. Greivenkamp, Field guide to geometrical optics p. 19, (SPIE Press, 2004).
55. Bruce R. Irving, et al., Code V, Introductory User’s Guide p. 82 (Optical Research
Associates, 2001).
56. T. Carozzi, R. Karlsson, and J. Bergman, “Parameters characterizing electromagnetic
wave polarization,” Phys. Rev. 61, pp. 2024-2028 (2000).
57. J.J. Gil, “Polarimetric characterization of light and media,” Eur. Phys. J. Appl. Phys.
40, pp. 1-47 (2007).
249
58. J. J. Gil and E. Bernabeu, “Depolarization and polarization indices of an optical
system,” Opt. Acta, 33, pp. 185-189 (1986).
59. R. A. Chipman, “Depolarization index and the average degree of polarization,” Appl.
Opt. 44, pp. 2490-2495 (2005).
60. http://www.philiplaven.com/mieplot.htm.
61. T. Setala, A, Shevchenko, and M. Kaivola, “Degree of polarization for optical near
fields,” Phys. Review E 66, 016615 (2002).
62. Wolfram Mathematica 8.0.1.0, © Copyright 1988-2011, Wolfram Research, Inc.