+ All Categories
Home > Documents > IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 1 The Effect of ... · In principal, the extra-retinal cues...

IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 1 The Effect of ... · In principal, the extra-retinal cues...

Date post: 18-Nov-2020
Category:
Upload: others
View: 1 times
Download: 0 times
Share this document with a friend
14
IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 1 The Effect of Focal Distance, Age, and Brightness on Near-Field Augmented Reality Depth Matching Gujot Singh, Member, IEEE, Stephen R. Ellis, and J. Edward Swan II, Senior Member, IEEE Abstract—Many augmented reality (AR) applications operate within near-field reaching distances, and require matching the depth of a virtual object with a real object. The accuracy of this matching was measured in three experiments, which examined the effect of focal distance, age, and brightness, within distances of 33.3 to 50 cm, using a custom-built AR haploscope. Experiment I examined the effect of focal demand, at the levels of collimated (infinite focal distance), consistent with other depth cues, and at the midpoint of reaching distance. Observers were too young to exhibit age-related reductions in accommodative ability. The depth matches of collimated targets were increasingly overestimated with increasing distance, consistent targets were slightly underestimated, and midpoint targets were accurately estimated. Experiment II replicated Experiment I, with older observers. Results were similar to Experiment I. Experiment III replicated Experiment I with dimmer targets, using young observers. Results were again consistent with Experiment I, except that both consistent and midpoint targets were accurately estimated. In all cases, collimated results were explained by a model, where the collimation biases the eyes’ vergence angle outwards by a constant amount. Focal demand and brightness affect near-field AR depth matching, while age-related reductions in accommodative ability have no effect. Index Terms—Perception and psychophysics, virtual and augmented reality, human performance, depth perception. 1 I NTRODUCTION M ANY compelling applications of augmented reality (AR) require interacting with real and virtual ob- jects at reaching distances. Some examples include image- guided medical procedures (e.g., Kersten-Oertel et al. [1]), manufacturing (e.g., Curtis et al. [2]), and maintenance (e.g., Henderson and Feiner [3]). Among the factors that determine success is the accuracy with which observers can match the distance of a real object to an AR-presented virtual object. For example, a surgeon may need to cut to the depth indicated by an AR-presented tumor, or place a needle within the tumor. In order for AR to be useful for image-guided surgery of the brain, Edwards et al. [4] found that surgeons must be able to place a scalpel with a tolerance of 1 mm; and, in order for AR to be useful for a type of radiation therapy, Krempien et al. [5] found that a needle must be placed with a tolerance of 1 mm. In previous work motivated by this topic, Swan et al. [6] reported initial efforts to measure the accuracy of AR depth matching. An optical see-through AR display was used, and reaching distances of 24 to 56 cm were examined. The depth judgment was perceptual matching, where observers adjusted a pointing object in depth, until they judged it to be the same distance from themselves as a target object. Fig. 1 summarizes these results, which were collected across three experiments. The pointer was always a real object, and therefore its distance from the observer could be objectively Gurjot Singh is with Fairleigh Dickinson University. E-mail: [email protected]. Stephen R. Ellis a Consultant, and was formally with NASA Ames Research Center. E-mail: [email protected]. J. Edward Swan II is with Mississippi State University. E-mail: [email protected]. Manuscript received XXXX; revised XXXX. measured in the real world. In Fig. 1, the x-axis is the actual depth of the target object, and the y-axis is the depth error of the pointer. Here, error =0 indicates that observers placed the pointer at the same depth as the target object; error > 0 indicates overestimated depth matches, where observers placed the pointer farther in depth than the target object; and error < 0 indicates underestimated depth matches, where observers placed the pointer closer than the target object. As a control condition, Swan et al. [6] examined the accuracy of matching a real target object, and found accuracies of 1.4 to 2.7 mm (Fig. 1a, the real consistent condition). However, when they examined matching a vir- tual target object, they found that observers systematically overestimated the matching distance, ranging from 0.5 cm at near distances to 4.0 cm at far distances (Fig. 1b, the AR collimated condition). Therefore, as illustrated in Fig. 1, there was a significant difference in depth matching real and virtual targets. Swan et al. [6] determined that the likely reason for these results was that their AR display used collimating optics, which present virtual objects focused at optical infinity. They found the results to be very well described by a model where this collimation causes the eyes’ vergence angle to rotate outwards by a constant amount. Fig. 2 illustrates this model. Let the black points labelled α and α 0 be two real objects, with the first located close to the observer, and the second located farther away. And, let the red points labelled β and β 0 be two virtual objects, which are rendered to be the same distance as the real targets. α, α 0 , β, and β 0 also repre- sent the angle of binocular parallax that the eyes make when the observer fixates on each object. Therefore, when fixating on the close real object, the angle of binocular parallax is α, and if the fixation changes to the close virtual object, then c 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. arXiv:1712.00088v1 [cs.HC] 30 Nov 2017
Transcript
Page 1: IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 1 The Effect of ... · In principal, the extra-retinal cues cues could provide absolute egocentric depth information (Gillam [11]). In contrast,

IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 1

The Effect of Focal Distance, Age, andBrightness on Near-Field Augmented Reality

Depth MatchingGujot Singh, Member, IEEE, Stephen R. Ellis, and J. Edward Swan II, Senior Member, IEEE

Abstract—Many augmented reality (AR) applications operate within near-field reaching distances, and require matching the depth of avirtual object with a real object. The accuracy of this matching was measured in three experiments, which examined the effect of focaldistance, age, and brightness, within distances of 33.3 to 50 cm, using a custom-built AR haploscope. Experiment I examined theeffect of focal demand, at the levels of collimated (infinite focal distance), consistent with other depth cues, and at the midpoint ofreaching distance. Observers were too young to exhibit age-related reductions in accommodative ability. The depth matches ofcollimated targets were increasingly overestimated with increasing distance, consistent targets were slightly underestimated, andmidpoint targets were accurately estimated. Experiment II replicated Experiment I, with older observers. Results were similar toExperiment I. Experiment III replicated Experiment I with dimmer targets, using young observers. Results were again consistent withExperiment I, except that both consistent and midpoint targets were accurately estimated. In all cases, collimated results wereexplained by a model, where the collimation biases the eyes’ vergence angle outwards by a constant amount. Focal demand andbrightness affect near-field AR depth matching, while age-related reductions in accommodative ability have no effect.

Index Terms—Perception and psychophysics, virtual and augmented reality, human performance, depth perception.

F

1 INTRODUCTION

MANY compelling applications of augmented reality(AR) require interacting with real and virtual ob-

jects at reaching distances. Some examples include image-guided medical procedures (e.g., Kersten-Oertel et al. [1]),manufacturing (e.g., Curtis et al. [2]), and maintenance(e.g., Henderson and Feiner [3]). Among the factors thatdetermine success is the accuracy with which observerscan match the distance of a real object to an AR-presentedvirtual object. For example, a surgeon may need to cut tothe depth indicated by an AR-presented tumor, or place aneedle within the tumor. In order for AR to be useful forimage-guided surgery of the brain, Edwards et al. [4] foundthat surgeons must be able to place a scalpel with a toleranceof 1 mm; and, in order for AR to be useful for a type ofradiation therapy, Krempien et al. [5] found that a needlemust be placed with a tolerance of 1 mm.

In previous work motivated by this topic, Swan et al. [6]reported initial efforts to measure the accuracy of AR depthmatching. An optical see-through AR display was used,and reaching distances of 24 to 56 cm were examined. Thedepth judgment was perceptual matching, where observersadjusted a pointing object in depth, until they judged itto be the same distance from themselves as a target object.Fig. 1 summarizes these results, which were collected acrossthree experiments. The pointer was always a real object, andtherefore its distance from the observer could be objectively

• Gurjot Singh is with Fairleigh Dickinson University.E-mail: [email protected].

• Stephen R. Ellis a Consultant, and was formally with NASA AmesResearch Center. E-mail: [email protected].

• J. Edward Swan II is with Mississippi State University.E-mail: [email protected].

Manuscript received XXXX; revised XXXX.

measured in the real world. In Fig. 1, the x-axis is theactual depth of the target object, and the y-axis is thedepth error of the pointer. Here, error = 0 indicates thatobservers placed the pointer at the same depth as the targetobject; error > 0 indicates overestimated depth matches,where observers placed the pointer farther in depth thanthe target object; and error < 0 indicates underestimateddepth matches, where observers placed the pointer closerthan the target object. As a control condition, Swan et al. [6]examined the accuracy of matching a real target object, andfound accuracies of 1.4 to 2.7 mm (Fig. 1a, the real consistentcondition). However, when they examined matching a vir-tual target object, they found that observers systematicallyoverestimated the matching distance, ranging from 0.5 cmat near distances to 4.0 cm at far distances (Fig. 1b, theAR collimated condition). Therefore, as illustrated in Fig. 1,there was a significant difference in depth matching real andvirtual targets.

Swan et al. [6] determined that the likely reason for theseresults was that their AR display used collimating optics,which present virtual objects focused at optical infinity. Theyfound the results to be very well described by a modelwhere this collimation causes the eyes’ vergence angle torotate outwards by a constant amount. Fig. 2 illustrates thismodel. Let the black points labelled α and α′ be two realobjects, with the first located close to the observer, and thesecond located farther away. And, let the red points labelledβ and β′ be two virtual objects, which are rendered to be thesame distance as the real targets. α, α′, β, and β′ also repre-sent the angle of binocular parallax that the eyes make whenthe observer fixates on each object. Therefore, when fixatingon the close real object, the angle of binocular parallax is α,and if the fixation changes to the close virtual object, then

c©2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses,in any current or future media, including reprinting/republishing this material for advertising or promotionalpurposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrightedcomponent of this work in other works.

arX

iv:1

712.

0008

8v1

[cs

.HC

] 3

0 N

ov 2

017

Page 2: IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 1 The Effect of ... · In principal, the extra-retinal cues cues could provide absolute egocentric depth information (Gillam [11]). In contrast,

IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 2

Real Consistent AR Collimated

● ● ● ● ●●● ● ● ●

●● ● ● ●

(a)

●●

●●

●●●

●●

●●

●●

(b)−1

0

1

2

3

4

5

1 2 3 4 5 1 2 3 4 5Actual Distance

Err

or (

cm),

±1

SE

M

Experiment

Exp I

Exp II

Exp III

Fig. 1. The perceptual matching depth judgments from Swan et al. [6].For Experiment I, the actual distances were 34, 38, 42, 46, and 50 cm,while for Experiments II and III the actual distances were 55, 63, 71, 79,and 87% of each observer’s maximum reach.

close objects

α β

Δv = α – β Δv = α’ – β’

α’ β’

far objects

Fig. 2. The model that explains how a constant change in vergenceangle, ∆v, leads to matched distances of virtual objects (red: β, β′),relative to real objects (black: α, α′), that are increasingly overestimatedwith increasing distance.

the collimation causes the eyes to rotate outwards, reducingthe angle to β. When placing the real pointer α at the samedepth as the virtual target β, observers’ eyes rotate inwardsand outwards as they fixate between the two objects, andtherefore observers perceive them to be located at the samedepth. The model predicts that this change in vergenceangle, ∆v = α − β, is constant for reaching distances.Therefore, when fixating on the far target, α′ < α, and thissame change in vergence angle, ∆v = α′ − β′, causes alarger depth distance between α′ and β′ (Fig. 2). This modelexplains three properties of Swan et al.’s [6] results (Fig. 1):(1) because the collimating optics cause the eyes to rotateoutwards, the depth judgments of the virtual targets areoverestimated relative to the real targets, (2) the amount ofoverestimation increases with increasing distance, and (3)the results are very well fit with a linear model.

This analysis suggests that, for accurate depth place-ment, virtual objects need to be presented with a focaldepth—also termed accommodative demand—that is consistentwith their intended depth. Then, the eyes’ vergence anglewill not be biased, and depth matches will be more accurate.This paper reports three experiments that systematicallyexamine this hypothesis.

However, it was not possible to conduct these experi-ments with the same AR display as Swan et al. [6]. Thatdisplay, an NVIS Inc. nVisor ST60 model, contains unad-justable collimating optics, which always present virtualobjects focused at optical infinity. This is consistent with thevast majority of commercially available AR displays, almostall of which have a focal distance that is set at the factory,

and unadjustable by the end user.1 Therefore, an augmentedreality haploscope—an AR display mounted on an opticalworkbench that allows accommodative demand and ver-gence angle to be independently and precisely adjusted—was developed and used for the experiments reported here.2

2 BACKGROUND AND RELATED WORK

2.1 Depth Perception and Depth CuesThe human visual system achieves a percept of perceiveddepth from depth cues—sources of perceptual information re-lated to depth. At least nine depth cues have been identified(Cutting and Vishton [9]): occlusion (a closer object occludesfarther objects), binocular disparity (an object projects to dif-ferent locations on each retina), motion perspective (objects atdifferent distances from a moving observer have differentapparent velocities), height in the visual field (starting fromthe horizon, closer objects are lower in the visual field),relative size (among objects of the same size, the fartherobject projects to a smaller retinal angle), accommodation (thelens of the human eye changes shape to bring objects intofocus), vergence (the two eyes rotate to fixate on an object(Fig. 2)), relative density (for a textured surface, at fartherdistances more objects are seen within a constant retinalangle), and aerial perspective (objects at great distances losecolor saturation and contrast).

Depth cues differ in effectiveness based on various visualcharacteristics, such as scene content and distance from theobserver. Nagata [10], and later Cutting and Vishton [9],organized the relative effectiveness of different depth cuesaccording to distance. Within near-field reaching distances,they find that the operative depth cues, in approximateorder of decreasing salience, are occlusion, binocular disparity,motion perspective, relative size, accommodation and vergence,and relative density.

Most of these depth cues can be categorized as retinal,because the information from the cue comes from the visualscene sensed on the retina. However, the cues of accom-modation and vergence are extra-retinal, because the cueinformation comes from sensors that detect the state of themuscles that control the lenses’ shape and the eyes’ ver-gence angle. In principal, the extra-retinal cues cues couldprovide absolute egocentric depth information (Gillam [11]).In contrast, retinal cues can only provide relative depthinformation between objects in the scene; these cues re-quire an external reference to establish the scene’s overallscale. However, when combined with extra-retinal cues, andan observer’s constant interpupillary distance, retinal cuescan also provide absolute depth information (Bingham andPagano [12], Mon-Williams and Tresilian [13]). In general,the way the human visual system combines informationfrom different depth cues to produce a stable percept ofdistance is subtle and not fully understood, although manytheories have been advanced and the collected evidence fa-vors some theories over others (Landy et al. [14], Singh [8]).

1. The authors of the current paper, who have been studying virtualand augmented reality for 8, 30, and 18 years, respectively, can onlyrecall a single commercially-available AR display—the MicrovisionNomad from the early 2000’s— which came with a focus adjustmentknob (Gabbard et al. [7]).

2. Portions of these experiments are reported in a PhD dissertation(Singh [8]).

Page 3: IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 1 The Effect of ... · In principal, the extra-retinal cues cues could provide absolute egocentric depth information (Gillam [11]). In contrast,

IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 3

(a) normal viewing (b) vergence farther than accommodation

verg

ence

dis

tanc

e fo

cal d

ista

nce

verg

ence

dis

tanc

e fo

cal d

ista

nce

verg

ence

dis

tanc

e fo

cal d

ista

nce

(c) accommodation farther than vergence

Fig. 3. The vergence-accommodation conflict, and its effect on per-ceived depth. (a) In normal viewing of real world objects, the vergencedistance, required for zero binocular disparity, is the same as the focaldistance, required for minimal focal blur. (b) When the vergence distanceis farther than the focal distance, e.g. when viewing a virtual objectbeyond the surface of a stereo monitor, the vergence angle is biasedinwards (grey lines), and the object is seen as closer than encodedby disparity. (c) When the vergence distance is closer than the focaldistance, e.g. when viewing a virtual object in front of the surface of astereo monitor, the vergence angle is biased outwards (grey lines), andthe object is seen as farther than encoded by disparity.

2.2 Vergence and Accommodation

Visual perception requires a rapid series of precise eyemovements (Leigh and Zee [15]), including fixation (hold animage steady on the fovea by minimizing eye movement),saccadic (quick movement that projects an object of interestto the fovea), smooth pursuit (retain fixation on an object dur-ing smooth movement of either the object or head), vestibular(hold vision steady during head movements), and vergence(the two eyes rotate to fixate on an object of interest (Fig. 2)).When changing fixation from a far to a near object, the eyesconverge, the lenses become thicker, and the pupils con-strict. These three actions—vergence, accommodation, andchanging pupil size—are interlinked physiologically, andthe mechanism of these three simultaneous reflexes is calledthe near triad. Because of the interlinkage, changes in eitheraccommodation or vergence drive corresponding changesin the other two components of the triad (Semmlow andHung [16]). Apart from the influence of accommodation andvergence, pupil diameter also changes according to sceneillumination, becoming larger in dim settings and smaller inbright settings. Although these illumination-driven changesin pupil diameter affect the eye’s optical depth of field,and therefore could potentially affect accommodation, littleeffect of changing pupil diameter on accommodation hasbeen observed (Ripps et al. [17]). Therefore, in near fieldviewing, vergence and accommodation are the main depthreflexes, and the link between them is known as the vergence-accommodation reflex. Because of this reflex, accommodationand vergence operate in unison: changes in accommodationdrive changes in vergence (accommodative vergence), andchanges in vergence drive changes in accommodation (ver-gence accommodation) (Kersten and Legge [18]). Therefore,the vergence reflex is driven both by binocular disparity(the eyes rotate to bring a fixated object to a level of zerobinocular disparity), as well as accommodative vergence.Likewise, the accommodation reflex is driven both by focalblur (the lenses adjust to minimize blur), as well as vergenceaccommodation (Mon-Williams and Tresilian [19]).

Of course, the vergence-accommodation reflex is cali-brated for viewing real world objects, which present consis-tent binocular disparity and focal blur cues (Fig. 3a). Whenviewing virtual objects, the binocular disparity and focalblur cues are often inconsistent, because the focal blur cueis fixed at the screen depth, while the depth of the binoculardisparity cue varies, sometimes beyond the screen depth(Fig. 3b), and sometimes in front (Fig. 3c). This is calledthe vergence-accommodation conflict, and it is a ubiquitousaspect of all stereo displays with a single focal plane (Kruijffet al. [20]). The conflict causes visual fatigue (Gabbard etal. [7], Lambooij et al. [21]), hinders visual performance(Hoffman et al. [22]), and biases depth perception towardsthe screen depth (Fig. 3, Swenson [23], Mon-Williams andTresilian [19]).

The contribution of vergence to perceived depth de-pends upon various properties of the scene. At near-field distances, vergence has been conclusively found toprovide egocentric depth information (Brenner and VanDamme [24], Owens and Liebowitz [25], Tresilian et al. [26],Viguier et al. [27]; Foley [28] provides a comprehensivereview). Although vergence in isolation is not a very ac-curate depth cue, observers are very sensitive to changesin vergence, which generally allows accurately matchingthe depth of one object with another (Brenner and VanDamme [24]). Each individual has a different vergence rest-ing point—their dark vergence—which is the vergence anglethat their eyes assume when the controlling muscles arecompletely relaxed. In low light conditions, the egocentricdepth specified by vergence is biased towards each individ-ual’s dark vergence distance (Owens and Liebowitz [25]).As a depth cue, vergence is most effective within near-field distances of 2 meters (Viguier et al. [27]), a distancerange that encompasses ∼90% of vergence eye movements(Tresilian et al. [26]). As other retinal depth cues becomeavailable, the contribution of vergence to perceived depth isreduced, but still present (Foley [28]).

As discussed above, accommodation influences per-ceived depth through the vergence-accommodation reflex.Although some studies have found evidence that accommo-dation alone can serve as a depth cue for some observers,these experiments require careful experimental setups toeliminate other depth cues, and the consensus remainsthat accommodation influences perceived depth throughits effect on vergence (Mon-Williams and Tresilian [19]).Similar to dark vergence, each individual has a dark focus—the distance their eyes focus when the controlling mus-cles are in a relaxed state (Iavecchia et al. [29]). The darkfocus biases the eye’s focal response, resulting in a num-ber of perceptual consequences, including perceived depth(Roscoe [30]). Generally, the dark focus and dark vergencedistances vary independently, and for most individuals arenot equal (Owens and Leibowitz [25]).

2.3 Accommodation and Age

Accommodative ability, the distance range within which aviewed object can be brought into clear focus, decreaseswith increasing age (Duane [31]), a condition known aspresbyopia. It is primarily caused by hardening of the crys-talline lens, although other physiological changes in the

Page 4: IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 1 The Effect of ... · In principal, the extra-retinal cues cues could provide absolute egocentric depth information (Gillam [11]). In contrast,

IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 4

lens, connective tissue, and controlling muscles also playa role (Kasthurirangan and Glasser [32]). As measured byDuane [31], presbyopia begins by the age of 12, but throughthe early 30’s the loss is minuscule—the closest distanceof clear focus declines from ∼8 to ∼13 cm. However, thedecline then accelerates, and by the age of 50 often surpasses50 cm. At some point in the 40’s, the closest distance ofclear focus often surpasses standard reading distance, andreading glasses are required. By their mid-50’s, most peoplehave lost the ability to adjust the distance of clear focus.

It seems reasonable that this loss of accommodative abil-ity would have perceptual consequences, and indeed, olderpeople are worse than younger people at many perceptualtasks (Bian and Andersen [33]). However, accommodativevergence does not diminish with age; even as the visualsystem looses the ability to adjust accommodation, the eyesstill verge properly in response to accommodative stimuli(Heron et al. [34]). Because vergence is the primary source ofdepth information from the vergence-accommodation reflex(Mon-Williams and Tresilian [19]), this suggests that depthperception could be unaffected by presbyopia. Indeed, Bianand Andersen [33] found that, when making judgments ofmedium-field egocentric distances, older people (average73.4 years) were more accurate than younger people (average22.5 years). This is one of a series of recent studies that havefound that older observers preserve their abilities in tasksrelated to distance perception (Bian and Andersen [33]).

2.4 Accommodation and Scene FlatnessAnother effect of the vergence-accommodation conflict instereo displays is that the accommodative distance changesthe perceived flatness of the scene (Andersen et al. [35], Na-gata [10], Singh [8]). Specifically, when medium- to far-fieldscenery is shown on a display, but accommodative distanceis in the near field, depth distances between scene objectsare compressed, and the scene is perceived as being a flatwindow, positioned some depth distance from the observer.However, when the same scene is shown with collimation,these depth distances are no longer compressed, and thescene objects appear to extend in depth, with some closer tothe observer and others farther. This is a reason why manyaugmented and virtual reality displays, especially thoseused for flight simulation and other far-field applications,present collimated light (Watt et al. [36]). Likewise, the NVISnVisor ST60 used by Swan et al. [6], which also presentscollimated light, was originally marketed for military train-ing and forward observer tasks, which primarily involvemedium- to far-field distances.

2.5 Depth Perception and BrightnessAmong objects of the same size and distance, the brighterappear closer than the dimmer. This principal has longbeen known in art, and is discussed by Leonardo DaVinci in his Notebooks (McCurdy [37]). The principal hasbeen thoroughly studied, at both near field (Ashley [38],Farne [39]) and medium field (Coules [40]) distances, andin both monocular and binocular conditions (Coules [40]).In addition to brightness, the contrast between an objectand the background also effects perceived depth, so a darkobject against a light background can appear closer than an

monitor

collimation*lens

minificationlens

accommodationlens

optical*combiners

height*adjustablechin*and*

forehead*restpivot*points

rotatable*assembly

rail*carriers*(adjust*inter8pupillary*distance)

tracking*fiducials

rotatable*assembly

Fig. 4. The Augmented Reality (AR) Haploscope. The physical designallows independent adjustment of vergence angle and focal distance.

object with less contrast (Farne [39]). Among the theoriesthat explain this effect are that brighter objects stimulatea larger area on the retina, and that brighter objects effectpupil size, which then biases other near triad reflexes.

2.6 Related Work in Augmented RealityTo date, including Swan et al. [6], only a small number ofpapers have examined near-field AR depth matching. Ellisand Menges [41] measured the effects of convergence, ac-commodation, observer age, viewing condition (monocular,biocular stereo, binocular stereo), and the presence of anoccluding surface. They found that accuracy is degradedby monocular viewing and an occluding surface. Using thesame experimental setup, McCandless et al. [42] addition-ally studied motion parallax and latency in monocular view-ing; they found reduced accuracy with increasing distanceand latency. Singh et al. [43] found that an occluding surfacehas complex accuracy effects, and Rosa et al. [44] foundincreased accuracy with redundant tactile feedback.

3 THE AUGMENTED REALITY HAPLOSCOPE

As motivated in Section 1, an augmented reality haploscopewas designed and engineered.3 The design was looselybased on the AR haploscopes described by Rolland et al. [45]and Ellis and Menges [41], but similar designs have a longhistory in the study of depth perception (e.g., Swenson [23]).

Fig. 4 shows the AR haploscope. The physical design hasthe following requirements: (1) provide a range of vergenceangles and accommodative demands, (2) adjust to matcha wide range of inter-pupillary distances, and (3) be rigidenough to resist inevitable bumps. To achieve these, thedevice is mounted on an optical breadboard. The primarystructure is built on three optical rails: two 12-inch railsserve as mounting bases for left-eye and right-eye opticalsystems, and both 12-inch rails are mounted on a 24-inch railusing 3-inch rail carriers, which can be adjusted to match therequired inter-pupillary distance.

3. Additional technical details, and a history of preliminary versionsand design tradeoffs, can be found in Singh [8].

Page 5: IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 1 The Effect of ... · In principal, the extra-retinal cues cues could provide absolute egocentric depth information (Gillam [11]). In contrast,

IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 5

monitor

10 cm

minification lens (50 mm concave)

collimation lens (50 mm convex)

accommodation lens (38 mm concave)

virtual object

image generated by minimization

lens 10 cm

5 cm image formed at 0 to 3 D (infinity to 33.3 cm), based on power of accommodation lens

–10 D +10 D 0%to%–3%D

Fig. 5. The optical system of the AR haploscope. Changing the accom-modation lens changes the focal distance.

f

m

n

monitoropticalsystem

eye.rotational.centers.over.pivot.points

opticalcombiner

i

d

1/2α

Fig. 6. Rotating the optical systems to match the correct vergence angle.

The goal of each optical system is to collimate the gen-erated image, so the image is located at optical infinity, or0 diopters (D). Then, the collimated image can either be leftat optical infinity, or a negative power lens can reduce thefocal distance. Fig. 5 shows the optical system. The imageis first generated by a monitor. Then, the image is minifiedby a −10 D concave lens; without minification, only a smallpart of the monitor can be seen through the optical system.As shown in Fig. 5, when this −10 D lens is placed 10 cmfrom the monitor, it creates a minified image at −5 cm. Thisminified image is then collimated by a +10 D convex lens,positioned 10 cm from the image. The collimated image isthen passed through an accommodation lens. This comes froma standard optometric trial set; either a 0 D plain glass lens,which retains the collimation, or a negative power concavelens, which reduces the focal distance. In the experimentsreported here, the strongest accommodation lens used was−3 D, which resulted in a 33.3 cm focal distance. Aftergeneration, the images are reflected into the observers’eyes by 15% reflective optical combiners, mounted at 45◦

directly in front of each eye. Fig. 4 shows the monitors; theminification, collimation, and accommodation lenses; andthe optical combiners.

Fig. 6 illustrates how the haploscope matches differentvergence angles. The rail carriers are adjusted so that the

distance between the pivot points matches the observer’sinter-pupillary distance (Fig. 4). The chin and forehead restis adjusted so that these pivot points are directly below therotational centers of the observer’s eyes. As illustrated inFig. 6, when the left and right optical systems then rotateabout the pivot points, for all convergence distances theview rays from the center of the two eyes stay in linewith the principal axes of the optical systems. This allowspresenting a virtual object at any distance, near (n), medium(m), or far (f ), while the observer’s view rays continue topass through the middle of the optical system, where opticaldistortion is minimized. To display a target object at a spe-cific distance, the optical systems are rotated to the matchingconvergence angle 1/2α (Figs. 2, 6); 1/2α = arctan(i/2d),where i is the observer’s interpupillary distance, and d isthe target distance. The angle of each optical system ismeasured by a constellation of tracking fiducials attachedto each monitor (Fig. 4), which allows an ART TrackPack tomeasure the vergence angle to an accuracy of 0.01◦.

4 EXPERIMENT I: ACCOMMODATION

As discussed in Section 1, Swan et al. [6] hypothesized thatthe linearly increasing overestimation they found with col-limated AR graphics (Fig. 1), was caused by the collimationbiasing the eyes’ vergence angle to rotate outwards by aconstant amount (Fig. 2). The purposes of Experiment I wereto test aspects of this hypothesis, using the same matchingtask and within a similar range of near-field distances.Because Experiment I used a different display—the ARhaploscope—the first purpose (1) was to replicate the realconsistent and AR collimated conditions of Swan et al. [6]. IfExperiment I found similar results, that would suggest thatthese results generalize to AR more broadly, and are notspecific to the NVIS display used by Swan et al. [6]. The nextpurpose (2), the AR consistent condition, was to test whetherpresenting AR objects at a focal distance that was consistentwith the distance specified by other depth cues, especiallybinocular disparity, would result in more accurate depthmatches than what was seen in the AR collimated condition.If the depth matches are more accurate, that would furthersupport the hypothesis that collimated graphics bias theeyes’ vergence angle outwards. However, for many ARapplications, always presenting virtual objects at consistentfocal distances is unlikely to be practical. Therefore, thefinal purpose (3), the AR midpoint condition, was to testwhether presenting AR objects at a focal distance equal tothe midpoint of the tested range would result in performancesimilar to the consistent condition. If the performance issimilar, this would suggest that, for accurate depth matchingwithin reaching distances, the expense of making the focaldemand consistent for every virtual object is not necessary.

4.1 Method

4.1.1 Apparatus and TaskFig. 7 shows the experimental setup. The haploscope wasmounted on the end of an optical breadboard, 244 cm longby 92 cm wide. The breadboard was supported by a custom-built aluminum table, with six legs. Mounted to the legs ofthe table were six hydraulic jacks, which could lift the entire

Page 6: IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 1 The Effect of ... · In principal, the extra-retinal cues cues could provide absolute egocentric depth information (Gillam [11]). In contrast,

IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 6

table, so the surface could be adjusted to be between 104and 134 cm above the ground. This adjustability allowedthe table to be comfortably positioned for observers of manydifferent heights. Aluminum arms extending above the tablesupported tracking cameras, as well as an overhead light(Fig. 7). Because the tracking cameras and light were at-tached to the table, when the table height was adjusted, theirdistance above the table top remained constant. Trackingwas provided by a 2-camera TRACKPACK system, fromA.R.T. GmbH.

On both sides of the table, depth adjusters—plastic pipesrunning through collars—could easily be slid back and forthin depth (Fig. 7). When the real target was presented, ithung from an arm attached to the left-hand depth adjuster.The real target was a wireframe octahedron, 5 cm wide by6 cm high, constructed of balsa wood and painted green.An electric motor rotated the target at 4 rpm. Althoughslow, the rotation gave a definite sense of three-dimensionalstructure from motion, even when viewed monocularly. Thedepth position of the real target was precisely measuredby a tracking fiducial mounted to the arm (Fig. 7). Whenan AR target was presented, the arm supporting the realtarget was removed. The AR target was identical to the realtarget: a green octahedron that rotated at 4 rpm, renderedand viewed through the haploscope optics. Only the greenchannel was used, which eliminated chromatic distortion.Careful calibration ensured that the AR target matched thereal target in size and position at all tested distances. Inaddition, because accommodation lenses of different pow-ers change the overall magnification of the optical system(Fig. 5), the calibration was repeated for every lens power.The targets were located 29 cm above the tabletop, and seenagainst a black curtain hung 1.2 meters from the observer(Fig. 7). The appearance of the real and AR targets wasas similar as possible: the lighting and color of the realtarget made it appear to glow against an otherwise darkbackground, and it did not cast any visible shadows orreflections. The table was covered with black cloth, whichcreated a smooth and featureless surface under the target.

The matching task from Swan et al. [6] was replicated.The pointer was made of green, translucent plastic, ∼4mm in diameter, with a rounded top, mounted on an armattached to the right-hand depth adjuster (Fig. 7). Observersmatched the target depth by sliding the depth adjusteruntil the pointer was directly below the bottom point ofthe rotating target. The distance between the bottom of thetarget and the top of the pointer was ∼1 cm. The depthposition of the pointer was precisely measured by a trackingfiducial mounted on the arm (not visible in Fig. 7).

4.1.2 Experimental Design

Observers: 40 observers were recruited from a population ofuniversity students and staff. The observers ranged in agefrom 18 to 38; the mean age was 20.9, and 18 were male and22 female. 10 observers were paid $12 an hour, and the restreceived course credit.

Independent Variables: Observers saw 4 different conditions:real consistent, AR collimated, AR consistent, and AR mid-point. The target object appeared at 5 different distancesfrom the observer: 33.3, 36.4, 40, 44.4, and 50 cm, which

tracking cameras

background curtain

real target

pointer

tracking fiducial

light

real target depth adjuster

pointer depth

adjuster

Fig. 7. The experimental setup. The AR haploscope was mounted onthe end of an optical breadboard. Real and AR targets were positionedat different depths from the observer. The depth of the targets wasmatched by changing the position of the pointer.

correspond to 3, 2.75, 2.5, 2.25, and 2 D. Observers saw 6repetitions of each distance.

In the real consistent condition, observers saw the realtarget object (Fig. 7), which, by definition, was alwayspresented at a focal distance that was consistent with itsactual distance. In the remaining conditions, the AR targetwas seen. In the AR collimated condition, a 0 D plain glassaccommodation lens was used, presenting the target atoptical infinity. In the AR consistent condition, the accom-modation lens power—3, 2.75, 2.5, 2.25, or 2 D—was alwaysconsistent with the target’s presented distance. Finally, inthe AR midpoint condition, the 2.5 D accommodation lenswas used, presenting the target at a focal distance of 40 cm.

Dependent Variables: The primary dependent variablewas judged distance—the measured position of the pointer(Fig. 7). In addition, error = judged distance − actual distancewas also calculated (Fig. 1).

Design: A mixed design was used, with condition varyingbetween observers, and distance and repetition varyingwithin each observer. There were 10 observers in eachcondition, and the presentation order of condition variedin a round-robin fashion, so each group of 4 observerscovered all conditions. For each observer, distance × rep-etition was randomly permuted, with the restriction thatthe distance changed every trial. Therefore, each observercompleted 5 (distance) × 6 (repetition) = 30 trials, and theexperiment collected a total of 40 (observers) × 30 (trials) =1200 data points.

4.1.3 ProcedureAfter receiving an explanation of the experimental proce-dures, an observer gave informed consent. Then, they tooka stereo vision test, which measured their sensitivity todepth changes encoded by binocular disparity. Next, theobserver’s inter-pupillary distance was measured, using a

Page 7: IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 1 The Effect of ... · In principal, the extra-retinal cues cues could provide absolute egocentric depth information (Gillam [11]). In contrast,

IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 7

pupilometer set to optical infinity, and the haploscope wasadjusted to match this distance. The task was then ex-plained, using the real target and the pointer. If the observerindicated that, when working at the demonstrated dis-tances, they would normally wear corrective optics (glassesor contacts), they were instructed to wear the optics. Ob-servers then donned safety goggles, which easily fit overglasses. The googles had 3.5 cm circular openings for eacheye, and were otherwise covered with black gaffer tape. Thesize of these openings was calibrated so that, when lookingthrough the haploscope optics, observers could see thecomplete field of view provided by the optical combiners,but their peripheral view of the rest of the haploscope wasblocked. The chinrest and forehead rest were adjusted sothat the observer’s eyes were approximately centered withinthe optical combiners, and the haploscope pivot points wereapproximately centered under the eyes’ rotational centers(Figs. 4, 6). The table and chair heights were adjusted so theobserver was sitting comfortably.

The observer then completed one of the four conditions.The pointer was placed at a random position within thetrackable distance of 23 to 67 cm from the observer, and theexperimenter then displayed the first target distance. Usingtheir right hand to manipulate the pointer depth adjuster(Fig. 7), the observer moved the pointer from this startingposition to match the target’s depth. The observer thenclosed their eyes, and the experimenter displayed the nexttarget distance. The observer then opened their eyes, andmoved the pointer from the previously matched distanceto the new distance. This pattern continued until all trialswere completed. To display distances with the real target,the experimenter used the real target depth adjuster to slidethe real target to the correct position. For the AR target, theexperimenter adjusted the angle of each haploscope arm,and swapped out the accommodation lenses as needed.Regardless of condition, the procedures were as similar aspossible, and the time required for each trial was approxi-mately equal. During real consistent trials, observers lookedthrough the haploscope optics, even through the monitorswere switched off.

After the trials, the observer was debriefed. The overallexperiment took approximately one hour.

4.2 Analysis

Similar to Swan et al. [6], the data was analyzed by examin-ing the slopes and intercepts of linear equations that predictjudged distance from actual distance. Multiple regressionmethods determine if the slopes and intercepts significantlydiffer (Pedhazur [46], Cohen et al. [47]). For data withthis structure, multiple regression methods are preferableto ANOVA analysis, because multiple regression allows theprediction of a continuous dependent variable (judged dis-tance) from a continuous independent variable (actual targetdistance), as well as a categorical independent variable(condition). In contrast, ANOVA analysis only examines cat-egorical independent variables, which results in a significantloss of power when an independent variable is inherentlycontinuous (Pedhazur [46]). In addition, multiple regressionyields slopes and intercepts, which as descriptive statisticsare more useful than means, because they directly describe

functions that predict judged distances from actual targetdistances. Finally, multiple regression methods focus oneffect size, as opposed to significance; an analytic approachadvocated by many in the applied statistics community(Cohen et al. [47]).4

Figs. 8a–d and 9 show the results from Experiment I,plotted as a scatterplot of judged against actual distance(Fig. 8), as well as mean error against distance (Fig. 9). Bothfigures indicate that the data is very well fit by linear regres-sions; note the r2 values in Fig. 8. Fig. 10 shows multipleregression analysis, which compares pairs of panels fromFig. 8 against each other; each panel in Fig. 10 examinestwo independent variables: a continuous variable (actualdistance), and a categorical variable (a pair of panels fromFig. 8). To properly account for repeated measurements, foreach observer at each distance, the responses were averagedover the 6 repetitions, reducing the size of the analyzeddataset from 1200 to 200 points—note the reduced densityof points in Fig. 10 relative to Fig. 8a–d.

Each panel in Fig. 10 compares two regression equationsfrom Fig. 8. The multiple regression analysis operates in thefollowing manner: First, the slopes of the equations are testedto see if they significantly differ. If they do, as in Fig. 10a,both equations from Fig. 8 are reported as the best overalldescription of the data in the panel. If the slopes of theequations do not significantly differ, then the intercepts ofthe equations are tested to see if they significantly differ.This test first sets the slopes of the equations—which donot differ—to a common value. If the intercepts significantlydiffer, as in Fig. 10b, two regression equations, with slopesadjusted to a common value, are reported as the best overalldescription of the data in the panel. If neither the slopesnor the intercepts significantly differ, as in Fig. 10c, then thedata from both panels is combined, and a regression overthe combined data is reported as the best overall descriptionof the data in the panel. Therefore, this multiple regressionanalysis yields three possible outcomes, which by chance areillustrated in the first three panels of Fig. 10: (1) the slopessignificantly differ (Fig. 10a), (2) the slopes do not differ butthe intercepts significantly differ (Fig. 10b), or (3) neither theslopes nor the intercepts significantly differ (Fig. 10c).

In each case, the panel also indicates two measures ofeffect size: (1) the overall R2 value, the percentage of vari-ation in the panel explained by the linear regressions, and(2) dR2, the percentage of variation explained by the changein the categorical variable. If dR2 is too small, hypothesistesting is not performed, because any statistical differenceswould be too small to be meaningful (Pedhazur [46]). Basedon the results reported in this paper, hypothesis testing isonly conducted when dR2 ≥ 0.08%. Finally, for each panel,if there is a statistical difference in either slope or intercept,then the distance, d, in cm, between the fitted regressionlines is also reported. When there is a difference in slope,as in Fig. 10a, d is reported at the minimum and maximumx values (33.3 and 50 cm). When there is a difference inintercept, as in Fig. 10b, then the regression lines are thesame distance apart for every x, and only one d value is

4. Custom analysis software, developed by the third author, wasused, which implements methods described by Pedhazur [46]. A moredetailed discussion of the application of multiple regression methodsto depth perception data is available in Swan et al. [6].

Page 8: IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 1 The Effect of ... · In principal, the extra-retinal cues cues could provide absolute egocentric depth information (Gillam [11]). In contrast,

IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 8

Real Consistent AR Collimated AR Consistent AR Midpoint

●●

(a)

y = 0.997x + 0.11 r2 = 99.9%

●●

●●

●●

●●

●●

(e)

y = 0.988x + 0.60 r2 = 99.0%

(i)

●●

●●

●●

● ●

●●

●●

(b)

y = 1.062x − 1.33 r2 = 96.5%

●●

●●

●●

●●

(f)

y = 1.081x − 3.21 r2 = 96.6%

●●

●●

(j)

y = 1.062x − 1.30 r2 = 95.9%

●●

(c)

y = 0.988x + 0.06 r2 = 99.3%

●●

●●

●(g)

y = 0.978x + 0.69 r2 = 99.0%

●●

●●

(k)

y = 1.029x − 1.10 r2 = 99.3%

(d)

y = 1.009x − 0.60 r2 = 99.0%

●●

●●

(h)

y = 0.982x + 0.74 r2 = 98.6%

●●

(l)

y = 1.010x − 0.25 r2 = 99.3%

30

35

40

45

50

55

30

35

40

45

50

55

30

35

40

45

50

55

Exp I

Exp II: O

lder Observers

Exp III: D

im Targets

33.3 36.4 40.0 44.4 50.0 33.3 36.4 40.0 44.4 50.0 33.3 36.4 40.0 44.4 50.0 33.3 36.4 40.0 44.4 50.0Actual Distance (cm)

Judg

ed D

ista

nce

(cm

)

Fig. 8. The results for all three experiments, plotted as judged against actual distance. Each panel shows the individual data points, color codedaccording to observer, and fit with a color-coded regression line per observer. For each panel, the dotted line represents veridical performance, andthe solid black line is the overall regression line. The corresponding regression equation is at the top of each panel. The r2 values indicate that thedata in each panel is very well described by the regression equation. A separate group of 10 observers provided the data for each panel. The graphsummarizes N = 1200 (Experiments I and II) and N = 900 (Experiment III) data points.

● ● ● ● ●

(a)

●● ●

●●

(b)

● ● ● ●●

(c)

● ● ● ● ●

(d)

Real Consistent AR Collimated

AR Consistent AR Midpoint−1.0

−0.5

0.0

0.5

1.0

1.5

2.0

2.5

−1.0

−0.5

0.0

0.5

1.0

1.5

2.0

2.5

33.3 36.4 40.0 44.4 50.0 33.3 36.4 40.0 44.4 50.0Actual Distance (cm)

Err

or (

cm),

±1

SE

M

Experiment

● Exp I

Fig. 9. Experiment I, plotted as mean error against distance (N = 1200).

reported. d is a signed value; d > 0 indicates a distancefarther from the observer, and d < 0 closer to the observer.

4.3 ResultsReal consistent very accurate: Fig. 9a indicates that observerswere extremely accurate in the real consistent condition. Themean error is −0.2 mm, and the slope of the regression forFig. 9a, y = −0.003x + 0.11, does not significantly differfrom 0 (F1,48 = 1.63, p = 0.21). Note that this is statisticallyequivalent to testing whether the slope in Fig. 8a differsfrom 1.

AR collimated increasingly overestimated: When AR collimatedis compared to real consistent (Fig. 10a), the slopes sig-

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

(a)

AR Collim: y = 1.062x − 1.33Real Consis: y = 0.997x + 0.11

R2 = 99.1%dR2 = 1.1%, d = +0.7, +1.8 cm

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

(b)

Real Consis: y = 0.993x + 0.28 AR Consis: y = 0.993x − 0.11

R2 = 99.7%dR2 = 0.12%, d = −0.4 cm

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

(c)

y = 1.003x − 0.25

R2 = 99.6%dR2 = 0.036%

(d)

y = 0.999x − 0.27

R2 = 99.4%dR2 = 0.035%

AR Collim vs Real Consis AR Consis vs Real Consis

AR Midpt vs Real Consis AR Consis vs AR Midpt

30

35

40

45

50

55

30

35

40

45

50

55

33.3 36.4 40.0 44.4 50.0 33.3 36.4 40.0 44.4 50.0Actual Distance (cm)

Judg

ed D

ista

nce

(cm

)

Condition● Real Consis

AR Collim

AR Consis

AR Midpt

Fig. 10. Experiment I, multiple regression analysis, plotted as a scat-terplot of judged against actual distance, with N = 200 ghosted datapoints. The thin dashed lines represent veridical performance. Blue linesrepresent fitted regression equations from Fig. 8. Black and red linesrepresent the linear regressions shown in each panel. Blue lines arenot visible when overlaid by black or red lines; the degree of blue linevisibility is a graphical indication of how closely the regressions in eachpanel agree with the regressions from Fig. 8.

nificantly differ (F1,96 = 10.7, p = 0.001), indicating thatthe AR collimated targets were overestimated, from +0.7 to+1.8 cm (Fig. 9b).

AR consistent underestimated: When AR consistent is com-pared to real consistent (Fig. 10b), the slopes do not signif-icantly differ (F1,96 = 0.68, p = 0.41), but the intercepts do(F1,97 = 41.3, p < 0.001), indicating that the AR consistent

Page 9: IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 1 The Effect of ... · In principal, the extra-retinal cues cues could provide absolute egocentric depth information (Gillam [11]). In contrast,

IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 9

targets were underestimated by a constant −0.4 cm (Fig. 9c).

AR midpoint equivalent to real consistent: When AR midpointis compared to real consistent (Fig. 10c), the effect size of thedifference is 0.036% of the variation, which is too small forany statistical differences to be meaningful. Therefore, thejoint data is best fit by a single equation, indicating that ARmidpoint targets were accurately matched (Fig. 9d).

AR consistent and AR midpoint equivalent: When AR consis-tent is compared to AR midpoint (Fig. 10d), the effect sizeis 0.035%, also too small for any statistical differences to bemeaningful. Therefore, matches of AR consistent and ARmidpoint targets were equivalent (Fig. 9c, d).

4.4 DiscussionThe first purpose (1) of Experiment I was to replicate the realconsistent and AR collimated conditions of Swan et al. [6](Fig. 1). The pattern in Figs. 9a, b indeed matches Fig. 1.Given the many differences between the AR haploscope andthe NVIS display used by Swan et al. [6], this replication isconsistent with the idea that this pattern of results gener-alizes to any collimated AR or stereo display. In addition,Swan et al. [6] hypothesized that collimation biases the eyes’vergence angle to rotate outwards by a constant amount(Fig. 2). For each distance, Fig. 11a shows ∆v, the changein vergence angle,5 for the 10 AR collimated observers. Forall observers ∆v changes less than 0.5◦, and the medianobserver, seen in the boxplot, changes less than 0.072◦. Thesesmall angular changes are consistent with the hypothesisthat, within these reaching distances, the vergence angle biasis constant.

The next purpose (2) was to test whether presentingAR objects at a focal distance that was consistent with thedistance specified by other depth cues, especially binoculardisparity, would result in more accurate depth matches thanwhat was seen in the AR collimated condition. Figs. 9a, b,and c, as well as the analysis in Figs. 10a and b, confirm thishypothesis: AR consistent is much more accurate than ARcollimated, and for a consistent focal distance, real and ARtargets do not differ in slope (Fig. 10b).

The final purpose (3) was to test whether presenting ARobjects at a focal distance equal to the midpoint of the testedrange would result in similar performance as the consistentcondition. Figs. 9c and d, as well as the analysis in Figs. 10cand d, indicate that, when the focus was set to the midpoint,matching was indeed just as accurate.

5 EXPERIMENT II: AGE

As discussed in Section 2.3, increasing age leads to presby-opia, a decline in the ability of the eyes to accommodateto different focal distances. Experiment I found significantnegative effects of collimation, but all of the observers wereyoung, with a mean age of 20.9, and therefore likely notpresbyopic. In addition, as discussed in Section 2.3, although

5. ∆v = α− β, α = 2 arctan(i/2x), and β = 2 arctan(i/2y), where iis the observer’s inter-pupillary distance, x is the actual target distance,and y is the judged distance (Fig. 8). Note that using x assumes thatobservers would match a real object with perfect accuracy, but the veryaccurate and precise results for the real consistent condition suggestthis assumption is reasonable.

Exp I Exp II: Older Observers Exp III: Dim Targets

● ●

● ●

●●

●●

● ● ●● ●●

●●●

●●

●●

● ● ●

●●

● ●

● ● ● ●●

●● ● ●

(a)

● ●

● ●

●●

●●

●●

● ● ●●

●● ● ●

●● ● ● ●

●● ●

●●●

●●

●●

●●

● ●

● ●

(b)

●●

● ●●

●● ● ● ●

●● ● ●

● ● ●●

●●

● ● ●● ● ● ●●

● ●● ● ●

●●

●● ●

●● ●

● ● ●●

(c)−1.2

−0.9

−0.6

−0.3

0.0

0.3

0.6

33.3 36.4 40 44.4 50 33.3 36.4 40 44.4 50 33.3 36.4 40 44.4 50Actual Distance (cm)

∆v°

Fig. 11. For the AR collimated condition, the change in vergence angle∆v = α − β (Fig. 2), when an observer has matched the depth of thevirtual target β with the real pointer α (Fig. 7). Each line in each panelis a different observer. For all N = 30 observers, ∆v is approximatelyconstant across all tested distances. The boxplot gives the value for themedian observer.

older people are worse than younger people at many per-ceptual tasks, recent studies have found that older peoplepreserve their abilities in many tasks related to distance per-ception. Therefore, it was unclear if older observers wouldreplicate the effects observed in Experiment I (Fig. 9). Fur-thermore, this work was primarily inspired by medical ARapplications, and the majority of medical professionals areold enough to suffer some degree of presbyopia. Therefore,the purpose of Experiment II was to replicate Experiment I,using presbyopic observers, aged 40 and older.

5.1 MethodOther than the age of the observers, the methods of Exper-iment II were identical to Experiment I. 40 observers wererecruited from a population of university and communitymembers. The observers ranged in age from 41 to 80; themean age was 55.6, and 19 were male and 21 female. 6observers were paid $10 an hour, 33 were paid $12 hour, andone was not paid. Each observer completed 5 (distance) ×6 (repetition) = 30 trials, and the experiment collected atotal of 40 (observers) × 30 (trials) = 1200 data points.

5.2 ResultsFig. 8e–h shows the results from Experiment II as scatter-plots; the r2 values indicate that the data continues to bevery well fit by regression equations. Fig. 12 shows theresults as error, with Experiment I’s results also shown forcomparison. Figs. 13 and 14 show the results of multipleregression analysis.

Older and younger only differ in AR collimated: Fig. 13 com-pares Experiment I to Experiment II condition by condition.For the AR collimated condition (Fig. 13b), the slopes donot significantly differ (F1,96 = 0.38, p = 0.54), but theintercepts do (F1,97 = 35.7, p < 0.001); the older observersmatched collimated AR targets a constant −1.1 cm closerto themselves than the younger observers (Fig. 12b). Forthe remaining conditions, the effect size of the differencebetween Experiments I and II, 0.013% (Fig. 13a), 0.031%(Fig. 13c), and 0.056% (Fig. 13d), is too small for any sta-tistical differences to be meaningful. Therefore, for the realconsistent, AR consistent, and AR midpoint conditions, theresults for the older observers and the younger observersare equivalent.

Page 10: IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 1 The Effect of ... · In principal, the extra-retinal cues cues could provide absolute egocentric depth information (Gillam [11]). In contrast,

IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 10

●● ● ● ●● ● ● ● ●

(a) ●

●●

●●● ●

●●

(b)

● ●● ● ●● ● ● ●

●(c)

● ● ● ●●● ● ● ● ●

(d)

Real Consistent AR Collimated

AR Consistent AR Midpoint−1.0

−0.5

0.0

0.5

1.0

1.5

2.0

2.5

−1.0

−0.5

0.0

0.5

1.0

1.5

2.0

2.5

33.3 36.4 40.0 44.4 50.0 33.3 36.4 40.0 44.4 50.0Actual Distance (cm)

Err

or (

cm),

±1

SE

M

Experiment

Exp II

Exp I

Fig. 12. Experiment II: older observers, plotted as mean error againstactual distance (N = 1200). For comparison, Experiment I’s results arealso shown in light grey, offset along the x-axis for clarity.

Real consistent very accurate: Fig. 12a indicates that olderobservers were very accurate when matching the distanceof real targets. The mean error is +0.4 mm, and the slope ofthe linear model for Fig. 12a, y = −0.012x + 0.60, does notsignificantly differ from 0 (F1,48 = 2.5, p = 0.12). Note thatthis is statistically equivalent to testing whether the slope inFig. 8e differs from 1.

AR collimated increasingly overestimated: For the older ob-servers, when AR collimated is compared to real con-sistent (Fig. 14a), the slopes significantly differ (F1,96 =13.8, p < 0.001); the AR collimated errors ranged from −0.7to +0.9 cm (Fig. 12b).

AR consistent underestimated: For the older observers, whenAR consistent is compared to real consistent (Fig. 14b), theslopes do not significantly differ (F1,96 = 0.56, p = 0.46),but the intercepts do (F1,97 = 16.6, p < 0.001); the AR con-sistent targets were underestimated by a constant −0.3 cm(Fig. 12c).

AR midpoint equivalent to real consistent: For the older ob-servers, when AR midpoint is compared to real consistent(Fig. 14c), the effect size is 0.0093%, which is too small forany statistical differences to be meaningful. Therefore, theAR midpoint targets were accurately matched (Fig. 12d).

AR consistent and AR midpoint equivalent: For the older ob-servers, when AR consistent is compared to AR midpoint(Fig. 14d), the effect size is 0.036%, also too small forany statistical differences to be meaningful. Therefore, thematches of the AR consistent and AR midpoint targets wereequivalent (Fig. 12c, d).

5.3 Discussion

The purpose of Experiment II was to replicate Experiment I,using older, presbyopic observers. According to Duane [31],the younger observers in Experiment I had an expected nearfocus of ∼8.3 cm (∼11.5 D), while for these older observersthe expected near focus was ∼68 cm (∼1.5 D).

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

(a)

y = 0.992x + 0.35

R2 = 99.8%dR2 = 0.013%

●●●●●●●

●●●

●●●●●

●●●

●●●●●

●●●●●

●●●

●●●●●

●●●●●●●●●●

(b)

Exp I: y = 1.072x − 1.72Exp II: y = 1.072x − 2.82

R2 = 98.0%dR2 = 0.74%, d = −1.1 cm

●●●●●●●●●●

●●●●●●●●●●

●●●●●

●●●●●

●●●●●●●●

●●

●●●●●●●●●●

(c)

y = 0.983x + 0.38

R2 = 99.4%dR2 = 0.031%

●●●●●●●●●●

●●

●●●●●●●

●●●●●●●●●●

●●●●●

●●●●●

●●●●●●●●●●

(d)

y = 0.995x + 0.07

R2 = 99.2%dR2 = 0.056%

Real Consis: Exp I vs Exp II AR Collim: Exp I vs Exp II

AR Consis: Exp I vs Exp II AR Midpt: Exp I vs Exp II

30

35

40

45

50

55

30

35

40

45

50

55

33.3 36.4 40.0 44.4 50.0 33.3 36.4 40.0 44.4 50.0Actual Distance (cm)

Judg

ed D

ista

nce

(cm

)

Experiment● Exp I

Exp II

Fig. 13. Experiment I versus II, the effect of age, multiple regressionanalysis, with N = 400 ghosted data points. See the caption for Fig 10.

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

(a)

AR Collim: y = 1.081x − 3.21Real Consis: y = 0.988x + 0.60

R2 = 98.6%dR2 = 0.20%, d = −0.7, +0.9 cm

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

(b)

Real Consis: y = 0.983x + 0.81 AR Consis: y = 0.983x + 0.49

R2 = 99.5%dR2 = 0.081%, d = −0.3 cm

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

(c)

y = 0.985x + 0.67

R2 = 99.4%dR2 = 0.0093%

(d)

y = 0.980x + 0.71

R2 = 99.2%dR2 = 0.036%

AR Collim vs Real Consis AR Consis vs Real Consis

AR Midpt vs Real Consis AR Consis vs AR Midpt

30

35

40

45

50

55

30

35

40

45

50

55

33.3 36.4 40.0 44.4 50.0 33.3 36.4 40.0 44.4 50.0Actual Distance (cm)

Judg

ed D

ista

nce

(cm

)

Condition● Real Consis

AR Collim

AR Consis

AR Midpt

Fig. 14. Experiment II, multiple regression analysis, older observers, withN = 200 ghosted data points. See the caption for Fig 10.

Experiment II’s results only differ for the AR colli-mated condition. For collimated targets, older observersshowed less overestimation than younger observers, withmatches shifted towards the observer by a constant −1.1 cm(Fig. 13b). Older observers had a mean error of +0.12 cm,while younger observers had a mean error of +1.2 cm(Fig. 12b), and therefore older observers were on averagemore accurate than younger observers. However, the slope,b = 1.072, is the same for both sets of observers (Fig. 13b),and differs significantly from the slope for the real consistentcondition (Figs. 10a and 14a). Therefore, for both youngerand older observers, matches of collimated targets wereinaccurate, and increasingly overestimated with increasingdistance. In addition, for each distance, Fig. 11b shows∆v, the change in vergence angle for the 10 older ARcollimated observers. For 9 of the 10 observers ∆v changesless than 0.6◦, for the outlying observer it changes 1.4◦, andthe median observer changes less than 0.25◦. These smallangular changes are consistent with the hypothesis that, for

Page 11: IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 1 The Effect of ... · In principal, the extra-retinal cues cues could provide absolute egocentric depth information (Gillam [11]). In contrast,

IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 11

(a) real'target (b) bright'AR'target (c)'dim'AR'target

Fig. 15. Experiment III examined target brightness. (a) The real target.(b) The bright AR target used in Experiments I and II. (c) The dim ARtarget used in Experiment III.

both younger and older observers, the vergence angle biasis constant.

For the other conditions, the observer’ age—and there-fore the observers’ ability to accommodate to different focaldemands—made no difference. Older observers were just asaccurate as younger observers in matching the distance toreal targets, as well as to AR targets with both consistentand midpoint focal cues. These results are consistent withprevious work that has found that older observers preservetheir abilities in many tasks related to distance perception(Bian and Andersen [33]).

6 EXPERIMENT III: BRIGHTNESS

However, a conflicting finding from both experiments is thatAR consistent was underestimated, while AR midpoint wasaccurate and AR consistent was equivalent to AR midpoint.This is true for both Experiment I (Fig. 10) and Experiment II(Fig. 14). These conflicting statistical results are likely dueto the small effect size of AR consistent’s underestimation(d = −0.4 and d = −0.3 cm, respectively). Nevertheless, theunderestimation is statistically significant, and was repli-cated among 20 observers with widely varying ages.

As discussed in Section 2.5, brighter objects appear closerthan similar-sized dimmer objects. Figs. 15a and b showphotographs, taken through the haploscope optics, of thereal and AR targets used in Experiments I and II. The ARtarget appeared brighter than the real target.6 For Exper-iment III, the brightness was reduced, until the AR andreal targets appeared to have the same brightness (Fig. 15c).The purpose of Experiment III was to determine if the dimAR target would increase the accuracy of the AR consistentcondition.

6.1 Method

Other than the brightness of the AR target, the methodsof Experiment III were identical to Experiment I. Becausethe real target object did not change, that condition wasnot replicated. To facilitate comparison with Experiment I,younger observers were recruited, from a population ofuniversity students and staff. The 30 observers ranged inage from 17 to 24; the mean age was 19.8, and 21 weremale and 9 female. 6 observers were paid $12 an hour,and the rest received course credit. Each observer com-pleted 5 (distance) × 6 (repetition) = 30 trials, and the

6. Note that brightness is the perceptual experience of luminance, andcannot be directly measured or captured with a camera. The luminanceof the targets was measured (Singh [8]).

experiment collected a total of 30 (observers) × 30 (trials) =900 data points.

6.2 Results

Fig. 8j–l shows the results from Experiment III as scatter-plots; the r2 values indicate that the data continues to bevery well fit by regression equations. Fig. 16 shows the sameresults as error, with Experiment I’s results also shown forcomparison. Figs. 17 and 18 show the results of multipleregression analysis.

Dim targets differ in AR consistent and AR midpoint: Fig. 17compares Experiment I to Experiment III condition by con-dition. For the AR collimated condition (Fig. 17a), the effectsize of the difference is 0.00027%, much too small for anystatistical differences to be meaningful. Therefore, the resultsfor the dim targets and the bright targets are equivalent(Fig. 16b). For the AR consistent condition (Fig. 17b), theslopes significantly differ (F1,96 = 7.7, p = 0.007), andtherefore the dim targets were matched +0.2 to +0.9 cmfarther than the bright targets (Fig. 16c). And finally, for theAR midpoint condition (Fig. 17c), the slopes do not signifi-cantly differ (F1,96 < 0.01, p = 0.96), but the intercepts do(F1,97 = 17.7, p < 0.001), and therefore the dim targets werematched +0.4 cm farther than the bright targets (Fig. 16d).

AR collimated increasingly overestimated: When dim AR col-limated is compared to real consistent from Experiment I(Fig. 18a), the slopes significantly differ (F1,96 = 5.5, p =0.021), indicating that the dim AR collimated targets wereoverestimated from +0.8 to +1.9 cm (Fig. 16b).

AR consistent equivalent to real consistent: When dim ARconsistent is compared to real consistent from Experiment I(Fig. 18b), the effect size of the difference is 0.031%, whichis too small for any statistical differences to be meaningful.Therefore, the dim AR consistent targets were accuratelymatched (Fig. 16c).

AR midpoint equivalent to real consistent: When dim ARmidpoint is compared to real consistent from Experiment I(Fig. 18c), the effect size of the difference is 0.025%, which istoo small for any statistical differences to be meaningful.Therefore, the dim AR midpoint targets were accuratelymatched (Fig. 16d).

AR consistent and AR midpoint equivalent: When dim ARconsistent is compared to dim AR midpoint (Fig. 18d),the effect size is 0.012%, also too small for any statisticaldifferences to be meaningful. Therefore, the matches of dimAR consistent and dim AR midpoint targets were equivalent(Fig. 16c, d).

6.3 Discussion

The purpose of Experiment III was to determine if a dimAR target, which has the same apparent brightness as thereal target (Fig. 15), would increase the accuracy of theAR consistent condition. When comparing Experiment IIIto Experiment I, there was no difference in matching ARcollimated targets, but increased accuracy for both AR con-sistent and AR midpoint targets (Fig. 17). In addition, whencombined with the real consistent data from Experiment I,

Page 12: IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 1 The Effect of ... · In principal, the extra-retinal cues cues could provide absolute egocentric depth information (Gillam [11]). In contrast,

IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 12

● ● ● ● ●

(a)

● ●●

●● ●

●●

(b)

●●

● ●

● ● ● ●●

(c)

● ● ● ● ●

● ● ● ● ●

(d)

Real Consistent AR Collimated

AR Consistent AR Midpoint−1.0

−0.5

0.0

0.5

1.0

1.5

2.0

2.5

−1.0

−0.5

0.0

0.5

1.0

1.5

2.0

2.5

33.3 36.4 40.0 44.4 50.0 33.3 36.4 40.0 44.4 50.0Actual Distance (cm)

Err

or (

cm),

±1

SE

M

Experiment

Exp III

Exp I

Fig. 16. Experiment III: dim targets, plotted as mean error against actualdistance (N = 900). For comparison, Experiment I’s results are alsoshown in light grey, offset along the x-axis for clarity.

●●●●●●●

●●●

●●●●●

●●●

●●●●●

●●●●●

●●●

●●●●●

●●●●●●●●●●

(a)

y = 1.062x − 1.32

R2 = 97.6%dR2 = 0.00027%

●●●●●●●●●●

●●●●●●●●●●

●●●●●

●●●●●

●●●●●●●●

●●

●●●●●●●●●●

(b)

Exp III: y = 1.029x − 1.10 Exp I: y = 0.988x + 0.06

R2 = 99.5%dR2 = 0.21%, d = +0.2, +0.9 cm

●●●●●●●●●●

●●

●●●●●●●

●●●●●●●●●●

●●●●●

●●●●●

●●●●●●●●●●

(c)

Exp III: y = 1.009x − 0.23 Exp I: y = 1.009x − 0.62

R2 = 99.4%dR2 = 0.10%, d = +0.4 cm

AR Collim: Exp I vs Exp III AR Consis: Exp I vs Exp III

AR Midpt: Exp I vs Exp III

30

35

40

45

50

55

30

35

40

45

50

55

33.3 36.4 40.0 44.4 50.0Actual Distance (cm)

Judg

ed D

ista

nce

(cm

)

Experiment● Exp I

Exp III

Fig. 17. Experiment I versus III, the effect of brightness, multiple regres-sion analysis, with N = 300 ghosted data points. See the caption forFig 10.

matches for the dim AR target were overestimated in the ARcollimated condition, but accurate in both the AR consistentand AR midpoint conditions (Figs. 18). Therefore, for the ARconsistent and AR midpoint conditions, the dim AR targetswere as accurately matched as the real targets (Fig. 16).

In addition, the pattern of AR consistent being underes-timated, while AR midpoint was accurate and AR consistentwas equivalent to AR midpoint, occurred for both theyounger observers of Experiment I (Fig. 10) and the olderobservers of Experiment II (Fig. 14). While Experiment IIIonly tested younger observers, the results are consistentwith the hypothesis that older observers would also accu-rately match the depth of dim AR consistent and dim ARmidpoint targets.

Finally, for the dim AR collimated targets, for eachdistance Fig. 11c shows ∆v, the change in vergence angle,for the 10 AR collimated observers. For all observers ∆vchanges less than 0.4◦, and the median observer changes less

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

(a)

AR Collim: y = 1.062x − 1.30Real Consis: y = 0.997x + 0.11

R2 = 98.3%dR2 = 1.1%, d = +0.8, +1.9 cm

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

(b)

y = 1.013x − 0.50

R2 = 99.7%dR2 = 0.031%

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●

(c)

y = 1.003x − 0.07

R2 = 99.7%dR2 = 0.025%

(d)

y = 1.019x − 0.67

R2 = 99.5%dR2 = 0.012%

AR Collim vs Real Consis AR Consis vs Real Consis

AR Midpt vs Real Consis AR Consis vs AR Midpt

30

35

40

45

50

55

30

35

40

45

50

55

33.3 36.4 40.0 44.4 50.0 33.3 36.4 40.0 44.4 50.0Actual Distance (cm)

Judg

ed D

ista

nce

(cm

)

Condition● Real Consis

AR Collim

AR Consis

AR Midpt

Fig. 18. Experiment III, multiple regression analysis, dim targets, withN = 200 ghosted data points. See the caption for Fig 10. The realconsistent data is repeated from Experiment I.

than 0.12◦. These small angular changes are consistent withthe hypothesis that, for the dim AR targets, the vergenceangle bias is still constant.

7 GENERAL DISCUSSION

Constant Vergence Angle Bias: As discussed in Section 1,Swan et al. [6] found that the AR collimated conditioncaused overestimation that increased linearly with distance,and proposed that this was caused by the collimation bias-ing the eyes’ vergence angle to rotate outwards by a constantamount. All of the experiments reported here replicated thisresult, and strongly support this hypothesis. These findingsare also consistent with the prediction, by Mon-Williamsand Tresilian [19], that an inconsistent accommodative cuewould bias perceived depth in the same direction as theaccommodative cue (Fig. 3). However, Swan et al. [6] didnot measure this vergence angle change, and it was notmeasured here. In a future experiment, it should be directlymeasured.

Dim Targets: The experiments found the most accuratematches for dim AR targets, which more closely matchedthe brightness of the real targets. These results are consistentwith previous work that finds brighter objects appear closerthan dimmer objects (Ashley [38], Farne [39], Coules [40]).However, it is interesting that the error for matching thedim AR targets disappeared, even though the error wascalculated between-subjects: in all of the experiments, differ-ent groups of 10 observers saw the real targets, the brightAR targets, and the dim AR targets. It would have beenless surprising to have found these errors in a within-subjects design, where observers made a judgment abouttwo targets with different brightnesses, viewed simultane-ously (e.g., Ashley [38], Farne [39], Coules [40]). The errorsmay be related to the fact that AR targets are drawn withimpoverished depth cues, and therefore brightness couldbe directly biasing the vergence angle. If this hypothesisis true, it would be another component of accurate depthpresentation that must be considered by AR practitioners. A

Page 13: IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 1 The Effect of ... · In principal, the extra-retinal cues cues could provide absolute egocentric depth information (Gillam [11]). In contrast,

IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 13

future experiment should examine whether the brightnessof an AR object directly influences vergence angle.

Midpoint Accommodative Stimulus: While the AR consis-tent condition was accurate for the dim AR targets, theAR midpoint condition was accurate across all of the ex-periments. It is not clear why AR midpoint was accurateat both brightness levels, while AR consistent was not.Nevertheless, the practical implication is that, because theAR midpoint condition was at least as accurate as the ARconsistent condition, for AR applications requiring accuratenear-field depth matching, it is sufficient for the focal de-mand to be set to the middle of the working volume.

However, the positive results for the AR midpoint con-dition suggest comparison with light-field displays, whichcan simultaneously present multiple virtual objects at dif-ferent focal distances. In addition to solving the vergence-accommodation conflict, light-field displays are predicted toeventually become the dominant technology for all kinds of3D experiences (Balram [48]). However, although the tech-nology is rapidly developing, AR light-field displays facemany fundamental challenges and design tradeoffs, in areassuch as depth range, color resolution, spatial resolution,computational demands, and data throughput requirements(Wu et al. [49]). Therefore, the AR midpoint results suggestthat the level of engineering complexity required for thesekinds of displays may not be necessary, especially for ARapplications where the most important perceptual task isaccurate matching at near-field distances (e.g., Edwards [4],Krempien et al. [5]).

Future Work: As discussed in this section, errors detected inthese experiments are likely due to vergence angle biases.Therefore, useful future work would replicate these exper-iments while measuring vergence angle. Possible methodsfor making this measurement include binocular eye track-ing (Wang et al. [50]), or nonius line methods (Ellis andMenges [41]).

In addition, because the AR haploscope was mounted toa tabletop, these experiments could not examine the depthcue of motion perspective. Although some AR applications,such as the operating microscope described by Edwards etal. [4], are also mounted and therefore lack motion perspec-tive, it is a very salient depth cue (Nagata [10], Cutting andVishton [9]), and should be examined in future experiments.A head-mounted AR haploscope, such as the one used byMcCandless et al. [42], would allow a replication of theseexperiments that included motion perspective.

8 PRACTICAL IMPLICATIONS

For accurate near-field depth matching, the experimentsreported here have the following implications:

• Collimated graphics should not be used. A focal distanceset to the middle of the depth range is a good as a focaldistance optimized for every virtual object.

• The brightness of virtual objects needs to match thebrightness of real objects.

• Observers old enough to suffer age-related reductionsin accommodative ability are just as accurate as youngerobservers.

ACKNOWLEDGMENTS

This material is based upon work supported by the NationalScience Foundation, under awards IIS-0713609, IIS-1018413,and IIS-1320909, to J. E. Swan II.

REFERENCES

[1] M. Kersten-Oertel, P. Jannin, and D. L. Collins, “The State of theArt of Visualization in Mixed Reality Image Guided Surgery,”Comput. Medical Imaging and Graphics, vol. 37, no. 2, pp. 98–112,2013.

[2] D. Curtis, D. Mizell, P. Gruenbaum, and A. Janin, “Several devilsin the details: Making an AR application work in the airplanefactory,” in Proc. of Intern. Workshop on Augmented Reality (IWAR),1998, pp. 47–60.

[3] S. J. Henderson and S. Feiner, “Evaluating the benefits of aug-mented reality for task localization in maintenance of an armoredpersonnel carrier turret,” in Intern. Symp. on Mixed and AugmentedReality (ISMAR), 2009, pp. 135–144.

[4] P. J. Edwards, A. P. King, C. R. Maurer Jr., D. A. de Cunha, D. J.Hawkes, D. L. G. Hill, R. P. Gaston, M. R. Fenlon, A. Jusczyzck,A. J. Strong, C. L. Chandler, and M. J. Gleeson, “Design and eval-uation of a system for microscope-assisted guided interventions(MAGI),” IEEE Trans. on Medical Imaging, vol. 19, no. 11, pp. 1082–1093, 2000.

[5] R. Krempien, H. Hoppe, L. Kahrs, S. Daeuber, O. Schorr, G. Eggers,M. Bischof, M. W. Munter, J. Debus, and W. Harms, “Projector-Based Augmented Reality for Intuitive Intraoperative Guidance inImage-Guided 3D Interstitial Brachytherapy,” International Journalof Radiation Oncology • Biology • Physics, vol. 70, no. 3, pp. 944–952,Mar. 2008.

[6] J. E. Swan II, G. Singh, and S. R. Ellis, “Matching and reachingdepth judgments with real and augmented reality targets,” IEEETrans. on Visualization and Computer Graphics, vol. 21, no. 11, pp.1289–1298, 2015.

[7] J. L. Gabbard, D. G. Mehra, and J. E. Swan II, “Effects of ARdisplay context switching and focal distance switching on humanperformance,” article in submission, 2017.

[8] G. Singh, “Near-Field Depth Perception in Optical See-ThroughAugmented Reality,” Ph.D. dissertation, Mississippi State Univer-sity, Aug. 2013.

[9] J. E. Cutting and P. M. Vishton, “Perceiving layout and knowingdistances: The integration, relative potency, and contextual use ofdifferent information about depth,” in Handbook of Perception andCognition: Perception of Space and Motion, W. Epstein and S. Rogers,Eds. Academic Press, 1995, vol. 5, pp. 69–117.

[10] S. Nagata, “How to Reinforce Perception of Depth in Single Two-Dimensional Pictures,” in Pictorial Communication in Virtual andReal Environments, S. R. Ellis, M. K. Kaiser, and A. J. Grunwald,Eds. London: Taylor & Francis, 1989, pp. 527–545.

[11] B. Gillam, “The perception of space layout,” in Perception of Spaceand Motion, 2nd ed., W. Epstein and S. Rogers, Eds. AcademicPress, 1995, pp. 23–67.

[12] G. P. Bingham and C. C. Pagano, “The necessity of a perception-action approach to definite distance perception: Monocular dis-tance perception to guide reaching,” Journal of Experimental Psy-chology: Human Perception and Performance, vol. 24, no. 1, pp. 145–168, 1998.

[13] M. Mon-Williams and J. R. Tresilian, “Some recent studies on theextraretinal contribution to distance perception,” Perception, pp.167–181, 1999.

[14] M. S. Landy, L. T. Maloney, E. B. Johnston, and M. Young, “Mea-surement and modeling of depth cue combination: In defense ofweak fusion,” Vision Research, vol. 35, no. 3, pp. 389–412, Feb 1995.

[15] R. J. Leigh and D. S. Zee, The Neurology of Eye Movements, 5th ed.Oxford University Press, USA, 2015.

[16] J. L. Semmlow and G. K. Hung, “The near response: Theories ofcontrol,” in Vergence Eye Movements: Basic & Clinical Aspects, C. M.Schor and K. J. Ciuffreda, Eds. Butterworth, 1983, pp. 175–195.

[17] H. Ripps, N. B. Chin, I. M. Siegel, and G. M. Breinin, “The effect ofpupil size on accommodation, convergence, and the ac/a ratio.”Investigative Ophthalmology, vol. 1, pp. 127–135, Feb 1962.

[18] D. Kersten and G. E. Legge, “Convergence accommodation,”Journal of the Optical Society of America, vol. 73, no. 3, pp. 332–338,Mar 1983.

Page 14: IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 1 The Effect of ... · In principal, the extra-retinal cues cues could provide absolute egocentric depth information (Gillam [11]). In contrast,

IEEE ELECTRONIC PREPRINT (UNDER REVIEW) 14

[19] M. Mon-Williams and J. R. Tresilian, “Ordinal depth informationfrom accommodation?” Ergonomics, vol. 43, no. 3, pp. 391–404,2000.

[20] E. Kruijff, J. E. Swan II, and S. Feiner, “Perceptual Issues inAugmented Reality Revisited,” in Intern. Symp. on Mixed andAugmented Reality (ISMAR), J. Park, V. Lepetit, and T. Hollerer,Eds. Piscataway, NJ, USA: IEEE, 2010, pp. 3–12.

[21] M. Lambooij, M. Fortuin, I. Heynderickx, and W. IJsselsteijn,“Visual discomfort and visual fatigue of stereoscopic displays: Areview,” Journal of Imaging Science and Technology, vol. 53, no. 3, pp.030 201–030 201–14, 2009.

[22] D. M. Hoffman, A. R. Girshick, K. Akeley, and M. S. Banks,“Vergence–accommodation conflicts hinder visual performanceand cause visual fatigue,” Journal of Vision, vol. 8(3), no. 33, pp.1–30, 2008.

[23] H. A. Swenson, “the Relative Influence of Accommodation andConvergence in the Judgment of Distance,” Journal of GeneralPsychology, vol. 7, no. 2, pp. 360–380, 1932.

[24] E. Brenner and J. M. Van Damme, “Judging distance from ocularconvergence,” Vision Research, vol. 38, no. 4, pp. 493–498, 1998.

[25] D. A. Owens and H. W. Liebowitz, “Accommodation, conver-gence, and distance perception in low illumination,” Optometry& Vision Science, vol. 57, no. 9, pp. 540–550, 1980.

[26] J. R. Tresilian, M. Mon-Williams, and B. M. Kelly, “Increasingconfidence in vergence as a cue to distance,” Proc. of the RoyalSociety of London B, vol. 266, pp. 39–44, 1999.

[27] A. Viguier, G. Clment, and Y. Trotter, “Distance perception withinnear visual space,” Perception, vol. 30, pp. 115–124, 2001.

[28] J. M. Foley, “Binocular distance perception,” Psychological Review,vol. 87, no. 5, pp. 411–434, 1980.

[29] J. H. Iavecchia, H. P. Iavecchia, and S. N. Roscoe, “Eye accommo-dation to head-up virtual images,” Human Factors, vol. 30, no. 6,pp. 703–712, 1988.

[30] S. N. Roscoe, “Bigness Is in the Eye of the Beholder,” HumanFactors, vol. 27, no. 6, pp. 615–636, 1985.

[31] A. Duane, “Normal values of the accommodation at all ages,”Journal of the American Medical Association, vol. LIX, pp. 1010–1013,1912.

[32] S. Kasthurirangan and A. Glasser, “Age related changes in accom-modative dynamics in humans,” Vision Research, vol. 46, pp. 1507–1519, Apr 2006.

[33] Z. Bian and G. J. Andersen, “Aging and the perception of ego-centric distance,” Psychology and Aging, vol. 28, no. 3, pp. 813–825,2013.

[34] G. Heron, W. N. Charman, and C. M. Schor, “Age changes in theinteractions between the accommodation and vergence systems,”Optometry and Vision Science, vol. 78, no. 10, pp. 754–762, 2001.

[35] G. J. Andersen, A. Saidpour, and M. L. Braunstein, “Effects of Col-limation on Perceived Layout in 3-D Scenes,” Perception, vol. 27,no. 11, pp. 1305–1315, 1998.

[36] S. J. Watt, K. Akeley, M. O. Ernst, and M. S. Banks, “Focus CuesAffect Perceived Depth,” Journal of Vision, vol. 5, no. 10, pp. 834–862, 2005.

[37] E. McCurdy, The Notebooks of Leonardo da Vinci, Volume II. Reynal& Hitchcock, 1938.

[38] M. L. Ashley, “Concerning the significance of intensity of light invisual estimates of depth,” Psychological Review, vol. 5, no. 6, pp.595–615, 1898.

[39] M. Farne, “Brightness as an indicator to distance: Relative bright-ness per se or contrast with the background?” Perception, vol. 6,pp. 287–293, 1977.

[40] J. Coules, “Effect of photometric brightness on judgments ofdistance,” Experimental Psychology, vol. 50, no. 1, pp. 19–25, 1955.

[41] S. R. Ellis and B. M. Menges, “Localization of virtual objects in thenear visual field,” Human Factors, vol. 40, no. 3, pp. 415–431, 1998.

[42] J. W. McCandless, S. R. Ellis, and B. D. Adelstein, “Localizationof a time-delayed, monocular virtual object superimposed on areal environment,” Presence: Teleoperators and Virtual Environments,vol. 9, no. 1, pp. 15–24, 2000.

[43] G. Singh, J. E. Swan II, J. A. Jones, and S. R. Ellis, “Depth judgmentmeasures and occluding surfaces in near-field augmented reality,”in ACM Symp. on Applied Perception in Graphics and Visualization(APGV). New York, NY, USA: ACM, 2010, pp. 149–156.

[44] N. Rosa, W. Hurst, P. J. Werkhoven, and R. C. Veltkamp, “Visuo-tactile Integration for Depth Perception in Augmented Reality,” inProc. of ACM Intern. Conf. on Multimodal Interaction (ICMI). ACM,2016, pp. 45–52.

[45] J. P. Rolland, W. Gibson, and D. Ariely, “Towards quantifyingdepth and size perception in virtual environments,” Presence:Teleoperators and Virtual Environments, vol. 4, no. 3, pp. 24–49, 1995.

[46] E. J. Pedhazur, Multiple Regression in Behavioral Research, 2nd ed.Holt, Rinehart and Winston, 1982.

[47] J. Cohen, P. Cohen, S. G. West, and L. S. Aiken, Applied MultipleRegression/Correlation Analysis for the Behavioral Sciences, 3rd ed.Lawrence Erlbaum Associates, 2003.

[48] N. Balram, “The next wave of 3-D—light-field displays,” Informa-tion Display, vol. 30, no. 6, pp. 4,49, November/December 2014.

[49] W. Wu, K. Berkner, I. Tosic, and N. Balram, “Personal near-to-eyelight-field displays,” Information Display, vol. 30, no. 6, pp. 16–22,November/December 2014.

[50] R. I. Wang, B. Pelfrey, A. T. Duchowski, and D. H. House, “Online3D Gaze Localization on Stereoscopic Displays.” ACM Trans. onApplied Perception, vol. 11, no. 1, pp. 1–21, 2014.

PLACEPHOTOHERE

Gurjot Singh Biography text here.

PLACEPHOTOHERE

Stephen R. Ellis Biography text here.

PLACEPHOTOHERE

J. Edward Swan II Biography text here.


Recommended