Download - Epistemic Time Bias in Newcomb’s Problemmf3095/Papers_files/Fusco_pandoc.pdf · 2019-09-19 · Epistemic Time Bias in Newcomb’s Problem* Melissa Fusco Causal decision theorists

EpistemicTimeBiasinNewcomb’sProblem*MelissaFusco

CausaldecisiontheoristslikeDavidLewisholdthat,whileanagentshouldchooseactsbyusinghercurrent,ratherthananticipated,credencesovercausaldependencyhypotheses,sheshouldalsoupdatebyconditionalizationonherownactassheperformsit(Lewis1981a,1976).Thispackageleadstoanunflatteringformoftime-bias,whichcanbehighlightedbyconsideringiteratedNewcombproblems.Afterpresentingthepuzzle,IdiscussaCDTer’sbestresponse.

1. IntroductionSupposeyouwilleitherexperienceanhourofpainonTuesdayandanhourofpleasureonThursday,oranhourofpleasureonTuesdayandanhourofpainonThursday.Neitherpossibilityseemsclearlybetterthantheother—unless,thatis,todayisWednesday.Forafixedmenuofpleasuresandpains,time-biasedagentspreferdistributionswhereinmorepainsareinthepast,andmorepleasuresareinthefuture.

Figure1:BasicTimeBias.

ThetwopossibilitiesinFigure1illustratethisbasiccommitment.

AclassicdiscussionoftimebiascanbefoundinChapter8ofParfit’sReasonsandPersons(Parfit1984).1Parfitimaginesthathewakesupinthehospital,andistoldbyanursethateitherhehasjustsurvivedaverylongandpainfuloperation(whichhedoesnotremember,sincepatientsundergoinghissurgeryareregularly

*WarmthankstoArifAhmed,LaraBuchak,AlexKocurek,andJohnMacFarlane(viapandoc)fortheirhelponthispaper.

1Parfitdistinguishestwoformsofbias,futurebias—beingmoreconcernedaboutone’sfuturewelfarethanforone’spastwelfare—andnearbias—beingmoreconcernedforthenearfuturethanforthedistantfuture.Futurebiasisthoughttobethemorerationallydefensiblethannearbias(seeop.cit.,pg.158ff.)Ifocusonlyonfuturebiashere.

givenpost-operativemedicationwhichinducesamnesia),orhehasyettoundergoashorterversionofanoperationofthesametype.WhileParfit’scharactergenerallypreferstosufferless,ratherthanmore,henowfindsthathewouldmuchprefertolearnthathehasalreadyhadthelonger,earlieroperation.Hispreferencesseemtohavebeenreversedbythemerepassageoftime.Noristhisareversalbetweenoutcomeswhichareintuitivelyequal.Rather,Parfit’scharacternowprefersapossibilityinwhichhislifecontainsmoretotalhoursofsuffering.

Figure2:Parfit’sSurgeries.

InthispaperIinvestigateaformoftimebiaswhichisentailedbyacommonformofcausaldecisiontheory.Itcanbeseeninthecausalist’sresponsetoNewcomb’sproblem.

Figure3:NewcombTimeBias.

AstandardcausaldecisiontheoristinNewcomb’sproblem—anagentwhopreferstotwo-box,giventhechance—willnonethelessprefertolearnthatNewcombproblemswhichinvolveher,butoverwhichshecurrentlyhasnoagency,areonesinwhichsheone-boxed.WhenParfittianamnesiaisonthescene,timebiaseasilytakesroot:learningaboutone’spastactionsisaparadigmofbeinginvestedinadecisionoverwhichonedoesnotcurrentlyexerciseagency.2Henceanagent,locatedat𝑁𝑂𝑊,whoseesthepastasfixedbutthefutureasherstochoose,willprefertheupperstateinthefiguretothelowerone.Aswewillsee,theCDTer’stimebiasis,if2ArifAhmedpointsouttomethatanact’sbeinginthepastisnotessentialtothedifferencebetweenchoosingandlearning.TherelevanttypeofCDTerwillalsoprefertolearnthatshewillone-boxinthefuture.Thisistrue,butsuch“learnings"arepuzzlingbecauseofthedifficultymanypeople(myselfincluded)willhavereconcilingsuchforeknowledgewithfreeagency.Forthisreason,Iwillfocusinthisessayonlearningaboutthepast.

anything,strangerthanthetimebiasinvestigatedbyParfit,becauseitistraceabletoanormative,ratherthanaffective,source.

Onepuzzleaboutthissetupwillimmediatelypresentitself.Isaythat,inFigures1-3,time-biasedagentswouldpaytoswitchfromthelowerpossibilityintotheupper.But—itwillbeobjected—anagentlocatedat𝑁𝑂𝑊cannotreallymakesuchaswitch,sincethatwouldinvolvechangingherpast.Thisistrue,butIshowinsection3thatthebasicstructureofthetime-biasedpreferencesillustratedinFigure3can,withsuitablemanipulation,beturnedintoactionablepreferenceswhichoperationalizethebasicdisputebetweenthebiasedandunbiasedperspectives.

Nonetheless,thisisaconfessionalessay:ImaintaintheCDTer’scommitmentsinbothcases,despitethe(literal)costs.Damagecontrolcommencesinsection4,whereIarguethatEvidentialDecisionTheoryisalsocommittedtoaformoftimebias.Thequestion,then,iswhetherthetwoformsofbiashaveanequalclaimtobefeaturesofrationalbehavior.

2. EvidenceandDecisionInthissectionIsketchthesharedbackgroundofcausalandevidentialdecisiontheorywithaneyetotwoissues,screening-offandconditionalization,whichbearontheargumenttocome.

Iwillassumethatinalldecisionproblems,agentshavethegoalofmaximizingexpectedutility.Theycometotheseproblemsequippedwithavaluefunction𝑉𝑎𝑙(⋅)onpossibleworlds,whichcanbeliftedontopropositionscalledoutcomes;3followingstandardidealization,Iwilloccasionallyspeakasiftheunitsintherangeofthevaluefunctionareequivalenttodollars.Anagent’sdecisionproblematatime𝑡canberepresentedasatriple⟨𝒜, 𝒮, 𝐶𝑟1⟩,where

• 𝒜 = {𝑎5, … , 𝑎7}isasetofactsavailabletotheagentat𝑡,

• 𝒮 = {𝑠5, … , 𝑠:}isasetofadmissiblestatesofnature,whereasetofstatesisadmissibleinadecisionproblemonlyifeachact-stateconjunction(𝑎; ∧ 𝑠=)isassignedadeterminateimage𝑉𝑎𝑙(𝑎; ∧ 𝑠=)underthevaluefunction,and

• 𝐶𝑟1(⋅)istheagent’stime-𝑡credencefunction.4

3Theliftingofthevaluefunctioncanbeachievedasfollows.Where𝑣𝑎𝑙(⋅)isaworldlyvaluefunctionin𝑊 → ℝ,(i)𝑉𝑎𝑙({𝑤}) = 𝑣𝑎𝑙(𝑤),and(ii)𝑉𝑎𝑙(𝑝 ∪ 𝑞)isdefinedonlyif𝑉𝑎𝑙(𝑝) = 𝑉𝑎𝑙(𝑞),inwhichcase𝑉𝑎𝑙(𝑝 ∪ 𝑞) = 𝑉𝑎𝑙(𝑝).

4Inwhatfollows,Itreatthemembersof𝒜,𝒮,andthedomainof𝐶𝑟1(⋅)alikeaspropositions.ThisdiffersfromSavage’soriginalpictureoftherelevantprimitives,onwhichactsarefunctionsfromstatestooutcomes.SeeJoyce(1999Ch.2)fordiscussionofthisshift.

Atafirstpass,theagent’scredencefunction𝐶𝑟1(⋅)willbeconceivedofinBayesianterms.Iassumethatitisaprobabilityfunction,andthattheagentisdisposedtorespondtonewinformationbyconditioningthisfunctiononthestrongestpropositionofwhichsheisinformed.I’llalsoassumethatherexpectationsaregenerallyfrequentist,inthesensethatwhentheyconcerneventsinthepresentorfuture,theywill,ceterisparibus,conformtoobservedpastfrequenciesoverevent-classesofthesametype.

Againstthisbackground,wecanintroduceSavage(1972)’sdecisiontheory.Itappliesthegeneralstatisticalnotionofexpectationtothevalueofanact𝑎 ∈ 𝒜.5

Equation1(SavageExpectedUtility).𝑆𝐸𝑈1(𝑎) = 𝐶; 𝑟1(𝑠;)𝑉𝑎𝑙(𝑎 ∧ 𝑠;).Savage’snorm:maximizeexpectedutilityat𝑡bychoosinganact𝑎 ∈ 𝒜suchthat𝑆𝐸𝑈1(𝑎)ismaximal:thatis,suchthat𝑆𝐸𝑈1(𝑎) ≥ 𝑆𝐸𝑈1(𝑎′),forany𝑎′ ∈ 𝒜.

EDTandCDTcanthenbeframedastwodifferentwaysofcashingoutafinalconditionSavageputonwell-formulateddecisionproblems:therequirementthattheset𝒮ofstatesintheproblemare,attime𝑡,independentoftheagents’acts.

EDTconstruesact-stateindependenceintermsofprobabilisticconfirmation.Ingeneral,theEDTerholdsthatrationalityrequirescalculatingexpectedutilitiesintermsofact-conditionedprobabilities𝐶𝑟1(𝑠|𝑎):

Equation2(EvidentialExpectedUtility).𝐸𝐸𝑈1(𝑎) = 𝐶; 𝑟1(𝑠;|𝑎)𝑉𝑎𝑙(𝑎 ∧ 𝑠;).ConditionalProbabilitynorm:maximizeexpectedutilitybychoosinganact𝑎 ∈ 𝒜suchthat𝐸𝐸𝑈1(𝑎) ≥ 𝐸𝐸𝑈1(𝑎′),forany𝑎′ ∈ 𝒜.

ThisdoesjusticetooneattractiveglossonSavage’sindependencerequirementon𝒮:itentailsthatSavage’sequationwillapplytoanyprobleminwhich𝐶𝑟1(𝑠|𝑎)isequalto𝐶𝑟1(𝑠)—inotherwords,toanyprobleminwhichtheagenttakes𝑎tobeprobabilisticallyindependentof𝑠.

CDT,ontheotherhand,favorsaninterpretationofact-stateindependencethatiscausal.SherejectstheEDTer’sglossbecauseprobabilisticindependence—alackofprobabilistic(dis)confirmation—isneithernecessary,norsufficient,forcausalindependence.6Inthispaper,IwilluseLewis(1981a)’s“dependencyhypothesis”characterizationoftheCDT.Thistakesasprimitiveapartition𝒦ofdependencyhypotheses,which,arestipulatedtobecausallyindependentofthe

5Where𝐹isafunction,thegenericnotionofexpectedvaluesaysthat𝐸[𝐹] = 𝑓;; 𝑃𝑟(𝐹 = 𝑓;).Here,ourprobabilityfunctionisthesubjectivecredencefunction𝐶𝑟1(⋅),andthefunction𝐹is𝑉𝑎𝑙(⋅)acrossact-statepairs,wheretheactisheldfixed.

6Fornon-necessity,wecanconsideranycase(suchasNewcomb)inwhichanact𝑎probabilisticallyconfirmssomestate𝑠whosetruth-valuewasfixedbefore𝑎occurred.Fornon-sufficiency,wecanappealtoexamplessuchasHesslow(1976)’swell-knownthrombosiscase:whilebirthcontrolpillscausethrombosis,theeffectisstatisticallymaskedbecausebirthcontrolpillscounteractpregnancy,whichitselfcausesthrombosis.

agent’savailableacts,aswellassufficienttofixanoutcomevalueforeach(𝑎 ∧ 𝑘;)conjunction.7

Equation3(CausalExpectedUtility).𝐶𝐸𝑈1(𝑎=) = 𝐶; 𝑟1(𝑘;)𝑉𝑎𝑙(𝑎= ∧ 𝑘;),where𝒦 = {𝑘5. . . 𝑘7}isapartitionofdependencyhypotheses.CausalSupportnorm:maximizeexpectedutilitybychoosinganact𝑎 ∈ 𝒜suchthat𝐶𝐸𝑈1(𝑎) ≥ 𝐶𝐸𝑈1(𝑎′),forany𝑎′ ∈ 𝒜.

ThiswayofstatingCDTleavesopenthemodel-theoreticimplementationofwhatisspecialaboutthe𝑘’s—leavesopen,thatis,howdoxasticstatesmightbemodeledsoastoreflectanagent’sconvictionthatthe𝑘’sarecausallyindependentofactsin𝒜.Itisclear,however,thatthesemodelingrequirementswillexceedthoseofaustereBayesianism.8Notethat,sinceEEUispartition-invariant,theevidentialdecisiontheoristcanalsocalculateexpectedutilityusingthe𝒦-partition.9

WearenowinapositiontodefineNCProblems,decisionproblemswiththegeneralstructureofNewcomb’spuzzle.Formypurposes,anNCProblemwillbeaproblemwithtwoacts𝒜 = {𝑎∗, ¬𝑎∗}andtwocausaldependencyhypotheses𝒦 ={𝑘5, 𝑘V},inwhichadominanceargumentacross𝒦givestheagentaone-utileincentivetochoose𝑎∗;nonetheless,because¬𝑎∗indicatesamuchmorevaluableoutcome(whichI’llrefertogenericallyas“thebigprize”),theconditionalprobabilitynormfavoredbytheEDTerrecommends¬𝑎∗.HencefromtheCDTer’spointofview,thedistinctivethingtosayaboutNCproblemsisthatmaximizingexpectedutilitywillentailgettingsome“badnewsontheside"(Joyce2007pg.542)—newsthatstatisticallydisconfirmsone’sgettingthebigprize.

Hereisanexamplewhichwewillrevisitthroughout.

MoodCandles.Lightingaromaticmoodcandles(=𝐿)slightlyincreaseshappiness.Bobisdecidingwhethertolightamoodcandle.Healsofindshimselfunabletorememberwhetherhesuffersfromdepression(=𝐷).Inhiscurrentstateofignoranceandindecision,hispossibleoutcomes,andtheirvalues,are:

NotDepressed(¬𝐷) Depressed(𝐷)Don’tlight(¬𝐿) 9 0

Light(𝐿) 10 1

7Foraprobabilisticversionofthesameidea—whichdoesnotrequirethatdependencyhypothesesentailoutcomes,butmerelythattheyfixtheirchances—see§10ofLewis(1981a).

8CandidatesincludethedirectedacyclicgraphsofPearl(2000),andtheworldlydistancemetricsofLewis(1973)’streatmentofnaturallanguagecounterfactuals.

9Evidentialexpectedutilityispartitioninvariantinthesensethattheexpectedutilityassignedtoallacts𝑎 ∈ 𝒜w.r.t.acredalstate𝐶𝑟1(⋅)willbethesameinanyproblems⟨𝒜, 𝒮5, 𝐶𝑡1(⋅)⟩and⟨𝒜, 𝒮V, 𝐶𝑡1(⋅)⟩solongas⋃𝒮5 = ⋃𝒮V.SeeJoyce(1999pg.176ff.),andreferencestherein,fordiscussionofthisfeature.

There’sacatch,however.Knowledgeofthetherapeuticvalueofmoodcandlesisrare.Onlypeoplewhoaredepressedtendtohaveit,sinceonlytheyareregularlyupdatedbytheirdoctorsaboutmoodtherapies.HenceBob’scredencesaresuchthat𝐶𝑟1(𝐷|𝐿) > 𝐶𝑟1(𝐷):whilelightingmoodcandlesmodestlyconducestohappiness,itisastrongindicatorofunhappiness.

Aflatfooted(perhapsnaive;seebelow)calculationbytheEDTer’sconditionalprobabilitynormwillrecommendthatBobrefrainfromlightingamoodcandle,solongas𝐿raisesthestatisticalprobabilityof𝐷bymorethan1/9.10GiventhatBobalsobelievesthatlightingmoodcandlescannotcausedepression,however,itisunclearthatthisistherightrecommendation.

2.1 ScreeningOffTheformaldescriptionofanNCproblemisclearlycoherent,andMoodCandlesprimafaciefitsthebill.Somephilosophershaveargued,however,thattherearenoNCproblemsinreallife—atleast,notforrationalagents.Thisisacommonfactoroftwovenerablethreadsintheliterature:theso-called“Ramseythesis"((Ahmed2014Ch.8),(Ramsey1990)),andElleryEell’s“tickledefense"ofEDT(Eells1982).Thesharedideaisthataslongasaresponsibleagenttakesherfullpre-decisionevidenceintoaccount,andisnotirrationallyinfluencedbyfactorsoutsideofhercontrol,shewillbeabletoaccessevidence𝑍thatscreensoffthestatisticalsupportthatheractwouldapparentlyprovideforanystate𝑘:11

Screeningoff.𝑍screensoff𝑘from𝑎relativetoaprobabilityfunction𝐶𝑟1(⋅)justincase𝐶𝑟1(𝑘|𝑎 ∧ 𝑍) = 𝐶𝑟1(𝑘|𝑍).

10Calculation:

𝐸𝐸𝑈1(¬𝐿) > 𝐸𝐸𝑈1(𝐿)iff

𝐶;

𝑟1(𝑘;|¬𝐿)𝑉𝑎𝑙(𝑘; ∧ ¬𝐿) > 𝐶;

𝑟1(𝑘;|𝐿)𝑉𝑎𝑙(𝑘; ∧ 𝐿)iff

> [𝐶𝑟1(¬𝐷|𝐿)×10 + 𝐶𝑟1(𝐷|𝐿)×1]iff> [𝐶𝑟1(¬𝐷|𝐿)×10 + 𝐶𝑟1(𝐷|𝐿)]

Since𝐶𝑟c1 (⋅)=𝐶𝑟1(⋅ |𝜙)isitselfaprobabilityfunction,thisisequivalentto

[𝐶𝑟¬e1 (¬𝐷)×9] > [𝐶𝑟g1(¬𝐷)×10 + (1 − 𝐶𝑟e1(¬𝐷))]

Letting𝑛 = 𝐶𝑟¬e1 (¬𝐷)and𝑚 = 𝐶𝑟e1(¬𝐷),thisobtainsjustincase

9𝑛 > 10𝑚 + (1 − 𝑚)iff𝑛 > 𝑚 + 1/9.

11Eells’sdiscussionemphasizesthefirstidea—screeningevidenceisalwaysavailablepre-decision—whileRamseyemphasizesthesecond—thatanagentnotirrationallyinfluencedbyfactorsoutsideofhercontrolinfactcannotlearnanythingfromheracts.BeyondEells,theissueisdiscussedinAhmed(2014Ch.8)andBriggs(2010).

Ascreeningproblem.Anagent’sdecisionproblem⟨𝒜,𝒦, 𝐶𝑟1(⋅)⟩isascreeningproblemjustincasetheagent’sevidentialpositionat𝑡makesaccessiblesome𝑍whichscreensoffall𝑎 ∈ 𝒜fromall𝑘 ∈ 𝒦,relativeto𝐶𝑟1(⋅).

Whensomesuch𝑍isavailablepre-decision,theadditionalevidentialvalueofactuallyacting—actuallymaking𝑎insteadof¬𝑎thecase,orviceversa—isnothing.𝐶𝑟1(𝑘) = 𝐶𝑟l1(𝑘) = 𝐶𝑟l1(𝑘|𝑎)forall𝑎and𝑘,sothecharacteristicdecisionrulesofEDTandCDTwillalwaysendorsethesameact.

MoodCandlesisdesignedtosuggestscreening-off.Therearetwoparticularlyappealingcandidatesfortheproposition𝑍whichwouldscreenof𝐿from𝐷.Thefirstwilllikelyhaveoccurredtothereaderupabove:itcomesfromBob’sabilitytoreflectonwhathealreadybelievesabouttheproblemheisfacing.InMoodCandles,Bobbelievesthatlightingupgiveshimanextrautile,andthatitiscausallyindependentfromwhetherhegetsthebigprize.(Asnotedabove,exactlyhowhisdoxasticstatereflectsthisconvictionaboutcausalindependenceisaquestionIamcurrentlyleavingunanswered,buthedoesbelieveit.)ItfollowsthatBobalreadyhasevidencethatheisamemberofacertainclass:theclassofpeoplewhoareawarethatlightingmoodcandlesisefficaciousforhappiness.Thisisstatisticalevidencefor𝐷—bythedescriptionoftheproblem,onlydepressedpeopletendtobethatinformedabout(small,but)effectivewaysofelevatingone’smood.Thefirstcandidate“screener”proposition,then,supportsahighcredencein𝐷.Onthisinterpretation,Bob’smeremiddlingcredencein𝐷looksratherlikeabitofepistemicnegligence.12

Asecondcandidateisalsointrospective,butpullsintheotherdirection.ItfocusesonBob’sintrospectiveaccesstohowhefeels,ratherthanwhathebelieves.𝐷,afterall,isthepropositionthatBobisdepressedat𝑡.Surelyagentshavesomeprivilegedaccesstowhethertheyaredepressed,whichcomesfromconcentratingonhowtheyfeel.Bythedescriptionoftheproblem,though,Bobdoesn’tfeelstronglydepressed.13Reflectiononthissupportsalowcredencein𝐷.Onthisinterpretation,Bob’smeremiddlingcredencein𝐷mightstillberemiss—butbecauseitshouldbelowerthanitis,ratherthanhigher.

Ofcourse,thereisnotreallyan“either-or”inthiscase:Bobshouldbeepistemicallyresponsibleintakingintoaccountbothwhathe(already)believesandhowhe(already)feels.Bobmaybeinsuchasingularevidentialsituationvis-à-vis

12IntheNewcombliterature,thisthoughtfindsanallyinargumentthatifanagentcanreflectthather“decisionalgorithm”recommendstwo-boxing,shecanalreadyconcludethatthepredictorleftnomoneyintheopaquebox.Foraversionthisargument,seeYudkowsky(2010pg.26ff.).

13Iconstruethistobeunderlyingthefactthat𝐶𝑟1(𝐷)isn’thigh,intheoriginaldescriptionofthecase.

histotalevidencethatitisjustdifficulttosaywhatajustifiedcredencein𝐷wouldbe.

Thisiswhythesuggestionthatreal-lifeNCproblemsareallscreeningproblemshashadaequivocaleffectontheliterature.Briggsweighsinonthethesisasfollows:

WhetherthetickledefenserulesoutallcasesofconflictbetweenEDT[and]CDT...[is]controversial.Perhapsagentscanbereasonableenoughtolooktodecisiontheoryforadvicewithoutbeingasrationalandself-awareas[screening-off]requires...[However,]itseemsclearthatagreatmanysuitableagentswillhaveinformationthatrenderstheiractionsevidentiallyirrelevanttothedependencyhypotheses.Totheextentthatsituations[tothecontrary]arerare,thetaskofdecidingamongvariouspossibledecisiontheoriesisrenderedmuchlessurgent.(Briggs2010,pg.28,emphasisadded)

Twonondenominationalmorals,Ithink,emergefromconsiderationsofthescreening-offdebate.Thefirstisthatscreening-offargumentsinNCProblemswillalwayssupportthedominantoption–thatis,theactprimafaciefavoredbytheCDT,ratherthantheEDTer.14Thesecondisthatanagent’spre-decisionreflectiononherfullinformation—anactivityIwillcallepistemicentrenchment—mustbeconsideredbyboththeoriesinanyputativeNCproblem,eveniffullscreening-offiscontroversial.Thisoftenleadstosubtledisputesaboutwhatintrospectioncandeliver,fromtheagent’spointofview.

ForCDTers,entrenchmentservesasareminderthatcausalistsdonotactuallyignoretheirevidence.InNCproblems,dominancereasoningcanobscurethefactthatevenaCDT-rationalagentmusthavesomeparticularcredenceovereachdependencyhypothesisin𝒦beforesheacts.Decisionprocedureaside,shewillnotmakethebestdecisionsshecouldmakeifthesecredencesfailtoreflecthertotalevidence.15

14Why?Ina2×2NCProblem,let𝒜 = {𝑎∗, ¬𝑎∗}and𝒦 = {𝑘5, 𝑘V}.Byscreeningoff,thereissome𝑍whichtheagentcanupdateonbeforeactingsuchthat∀𝑘: 𝐶𝑟l1(𝑘|𝑎) = 𝐶𝑟l1(𝑘).Itfollowsfromthisthat𝐶𝑟l1(𝑘|𝑎∗) = 𝐶𝑟l1(𝑘|¬𝑎∗)forall𝑘 ∈ 𝒦.Callingthisnumber𝑛,wehave𝐸𝐸𝑈(𝑎∗) = 𝑛𝑉𝑎𝑙(𝑎∗ ∧𝑘5) + (1 − 𝑛)𝑉𝑎𝑙(𝑎∗ ∧ 𝑘V),and𝐸𝐸𝑈(¬𝑎∗) = 𝑛𝑉𝑎𝑙(¬𝑎∗ ∧ 𝑘5) + (1 − 𝑛)𝑉𝑎𝑙(¬𝑎∗ ∧ 𝑘V).Bydominance,𝑉𝑎𝑙(𝑎∗ ∧ 𝑘5) > 𝑉𝑎𝑙(¬𝑎∗ ∧ 𝑘5)and𝑉𝑎𝑙(𝑎∗ ∧ 𝑘V) > 𝑉𝑎𝑙(¬𝑎∗ ∧ 𝑘V).Hence𝐸𝐸𝑈(𝑎∗) > 𝐸𝐸𝑈(¬𝑎∗).ThisisnotanembarrassmenttotheEDTer’stheory,sincescreeningoffmakesgoingfordominantoptionscompatiblewiththattheory.

15ForasimilarobservationabouthowdominanceargumentscanobscureaCDTer’sevidentialduties,seeJoyce(2012pg.239).HereItakethetraditionalviewthatepistemicnormssuchasthePrincipleofTotalEvidencearesuigenerisprinciplesofepistemicrationality,whosejustificationispriortothejustificationofanypracticalmaximstheyinform(see,forexample,(HareandHedden2016pg.615)).Forargumentthatinvertsthisorderofpriority—offeringapracticalvindicationofthePrincipleofTotalEvidence—seeGood(1967);foranargumentthatEDT,incontrasttoCDT,hastroubleunderwritingGood’sTheorem,seeSkyrms(1990).

EDTers,fortheirpart,needepistemicentrenchmenttocombattheobjectionthattheirviewrecommendspatentlyabsurdactsthatwouldapparentlyresultfromgivingtoomuchweighttogeneralstatisticalcorrelations.ConsiderPearl’scomplaint:16

[AccordingtoEDT,][p]atientsshouldavoidgoingtothedoctortoreducetheprobabilitythatoneisseriouslyill;workersshouldneverhurrytowork,toreducetheprobabilityofhavingoverslept...remedialactionsshouldbebanishedlesttheyincreasetheprobabilitythataremedyisindeedneeded.(Pearl2000pg.108-9)

Thereasonablereplyhere,onbehalfoftheevidentialist,appealstoanagent’sneedtoincorporateherfullinformationintohercredalstatebeforesheacts.FocusonPearl’sexampleofhurryingandlateness(whichwecancallHurryingtoClass).GrantPearl’sassumption,thathurrying(e.g.toa10amclass)onanygivendayiscorrelatedwithlatenessonthatday:

HurryingtoClass.Myoutcomematrixandcredencesareasfollows:

NotLate(¬𝐿) Late(𝐿)

hurry(𝐻) 9 0

don’thurry(¬𝐻) 10 1

𝐶𝑟1(late|hurry) > 𝐶𝑟1(late)

Whydoesn’ttheEDTer’snormrecommendthatIavoidhurryingtoday,rightnow,asIheadtocampus?

Inthesimplestcase,Ihaveatimekeepingdevicewithme,andeitherdo,oreasilycould,glanceatit,toseethatitis(e.g.)9:46am.Relativetoanyspecifictime—afortiori,relativeto9:46am—thecorrelationbetweenlatenessandhurryingisreversed:

𝐶𝑟1(late|hurry ∧ 9: 46am) < 𝐶𝑟1(late|9: 46am)

Hencerelativetomyfullinformation,hurryingisnotcontraindicatedbymyact-conditionalizedcredences.

WhathappensifIdonothaveawatch?Itseemsthatthesamereasoningoughttohold—afterall,forebearingtohurrywhenoneisunsureofthetimeseemslikeanevenworseideathanforebearingtohurrywhenonedoesknowthetime.

Hereisawaytogeneralizethestrategy.Theargumentsketchedaboveclearlydidnotdependonthetime’sbeingexactly9:46.Rather,itdependedonthefactthattheagent(here,me)waslocatedatsomeparticulartime,whichshecouldthinkaboutinadirectway.Supposethatagentsgenerallyhavethepowertopick16ItakethequotefromPearlviaAhmed(2014pg.82).

thecurrenttimesrigidlybymeansofsomedemonstrativetoken—say,𝜇.17RelativetomyfullinformationinHurryingtoClass,Iaccept

𝐶𝑟1(late|hurry ∧ thetimeis𝜇) < 𝐶𝑟1(late|thetimeis𝜇)

Justinvirtue,then,ofthepatentlyintrospectablefactthatthetimeis𝜇,anEDTagentinaproblemlikeHurryingtoClasscanconcludethatthereisnoin-respect-of-latenessreasonnottohurry.Iwillreturntothisargumentin§4.

2.2 ConditionalizationandLewis’sPackageUpabove,IdescribedcredencebysayingthatagentsarepresumedtohaveBayesiandispositions—includingadispositiontoupdatebyconditionalizingontheanynewevidencetheyreceive.Thispresentation,however,wasneutralwithrespectthenormativestatusofthattendency.Bayesianism’sadditional,normativeclaimisthatconditionalprobabilitiesprovideadiachronicnormofbeliefrevision—thatanagentoughttorevisehercredenceinlightofevidence𝐸bymovingfrom𝐶𝑟1(⋅)to𝐶𝑟1~(⋅) = 𝐶𝑟1(⋅ |𝐸).IwillcallthisnormConditionalization(withacapital‘C’.)

Conditionalizationentails,asaspecialcase:

(CNC)InanNCproblem,oneoughttoupdatebyconditionalizingonone’schosenact𝑎.

(CNC)appearstodovetailwellwiththeCDTer’sconciliatorypositiononnewsvalues.Relativetoafixedcredalstate,propositionslike𝐿—amoodcandleislit—or𝐻—theagenthurries—carryastablenewsvaluebothbefore,andafter,thatpropositionisanavailableact.TheCDTpositionissimplythat,whene.g.𝐿isanavailableoption,itshouldbechosen(orpassedup)accordingtoitscausalexpectedutilityratherthanitsnewsvalue.

Lewishimselfseemstohaveembraced(CNC).18Theconjunctionof(CNC)andtheCDTer’sexpectedutilityequationcompriseajointCDTerpositionwhichwecancall“Lewis’spackage"(LP):

(LP)Atalltimes𝑡,oneoughtto:(i)assignexpectedutilitytoanavailableact𝑎usingone’stime-𝑡credencesacrossthedependencyhypotheses𝒦,selectingthe𝐶𝐸𝑈-maximalact;

17SeeMoss(2012)’s“Dr.Demonstrative”example(§1)forasimilarmaneuver,inthecaseofself-locatingbeliefs.

18InLewis(1976pg.302),hewritesthat“therearegoodreasonswhythechangeofbeliefthatresultsfromcomingtoknowanitemofnewevidenceshouldtakeplacebyconditionalizingonwhatwaslearned."AfootnotefromthequotedsentencerefersthereadertoTeller(1973)’sDutchBookargumentinfavorofConditionalization.LewisadvanceshisownDutchBookforConditionalizationinLewis(1999).

(ii)whenmakingpropositionsin𝒜truebyone’sactions,plantoevolve𝐶𝑟1(⋅)accordingtoConditionalization.

Conceptually,thereissomethingoddabout(LP).IfanagentwithLewis’sformofCDTisself-aware,shecananticipatewhathercredenceinthevariousdependencyhypotheses𝑘 ∈ 𝒦willbeatanyfuturetime𝑡~whenshehasperformedagivenavailableact𝑎.19ByConditionalization,thesefuturecredencesarejust

{𝐶𝑟1(𝑘|𝑎): 𝑘 ∈ 𝒦}

ByEquation2,thesearethesamenumberstheEDTerwillcurrentlyapplytocalculatetheexpectedvalueof𝑎.Infavoring𝐶𝐸𝑈over𝐸𝐸𝑈asaguidetoaction,then,theLewisianCDTerdefactoholdsthatone’scurrentcredencesinthedependencyhypothesesoverrulethecredencesoneplanstoadopt.Thisseemsintensionwiththeideathatthereissomethingmoreinformedaboutthemoreopinionatedcredencefunction𝐶𝑟1(⋅ |𝑎),invirtueofwhichitistheoneanagentshouldplan(followingConditionalization)totransitiontouponperforming𝑎.

Thistensioncanbeseeninthejuxtapositionofnormsgoverningactionandbeliefchange.FortheLewisianCDTer,alackofcausalefficacymakesitthecasethatoneshould,inchoosinganact,beunmoved,sotospeak,bythefactthattheactprovidesstatisticalevidenceforadesirablestate.Toreturntoourfirstexample,inlightofitslackofefficacyinpromoting𝐷,Bobshouldbeunmoved,inhisdecision-making,bythefactthat𝐿providesgoodstatisticalevidencefor𝐷.ButbecausetheLewisianCDTeralsothinksConditionalizationapplies,hebelievesthatifBobdoesbringabout𝐿,hemust,inaway,be“moved"byitsevidentialforceafterall:Bobisrationallyrequiredtoincreasehisconfidencein𝐷uponprocessingthefactthathemade𝐿true.20

3. TheNewcombHospitalCanthisberight?Thepresentsectionwillbedevotedtosketchinganoddconsequenceof(LP),drawingonananalogywithParfit’samnesiacases.Thesettingisahospitalfundedbyaneccentricbillionaire,whereamnesiacpatients—besetbyvarioushandicaps—gettofacedailyNewcombpuzzles.

SupposeyouwakeupintheNewcombhospitaltoday,knowingthatyoufacedNewcomb’sproblemyesterday,butunabletorememberwhatyoudid.Youarecompletelyevidentiallyindifferentonthematter,assigningaprobability1/2to

19Andhaslearnednothingmore:𝑎is,at𝑡~,hertotalevidence.

20Again,Lewisseemstohavebeencomfortablewiththisresult.AddressingNewcomb,hewriteswithanairofresignationthatatwo-boxermustexpectone-boxerstocomeoutahead:“[w]e[two-boxers]...didnotpleadsurprise.Weknewwhattoexpect."(Lewis1981bpg.378).SeeByrneandHájek(1997)formorediscussionofLewis’spackageview.

thepropositionthatyouone-boxedandaprobability1/2tothepropositionthatyoutwo-boxed.Thenewsthatyoudidtheformerwouldbeanear-to-perfectindicationthatyoumade$1,000yesterday,andthenewsthatyoudidthelatterwouldbeanear-to-perfectindicationthatyoumade$1𝑀yesterday.Henceinyourstateoftotalignoranceyouestimateyourwinningsat$500,500.ALewisianrationalamnesiac,whileatwo-boxer,muchprefersthenewsthatsheone-boxedinthepast.

Atradereflectingthispreferencecanbeengineered.

NewcombPast.Fry(aCDTer)awakens,bedriddenandwithamnesia,intheNewcombHospital.TodayisWednesday,andFryisinbed336.Bender,Fry’shospitalroommate,inbed335.WhileFrycan’trememberwhetherheone-boxedonTuesday,Bendertruthfullytellshimthathe,Bender,one-boxedonTuesday.Fry’swinningsfromyesterdayareinalockboxmarked“336-Tuesday”.Bender’sareinalockboxmarked“335-Tuesday”.For$𝛥,BenderofferstoswitchlockboxkeyswithFry(callthistrade𝜎).

Whatisthevalue𝜎?BecauseFryconditionalizesonhisevidence,whenBendertellshimthatheone-boxedyesterday,Frybecomes90%confidentthatBender’slockbox,336-Tuesday,has$1Minit.Heestimatesthevalueofhisownlockboxatamere$500,500.So𝜎,thedealBenderisoffering,isattractiveunlesstheamountofmoneyhedemandsforthetrade,𝛥,isgreaterthan$399,500.

TheattractivenessofthetradeinNewcombPastreflectsthefactthat,onLewis’spicture,itisveryfinethingtohavethepastofaone-boxer.Ofcourse,itisalsoafinethingtobeatwo-boxer,sincetwo-boxingmaximizescausalexpectedutility.Wecaneasilyengineerabetwhoseappealreflectsthispreferenceaswell,byinducingsomeuncertaintyoverwhichactisbeingperformed:

NewcombPresent.It’sstillWednesdayinthehospitalwithdailyNewcombrounds.Butnowit’stimeforFrytochoosetoday’smove.Itworkslikethis:thepredictorhasalreadydepositedeither$1Mornothinginopaquelockbox“Wednesday-336",dependingonwhethershepredictedthatFrywould,today,chooseact2𝐵(whichdumpsanextrathousandintoWednesday-336).Likewise,shehasalreadydepositedeither$1MornothinginBender’slockbox,“Wednesday-335",dependingonwhethershepredictedBenderwouldperform2𝐵(whichdumpsanextrathousandintoWednesday-335).

Unfortunately,Fryhasaninjurythatmakesspeechimpossible,andduetoamix-up,thenurseincorrectlybelievesheisamonolingualDingbatsspeaker.Thenursegiveshimtheday’sNewcombmenuwithtwooptionsprintedentirelyinDingbats:hecaneitherchoose“@”or“#".Hedoesnotknowwhichonecorrespondsto2𝐵.Infrustration,Fryrandomlycirclesthefirstoption,“@”.

Bender’sformisinEnglish—hecaneitherchoose“1𝐵"or“2𝐵"—anditisstillblank.For$𝛥,BenderofferstoswitchformswithFry(callthistrade𝜏).

HowshouldFryevaluate𝜏?ThevalueofthetradedependsonwhathewoulddowithBender’sformifhegotit.Butthatpartiseasy:fromtheCausalPointofView,atwo-boxforminaNewcombproblemisalwaysworth$1000morethanaone-boxform(itensuresthattheextrathousandisdumpedintoone’sbox).SoaCDTerlikeFry,whochooses𝜏andgetsablankformwillcertainlygoontopickthetwo-boxoption.Indeed,thetradeBenderisofferingisappealingtohimsolongas𝛥 < $500.

Argument:let𝛽betheamountofmoneyalreadyinthebox.AsintheoriginalNewcombpuzzle,becausetheidentityof𝛽iscausallyindependentofanythingFrydoesnow,wecanframetheproblemfromtheCDTer’spointofviewviaanargumentthatabstractsfromthevalueof𝛽:

𝑝(@ = 1𝐵) ¬𝑝(@ = 2𝐵)

𝜏 𝛽+(1000-𝛥) 𝛽+(1000-𝛥)¬𝜏 𝛽+0 𝛽+1000

IfFryiscompletelyindifferentontheidentityof@,then𝐶𝑟(𝑝)=.5.Hence

𝐶𝐸𝑈1(𝜏) = .5(𝛽 + (1000 − 𝛥)) + (1 − .5)(𝛽 + (1000 − 𝛥))= 𝛽 + (1000 − 𝛥)

𝐶𝐸𝑈1(¬𝜏) = .5(𝛽 + 0) + (1 − .5)(𝛽 + 1000)= 𝛽 + 500

Hence𝐶𝐸𝑈1(𝜏) > 𝐶𝐸𝑈1(¬𝜏)iff𝛥 < $500.(Forapresentationoftheargumentwithoutthevariable𝛽,seeAppendix.)

3.1 PuttingitalltogetherOurlaststeptogettingthepreferencesreflectedinFigure3istocombinethebettingbehavioroftheCDTersinNewcombPastandNewcombPresent.Thefollowingsetup,whilebaroque,willdothejob:

NewcombHospitalFusion.Asbefore,it’sWednesday.Thelockboxes,Fry’samnesia,andmutualknowledgethatBenderone-boxedyesterdayareasinNewcombPast.AsinNewcombPresent,FryhasbeengivenaDingbatsformfortoday’sroundandhasrandomlymarked“@"onit,whileBenderwasgivenaforminEnglish.BenderoffersFrya“combodeal":for$𝑛,hewillswitchlockboxkeysandformswithFry(callthistrade𝜔).

BeforeFrymakeshisdecision,helearnsoneadditionalpieceofinformationfromtheDingbats-languagechartbyhisbed:yesterday,theacthechosewas#.

HowwillFryweighthecombodeal?Thecausalexpectedutilityofchoosing“@"onthisroundisequaltothecausalexpectedutilityofchoosing“#"givenhiscurrentcredences,sinceeachacthasa50%chanceofbeingtheactofone-boxinganda50%chanceofbeingtheactoftwo-boxing.Becauseheknowshechose#yesterdayandhascurrentlymarked@onhisform,heknows,evenbeforehetrades,thatheisoneofthetwosituationsinFigure3.Foranadditional$𝛥—bychoosing𝜔—hecanguaranteetheoutcomeaLewisianCDTerconfidentlydesires:thathehasaone-box-pastandtwo-box-futureasopposedtothereverse.

NewcombTimeBias:Figure3,repeated.

AsinNewcombPresent,thepartofthecombodealthatensuresatwo-boxingpresentisappealing.Evenmoreappealingisthethepartthatmimicsbeingabletoensureaone-boxingpast—thatis,Fry’sbeingabletotradehisTuesdayboxforBender’s.SinceBenderone-boxedyesterday,hisbox,box335-Tuesday,is90%certaintocontainamilliondollars.Indeed,aslongas𝛥 <$400,500,Frywillchoosetotrade.

Argument:let𝛽betheamountofmoneythatisalreadyinFry’sWednesdaybox—thatis,theboxhewillgettoday.Nomatterwhat,FryisnottradingawayhisWednesdaybox(rather,ifhetakes𝜔heswitchesTuesdayboxesandWednesdayformswithBender.)Sothevalueof𝛽—thoughitisactdependent—is,again,causallyindependentfromwhatFrychoosestodo.

IfFrychooses𝜔,heswitchesformswithBender,soheinsulateshisownpayofffromtheidentityof@;hewillget𝛽 + (1000 − 𝛥)todayinanycase.Fry’spayoffswillthenfurtherdependonwhetherBender’sTuesdayboxcontains$1𝑀ornothing(letthesebepropositions5𝑇1and5𝑇0,respectively).

IfFrydeclinesthecombodeal,optingfor¬𝜔,hekeepsthecontentsofhisownTuesdaybox,andhispayoffswillthendependonwhetherhisTuesdayboxcontains$1𝑀ornothing(letthesebepropositions6𝑇1and6𝑇0,respectively).Moreover,histotalpayoffdoesnowdependontheidentityof@—hewillgetanextrathousandjustincase@= 2𝐵(row2,below).

I. 5𝑇1, 𝑝, 6𝑇1 II. 5𝑇1, 𝑝, 6𝑇0 III. 5𝑇1,¬𝑝, 6𝑇1 IV. 5𝑇1,¬𝑝, 6𝑇0𝜔 1𝑀 + (1000 − 𝛥)

+ 𝛽1𝑀 + (1000 − 𝛥)

+ 𝛽1𝑀 + (1000 − 𝛥)

+ 𝛽1𝑀 + (1000 − 𝛥)

+ 𝛽¬𝜔 1𝑀 + 0 + 𝛽 0𝑀 + 0 + 𝛽 1𝑀 + 1000 + 𝛽 0𝑀 + 1000 + 𝛽 V. 5𝑇0, 𝑝, 6𝑇1 VI. 5𝑇0, 𝑝, 6𝑇0 VII. 5𝑇0,¬𝑝, 6𝑇1 VIII. 5𝑇0,¬𝑝, 6𝑇0𝜔 0𝑀 + (1000 − 𝛥)

+ 𝛽0𝑀 + (1000 − 𝛥)

+ 𝛽0𝑀 + (1000 − 𝛥)

+ 𝛽0𝑀 + (1000 − 𝛥)

+ 𝛽¬𝜔 1𝑀 + 0 + 𝛽 0𝑀 + 0 + 𝛽 1𝑀 + 1000 + 𝛽 0𝑀 + 1000 + 𝛽

Utilities.

Whatoftheappropriatecausalistcredences?KnowingBenderone-boxedyesterday,Fryis90%confidentin5𝑇1,thepropositionthatthepredictorput$1𝑀inBender’sboxonTuesday.Hencehehascredence90%acrosscolumnsI-IVandcredence10%acrosscolumnsV-VIII.Withineachofthesepossibilities,heis90%confidentthepredictorpredictedhisownmove(#)correctlyyesterday,soheexpectsacorrelationof.9between¬𝑝(whichentailsthat#isone-boxing)and6𝑇1,thepropositionthatthepredictorput$1𝑀in336-Tuesday.Finally,hisoverallconfidencein𝑝is.5.Thatyields:

I. 5𝑇1, 𝑝, 6𝑇1 II. 5𝑇1, 𝑝, 6𝑇0 III. 5𝑇1,¬𝑝, 6𝑇1 IV. 5𝑇1,¬𝑝, 6𝑇0𝜔 0.045 0.405 0.405 0.045¬𝜔 0.045 0.405 0.405 0.045 V. 5𝑇0, 𝑝, 6𝑇1 VI. 5𝑇0, 𝑝, 6𝑇0 VII. 5𝑇0,¬𝑝, 6𝑇1 VIII. 5𝑇0,¬𝑝, 6𝑇0𝜔 0.005 0.045 0.045 0.005¬𝜔 0.005 0.045 0.045 0.005

Credences.

Whatoftheappropriatecausalistcredences?KnowingBenderone-boxedyesterday,Fryis90%confidentin5𝑇1,thepropositionthatthepredictorput$1𝑀inBender’sboxonTuesday.Hencehehascredence90%acrosscolumnsI-IVandcredence10%acrosscolumnsV-VIII.Withineachofthesepossibilities,heis90%confidentthepredictorpredictedhisownmove(#)correctlyyesterday,soheexpectsacorrelationof.9between¬𝑝(whichentailsthat#isone-boxing)and6𝑇1,thepropositionthatthepredictorput$1𝑀in336-Tuesday.Finally,hisoverallconfidencein𝑝is.5.Thatyields:

Acalculationshowsthat𝐶𝐸𝑈(𝜔)=900,000+(1000-𝛥)overthebaseline𝛽,while𝐶𝐸𝑈(¬𝜔)=$500,500inexcessofthesamebaseline.HenceFrywilltakethedealaslongas𝛥 <$400,500.

3.2 KarmaForetoldFromtheLewisianpointofview,time-biasedtradingattheNewcombHospitalseemstobemotivated.Ofcoursethecausaldecisiontheoristwantsthepastofaone-boxer—thisisanexcellentindicatorthatsheisalreadyamillionaire!Andofcourseshewantstotwo-boxtoday—thatguaranteesthatshewalksawaywith$1,000morethanshewouldhavegottenotherwise!21

21IhereusethecounterfactuallocutionLewisfavoredindescribingcausalistreasoning;see,forexample,(Lewis1981b).

Thesesentimentsarearticulatedfromthepointofviewofaparticulartime—forexample,thetimelabeled“𝑁𝑂𝑊"inFigure3.Butatthepointintimemarked“after"—whenbothroundslieinthepast—takingthecombodealwillseem,bytheCDTer’sownlights,likebadnews.Moretothepoint,Ifear,itlookslikeanirrationaldecision.SupposeFrywakesupamnesiaconThursday,andistoldthatBenderofferedhimthecombodealonWednesday.OnWednesday,Frycouldhavekepthis@formforfree,ensuringthattheactionheperformedonWednesdaywasdifferentinkindthantheactionheperformedonTuesday—hence,thathewasinoneofthescenariosinFigure3,withoutknowingwhich.Now,forFrytolearnwhetherhetookBender’scombodealistolearneitherthat:

(a)Inthelasttwodays,hetwo-boxedonceandone-boxedonce,insomeunknownorder,or

(b)Inthelasttwodays,heone-boxedonceandtwo-boxedonce,andwillinglygaveBender$400,000toensurethatthefirsthappenedbeforethesecond.

ThoughIamaCDTer,theideaoflearningthatIsecured(b)over(a)insuchasituationmakesmeuncomfortable.Itishardtojustifytheclaimthatonewayoforderingmyactsintimeisworth(morethan)$400,000morethantheother.22

4. CDT’sresponseTheoddityinNewcombHospitalFusionarisesbecausethetiebetweenone-boxingandbeingamillionaireisinvisibletoFry’sdeliberationsatthemomentofchoice—onlytoreappearwithavengeancelater,whenheassesseswhathebelievesisinyesterday’slockboxes.ThismakesitimpossibletostablyassumethatFrydoes,ordoesn’t,takethefactthatanagentisaone-boxertobegoodreasontoincreaseone’sconfidencethatthatagentisamillionaire.

Inmostordinarysituations,theredoesseemtobeastablefact.InMoodCandles,forexample,thecausalist’sintuitionisthatBobshouldlightacandleeventhoughcandle-lightingis—atleast,inthegeneralpopulation—correlatedwith

22Itisworthnotingthat,whilethepost-factonewsvalueof(a)exceedsthatof(b),NewcombHospitalFusionisnotaDutchBook:choosing(b)over(a)doesnotguaranteethatyouarepoorerineverypossibleworld(thoughitdoesmakeitlikely).Aworld𝑤willmaketheagentbetteroff,where

• 335-Tuesdaycontains$0in𝑤,

• 336-Tuesdaycontains$1millionin𝑤,and

• @isone-boxing.

ForadiachronicNewcombproblemforthetwo-boxerthatdoesmakeastandardCDTerpoorerineverypossibleworld,andtheextraassumptionsthisinvolves,seeAhmed’s“NewcombInsurance"cases(Ahmed2017,2014pg.202ff.).

depression.Italsoseemslike,ifhedoeslightup,hefailstoacquireagoodreasontoincreasehisconfidencethatheisdepressed.Atleast,thatiswhatIsuggestedabove:thereissomethingoddaboutthinking,withLewis,thatifBobdoeslightup,hewouldtherebyacquiregoodevidenceforthinkinghewasdepressedafterall.Heknowswhyhe’scontemplatinglightingup—heisinpursuitoftheextrautile,whichtheactwillsecurehim—andit’snotbecauseheisdepressed.

4.1 TimeBiasandEDTAsindicatedintheintroduction,myfirstdefensivemaneuveristoarguethatEDTisalsosusecptibletotimebias.ItwillbeuptothereadertodeterminewhethertheEDTer’sformofbiasisasunflatteringasFry’s.

RecallthatinthePearl-inspiredexampleHurryingtoClass(§2.1),hurryingonanygivendaywasgenerallycorrelatedwithlatenessonthatday:

𝐶𝑟1(late|hurry) > 𝐶𝑟1(late)

Nonetheless,relativetoaspecifictime—like9:46am—thatcorrelationfailed(infact,wasreversed),sothatrelativetotheagent’sfullavailableinformationhurryingwasnotafterallcontraindicated,byeitherdecisiontheory.Forcaseswheretheagentlackedawristwatchorothertimekeepingdevice,Iextendedthescreening-offargumentbyappealtointrospection—totheagent’sabilityto“lockon”tohercurrenttimebymeansofsomedemonstrative𝜇,anduse𝑍=thetimeis𝜇toscreenoffthesupportlenttolatenessbyhurrying.

Thisability,whichextendsonlytothepresentmoment,givesrisetotime-bias.SupposeanEDTer,whohasjustwokenat𝑡unabletorememberwhatshehasdoneforthelastmonth(theintervaloftime“𝑇�”),findsherselfinaniteratedversionofHurryingtoClass.Ifshecaresabouthertrackrecordofarrivingontimetoclass,shemaywell:

(i)prefertohurryat𝑡,and

(ii)prefertolearnthatshedidn’thurryatanypasttime𝑡′ ∈ 𝑇�.

TheEDTerinHurryingtoClassshouldhaveattitude(i)forthereasonsexplainedabove:overturningPearl’sargumentdependsonherleveragingher(difficult-to-express)knowledgethatitisnowthecurrenttime.Butthispowerislimitedto𝑡:withregardtoherpast,itisrationaltocontinuetomaintainattitude(ii),despitetakingupattitude(i).Thoughsomethingsimilartothethoughtinvolving𝜇wastrueinthepast—afterall,foranymorningintheagent’spastitwasthattimethenasmuchasitisthistimenow—thepropositionexpressedbytheformerthoughtdoesnothavetheforce,at𝑡,thatthelatterthoughtdoes.EDT’sappealtoscreeningoffinmanyputativeNCproblemswillcommittheviewtotimebiasincaseswhere,asinNewcombHospital,thoseNCproblemsareiterated.

Figure4:ASharedStructure(toppreferredtobottom).

Itisworththinkingaboutwhythisisso,andwhatthismeansforLewis’sviewsonupdating.AtissueinLewis’sPackageiswhetheranagent—forexample,inanNCProblem—shouldplantoconditionalizeontheactsheperforms.Onewayofthinkingaboutthisistoaskwhether,inanNCproblem,yourattitudetowardstheacts𝑎′ ∈ 𝒜youarecurrentlycontemplatingshouldbethesameasthepreferencesyouwouldhaveifyouwokeuptomorrowwithamnesiaandwereabouttobetoldwhich𝑎′ ∈ 𝒜youperformed.

Butintrospection’spowertocreateanddiscoverpropositionsconcerningthetime𝑡ofchoiceissharplydifferentacrossthetwosituations.InHurryingtoClass,Icanthinktomyself,thetimeis𝜇.Oddly,atthetimeofchoice,Iseemtogainsomeknowledgebythisexercise—knowledgethataffectsmyrationalcalculations.Later,post-amnesia,Iwillnothavethispower.Itispatentlyuninformativetobetold,“youreflectedthenthatthetimewas𝜈,whereby𝜈youmeant:thetimeitinfactwas.”

AcomparisonbetweenthissituationandaCDTer’sreasoninginNCProblemsistelling.InanNCproblem,Icantellmyself,“Iwillnowchoosethedominantoptionbecauseofthe(small)benefitIgetfromit”.Iamintrospectingmymotives,whichgivemeareasontoperform𝑎∗withoutgivingmeanextrareasontothinkthatIwon’tgetthebigprize.IfIamsubsequentlysubjectedtoamnesia,laterIwillonlybeabletolearnthatIeitherperformedthedominantoptionorIdidn’t.AnyextraknowledgeIhadaccesstoattheearliertimewillbewipedout:itfeelsuninformativetobetold“youperformedthedominantoptionbecauseitwasthedominantoption”.Whilenotvacuous,thisthoughtseems,again,obvious—likethethoughtthatifyouhurried,youhurriedatsomeparticulartime.

5. Conclusion

IbeganbyarguingthatCDTistime-biased,andgoingontoshowthatthisleadstooddchoicesintheNewcombHospital.Inresponsetothisbias,IleveledatuquoqueagainstEDT.Time-biasmaybeapervasivefeatureofdecision-making,notaweaknessinanyparticulartheoryofit.

ItwouldbebesttoclosebyreturningtoNewcombitself.EDTersmayfeelthat,inshiftingthefocustoanagent’stheact-priorviewpoint,andtheepistemicreasonsthatcanbearticulatedfromit,Ihavequietlysidesteppedtherealbiteoftheoriginalproblem.Forinthatpuzzle,whetheritispossible,beforeyouact,tocometoanevidentiallysupportedcredenceinthe𝒦-partition{Predictorpredictedyou’done-box,Predictorpredictedyou’dtwo-box}(={𝑃1,𝑃2})ispreciselywhatisatissue.Thereisawidespreadtendencyintheliteraturetoassume

(P)InNewcomb’spuzzle,theonlyevidenceavailablewithrespecttodistributingcredenceoverthepartition{𝑃1,𝑃2}ishowtheplayerhim/herselfreasonsabouttheexpectedutilityofone-boxing.

Thiswouldapparentlyrenderfullreflection,andanyattemptatscreeningoff,impossible.Ahmed,forexample,pressesthepointthat“[a]nyonefacingNewcomb’sproblemhasnoevidencethatrelevantlydistinguisheshimnowfromanyoneelsewhomthestatisticalgeneralization[𝑃𝑟(𝑃1|1𝐵)and𝑃𝑟(𝑃2|2𝐵)arehigh]covers;thatis,allotherpersonswhoeverfacethisproblem"(Ahmed2014pg.191).Thissuggests(P)’sepistemologicalmoral—thatthereisjustnoway,inNewcomb,toeventrytotakeyourfullevidenceintoaccountbeforeyouact,sincenothingcountsasrelevantevidenceotherthantheactitself.

Butwhile(P)mightcharacterizeNewcomb’sProblem,itisn’tsupported,eitherbydeductiveorabductiveconsiderations,bythebasicmechanicsofanNCproblem—wherethelatterisdescribedsimplyintermsofcausaldependencyhypotheses,credences,andoutcomestates,andwhereitisthelatterthatusefullydiagnosesthesplitbetweencausalandevidentialdecisiontheory.

6. Appendix:NewcombPresent:LongargumentAsintheoriginalNewcombproblem,Fry’sestimatehowmuchmoneyisalreadyinhisownbox—thequantitycalled“𝛽”inthemaintext—isact-dependent.Itdepends,inpart,onhowlikelyhethinksitisthathewilltakethetrade.Fryknowsthatifhechoosestotrade,thenhewillcertainlytwo-box.Thepredictorwillhavepredictedthiswithherusualaccuracy,sothat𝐶𝑟1(𝑃1|trade) = .1and𝐶𝑟1(𝑃2|trade) = .9.However,ifFrydeclinestotrade,stickingwiththewildcardmove@,thentheprobabilitythatthepredictorpredicted𝑃1isequaltotheprobabilitythatthepredictorpredicted𝑃2(thehighcorrelationsof𝑃1with1𝐵and𝑃2with2𝐵arenowbalancedbythefactthatFryhasa50%chanceofdoing1𝐵anda50%chanceofdoing2𝐵.)

𝒦 = { 𝑃1 ∧ 𝑝 , 𝑃1 ∧ ¬𝑝 , 𝑃2 ∧ 𝑝 , 𝑃2 ∧ ¬𝑝 }isadependencyhypothesispartition.

𝑃1 ∧ 𝑝 𝑃1 ∧ ¬𝑝 𝑃2 ∧ 𝑝 𝑃2 ∧ ¬𝑝𝜏 1𝑀 + (1000 − 𝛥) 1𝑀 + (1000 − 𝛥) 0𝑀 + (1000 − 𝛥) 0𝑀 + (1000 − 𝛥)¬𝜏 1𝑀 + 0 1𝑀 + 1000 0𝑀 + 0 0𝑀 + 1000

Utilities.

Wehave:

𝐶𝑟(𝑘5) = 𝐶𝑟(𝑃1 ∧ 𝑝) = 𝐶𝑟(𝑝)𝐶𝑟�(𝑃1)= 𝐶𝑟(𝑝)[𝐶𝑟�(𝑃1|𝜏)𝐶𝑟�(𝜏) + 𝐶𝑟�(𝑃1|¬𝜏)𝐶𝑟�(¬𝜏)]= 𝐶𝑟(𝑝)[.1𝐶𝑟�(𝜏) + .9𝐶𝑟�(¬𝜏)]= .5[.1𝐶𝑟�(𝜏) + .9𝐶𝑟�(¬𝜏)]= .5[.1𝐶𝑟(𝜏) + .9𝐶𝑟(¬𝜏)]

𝐶𝑟(𝑘V) = 𝐶𝑟(𝑃1 ∧ ¬𝑝)= 𝐶𝑟(¬𝑝)𝐶𝑟¬�(𝑃1)= 𝐶𝑟(¬𝑝)[𝐶𝑟¬�(𝑃1|𝜏)𝐶𝑟¬�(𝜏) + 𝐶𝑟¬�(𝑃1|¬𝜏)𝐶𝑟¬�(¬𝜏)]= 𝐶𝑟(𝑝)[.1𝐶𝑟¬�(𝜏) + .1𝐶𝑟¬�(¬𝜏)]= .5[.1𝐶𝑟(𝜏) + .1𝐶𝑟(¬𝜏)]= .5(.1) = .05

𝐶𝑟(𝑘�) = 𝐶𝑟(𝑃2 ∧ 𝑝) = 𝐶𝑟(𝑝)𝐶𝑟�(𝑃2)= 𝐶𝑟(𝑝)[𝐶𝑟�(𝑃2|𝜏)𝐶𝑟�(𝜏) + 𝐶𝑟�(𝑃2|¬𝜏)𝐶𝑟�(¬𝜏)]= 𝐶𝑟(𝑝)[.9𝐶𝑟�(𝜏) + .1𝐶𝑟�(¬𝜏)]= .5[.9𝐶𝑟�(𝜏) + .1𝐶𝑟�(¬𝜏)]= .5[.9𝐶𝑟(𝜏) + .1𝐶𝑟(¬𝜏)]

𝐶𝑟(𝑘�) = 𝐶𝑟(𝑃2 ∧ ¬𝑝)= 𝐶𝑟(¬𝑝)𝐶𝑟¬�(𝑃2)= 𝐶𝑟(¬𝑝)[𝐶𝑟¬�(𝑃2|𝜏)𝐶𝑟¬�(𝜏) + 𝐶𝑟¬�(𝑃2|¬𝜏)𝐶𝑟¬�(¬𝜏)]= 𝐶𝑟(𝑝)[.9𝐶𝑟¬�(𝜏) + .9𝐶𝑟¬�(¬𝜏)]= .5[.9𝐶𝑟(𝜏) + .9𝐶𝑟(¬𝜏)]= .5(.9) = .45

Forshorthand,westipulatethat𝔱 = 𝐶𝑟(𝜏).HenceFry’scredencesareasfollows:

𝑘1 𝑘2 𝑘3 𝑘4. 45− .4𝔱

. 05 . 4𝔱+ .05

. 45

Credences,takeI.

Letting𝑧 = .4𝔱,thisisequivalentto:

𝑘1 𝑘2 𝑘3 𝑘4. 45 − 𝑧 . 05 𝑧 + .05 . 45

Credences,takeII.

Calculatingcausalexpectedutilities:

𝐶𝐸𝑈1(𝜏) = (.45 − 𝑧)(1𝑀 + (1000 − 𝛥)) + .05(1𝑀 + (1000 − 𝛥))+(𝑧 + .05)(0𝑀 + (1000 − 𝛥)) + .45(0𝑀 + (1000 − 𝛥))= (.5 − 𝑧)(1𝑀) + (1000 − 𝛥)

𝐶𝐸𝑈1(¬𝜏) = (.45 − 𝑧)(1𝑀 + 0) + .05(1𝑀 + 1000) + (𝑧 + .05)(0𝑀 + 0) + .45(0𝑀 + 1000)= (.45 − 𝑧)(1𝑀) + (.05)(1𝑀) + (.05)(1000) + (.45)(1000)= (.45 − 𝑧 + .05)(1𝑀) + (.05 + .45)(1000)= (.5 − 𝑧)(1𝑀) + 500

Onceagain,wehavesomethingoftheform:𝐶𝐸𝑈1(𝜏) = 𝑋 + (1000 − 𝛥),while𝐶𝐸𝑈1(¬𝜏) = 𝑋 + 500.Hence𝐶𝐸𝑈1(𝜏) > 𝐶𝐸𝑈1(¬𝜏)iff𝛥 < $500.QED.

References

Ahmed,Arif.2014.Evidence,DecisionandCausality.CambridgeUniversityPress.

———.2017.“ExploitingCausalDecisionTheory.”Manuscript.

Arntzenius,Frank.2003.“SomeProblemsforConditionalizationandReflection.”JournalofPhilosophy100(7):356–70.

Baratgin,Jean,andGuyPolitzer.2010.“Updating:APsychologicallyBasicSituationofProbabilityRevision.”Thinking&Reasoning16(4):253–87.

Briggs,R.A.2010.“Decision-TheoreticParadoxesasVotingParadoxes.”PhilosophicalReview119(1):1–30.

Byrne,Alex,andAlanHájek.1997.“DavidHume,DavidLewis,andDecisionTheory.”Mind106(423).

Eells,Ellery.1982.RationalDecisionandCausality.CambridgeUniversityPress.

Good,I.J.1967.“OnthePrincipleofTotalEvidence.”TheBritishJournalforthePhilosophyofScience17(4):319–21.

Hare,Caspar,andBrianHedden.2016.“Self-ReinforcingandSelf-FrustratingDecisions.”Noûs50(3):604–28.

Hedden,Brian.2015.“Time-SliceRationality.”Mind124(494):449–91.

Hesslow,Grant.1976.“Discussion:TwoNotesontheProbabilisticApproachtoCausality.”PhilosophyofScience43(2):290–92.

Joyce,James.1999.TheFoundationsofCausalDecisionTheory.CambridgeUniversityPress.

———.2007.“AreNewcombProblemsReallyDecisions?”Synthese156:537–62.

———.2012.“RegretandInstabilityinCausalDecisionTheory.”Synthese187:123–45.

Katsuno,Hirofumi,andAlbertoO.Mendelzon.1991.“OntheDifferenceBetweenUpdatingaKnowledgeBaseandRevisingIt.”InProceedingsoftheSecondInternationalConferenceonPrinciplesofKnowledgeRepresentationandReasoning.Vol.2.

Lewis,David.1973.Counterfactuals.Oxford:Blackwell.

———.1976.“ProbabilitiesofConditionalsandConditionalProbabilities.”PhilosophicalReview85.

———.1981a.“CausalDecisionTheory.”AustralasianJournalofPhilosophy59(1):5–30.

———.1981b.“WhyAin’chaRich?”Noûs15:377–80.

———.1999.“WhyConditionalize?”InPapersinMetaphysicsandEpistemology.CambridgeUniversityPress.

Moss,Sarah.2012.“UpdatingasCommunication.”PhilosophyandPhenomenologicalResearch85(2):225–48.

———.2013.“EpistemologyFormalized.”PhilosophicalReview122(1):1–43.

———.forthcoming.ProbabilisticKnowledge.OxfordUniversityPress.

Nozick,Robert.1981.PhilosophicalExplanations.HarvardUniversityPress.

Parfit,Derek.1984.ReasonsandPersons.OxfordUniversityPress.

Pearl,Judea.2000.Causality:Models,Reasoning,andInference.CambridgeUniversityPress.

Ramsey,Frank.1990.“TruthandProbability.”InPhilosophicalPapers,editedbyD.H.Mellor.CambridgeUniversityPress.

Savage,Leonard.1972.TheFoundationsofStatistics.Dover.

Skyrms,Brian.1990.“TheValueofKnowledge.”MinnesotaStudiesinthePhilosophyofScience14(245-266).

Teller,Paul.1973.“ConditionalizationandObservation.”Synthese26:218–58.

Yudkowsky,Eliezer.2010.“TimelessDecisionTheory.”http://www.intelligence.org.