1
2016ResearchPapersCompetitionPresentedby:
MixedMembershipMartialArts:Data-DrivenAnalysisofWinningMartialArtsStyles
OtherSportsPaperID:1575
SeanR.HackettandJohnD.StoreyLewis-SiglerInstituteforIntegrativeGenomicsPrincetonUniversity,Princeton,NJ08544,USA
Abstract
Amajoranalyticschallenge inMixedMartialArts(MMA) isunderstandingthedifferencesbetween fighters that are essential for both establishing matchups and facilitating fanunderstanding. Here, we model ~18,000 fighters as mixtures of 10 data-definedprototypical martial arts styles, each with characteristic ways of winning. By balancingfighter-leveldatawithbroader trends inMMA, fighterbehaviorcanbepredictedeven forinexperiencedfighters.Beyondprovidingan informativesummaryofa fighter'sstyle, it isalso thecase thatstyle isamajordeterminantofsuccess inMMA.This isreflectedby thefactthatchampionsofthesportconformtoanarrowsubsetofsuccessfulstyles.
1. IntroductionEarly events inMixedMartialArts (MMA)were touted as a chance to determinewhich styles ofpuremartialartsweremosteffectivefordefeatinganopponent[1,2].Theseevents,epitomizedbythe mid-1990s events of the Ultimate Fighting Championship (UFC) and Vale Tudo, featuredmatchupsofstylisticallydiversemartialartistsrepresentingsportssuchasBoxing,JudoandKarate[1, 3]. As the sport ofMMA has evolved, fighters have become increasinglywell-rounded [1, 2].ModernMMA fighters need to be effective strikerswhodictatewhere the fight occurs, either bytakingdowntheiropponentsorforcingthemtofightstanding.
EachMMA fighter’s skills aredrawn fromamixofmultiplepuremartial arts thatdefinehis/herpersonalizedfightingstyle.WhilemodernMMAfightersaremorestylisticallymixedthantheirpuremartialartistforbearers,themixturesofindividualfightersdiffer.Somefighterswouldbereferredtoasstrikersandothers,asgrapplerswhospecializeinmartialartssuchasBrazilianJiu-Jitsu(BJJ)[4].Thesestylisticcharacterizationsareimportantwhenestablishingmatch-ups;buttodate,thereisnoquantitativeprocedureforcharacterizingthestyleofMMAfighters.Furthermore,becauseofthe absence of such an approach, the impact of style on success in MMA has never beenquantitativelyinvestigated.
Every sport is enriched by the behaviors that distinguish its athletes and teams.Whether one isconcernedwith a pitcher’s arsenal of pitches, the plays a football team employs or the locationsfromwhichabasketballplayerlikestoshoot,styleisanimportant,albeitoftennebulous,conceptinsports.Tomakestylemoreaccessible,data-drivenmodelingapproacheswillbeinvaluable.Thesemodelshaveenormousvalue-theycanbeusedtoidentifyprospectswhoaresimilartosuperstars,
2
2016ResearchPapersCompetitionPresentedby:
buildabalancedteam,andguidevisualizationsthatinformcasualfansofeachathlete’sstrengths.Todate,approachesthathaveaimedtodefineathletes'styleshavegenerallyattemptedtoidentifylatent factors that could serve as a representationof style [5, 6, 7]. These approacheshavebeenrestrictedtospatialsummariesofathletes(e.g.,fromwheretheyscorepoints).Consequently,theirlatent factors reflect common spatial patterns, like three-point shootingordunking in basketball[6].
While latent variable based approaches have been successfully applied to sports where a largeamountofathlete-specificperformancedataisavailable,suchmethodshavethetendencytooverfitpatternswhen little data is available orwhen complex patterns are sought (for example, if stylewere allowed to vary between seasons or between games).Most sports are not immune to suchoverfitting;forinstance,thecapacityforoverfittingobserveddatahasbeenwell-demonstratedforthedata-richapplicationofbaseballbattingaverages [8,9].Wewould intuitivelyexpect abatterwith300hitsoutof1000at-batstoperformbetterinthefuturethanabatterwith5hitsoutof10at-bats,yetthesecondbatterwouldhaveahigherbattingaverageusingtheMaximumLikelihoodEstimate (MLE)ofbatting averages. Indeed, itwasdemonstrated that theMLEwas less accuratethan the James-Stein estimate of batting averages, which shrinks observed batting averagestowards the mean of all players based on the amount of individual-specific data [8]. Bayesianapproachesof thissorthavethecapacity tobalanceathlete-specificdatawithoverallbehavior inordertoimproveprediction.
Characterizing the styles of MMA fighters is an unprecedented challenge because of both theamountandtypeoffighter-leveldataavailable.Mostprofessionalathletescompeteinupwardsof100gamesinasingleseason.Incontrast,asuccessfulMMAfightermayretireafteronly20bouts.With such limiteddata, a fighter's to-datebehavior alone is insufficient to predict his/her futureperformance.Additionally,whilethemannerbywhichafighterfinishesafight(e.g.,finishessuchaspunches,kicksorchokes)isthepurestdepictionofhis/herstyle[3],thesemetricsarecategorical.Consequently, methods for defining spatially-oriented styles from other sports are not directlyapplicable.WeaddresstheabovetwochallengesbyapplyinglatentDirichletallocation(LDA)[10,11],aBayesianmixedmembershipmodelthatnaturallyaccommodatescategoricaldataandsharespoweracrossfighters,therebybalancingfighter-specificbehaviorwithbroadertrends.Thegoalofthismodelistoestimatetwomixturesthatdefineaplausibleunderlyingstructureoffighters'styles(Figure1):(i)Distinctmartialarts(prototypes)havecharacteristicfinishesand(ii)Fightersareamixtureofprototypes(withindividualizedweightssummingtoone).
1. Striker
3. Grappler
2. Balanced
Wins using
Punches, Elbows, Punches, Kicks
Armbar, Choke, Toehold, Armbar
Punches, Choke, Armbar, Elbows
Fighters Signal
Punches, Elbows, Punches, Kicks
Punches, Elbows, Punches, Kicks
Armbar, Choke, Toehold, Armbar
Punches, Elbows, Choke, Armbar
Identify covarying finishes (prototypes) and fighter styles
Punches, Elbows, Kicks
Armbar, Choke,Toehold
PrototypesFighter styles
Striking
Grappling
Figure1:Identifyingfighterstylesbasedonperformance.WeassumethatanMMAfighter'swinsgenerallyreflecthis/hertraining,eitherasapuremartialartistorasapractitionerofmultipleprototypicalstyles.Eachofthesefightingstylesutilizesasubsetofpossiblefinishesatcharacteristicfrequencies.
3
2016ResearchPapersCompetitionPresentedby:
Usingdata from>250,000 fights,weapplied themixedmembershipmartialarts (3MA)model toidentify10prototypicalmartial arts styles, each composedof a setof characteristic finishes.Thestyleofeachof~18,000fighterswasdefinedbasedonhis/herindividualizedcombinationoftheseprototypes. These stylisticmakeups greatly improved the prediction of held out results. Beyondtheirvalueasan implicit readoutofa fighter’sabilities, fighterstylesareamajordeterminantofsuccess in thecage. Inparticular,winning fights in theupperechelonsofMMArequiresdynamicstrikingandtheability to“gothedistance”bywinning fights through judges’decision.DefendingUFC champions, the most successful athletes in the sport, generally conform to these styles,providinginsightsintohowfightingstrategyaffectssuccessinMMA.2. DataSetUsingtheSherdogFightFinder[12],weobtainedrawsummariesof257,582previousamateurandprofessionalboutsamong151,746fightersthroughOctober29,2016.Eachboutwassummarizedbased on the pair of fighters, the result of the bout (win, loss, draw, no contest) and theway inwhichtheboutwasfinished(e.g.bypunchesorunanimousdecision).Rarefinishesweremanuallycombined with the most similar high-frequency finish to generate 50 distinct high-frequencycategories (shown in Figure 2). Only fights that ended in awin for one fighterwere considered,leaving230,126boutsforthisanalysis.
3. IdentifyingpatternsinfinishusageAttheirmostbasiclevel,martialartsaresetsoftechniquesthatareusedtowinfights.Eachfightiswonbyusinga single technique,but ifwe lookata fighter's career,patternsmayemergewheresetsoffinishesshowuprepeatedlyacrossafighter'svictories.Ifthesepatternsareduetoamartialartthatissharedamongfighters,wewouldexpectsetsoffinishestoco-occurnotonlyinasinglefighterbutalsoinarecurringmanneracrossmanyindividuals.Forexample,MuayThaifightersusebothelbowsandknees,soweexpectthatfighterswhowinwithelbowstendtowinbyknees.Thus,elbowsandkneesshouldco-occur.
To determine whether any subsets of the 50 finishes are frequently used in conjunction, wequantifiedhowofteneachpairof finishes co-occurs (i.e. theyareused towin fightsby the samefighter), aggregating across all fighters. To answer this question, we compared observed co-occurrencesofeach finishpair (numberofpairsofa finishx andy ina fighter's record, summedover fighters) with the expectation under independence. Then, using permutation testing, wedeterminedhowoftensuchlargedeviationswereexpected.Outof1,225pairsoffinishes,289pairsco-occurmorefrequentlythanexpected,and292pairsco-occurlessfrequentlyatafalsediscoveryrateof0.05[13].
Pairs of finishes that significantly co-occur are far from random; rather, they are organized intocliques of generally mutually correlated finishes (Figure 2). For example, kicks/knees form onetightlyconnectedcliqueandlegsubmissions,another.Fromthisanalysis,wealsoseethatpatternsaremore strongly influencedby theposition fromwhich a submission is applied, rather thanbywhattypeofsubmissionisapplied,perse.Forexample,theanacondachoke,appliedfromafront
4
2016ResearchPapersCompetitionPresentedby:
headlockposition,isindependentfromthetrianglechoke,whichisappliedfromguard.Incontrast,trianglechokesandarmbarsarefrequentlyappliedfromguardandthustendtoco-occur.
Figure2:StructureofMMAfinishesbasedonpairwiseco-occurrence.289pairsoffinishessignificantlyco-occurredamongfighters.Thethreestrongestconnectionstoeachfinish(darkedges)wereusedtoconstructalayout;othersignificantedgesareshownasfaintbackgroundlines.
4. Determiningfighter-specificstylesBy aggregating over all fighters, we demonstrated that finishes exist in co-occurring groups.Directly applying this information at the fighter-level is challenging, however. Although wecollectively have a large amount of information about fighters’ performances, we have a moremodest amount of fighter-specific data. No fighter has won a fight with each of the 50 possiblefinishes.Infact,thevastmajorityofexperiencedfightersdonotevenhave50winstodate.Becauseweonlyhavean incompletereadoutofa fighter’scapabilities,weneedtobalancewhatweknowabout a fighter with what we do not, hedging the uncertainty in fighter-specific data by usingbroader patterns across fighters. Specifically, we use the behavior of similar fighters to definecommon finish patterns and we then characterize individual fighters in the context of howfrequentlytheyhavefoughtandtheclassesoffinishestheyhaveemployed.
4.1. Themixedmembershipmartialarts(3MA)modeloffighterstyles
Increatingamodeloffighterstyles,ourultimategoalistomodelthefinishprobabilitiesforeachfighter. Statistically, this objective amounts to estimating fighter-specific finish probabilities:Pr(Finishj|Fighteri)foreachfinishj=1,2,…,50andeachfighteri=1,2,…,17,778includedinour
5
2016ResearchPapersCompetitionPresentedby:
data set. (We restricted our analysis to fighters with four or more wins to date.) Our data iscollectedintoanI=17,778byJ=50matrixX,wherewenotethatthe(i,j)entryisthetotalnumberofwinsforfighterioffinishtypej.Wewilldenotewij=Pr(Finishj|Fighteri),whicharethevalueswewishtoestimate,andwecollectthesevaluesintotheIxJmatrixW.
Themostobviousestimateofthewijvaluesisthemultinomialmaximumlikelihoodestimate(MLE):
However,whenusedonaper-fighterbasis,theMLEissusceptibletooverfittingtheobserveddata(particularly for fighters with a smaller number of observations), and the MLE estimates thatunobserved finishes have zero probability. It also does not “borrow strength” across themanyfighters’observeddata.
Instead,inlinewithotherlatentvariablebasedapproaches(suchasprincipalcomponentsanalysisand factor analysis) [14], we assume a lower dimensional structure toW. By doing so, we areassumingthatfighters’individualizedstylescanbeattributedtoasetofprototypicalwinningstyleswhich will be smaller than the number of J = 50 finish types. We will denote the number ofprototypesbyK,whereK < J, andwewill estimate the valueofK from thedata.Weassume thefollowingdecompositionofW:
(1)
where foralliandwherefkjÎ[0,1]forallkandj.Theinterpretationofthismodelis
thatthe“prototypes”arecapturedbyfkj=Pr(Finishj|Prototypek).Theseprototypesaremappedtoeach individual fighter i through the values qik where qik = Pr(Prototypek|Fighteri), where
. Forfighter i,thevector(qi1,qi2,…,qiK)yieldshis/hermixtureofprototypes,i.e.,
thefighter’smixtureofmartialartswinningstyles.Thismodeldecompositionisoftenreferredtoasa“mixedmembershipmodel”[10]inthateachindividualiscomposedofanindividual-specificmixtureofmembershipstodifferentclasses.Here,thedifferentclassesaretheprototypes.
ThisapproachresultsingroupingofJ=50finishesintoasmallernumberofprototypesK,andthusprovides increasedgeneralizabilitywhenpredictingpreviouslyunobservedfinishes.Forexample,submissionsthattargetthelegs(e.g.,kneebarandanklelock;Figure2)shouldbeassociatedwiththe same prototype. Because of this, a fighter with only kneebar submissions would have someweight in this “leg-submission” prototype. Therefore,wewould predict that he/she also has thecapacitytoapplyanklelocks.Onelimitationofthisapproach,however,isthatifafighterhadneverusedlegsubmissions,he/shewouldhavezeroweightinthe“leg-submission”prototype.Asaresult,wewouldpredictthatthereisprobabilityzeroforhim/hertoperformanylegsubmission.
Inordertoallowforfinishescontainedinunrepresentedprototypesandtoalsolimittheinfluenceofunrepresentativepatternsoffinishes,wecanplaceaprioronmixturecomponents,inlinewith
6
2016ResearchPapersCompetitionPresentedby:
the earlier example of applying shrinkage to batting averages in baseball [8]. Latent Dirichletallocation(LDA) isapopularBayesianmodelingapproach for incorporatingsuchaprior into thecategoricalmatrixfactorizationshowninmodel(1),andfurthermore, forfittingthismodel(1)todatathroughatechniquecalledvariationalinference[10].InLDA,Dirichletpriorswithparametersa= (a1,a2,…,aK) andb= (b1,b2,…,bJ) are placed onQ andF, respectively. LDA thenutilizesvariational inference to estimate the maximum a posteriori probability (MAP) of the posteriordistributionsofQandF,Pr(Q|X)andPr(F|X).Theestimates and aresettobetheseMAP
estimates,andwefinallyform .
ThevaluesofbdonotgreatlyaffectinferenceonFbecauseFisarelativelysmallmatrix(KxJ)thatis estimated by pooling data across all fighters [15]. In contrast, choice of a greatly impactsestimation ofQ [15]. Each component ofa, denoted byak, can be interpreted as the number ofeffectiveobservations(pseudocounts)[16]ofaprototypekthatsupplementseachfighter'shistory.Theoverallmagnitudeofa,whichwedenoteby ,reflectsthedegreetowhichfighters’
prototypes will be determined by their own records (low aS) versus the average of prototypefrequenciesacrossall fighters (highaS).Treatinga asasymmetric (meaning theak valuesacrossprototypesmaydiffer) isanother importantconsiderationbecause thisallows largerandsmallerprototypestoexistratherthanarbitrarilyenforcingthatprototypesbeequallyprevalent.
ToapproximateoptimalvaluesofaSandK(sinceboththedistributionofaandthevaluesofbcanbeadaptivelyestimated),possiblecombinationsofaSandKwerecomparedtodeterminewhichsetgeneratedanestimateofWthatbestpredictedheldoutfinishes.Quantitatively,foreachpairofaSand K values, we used 20-fold cross validation to estimateW for each of 20 subsets of the fulldatasetusingthesoftwareMALLET[17].Wethencalculatedthelog-likelihoodofobservationsthatwereheldoutfromeachdataset:
where𝒮faretheindicesofthetestsetforfoldfand𝜔#$𝒮% istheestimatebasedonthetrainingsetof
foldf.
ParametersetswithhighKgenerallyresultedinheldoutlog(L)similarinmagnitudetothosewithmoderateK, but highKmodelswere less interpretable.Accordingly,we adopted aparsimoniousapproach to choosing a set of parameter values (small K, small aS) among those with goodperformance ( 95% of the difference between and multinomial likelihood model
containingnofighter-specificinformation).ThisparsimoniousapproachresultedinsettingK=10prototypes (two pairs of prototypes were combined because they contained the same majorityfinish)withaS=16.
log 𝐿 = 𝑥#$log 𝜔#$
𝒮%
$∈𝒮%#∈𝒮%
./
012
(2)
7
2016ResearchPapersCompetitionPresentedby:
4.2.Performanceandgeneralityof3MA:winsvs.lossesandamateursvs.theeliteTo evaluate the performance of 3MA and assess its broader applicability, we applied the aboveapproachboth toelitesubsetsof fightersandtodatasetsconstructed fromfighters' lossesratherthantheirwins(Table1).
Dataset Type InclusionCriteria Nfighters Nbouts K aSAllData allfighters wins 4+wins 17,778 142,775 10 16SubsetsofData
topfighters wins 4+wins,8+fights,1+fightsin"toppromotions" 2,973 40,244 11 14topvs.top wins 4+wins,8+fightsagainsttopfighters 408 3,673 14 22allfighters losses 4+losses 14,690 97,569 4 4topfighters losses 4+losses,8+fights,1+fightsin"toppromotions" 2,201 18,043 4 10
topvs.top losses 4+losses,8+fightsagainsttopfighters 485 3,824 3 8
Table 1: 3MAwas applied to six datasets constructed based on the fighters included andwhetherwinsor losseswereinvestigated.Weconsidered“toppromotion” tobeUFC,Bellator,WorldSeriesofFighting,PRIDE,DREAMandStrikeforce.
Our estimates of K and aS aresimilar regardless of whether allfighters or only elite fighters areinvestigated, suggesting thatregardless of the tier ofcompetition, similar move sets areemployedandaccordinglyamateurfightersmaybeimportanttobetterunderstand the elite. Toquantitatively evaluate thisassertion, for each of the sixdatasets described in Table 1,prediction accuracywas calculatedfor both inexperienced andexperiencedfighters(Figure3).
In all cases, 3MA greatly outperforms a multinomial model, 1 prototype model (where finishprobabilitiesforeachfighterareproportionaltofinishfrequenciesacrossallfighters).Ingeneral,itiseasiertopredicthowafighterwinsthanhowhe/sheloses(losingisnotsomethingoverwhichafighterhascontrol).Additionally,thefinishesofexperiencedfighters(thosewith>20wins/losses)canbemoreaccuratelypredictedwhenamateurfightersareincludedratherthanusingasmallerdatasetcomposedexclusivelyofelite fighters.Thissuggests thatpatternsofMMAprototypesarenearlyuniversal;accordingly,informationfromamateurscanimprovetheaccuracyofdefiningthestylesofchampions.Becausethecompletedatasetresultsinthebestpredictionoffinishaccuracyregardlessoffighterexperience,subsequentanalyseswillutilizeall~18,000fighters.
WefoundK=3orK=4forlossescomparedtoK=10forwins,indicatingthatfighterstendtoloseinmoregenericwaysthanhowtheywin.Losingprototypesaregroupedintothelargecategoriesof
losses wins
0.00 0.05 0.10 0.15 0.00 0.05 0.10 0.15
top vs. top
top fighters
all fighters
Δ log likelihood per finish, 3MA − multinomial
Number of Finishes <8 8−20 >20
Figure3:Performanceof3MAmodelrelativetomultinomialmodelwhereallfightersareassumedtousethesamestyle(1prototype).
8
2016ResearchPapersCompetitionPresentedby:
strikes,submissions,anddecisions.Becauseafighter'sweaknessescanbefaithfullyreadoff fromhis/her recordwhilewinningprototypesareconstructed fromnuancedcombinationsof finishes,wefocusourremainingattentiononwinningstylesderivedfromanalyzingallfighters.Thesestylesare reflected in theMALLET software estimates of both the fighter-specific prototypes ( ) andprototype-specificfinishes( ).
4.3.PrototypesasmixturesoffinishesBy studying theestimatedvaluesoffkj =Pr(Finishj|Prototypek),wecancharacterizewhich finishtypesmake up each prototype. We can also calculate themarginal probability Pr(Prototypek) =
𝜃#45#12 /𝐼 as a summary of each prototype's overall relevance in the data. Summaries of
Pr(Prototypek)andPr(Finishj|Prototypek)asestimatedfromthedatacanbefoundinTable2.
Prototype Description Pr(Prototype) FinishBreakdown,Pr(Finish|Prototype)
P1 Punches(TKO)/Punches(KO) 0.22 76%Punches(TKO),20%Punches(KO),3%Punches(Submission)
P2 UnanimousDecision 0.18 82%UnanimousDecision,8%SplitDecision,7%MajorityDecision
P3 Rear-NakedChoke 0.11 100%Rear-NakedChoke
P4 UnanimousDecision/SplitDecision 0.09 48%UnanimousDecision,26%SplitDecision,16%Punches(TKO),11%Other
P5 Armbar/LegSub 0.09 72%Armbar,11%HeelHook,5%Kneebar,12%Other
P6 Kicks/Knees/Punches(TKO) 0.08 27%Punches(TKO),17%Punches(KO),14%Knees,42%Other
P7 TriangleChoke/Armbar 0.07 56%TriangleChoke,38%Armbar,3%Rear-NakedChoke
P8 GuillotineChoke/Rear-NakedChoke 0.07 84%GuillotineChoke,15%Rear-NakedChoke
P9 Rear-NakedChoke/Kimura 0.06 33%Rear-NakedChoke,25%Kimura,25%Arm-TriangleChoke,17%Other
P10 Punches(Submission)/Choke 0.05 35%Punches(Submission),19%Choke,13%Strikes,33%Other
Table 2: Summary of LDA-derived MMA prototypes. Pr(Prototype) is given by the overallestimatedfrequencyofeachprototypeinfighters( 𝜃#45
#12 /𝐼).Pr(Finish|Prototype)isdeterminedfromthevaluesin .Thetopfinishesforeachprototypeareshown.
Theprototypecharacterizationbasedon3MAreproducesmuchofthestructurefoundinFigure2butbreaksthisstructure intodiscreteprototypes(Figure4).ThesediscreteprototypesappeartocapturemeaningfulstylesusedbyMMAfighters:
• Themajor strikingprototypesareP1andP6.P1 containsTKOandKOwinsbypunches, andthusrepresentsboxing.P6containspunchesaswellaselbows,kicksandotherdiversestrikeswhichareheavilyfeaturedinMuayThai,Karate,TaeKwonDoandkickboxing.
• Prototypes P2 and P4 contain wins by judges’ decision. P2 primarily reflects unanimousdecision,casesinwhichafighterdominatedhis/heropponentforthelengthofthefightandalljudges agreed thathe/shewon.P4has someweight inunanimousdecisionbut also stronglyrepresentssplitdecisions:fightsinwhichjudgesdisagreedaboutwhowon.
9
2016ResearchPapersCompetitionPresentedby:
• BJJandsubmissiongrapplingarerepresentedbyfivegenerallysmallprobabilityprototypes.P5primarily contains thearmbarand submissions that target the legs.P7are submissions fromguard. P9 has some contribution from the rear naked choke but is primarily composed ofsubmissions applied from side-control and front headlock. P8 and P3 each largely contain asinglesubmission:theguillotinechokeandrearnakedchoke,respectively.Thesesubmissions,whiletraditionallyapartofBJJ,areknownandpracticedbymostMMAfighters.
• Thefinalandsmallestprototype,P10,isahodgepodgeofunusualstoppages,generictermsandeasily defended submissions that were relatively common in circa 2000 MMA but havedecreasedinfrequencybyover10-foldsince.Thus,P10isaprototypewhichdatesafightertothe formative years ofMMA.
Thisanalysisdoesnot includewrestling, judo and othermartial arts that arespecialized in achievingtakedowns. Because thesemartial arts are not directlytied toMMA finishes,winsbytheir practitioners couldmanifest in many ways, suchas through decision (P2) orsubmissions from strongcontrolpositions(P3,P8,P9).
Figure4:3MAidentifiessetsoffinishesusedtogetherbyfighters.FinishesfromthenetworkfoundinFigure2arecoloredaccordingtoPr(Prototype|Finish)whereviaBayes’theoremwehave:Pr(Prototype|Finish)=Pr(Prototype)´Pr(Finish|Prototype).
10
2016ResearchPapersCompetitionPresentedby:
4.4.FightersasmixturesofprototypesPrototypesexhibitedabroadrangeofaveragefrequencies, ranging from 5% to 22%.Individual fighters vary greatly about theseaverages,reflectingeithertheirspecializationor neglect of a given prototype and itsassociated finishes (Figure 5). Because allpairs of prototype loadings are negativelycorrelated,themixturesofprototypesusedbyfightersarerelativelyindependent;theweightin any one prototype does not necessarilyentail higher weight in another prototype.While this trendwould be partially expecteddue to tradeoffs between mixturecomponents, the absence of strongassociationsbetweenprototypes (as couldbedealtwithusingcorrelatedtopicmodels[18])suggests that the structure of finishes isappropriatelydiscretizedusingLDA.5. Notallprototypesarecreatedequal:winningstylesinMMAMMAfightersemployavarietyofapproachestodefeattheiropponents,butitisunclearwhetherallstylesareequallyviableorifsomehavefounddisproportionatesuccessinthecage.Thesuccessofastylemayalsobecontextual.MMAfansfrequentlytalkaboutanopponentbeinga“goodmatchup”for a fighter, reflecting that a fighter’s oddsofwinningmightbe influencedbybothhis/herownstyleaswellasanopponent’sstyle.Bycreatingaquantitativerepresentationofstyle,weareabletosystematicallyassesswhethersomestylesaresuperiortoothersandwhetherthissuccessdependsuponanopponent’sstyleaswell.5.1.Theroleofmatchups:prototypesformastrictdominancehierarchyIfsomeprototypesandtheirassociatedfinishesaremoreimportantforwinningMMAfightsthanothers, wewould expect that fighterswho favor these finisheswouldwinmore frequently. Therelativestrengthofaprototypemayalsodependonanopponent'sstyle.Insuchacase,therelativestrength of a prototypemay be contextual, resulting in intransitive properties like those seen inrock-paper-scissors.
Toinvestigatehowtheprototypesperformagainsteachother,wecanconsiderwhotendstowinwhenallpairsofprototypesarematchedagainsteachother.Sinceeveryfighterisrepresentedbyaprototype mixture rather than by a pure prototype, we cannot directly count who wins duringmatchups.Instead,weconsiderthesetsofboutsℬ={(b1,b2)}whereb1isthewinningfighterandb2isthelosingfighter(sob1,b2Î{1,2,…,I}),andcharacterizetheirprototypemixturesforeachpairofprototypesm(inwinningfighters)andn(inlosingfighters)wherem,nÎ{1,2,…,K}andm¹n.To the extent that and are large, prototypem hasde facto defeatedprototypen.More
0.01
0.03
0.10
0.30
P1: P
unch
es (T
KO),
Punc
hes (
KO)
P2: U
nanim
ous D
ecisi
onP3
: Rea
r−Na
ked C
hoke
P4: U
nanim
ous D
ecisi
on,
Split
Decis
ionP5
: Arm
bar,
Leg S
ubP6
: Kick
s/Kne
es,
Punc
hes (
TKO)
P7: T
riang
le Ch
oke,
Armba
rP8
: Guil
lotine
Cho
ke,
Rear
−Nak
ed C
hoke
P9: R
ear−
Nake
d Cho
ke,
Kimu
ra
P10:
Punc
hes (
Subm
ission
),Ch
oke
Prototype
Figh
ter p
roto
type
wei
ghts
Figure5:Violinplotshowingthedistributionsoffighterprototypeabundances.
11
2016ResearchPapersCompetitionPresentedby:
generally,weweighteachmatchbasedon ,reflectingtheextenttowhichprototypemhasdefeatedprototypen.Thenthescoreforwinnermandlosern(smn)canbecomparedtothescoreforwinnernandloserm(snm)toseeifoneprototypesystematicallydominatesanother:
Todeterminewhether thesedominancescores (dmn)weresignificant, theobservedvalueofeachscore was compared to 10,000 null values where winner and loser labels were permuted, andsignificantdominator-dominatedprototypepairswerefoundatafalsediscoveryrateof0.05[13].
Visualizingthedominancesbetweenallpairsofprototypes(Figure6)suggeststhatintransitivitiesare a relativelyminor phenomenon. Instead, prototypes occupy five tiers of dominance,whereinprototypes assigned to higher tiers dominate all prototypes in tiers beneath them.The topmostsuccessful,tierofprototypescontainsonlyP2(unanimousdecision),whiletheleastsuccessfultierincludesP5(armbarandlegsubmissions),P8(guillotinechoke)andP10(circa2000submissions).
𝑠:; = 𝜃<=:<=,<? ∈ℬ
𝜃<?;
𝑑:; =𝑠:; − 𝑠;:=? BCDEBDC
(3)
Figure6:Prototypesformtiersofeffectiveness.Networkedgesthatindicateasignificantdominancescorelinkpairsofprototypes,pointingfromadominatedprototype(light)toitsdominator(dark).Groupsofprototypeswithmutuallyweakorinsignificantdominancescoresweregroupedtogetherintotiers,andtierswerearrangedfromthebestperforming(1)totheweakestperformingprototypes(5).
12
2016ResearchPapersCompetitionPresentedby:
5.2.EffectofprototypemixturesonoverallsuccessBecause prototypes are either effective or ineffective without regard to an opponent’s strategy,each fighter's performance can be explained based on his/her prototype mixture, setting asideattributes of his/her opponent. To investigate prototype performance from the perspective ofindividual fighters, we sought to evaluate the degree towhich fighterwin percentages could beexplained by prototype frequencies, i.e., Pr(win | ). One challengewith this strategy is that, byconstruction, fighterswith extreme values of any prototype component are inherently veterans;theyhavewonenough fights that theirobserved finisheshaveoverwhelmed the influenceof theprior.Bycontrast, fighterswhoseprototypeproportionsarenear thepriorproportionsgenerallyhaveveryfewfights. Therefore,theirperformanceismorerepresentativeofan“averagefighter.”Becauseprototypevaluesarepartiallyconflatedwithexperience,weneedto teaseout these twoeffectsinordertodeterminethedirectcontributionoffighterstylesonperformance.
Inordertoseparatetheinfluenceofexperienceandfighterstyles( ),usinggeneralizedadditivemodels (GAMs) with a logistic link function, the win percentage of each fighter i, Pr(win)i, wasmodeledbasedonthenumberoffightswon(binnedintointervalswhichincludeasimilarnumberof fighters)andthe10estimatedprototypemixturecomponents( ).Logisticregressionwasused because Pr(win) can be treated as Binomial successes (wins) and failures (losses);accordingly,inferenceisbestperformedonthelogoddsofwinsversuslosses[19,20].Theuseofgeneralized additive models provides flexibility, allowing for the identification of nonlinearrelationships between each explanatory variable and the log-odds of winning [21]. Fights werebinnedandmixturecomponentswerelog-transformedinordertoapproximateNormaldistributedvaluessothatpredictionwasnotundulyinfluencedbyaminorityoffighters.
From the fittedmodel (Figure7A),we can see that the oddsofwinning increasesmonotonicallywiththenumberofwinsasexpected,whiletheinfluencesofprototypeabundancesontheoddsofwinning, though highly significant (p < 10-200, ANOVA versus amodel excluding prototypes) aremorecomplicated.InFigure6,wenotedthatprototypesformtiersofdominance.Here,weseethatafighter'ssuccesswithaprototypegreatlydependsonthedegreetowhichthefighterhasinvestedinit.Basedondominance,P2isthemostimportantprototypeforsuccessoverall,butitsbenefitsactually peak once a fighter contains 25% of this style and its effectiveness decreases at highervalues.Aside fromP2, athighvaluesofotherprototypes, the rankingofprototypeeffectson theodds of winning is similar to the tiers of dominance (Figure 6). Prototypes P5, P8 and P10 stillnegatively impact theoddsofwinning,while the remainingprototypesare all generallypositive.Thesepositiveprototypeseitherhavestrongeffectsbutimpactfewfighters(e.g.,extremevaluesofP4,P6andP9)orrelativelyweakeffectsthatimpactmanyfighters(e.g.,intermediatevaluesofP2andhighvaluesofP1).Toprovideasenseofthemagnitudeoftheseeffects,consideringstylealone,a 50%:50% P6:P9 fighter and a 50%:50% P5:P7 fighter would translate to an expected winpercentageof60%and43%,respectively.
Experience and prototypes together strongly predict fighters’ true win frequencies; but, thepredictionisstillquiteremarkablewiththeroleofexperienceremoved(generatingamodelwithexperienceandthencorrectingforthisinfluenceinthefit)(Figure7B).
13
2016ResearchPapersCompetitionPresentedby:
Based on Experience and Prototypes Based on Prototypes only
25% 50% 75% 100% 25% 50% 75% 100%
50%
60%
70%
80%
True win frequency
Pred
icte
d w
in fr
eque
ncy
1
2
4
8
16
32
64
Counts
−0.2
−0.1
0.0
0.1
0.2
0.78% 1.6% 3.1% 6.2% 12% 25% 50%Fighters' prototype fraction
Δ
log
Pr(W
in)/P
r(Los
s)Prototype
P1: Punches (TKO)/ Punches (KO)P2: Unanimous DecisionP3: Rear−Naked ChokeP4: Unanimous Decision/ Split Decision
P5: Armbar/ Leg SubP6: Kicks/Knees/ Punches (TKO)P7: Triangle Choke/ ArmbarP8: Guillotine Choke/ Rear−Naked Choke
P9: Rear−Naked Choke/ KimuraP10: Punches (Submission)/ Choke
−0.1
0.0
0.1
0.2
4 5 6 7 8 9 10
11−1
2
13−1
4
15−1
6
17−1
8
19−2
0
21−2
2
23−2
4
25−2
6
27−2
8
29−3
0
31−4
0
41−5
0
>50
Number of wins
A
B
Figure7:Predictingwinsbasedonfighters’experienceandprototypes.A)Theimpactofprototypemagnitudeandnumberoffightsonfighters’oddsofwinning(versuslosing)wasestimatedusingageneralizedadditivemodel.B)Thefittedvaluesoffighters’Pr(win)iscomparedtofighters’truewinpercentage.Theimpactofnumberofwinswassubtractedfromfighters’Pr(win)togenerateanexperience-correctedestimatebasedonprototypesonly.
14
2016ResearchPapersCompetitionPresentedby:
Becauseprototypespecializationisanimportantpredictorofhowfrequentlyeachfighterwins,aninteresting question is whether the most successful MMA fighters, UFC champions who havesuccessfully defended their title at least once, utilize these effective prototypes. Looking at thepredictedwin frequencies of fighters (corrected for experience), defendingUFC champions favoreffectiveprototypemixtures(Figure8A).Visualizingthemostsuccessfulofthesechampions(thosewiththemosttitledefenses),strongdynamicstriking(representedbyP6andtoalesserextentP1)andtheabilityto“gothedistance”bywinningthroughdecision(P2)arethemajorattributesthatdistinguishthebestof thebest(Figure8B).Thesetrendssuggest thatbeingagreat fighter isnotsufficienttobecomeachampion;fightersmustalsoembraceastylethatpotentiatestheirsuccess.
Pat Milet
ich, R
onda
Rousey
Rich Fran
klin
Anthon
y Pett
is
Frank S
hamroc
k, Mau
rice S
mith
Kevin R
andle
man
Murilo
Bustam
ante,
Rafael
dos A
njos
Matt Hug
hes, M
ichae
l Bisp
ing
Benso
n Hen
derso
n, Dem
etriou
s Joh
nson
, Ren
an Bara
o, T.J
. Dilla
shaw
Brock L
esna
r
Dominic
k Cruz
, Fran
kie Edg
ar, Jo
anna
Jedrz
ejczyk
, Jon
Jone
s, Jun
ior do
s San
tos
Chris W
eidman
, Geo
rges S
t. Pier
re
Quinton
Jacks
on, R
andy
Coutur
e, Tito
Ortiz
Lyoto
Machid
a, Tim
Sylvia
Andrei
Arlovsk
i
Stipe M
iocic
Anders
on Silva
, Cain
Velas
quez
, Chu
ck Lid
dell, J
ens P
ulver,
Sean S
herk
Daniel
Cormier
, Jos
e Aldo
, Rob
bie La
wler
B.J. Pen
n
0
500
1000
1500
48%
50%
52%
54%
56%
58%
60%
62%
64%
66%
68%
70%
72%
74%
76%
Pr(Win | Prototype Mixture)
Num
ber o
f fig
hter
s
0%
10%
20%
30%
40%
Ande
rson S
ilva
Geor
ges S
t. Pier
re
Deme
trious
John
son
Jon J
ones
Jose
Aldo
Prot
otyp
e fra
ctio
n
PrototypesP1: Punches (TKO)/ Punches (KO)
P2: Unanimous Decision
P3: Rear−Naked Choke
P4: Unanimous Decision/ Split Decision
P5: Armbar/ Leg Sub
P6: Kicks/Knees/ Punches (TKO)
P7: Triangle Choke/ Armbar
P8: Guillotine Choke/ Rear−Naked Choke
P9: Rear−Naked Choke/ Kimura
P10: Punches (Submission)/ Choke
B
AFigure8:UFCchampionsfavorthemosteffectiveprototypes.A)Distributionoffittedwinprobabilitiesforallfightersbasedontheirestimatedstyle,correctingforexperience.DefendingUFCchampions(alllistednames)heavilyfavorsuccessfulprototypes.B)Prototypemixturesofthe5championswiththegreatestnumberoftitledefenses.
15
2016ResearchPapersCompetitionPresentedby:
5.3.SheddinglightonsuccessinMMAApplyingunsupervisedapproaches,wedefinedstyleinMMAandfoundthatdifferentelementsofMMAfighters'gameshaverealizedvaryingsuccessinthecage.Lessonsemergefromthisanalysis.Successfulstrategies:settingthepaceandemployingdynamicstriking• Being able to win by decision is a crucial component of every fighter's game. Winning by
decisionentailsnotonlyhavingsufficientcardiotofightfor15+minutesbutalsodoingsowithenoughofanadvantagetobedeclaredthevictor.Thus,winningbydecisionmaybeanindicatorof theoverall fitnessandcaliberofa fighter.Wealsonote that fighterswhooverspecialize insecuringdecisionvictoriesbecome lesseffective, thus securingdecisionvictories shouldonlybeapartofafighter'sstrategyratherthanhis/herprimarygoal.
• Winningbyusingkicks,elbowsandkneesisagreaterindicatorofsuccessthanvictorybyusingpunches,eventhoughpunchesare themost frequentwayof finishinga fight.ThisMuayThaistyle could be advantageous for several reasons. Kicks, elbows and knees are important forcontrollingdistance; they are generally acknowledged as beingmorepowerful thanpunches,andtheymayalsobeanindicatorofatop-levelstrikerwhowouldbedangerouswithpunches.
• Themostsuccessful submissionsare thekimura, rearnakedchoke,andotherchokesappliedwiththearms(excludingtheguillotine).Acommonfeatureofthesesuccessfulsubmissions isthat they are applied from a position of very strong control (back,mount and side control).Becauseof this, ifanopponentescapes fromoneof thesesubmissions, thesubmitting fighterwillstillbeleftinastrongposition.
Unsuccessfulstrategies:sacrificingpositionforsubmission• Fighterswhospecializeinguillotinechokesorleglocksarelesssuccessfuloverall.Acommon
featureoftheseclassesofsubmissionsisthatshouldtheapplicationoftheseattacksfail, theygenerallyleavetheattackerinaninferiorposition.
• Guillotine chokes are generally applied by pulling guard (or being forced down during atakedown).Assuch,shouldtheattackfail,anopponentisinagenerallyadvantageouspositiontopassguardandsecureastrongcontrolposition.
• Leg locksarealsonotapplied fromthetraditionaldominantgrapplingpositions.Accordingly,whentheyfail,anopponentoftenhasenoughspacetoeithersecureagoodpositionortoreverttoastandingposition.
• Themostpoisonousstylewasanensembleof finishesthatwerecommoninMMAaroundtheyear 2000, but have since decreased greatly in prevalence. These finishes could bedisadvantageous, but more likely, they simply date a fighter. If fighters have won by thesefinishes,itislikelythattheyarepastthemostcompetitiveperiodoftheircareersandthatthisisreflectedintheirrecords.
WhilemostmodernUFCchampionsarestrikersratherthangrapplersandwepointtosubsetsoftheBJJgamethatareill-suitedforMMA(theguillotinechoke,leglocks,andtoalesserextentguardattacks),wefindthatfighterswhospecializeinsubmissionsthatareappliedwithstrongpositionalcontrolcanbesuccessfulinMMA.Indeed,ratherthancommittingtoattacksthatsacrificeposition,themost successfulBJJpractitioners inMMA (likeRonaldo “Jacare” Souza,RafaelDosAnjos, andDemianMaia)blendBJJwithstrikingbychallenging(oroutrightdefeating) theiropponentswith
16
2016ResearchPapersCompetitionPresentedby:
strikingbeforemovingtotheground.Then,whenontheground,thesefightersapplychokesfromstrongpositions.While themannerbywhicha fighterwins is thepurest reflectionofhis/her individualizedstyle,andthissummarycanbeobtainedforamassivenumberoffighters,afighter'sstylecontributesnotonlytohowhe/shewinsaboutbutalsotohowhe/sheperformsthroughoutthefight.Suchdataisincreasingly captured, at least for top competitors, through analysis of video footage byorganizationssuchasFightMetric[22].Thesedatacouldimprovetheaccuracywithwhichstyleisdefined for top fighters as well as capture other elements of a fighter's style, such as wrestlingability,thatdonotdirectlyleadtocharacteristicfinishes.6. BroaderapplicationsinsportsanalyticsAcommonelementof latentvariablebasedapproachesthathavesoughttodefineathletes’styleshasbeentofindtrendsacrossathletesthatsummarizeeachathlete'sperformance.Onelimitationof such methods is that they are directly fit to the data, and thus do not account for stylisticcomponentsthathaveyettomanifest(orhave,bychance,onlybeenweaklydemonstrated)inanathlete's performance.When applying principles in stylistic inference toMMA, a sport that is anextreme case of little athlete-specific data, it is crucial to share power across fighters usinginferencegroundedinBayesianmodeling.Whileothersportsarelikelymoreinsulatedfromsuchpathologies by virtue of possessingmore per-athlete data, evenwhen a large amount of data isavailable,directlyfittingtoper-athleteobserveddatamightnotmostaccuratelyestimatethetruepropertiesoftheathlete(ashasbeendemonstratedforbaseballbattingaverages).
By harnessing latent variable based methods, we characterized MMA fighters' styles based onprototypesthatnaturallygroupfinishes.Ouranalysiscanbeappliedtoanycategoricalmeasureofperformance, including to other martial arts where fighters score points or win by using anexpressivesetofpossiblemoves(e.g.,BJJ,TaeKwonDo,Sumo,JudoandFencing).Ineachofthesesports, the 3MA approach could help to systematize the styles of competitors and reveal factorsgoverningsuccess.
References[1] R. S. Garcıa and D. Malcolm, “Decivilizing, civilizing or informalizing? The internationaldevelopmentofMixedMartialArts,”InternationalReviewfortheSociologyofSport,vol.45,pp.39–58,Mar.2010.[2] S. H. Bishop, P. La Bounty, and M. Devlin, “Mixed Martial Arts: A Comprehensive Review,”JournalofSportandHumanPerformance,vol.1,no.1,pp.28-42,2013.[3]G.J.Buse,“Noholdsbarredsportfighting:a10yearreviewofmixedmartialartscompetition,”BritishJournalofSportsMedicine,vol.40,pp.169–172,Feb.2006.[4] A.HiroseandK.K.-h.Pih, “MenWhoStrikeandMenWhoSubmit:HegemonicandMarginal-izedMasculinitiesinMixedMartialArts,”MenandMasculinities,vol.13,pp.190–209,Sept.2009.[5]K.GoldsberryandE.Weiss,“TheDwighteffect:AnewensembleofinteriordefenseanalyticsfortheNBA,”MITSloanSportsAnalyticsConference,2013.[6]A.Miller,L.Bornn,R.Adams,andK.Goldsberry,“FactorizedPointProcessIntensities:ASpatialAnalysisofProfessionalBasketball.,”Proceedingsofthe31stInternationalConferenceonMachineLearning,vol.32,2014.
17
2016ResearchPapersCompetitionPresentedby:
[7]X.Wei,P.Lucey,S.Morgan,M.Reid,andS.Sridharan,“TheThinEdgeoftheWedge”:AccuratelyPredicting Shot Outcomes in Tennis using Style and Context Priors,” MIT Sloan Sports AnalyticsConference,2015.[8]B.EfronandC.Morris,“DataAnalysisUsingStein’sEstimatoranditsGeneralizations,”JournaloftheAmericanStatisticalAssociation,vol.70,no.350,pp.311–319,1975.[9] S. T. Jensen, B. B. McShane, and A. J. Wyner, “Hierarchical Bayesian modeling of hittingperformanceinbaseball,”BayesianAnalysis,vol.4,pp.631-652,2009.[10] D. M. Blei, A. Y. Ng, and M. I. Jordan, “Latent dirichlet allocation,” the Journal of machineLearningresearch,vol.3,pp.993–1022,Mar.2003.[11] D. M. Blei, “Build, Compute, Critique, Repeat: Data Analysis with Latent Variable Models,”AnnualReviewofStatisticsanditsApplications,vol.1,no.1,pp.203-232,Jan.2014.[12]Sherdog,“SherdogFightFinder:http://www.sherdog.com/stats/fightfinder.”[13]J.D.StoreyandR.Tibshirani,“Statisticalsignificanceforgenomewidestudies.,”ProceedingsoftheNationalAcademyof Sciences of theUnited States ofAmerica, vol. 100, pp. 9440–9445,Aug.2003.[14] W. Hao, M. Song, and J. D. Storey, “Probabilistic models of genetic variation in structuredpopulationsappliedtoglobalhumanstudies.,”Bioinformatics,vol.32,pp.713–721,Mar.2016.[15]H.M.Wallach,D.M.Mimno,andA.McCallum,“RethinkingLDA:WhyPriorsMatter,”AdvancesinNeuralInformationProcessingSystems,vol.23,pp.1973–1981,2009.[16]G.Heinrich,“Parameterestimationfortextanalysis,”Tech.rep.,UniversityofLeipzig,2009.[17]A.K.Mccallum,MALLET:AMachineLearningforLanguageToolkit,2002.[18]D.BleiandJ.Lafferty,“ACorrelatedTopicModelofScience,”TheAnnalsofAppliedStatistics,vol.1,no.1,pp.17-35,2007.[19] V.S.Y.Lo, J.Bacon-Shone,andK.Busche,“TheApplicationofRankingProbabilityModelstoRacetrackBetting,”ManagementScience,vol.41,no.6,pp.1048-1059,June1995.[20] S. M. Crowe and J. Middeldorp, “A Comparison of Leg Before Wicket Rates Between Aus-tralians and Their Visiting Teams for Test Cricket Series Played in Australia, 1977-94,” TheStatistician,vol.45,no.2,p.255,1996.[21]T.HastieandR.Tibshirani,“GeneralizedAdditiveModels,”StatisticalScience,vol.1,no.3,pp.297-310,1986.[22]R.Genauer,“FightingwithData:CreatingStatisticsfromScratchinMixedMartialArts,”inMITSloanSportsAnalyticsConference,2012.