+ All Categories
Home > Documents > Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG)...

Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG)...

Date post: 05-Jan-2016
Category:
Upload: rosa-potter
View: 222 times
Download: 1 times
Share this document with a friend
22
Hierarchical Bayesian Model Specification •Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all nodes given values of its parents •Topology of the DAG defines the conditional dependencies of all variables through the Markov directed Markov property which states that given the values of its parents, a variable in the model is independent of all its non-descendents •DAG and local distributions define the joint probability distribution of data and all parameters in the model •In our case this distribution can not be explicitly characterized but it
Transcript
Page 1: Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all.

Hierarchical Bayesian Model Specification

•Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all nodes given values of its parents•Topology of the DAG defines the conditional dependencies of all variables through the Markov directed Markov property which states that given the values of its parents, a variable in the model is independent of all its non-descendents•DAG and local distributions define the joint probability distribution of data and all parameters in the model•In our case this distribution can not be explicitly characterized but it estimates using Markov Chain Monte Carlo approach (Gibbs sampler)

Page 2: Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all.

Uses and Miss-Uses of Clustering

•Define a statistical model that facilitates clustering of genes based on similarities of their expression profiles

•Define the method-selection criteria that allows for estimating the "correct" number of clusters

•Show that inappropriate "pre-filtering" can fool the statistical model in the same way it fools the casual observer

•Show appropriate ways to use cluster analysis and illustrate the importance of using the "best available treatment"

Page 3: Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all.

Clustering of gene expression profiles

Microarrays

Diff

eren

tial E

xpre

ssio

n

0 2 4 6 8 10 12

-20

-15

-10

-50

5

1 2 3 4 5 6 7 8 9 10 11

Microarrays

Diff

eren

tial E

xpre

ssio

n

0 2 4 6 8 10 12

-20

-15

-10

-50

5

1 2 3 4 5 6 7 8 9 10 11

Microarrays

Diff

ere

ntia

l Exp

ress

ion

0 2 4 6 8 10 12

-20

-15

-10

-50

5

1 2 3 4 5 6 7 8 9 10 11

0 1 2 3 4 5 6 7 8 9 10 11 12

arrays

-20

-15

-10

-5

0

5

10

0 1 2 3 4 5 6 7 8 9 10 11 12

arrays

-20

-15

-10

-5

0

5

10

Page 4: Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all.

“Patterns” of Expression - Finite Mixture Model

0 1 2 3 4 5 6 7 8 9 10 11 12

arrays

-20

-15

-10

-5

0

5

10

Patterni i=(1i, 2i,…, 11i)

Dataik ~ iid N(i, ), k=1,…,ni

ni=number of genes generated

by the Patterni

i=ni/n

Page 5: Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all.

“Patterns” of Expression - Finite Mixture Model

Any gene profile x = (x1,x2,…,x11)

),( i

G

1ii σNπ μ

x ~

), ; ( )( iN

G

1ii σpπp μx x

All data x1, x2,…, xn

x xx

n

σpπp1k

ikN

G

1iiG1G1n1 ), ; ( ) ,,..., ,,..., ;,...,( μμμ

} Finite Mixture Model

Page 6: Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all.

0 1 2 3 4 5 6 7 8 9 10 11 12

arrays

-20

-15

-10

-5

0

5

10

0 1 2 3 4 5 6 7 8 9 10 11 12

arrays

-20

-15

-10

-5

0

5

10

One-dimensional mixturePattern 1

Pattern 2

Differential Expression on Microarray 1

Differential Expression on Microarray 1

Differential Expression on Microarray 1

N(11, )

N(12, )

),(4

3 ),(

4

12111 σμNσμN

Page 7: Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all.

MCLUST

> library(mclust)> SimData<-matrix(rnorm(5000*15),ncol=15)> ColLabels<-c(paste("Tumor_",1:8,sep=""),paste("Control_",1:7,sep=""))> heatmap(SimData,labCol=ColLabels)

> .Mclust$hcModelNames<-c("E","EEI")> .Mclust$emModelNames<-c("EEI")> BIC.emclust<-EMclust(SimData,1:10)> BIC.emclust

BIC: EEI1 -213490.32 -213624.93 -213753.04 -213880.75 -213993.76 -214121.07 -214243.48 -214351.69 -214481.410 -214588.7

> plot(BIC.emclust)EEI "1" >

Page 8: Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all.

Determining the number of patterns

log(n) -)ˆ ,ˆ,...,ˆ ,ˆ,...,ˆ ;,...,( parametersG1G1n1 NpBIC xx μμ

1

1

1

1

1

1

1

1

1

1

2 4 6 8 10

-21

46

00

-21

44

00

-21

42

00

-21

40

00

-21

38

00

-21

36

00

number of clusters

BIC

Con

trol

_5

Tum

or_1

Tum

or_2

Con

trol

_1

Tum

or_5

Tum

or_6

Tum

or_8

Con

trol

_4

Con

trol

_3

Tum

or_4

Con

trol

_2

Tum

or_3

Tum

or_7

Con

trol

_6

Con

trol

_7

52989344774512493102425422215264375516372494316910433126249049832607884381414313427116726912359366427382527156964913213721623185742183089253725454542633154108986190515504422326749642817160543552339189828098302312603484237071889468622112363547721137153212662072601354412268439743167289933145347435012443297348147462854789191111333044911425146424853371693620786686596360311036723514361403914543060146619551034200826634400470515841523414523234819588882054366638272176114040812110135695437631727611446421273424053612193487185017399687363537348345602869448140977763725227719252572486012251887297329425541306134336343418268836491282200539514485890412336181413440113631942615388755524472928252233696421727943571469420933193407939201476399445254311353985489437216319245901364993509333242322681218013971963161536640273569243991037521888466237674571165325962774829387740043988112311911552212445824424297528783100203743242743271931112146146937421959135486613133219205726634862247126123963378161730901654612699307529104680105958212113332085563265338823692297010202905290746659433948191937964803310514501642317261594405510962434399245030884542482633461229127213124059217011807095464227260010362487208457820824899256845132461123837851440480439938386283759227466134677142544631591119282821923148400135737584503273130840572782227026651520264442902292463937531501407045594300749319166519256530992488427446614321365180146212093617488437908634865230406385730851222253636041995232033581209613298212665429003283077221247791012242925444833437412324252214135154523826274745939049271638173744933747920635335211121964276345453155266816593637314361369049534521167723883438439468114212593327157011342737833184334193740205364733920104093464017971875497412445624764535640181941341686412643454952139846382091592275417946523399149526587781081961323441314258109148391582356352944021896336484020496582501333510531193269080830574518225349941823894461738091859429939378642362275520262065161048314742451630342218775423881612499932704699117635614021299689871461117702128272715071391928133034583656766133791630972240496928933550117335525273946164174282917653435302616311473416240383596376305246594313139433614454291342389127841585204436963600150043932324753372332967442639852805406231527577202329317186110129402583924374520772953404212832495162142571522343612884809441529221458740602381303425423144723175840724533380535031684218228543108180946281648384811174475236137334213452991148472617394416284005431818911383813518426436314731291479428532230160340344237314918465201364311644384725126235453942465725122113237914984891391449583164440321294876109230043874466933373415369140004428134614771015165026761434531433642823852362918442891237428503248985374841681962397348947569951203819181328024757235223496396093766414147412121120535885528994812478382530191504217516984978788412147162443128324166246629462859319237494742196031197715894960392619773532115523094225241419991904344331182629102944961116456223172498268762025411994190735598109292472976239412398811626526783539386118651913359916393295261129365806434771190930282238136246263683159534421803227156446012468343074247928454383344918122474371117033424341883297930334609327643843253200015973228258449423323162460037199192407357850624774563171481816813136434347354878317039784272465819303586195634283519481593144835604517423936715283382490718672717769282621483408363256631063485245611889644301180115131926246715331981295628844823196655724691830443580577016832291363949464035132928653492442088012791293327218832812034842223167394987396031421070377449523029794259375131071934156719521803345718485232569340110691325265921392602165236424393708270031174031200433564843405318993468203303662135973522215510551289532286030202916499644451544204839518223680196813272515282436873880122044432393797447432752667482921181051469247504679140612529044631239825081840373630724572146419173055367318807674580300139669722926408622594060484543132715737105835204078378614414772417134094101368922164204631662444632031718131175548562741097338613582526130739834212265047742823894796124129111392166322548731172957521424343522272402427633301252016644441179269515056592664203145542811079447277723433421432629571094763459733454447818421949153052366299846107947414230270911284857354122262371388831903517113545084935459956350514635693236250328943404732583276747654379260839391749430426203256404047614405474833914799379736449489636462528217216631375353339673091348446562056242122893930334711022234455274030232827417442654479452334701541255521441587277242021315261236824854791235749623135430733842881269713703098175453826224136220372246973659125164935874019112249755415654458288331049894349131914091301255598202422833297133243323996876102348493734240107491993125562014422019572052361284947733814566207444765404673670185528521276450941134808242528634710352810853580212045161940168547642660195406446442815280714172999177328424178160721154576272329724745476623715555212363165165437353605239062542333568362822652119622208568342435933368446074288741736731311056134848684117397741961714127527852539914629194798721814135133148356342954427024973611343741942525382514971297148818694359257348163425155830663293069394142227325849749134932073156825332103106079123008111891485278337704946487047086851871668295042934112404541956383933933176433064740311325064177990368176524339563266365531884390257840232927483846432437350747491775279631273971328739153286120145774128370678748633803500027441265727467846067199441772322423921208478222447262122460269342352060167014532820495021832662259938041589112150719651031249133859341991479023301090324121571305254223210032147534867468454994217052151191636102846749612062023307263622229811805912304415164858399555625335327912444148204013733862706172821954209244133628417114010115636822884223959197847920974594251930071795405311185814681494217843713353914350322500378434563093173628101946497330148252096373846162340424230474016272625321657234446833124361644194286176723916982654412547121795624277611392757164738964962748286497013535742111834495426343316293012152218474386935163992259186296441847023788865496828371574495121974314453149238837749229355992282128254038326553128677331217604817455644883103238622424844355392458128342081201743641839203047063237140313823543216920187806533322258929921910347998300523211087288926401371427262445982286326543604253345440081419344624629179231844214645316019356371524956340728902703296449492266325431451645440285812743501104033197226916201064353829243271329140541340203213172331222384148110211504088494048662114319933141756369539054967299341303284367512421658449238228354852236332053823345913724305515153395237917803932997100392924771622793062145527893313418312904025409491284927622196229256017874083251248541381982236739485944982104198941553711235138473519299614604802933351827243619281342787352372303536602370381116253056402899989520294314292519589382016160840182801292922317340666232502133841929037962871193166435571414399447132404043191394310346662410484145281304683245845732554982158126185022825814150295125614006122348791900760492547134065468928513096139239821631432001243245363643489022521559121031471828246043283959231641901008450426812283863974362630634932934396274682312671785428835894937278742103829298039474411937141860521124619218724734995186239862877159243182347183744393536064624828224972325522610322129271885415061129406725578222602876406853943352455199644672389222745831655394026314215547303840414653982673492346034877354766719501148404432894788213038946341984855355144111150151868825223484851443231401551153638122312272526894596258624681676703103291739109061399464541993572172382811114347315123353709462325772298161220924153180418313412081234323923264116873277326254328681592194136660029203890764108103710452319673447347446117351510210619922932294814993406184739023591452410262094284661946481984121441332023303177938453209138645554806267614135533715383806349743625613181248645842478256290824363776355322674482354946273721254817082944351242628113130392331831634719157933192997209053010252100473621824714500419824121068179533444238527281388323914613928399844372218854363159149861447237510813413287429784009408729887718338182839384111034233679429434902529114320073076406123191426214923273388173228852574204614783697241213843573432799447810396062732325035342201114138601042219310651838392229901007414926138124312682607260432312675950475149814165370413682742420027702745422235465610824881147241722779424537826921293443748216277384002205037564098453934912179178846718891321042241144403615734641573446610802760291927494668471518514716252413422887193233921688352722610661094276531831263131813032932535137834115054759121313774760482031572994151230721902923124724123337052360150243953548200488242501352127148244898186427593531178933033872814422810240468749334002033322922201922445216029083755341515905307346223912403283425544998274784362410114889448728801884484536931310724487045302244996452710487501973199313344690358315451876268621674085351136404011353307029894104328530113635432223816120673080144492749562547227612682673125934456614700411033972538470119491833283117343730322625304972198333651882361418861410468812341408325165615261521239710283109164183419982134786444824264537127383718562909259857313809633815319730843884365242739231853707401535234502516269632151816406333963141181125420191151168015082806326434229023393754306737272126117722543917306139765121932509444036222994532146726092952457864822904143455334992962255941841139501067221439015504747446343461642234614562420429247341445297747143401195145137941929170627971442145726133216597436946353153225546743461734186385837746461559663908377733344837347646672066312362215032041075236929354451841627353511832009347320423911401313222645287400774811424452103674305349882718169021323810393630171260301830104206352529651818153521544032020512035317517076692143215804685387386640322262265536625031471344384440993841378738353156156135083235350419336262281120048111454037337021354442265278191384513651027380023182914394460530502245283493022353121776334815613083724207145741912330241753290725285730512296159015173064431912513502353611054653193817661515721264143346603469211635241022086446938684047986468239134723387855348069024963023331841336317291439190349701646104430684179427628381700186118741005409641472322142312545191870443527718003184620636472030432656423819235771730338121734308498945871281339531322243222249941341633708295942834316220594701443442682165721146886103053267520134348202579161818075863452273533624501201232622311270301416653954519480581616241002377518311412323035004887455241394883220193621174717339441273972631386435947075221695110124752730151604184949262278330517461324363637914641250427110993975439817444595383497121684732278834883878492442691894385287231372886179077324164277163022044376296618144615187114004071653779346730211602197946042792488035654115249448929584754424128141063240620555842264493918973341486453117234514179131583024176112962415799491449456119624132625155330084724206311154330367641136549254118133296023427062861277521163428662523259741643226332591261549331300347220708483260374172813092915215687523833249326215483217304522564994353171615627773471199250538082125375560140562840101629684581304124112685199748501733357914893938343942661632207510043268174042466722510456853324003855267745544729459049902098147033517125826322470384637124480167247671302286962775226883517774325383692180658549341435407643312983564120336253667258128184540292927141918387333764348264711861071325911752758392295105446633150233884322093820955548365371042806293781418818902816227385226042713039302573423538133513387521234897360742420212234898210116014804770246415121006179932114397940486433904105432072702143017983278186839692453207633551093146244362061437819694303124335104641124826315115352947251729397293734471139123222110211241038356147381951201226572109277145035064297282473947032378497741293502358309597838561041369306520283032489373930782701468457120487918323293595161323343455336933114111283634026104470168917821420291723733885258825212271340517113343134617242236782452517118771215254954279816222622888364749413677380139995772672152174131023320988220741403714262314630490821021778241865146813389363043024586123109529181046297446181719153146702646376564617814758381713503451250740869743581360218441803204226700437723248002294379247289942011210521581725248432442454174334663030106218101109141123284976120231481825121162926511376438526521802980471431933834033384917135123154548449790197327420412194468324328194225737723177184108442342984463338335404380277816003686424843872089759514813132039493798892783147516913083117390348916604637125315761152271612863299432779179688995390323417694761661355437784216445927802442334454526146553200119490100019453440761458913713854175021331189197116333562469534034848270895237612185160649217381438456926351754454132884333118222754762504231330374595252562225116038331153184248464913554366511421171181464058756167132632199138424921715267145583590409227514544878114711682435379949443881682513396525013742253079105034162045182286212245923042380766210224381602333636233737140734544231821119915306793331209528794159270428417133957558205839588017171534745116927201492707174838537841675310136613701126918307041724297134943702382112729213816305848219774421154731333139428538701432390022584651223912573895433143914197387128433071216416792628228432693869135968319421336574167237641705754885133394139814073699185422933013410510152949573238287514624440443512856604186322973449977479474916339832011486337343724526276110734998111417122516457519531019352437004464221335917839264426230349632898832483686770187319444153215940518622246495591517684582941261678467366312848216603379478041073238108646014163633752244446515984625392720593441591161217731542364695672198366928212906316816972630231130033325490558866822507583176173142291479323134311836418919904231579207949212228161982616433909837418134143189317217933867930219124658099392213283025644090354245228683806941138642423622024069445723557611943411083430656447759183115269322242237293135584146427967643651049894119044132582474100133714292546270531781414445028333350407536456713862329229376017921975115940032287325549282137398019411326258517632669321288511492143399736211052395431238654886261933093354337738502108466434532324121246930343743441621711369178637934650316215496648101596169431295104945104271531414557369916674046134117832331224107426388221078256723052441443332424433195163647304929570241771794534611837494830543223334048754798439224014192222129914260487136154778230490795855946107734781644121714044254334270235568519024356277314349201076393485915658474979284816994412450510333081831251828973204485419704022431543671285118289678928472248526490026424531601138943393789112645673430293748314249125614914787129442474077279012302736226187317851147214731432024819141162262721113471808119636031052306300745726496824902188157249801474607105753712494263445240481586300930921422903397020254490691354647093333172016821751214219875713213158849431349294522632233281911314704335786045502967429814591948296939873247337438221973526473785336844026463643422020183510142801447211217093648244049112017744984114537203650337243101390440814294591279325261111843433156028704089393125391017479334291804687123737031328483436024931320821502587128027325141866499129463246981383280320393251420515144124478420119062648329841213463361316942033308136131613838377119863644116400469348018202739140143533257384314448637695083575449227043330250480311003339267421743049726122111571125405266955323683380960267030944622338348883120372927643027140516114786425714416120643182124518952650446231961892402421225753899293340334776294926054430416040123343308720223764142823772534549131134102637245798332071490273322293831741146577981908208822411482321816403945179420832698444932458047213609242725764308401537276635831143627175742871853576139533664551438172139682236332934123698764145223061635420733101893208749031416317449122511385921537622448500366318781976268423802758177147337053121733415648223585480140141539362374210737269843601781437129552867713368564447524092038429145072768316688474344176239613495318637952882396245474208161612415712489415138971701290136684296140417593582364848624157130123085604906418549614417312227303978193317332135841527371015562433208914449471710969266128223465391624233566243015094465144935704672278613961206401738303768280848721154375034743876175284414934909154343862756458527412145209924403658369447523608127341444409411614433906992873472273041583006291276831852995255326265444193285519212483155724243840388622743781915437342044510213689749384781201539891920409511858822666117441428504632497120432387373117222053688382442612069355581537623984268035942210456446911549131417034832491864123014338149636414484336410133360148324625143964181719021904768294240804041254912183754490426449012408257190948833029448323557317823349139345616142892846438919012961973308245123842402116661437231083632739822341479726794094298647692711301632063002298813166702299307327124647287521271563137947851753793475524520803991498510094429182713572280373294385137174317443924811860122755911652752372849363012332393722194676156614241582324633261702268318521133242885838894895695302115328015991764284262501216481580647444267371339903739402045112152407163141514871980134417471130245934813426298722061609401157447785717043498304044442350226948742721018129546121136343435212976272224492251967397948305874727181131871277410314367591112963447324461879279346142006856499290054182961635094424137408243824084774259540911974337537021865127641001988280444104275104711662003591495925662721216149651107422125701583365344832723598356723482621123528915461779317940302347271017452800308623324176324790470167428954375174123115195931068022345449119283450256331121339914382125585514696228127843684746220524828726783214103548921098363389296233624802571387309377334961164157524051985288949486132613757167834201010436629432173152140464929832550412231989924106416945064874869195429343102325192739073000110437802729258075378339182812807181576360337463592932452032583301198371547922421939402920473595680119252411978722004370477715404337358130591845166932822904528436513302213451578435413854812062260627502531314444141883413220271341216245293823992713461365247952753357324042953202213126141943448304842113460415429853138331518263671154256231461784482738396841692192346084309281347727914289498489637181672285410966642564570457948433537854244618407424191118107212873893391945652694356344063328157739633904255123653953187239624382594298145885381703921965444132134753849817429535303300800290362043881061480738794191395510882692165117424141484142325902769432918814624023802451516962036467513231779933367491011631103110616628442403182411081367278304639253482411945464407332424632337415227951207113241874172068221738311113363811583417608169321892592349442740503898200246541554998874

Page 9: Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all.

MCLUST

> p.value<-apply(SimData,1,function(x) t.test(x[1:8],x[9:15],var.equal=T)$p.value)> > SigData<-SimData[p.value<0.05,]> dim(SigData)[1] 242 15> heatmap(SigData,labCol=ColLabels)>> BIC.emclust<-EMclust(SigData,1:10)> BIC.emclust

BIC: EEI1 -10599.4852 -9647.6453 -9685.8974 -9729.2395 -9796.1196 -9849.1097 -9912.6018 -9973.6459 -10037.43610 -10077.862

> plot(BIC.emclust)EEI "1" >

Page 10: Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all.

Determining the number of patterns

log(n) -)ˆ ,ˆ,...,ˆ ,ˆ,...,ˆ ;,...,( parametersG1G1n1 NpBIC xx μμ

Tum

or_6

Tum

or_2

Tum

or_3

Tum

or_1

Tum

or_7

Tum

or_8

Tum

or_4

Tum

or_5

Con

trol

_4

Con

trol

_2

Con

trol

_3

Con

trol

_7

Con

trol

_5

Con

trol

_1

Con

trol

_6

151161311742322932151426418267572198123522110024119133194522021351382071411222381895024085318413411323420417716775139918617026111150761053517637107169841721403413020821612510496147481399901681971528279162278718220494042889820217112122333153392795592810268118218301431981572092004710114424218719521016489431602374203181196193851031192183205123156558225921462372152551226233231831069345362111524691861086521344206229178239166701322249711277725691592284123011660115554154179786621621417318514521411719017512012817136119217149212718838201148109422129416515580236631143126118021113110912416319911019171126664615812722273137

1

11

1

1

1

1

1

11

2 4 6 8 10

-10

60

0-1

04

00

-10

20

0-1

00

00

-98

00

number of clusters

BIC

Page 11: Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all.

Summary

•The "weak filter" based on selecting "sub-significant" differentially expressed genes created artificial clusters

•When the whole dataset was used, the Bayesian information criteria did the right thing by estimating the correct number of clusters to be equal to one

•Take home message: When "filtering" before clustering make sure that appropriate statistical significance levels have been used

Page 12: Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all.

Using clustering to find "patterns" among differentially expressed genes

•Cluster analysis is preceded by a rigorous statistical analysis

•For example-identify genes that were "differentially" expressed on at least one experimental comparison. Among all these genes some will have similar behavior across all experimental conditions

•Clustering is a way of organizing behavior of differentially expressed genes across different experimental conditions

Page 13: Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all.

Using clustering to find "patterns" among differentially expressed genes

Up-Regulation

Weak Uniform Up-Regulation

Strong Uniform Up-Regulation

Weak Early Down-Regulation

Strong Uniform Down-Regulation

Down-Regulation

Strong LaterUp-Regulation

Page 14: Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all.

Using clustering to find "patterns" among all genes

•No filtering is performed

•You can perform the "quality filtering"

•Trying to identify statistically significant patterns

•Using the best available method becomes extremely important

Page 15: Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all.

Does It matter which clustering procedure we use?

SK

1 0h

–12

h

W30

3 0h

–12

h

Cel

l cyc

le 1

Cel

l cyc

le 2

SK

1 0h

–12

h

W30

3 0h

–12

h

Cel

l cyc

le 1

Cel

l cyc

le 2

SK

1 0

h –

12h

W30

3 0

h –

12h

Cel

l cyc

le 1

Ce

ll cy

cle

2

SK

1 0

h –

12h

W30

3 0

h –

12h

Cel

l cyc

le 1

Ce

ll cy

cle

2

Simple Commonly Used Method(Euclidian Distance Based Hierarchical Clustering)

"Complicated" Method(Context-specific Infinite Mixtures)

•5685 Yeast Genes Across Two Experiments (Cell Cycle and Sporulation)•NO VARIABILITY BASED FILTER•135 Genes with closest co-expression partners

Page 16: Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all.

"Objective" Performance Assessments Using KEGG as the Gold Standard

•Due to a large imbalance between the total number of negative and positive pairs:There are 17 times more negative pairs than positive pairs - a small FPR can still produce more false positive than true positives

Page 17: Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all.

Summary

•Using clustering alone, one can identify "significant" patterns of expression when using appropriate methodology

•For example, Yeast data clustered in this example did not have any replicates so the traditional analysis to identify differentially expressed genes before clustering is not feasable

•Statistical significance of resulting clusters needs to be carefully examined

Page 18: Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all.

Infinite Bayesian Mixtures

X

C M

r w

M=(1,…, K)

=(1,…, K)

=(1,…, K)

C=(c1,…,cN)

ci{1,…, K}

1-N

n ) jc ( ji,

ip

1-N

)cc ( jip

)σ, | ()σ,...,,σ,..., j,c| ( 2jjiK1K1ii Iμxμμx Nfp

K

1j

j)I(cjK1i

i π )π,...,π|c(p

)r,|()r,|( -1jj Iλμλμ Nfp

)2

βw,

2

β|σ()wβ,|σ( 2-2-

jj Gfp

)2

σ,

2

1|w()σ|w(

-22 xx Gfp

)2

1,

2

1|β()β( Gfp

)2

σ,

2

1|r()σ|r(

22 xx Gfp

)σ,|()σ,( 22 Iμλμ|λ xxxx Nfp

)K

α,...,

K

α|π,...,π( )Kα,|π,...,π( K1K1 Dfp

Page 19: Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all.

Conditional posterior distributions and Gibbs Sampler

I)μxμxc-2jji

ji,2jjiii σ, | (

1-T

nb )σ, , , | jc ( Nfp

2jj

1-2jj

2jji

2iiji dσd r,σ,( σ, | (

α1-T

αb)σ, , , | ij ,cc ( μ)λ|μI)μxμxc xx-

pfp N

)

n

σr

n

σr

,

n

σr

n

σr

| ( )r, , ,σ ,| (

j

2j1

j

2j1

j

2j1

j

2j

j1

j2jj I

λx

μλXcμ

Nfp )2

βw,

2

βMn | σ ( )wβ,, | σ (

2jj2-

j2-

j

sfp GMX,

)

Q

Q

,

Q

Q

r

| ( )r,,..., | (1-

2

1-2

1-2

1-i

i2

Q1 I

μ

μ

λμμλ

x

x

x

xx

Nff)

2

σ)-)`(-(

,2

1MQ |r ( ),..., |r (

2

iii

Q1

xλμλμ

λ,μμ

Gff

)2

σσβ

, 2

1Qβ | w( )β,σ,...,σ | w(

j

-2-2j

2-Q

2-1

x

Gff

j

-2j2

β2-

j

1-2

3)-(Qβ2-

Q2-

1 2

)wβσ-exp )(wσ

2

βexp)

2

β)(

2

β( )w,σ,...,σ | β ( f

Page 20: Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all.

Gibbs Sampler Result

in-burnn1n1nk,k,1 kk;),...,c(c~)cc( for xx|,...,,..., p

Sequence: (ck,1,….,ck,n), k=1,…,kmax

such that

•Posterior distribution summarized through

“posterior pairwise probabilities of co-

expression” p(ci=cj|X)

Page 21: Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all.

Properties•“Pooling” information from the whole dataset by estimating both “patterns” and “assignments” – similar to K-means (K-means is actually equivalent to a special case of the mixture models with known number of clusters)•Does not require specification of the right number of clusters (unlike K-means)•Gives direct estimates of statistical significance (unlike anything else on the market)•Instead of lamenting which distance measure to use – focus on the appropriate statistical model which is a well-defined problem•Works for any type of data

Page 22: Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all.

Finding important functional groups for up-regulated genes

Using the "Ease" annotation tool http://david.niaid.nih.gov/david/

We obtained following significant gene ontologiesUp_DexANDNE2ANDirr_381_GO.htm

Homework:1) Download and install Ease2) Select top 20 most-signficianly up-regulated genes in our W-C dataset and identify significantly over-represented categories (using the three-way ANOVA analysis)3) Repeat the analysis with 30, 40, 50 and 100 up-regulated and down-regulated gene4) Prepare questions for the next class regarding problems you run into


Recommended