Hierarchical Bayesian Model Specification Model is specified by the Directed Acyclic Network (DAG)...

Post on 05-Jan-2016

222 views 1 download

transcript

Hierarchical Bayesian Model Specification

•Model is specified by the Directed Acyclic Network (DAG) and the conditional probability distributions of all nodes given values of its parents•Topology of the DAG defines the conditional dependencies of all variables through the Markov directed Markov property which states that given the values of its parents, a variable in the model is independent of all its non-descendents•DAG and local distributions define the joint probability distribution of data and all parameters in the model•In our case this distribution can not be explicitly characterized but it estimates using Markov Chain Monte Carlo approach (Gibbs sampler)

Uses and Miss-Uses of Clustering

•Define a statistical model that facilitates clustering of genes based on similarities of their expression profiles

•Define the method-selection criteria that allows for estimating the "correct" number of clusters

•Show that inappropriate "pre-filtering" can fool the statistical model in the same way it fools the casual observer

•Show appropriate ways to use cluster analysis and illustrate the importance of using the "best available treatment"

Clustering of gene expression profiles

Microarrays

Diff

eren

tial E

xpre

ssio

n

0 2 4 6 8 10 12

-20

-15

-10

-50

5

1 2 3 4 5 6 7 8 9 10 11

Microarrays

Diff

eren

tial E

xpre

ssio

n

0 2 4 6 8 10 12

-20

-15

-10

-50

5

1 2 3 4 5 6 7 8 9 10 11

Microarrays

Diff

ere

ntia

l Exp

ress

ion

0 2 4 6 8 10 12

-20

-15

-10

-50

5

1 2 3 4 5 6 7 8 9 10 11

0 1 2 3 4 5 6 7 8 9 10 11 12

arrays

-20

-15

-10

-5

0

5

10

0 1 2 3 4 5 6 7 8 9 10 11 12

arrays

-20

-15

-10

-5

0

5

10

“Patterns” of Expression - Finite Mixture Model

0 1 2 3 4 5 6 7 8 9 10 11 12

arrays

-20

-15

-10

-5

0

5

10

Patterni i=(1i, 2i,…, 11i)

Dataik ~ iid N(i, ), k=1,…,ni

ni=number of genes generated

by the Patterni

i=ni/n

“Patterns” of Expression - Finite Mixture Model

Any gene profile x = (x1,x2,…,x11)

),( i

G

1ii σNπ μ

x ~

), ; ( )( iN

G

1ii σpπp μx x

All data x1, x2,…, xn

x xx

n

σpπp1k

ikN

G

1iiG1G1n1 ), ; ( ) ,,..., ,,..., ;,...,( μμμ

} Finite Mixture Model

0 1 2 3 4 5 6 7 8 9 10 11 12

arrays

-20

-15

-10

-5

0

5

10

0 1 2 3 4 5 6 7 8 9 10 11 12

arrays

-20

-15

-10

-5

0

5

10

One-dimensional mixturePattern 1

Pattern 2

Differential Expression on Microarray 1

Differential Expression on Microarray 1

Differential Expression on Microarray 1

N(11, )

N(12, )

),(4

3 ),(

4

12111 σμNσμN

MCLUST

> library(mclust)> SimData<-matrix(rnorm(5000*15),ncol=15)> ColLabels<-c(paste("Tumor_",1:8,sep=""),paste("Control_",1:7,sep=""))> heatmap(SimData,labCol=ColLabels)

> .Mclust$hcModelNames<-c("E","EEI")> .Mclust$emModelNames<-c("EEI")> BIC.emclust<-EMclust(SimData,1:10)> BIC.emclust

BIC: EEI1 -213490.32 -213624.93 -213753.04 -213880.75 -213993.76 -214121.07 -214243.48 -214351.69 -214481.410 -214588.7

> plot(BIC.emclust)EEI "1" >

Determining the number of patterns

log(n) -)ˆ ,ˆ,...,ˆ ,ˆ,...,ˆ ;,...,( parametersG1G1n1 NpBIC xx μμ

1

1

1

1

1

1

1

1

1

1

2 4 6 8 10

-21

46

00

-21

44

00

-21

42

00

-21

40

00

-21

38

00

-21

36

00

number of clusters

BIC

Con

trol

_5

Tum

or_1

Tum

or_2

Con

trol

_1

Tum

or_5

Tum

or_6

Tum

or_8

Con

trol

_4

Con

trol

_3

Tum

or_4

Con

trol

_2

Tum

or_3

Tum

or_7

Con

trol

_6

Con

trol

_7

52989344774512493102425422215264375516372494316910433126249049832607884381414313427116726912359366427382527156964913213721623185742183089253725454542633154108986190515504422326749642817160543552339189828098302312603484237071889468622112363547721137153212662072601354412268439743167289933145347435012443297348147462854789191111333044911425146424853371693620786686596360311036723514361403914543060146619551034200826634400470515841523414523234819588882054366638272176114040812110135695437631727611446421273424053612193487185017399687363537348345602869448140977763725227719252572486012251887297329425541306134336343418268836491282200539514485890412336181413440113631942615388755524472928252233696421727943571469420933193407939201476399445254311353985489437216319245901364993509333242322681218013971963161536640273569243991037521888466237674571165325962774829387740043988112311911552212445824424297528783100203743242743271931112146146937421959135486613133219205726634862247126123963378161730901654612699307529104680105958212113332085563265338823692297010202905290746659433948191937964803310514501642317261594405510962434399245030884542482633461229127213124059217011807095464227260010362487208457820824899256845132461123837851440480439938386283759227466134677142544631591119282821923148400135737584503273130840572782227026651520264442902292463937531501407045594300749319166519256530992488427446614321365180146212093617488437908634865230406385730851222253636041995232033581209613298212665429003283077221247791012242925444833437412324252214135154523826274745939049271638173744933747920635335211121964276345453155266816593637314361369049534521167723883438439468114212593327157011342737833184334193740205364733920104093464017971875497412445624764535640181941341686412643454952139846382091592275417946523399149526587781081961323441314258109148391582356352944021896336484020496582501333510531193269080830574518225349941823894461738091859429939378642362275520262065161048314742451630342218775423881612499932704699117635614021299689871461117702128272715071391928133034583656766133791630972240496928933550117335525273946164174282917653435302616311473416240383596376305246594313139433614454291342389127841585204436963600150043932324753372332967442639852805406231527577202329317186110129402583924374520772953404212832495162142571522343612884809441529221458740602381303425423144723175840724533380535031684218228543108180946281648384811174475236137334213452991148472617394416284005431818911383813518426436314731291479428532230160340344237314918465201364311644384725126235453942465725122113237914984891391449583164440321294876109230043874466933373415369140004428134614771015165026761434531433642823852362918442891237428503248985374841681962397348947569951203819181328024757235223496396093766414147412121120535885528994812478382530191504217516984978788412147162443128324166246629462859319237494742196031197715894960392619773532115523094225241419991904344331182629102944961116456223172498268762025411994190735598109292472976239412398811626526783539386118651913359916393295261129365806434771190930282238136246263683159534421803227156446012468343074247928454383344918122474371117033424341883297930334609327643843253200015973228258449423323162460037199192407357850624774563171481816813136434347354878317039784272465819303586195634283519481593144835604517423936715283382490718672717769282621483408363256631063485245611889644301180115131926246715331981295628844823196655724691830443580577016832291363949464035132928653492442088012791293327218832812034842223167394987396031421070377449523029794259375131071934156719521803345718485232569340110691325265921392602165236424393708270031174031200433564843405318993468203303662135973522215510551289532286030202916499644451544204839518223680196813272515282436873880122044432393797447432752667482921181051469247504679140612529044631239825081840373630724572146419173055367318807674580300139669722926408622594060484543132715737105835204078378614414772417134094101368922164204631662444632031718131175548562741097338613582526130739834212265047742823894796124129111392166322548731172957521424343522272402427633301252016644441179269515056592664203145542811079447277723433421432629571094763459733454447818421949153052366299846107947414230270911284857354122262371388831903517113545084935459956350514635693236250328943404732583276747654379260839391749430426203256404047614405474833914799379736449489636462528217216631375353339673091348446562056242122893930334711022234455274030232827417442654479452334701541255521441587277242021315261236824854791235749623135430733842881269713703098175453826224136220372246973659125164935874019112249755415654458288331049894349131914091301255598202422833297133243323996876102348493734240107491993125562014422019572052361284947733814566207444765404673670185528521276450941134808242528634710352810853580212045161940168547642660195406446442815280714172999177328424178160721154576272329724745476623715555212363165165437353605239062542333568362822652119622208568342435933368446074288741736731311056134848684117397741961714127527852539914629194798721814135133148356342954427024973611343741942525382514971297148818694359257348163425155830663293069394142227325849749134932073156825332103106079123008111891485278337704946487047086851871668295042934112404541956383933933176433064740311325064177990368176524339563266365531884390257840232927483846432437350747491775279631273971328739153286120145774128370678748633803500027441265727467846067199441772322423921208478222447262122460269342352060167014532820495021832662259938041589112150719651031249133859341991479023301090324121571305254223210032147534867468454994217052151191636102846749612062023307263622229811805912304415164858399555625335327912444148204013733862706172821954209244133628417114010115636822884223959197847920974594251930071795405311185814681494217843713353914350322500378434563093173628101946497330148252096373846162340424230474016272625321657234446833124361644194286176723916982654412547121795624277611392757164738964962748286497013535742111834495426343316293012152218474386935163992259186296441847023788865496828371574495121974314453149238837749229355992282128254038326553128677331217604817455644883103238622424844355392458128342081201743641839203047063237140313823543216920187806533322258929921910347998300523211087288926401371427262445982286326543604253345440081419344624629179231844214645316019356371524956340728902703296449492266325431451645440285812743501104033197226916201064353829243271329140541340203213172331222384148110211504088494048662114319933141756369539054967299341303284367512421658449238228354852236332053823345913724305515153395237917803932997100392924771622793062145527893313418312904025409491284927622196229256017874083251248541381982236739485944982104198941553711235138473519299614604802933351827243619281342787352372303536602370381116253056402899989520294314292519589382016160840182801292922317340666232502133841929037962871193166435571414399447132404043191394310346662410484145281304683245845732554982158126185022825814150295125614006122348791900760492547134065468928513096139239821631432001243245363643489022521559121031471828246043283959231641901008450426812283863974362630634932934396274682312671785428835894937278742103829298039474411937141860521124619218724734995186239862877159243182347183744393536064624828224972325522610322129271885415061129406725578222602876406853943352455199644672389222745831655394026314215547303840414653982673492346034877354766719501148404432894788213038946341984855355144111150151868825223484851443231401551153638122312272526894596258624681676703103291739109061399464541993572172382811114347315123353709462325772298161220924153180418313412081234323923264116873277326254328681592194136660029203890764108103710452319673447347446117351510210619922932294814993406184739023591452410262094284661946481984121441332023303177938453209138645554806267614135533715383806349743625613181248645842478256290824363776355322674482354946273721254817082944351242628113130392331831634719157933192997209053010252100473621824714500419824121068179533444238527281388323914613928399844372218854363159149861447237510813413287429784009408729887718338182839384111034233679429434902529114320073076406123191426214923273388173228852574204614783697241213843573432799447810396062732325035342201114138601042219310651838392229901007414926138124312682607260432312675950475149814165370413682742420027702745422235465610824881147241722779424537826921293443748216277384002205037564098453934912179178846718891321042241144403615734641573446610802760291927494668471518514716252413422887193233921688352722610661094276531831263131813032932535137834115054759121313774760482031572994151230721902923124724123337052360150243953548200488242501352127148244898186427593531178933033872814422810240468749334002033322922201922445216029083755341515905307346223912403283425544998274784362410114889448728801884484536931310724487045302244996452710487501973199313344690358315451876268621674085351136404011353307029894104328530113635432223816120673080144492749562547227612682673125934456614700411033972538470119491833283117343730322625304972198333651882361418861410468812341408325165615261521239710283109164183419982134786444824264537127383718562909259857313809633815319730843884365242739231853707401535234502516269632151816406333963141181125420191151168015082806326434229023393754306737272126117722543917306139765121932509444036222994532146726092952457864822904143455334992962255941841139501067221439015504747446343461642234614562420429247341445297747143401195145137941929170627971442145726133216597436946353153225546743461734186385837746461559663908377733344837347646672066312362215032041075236929354451841627353511832009347320423911401313222645287400774811424452103674305349882718169021323810393630171260301830104206352529651818153521544032020512035317517076692143215804685387386640322262265536625031471344384440993841378738353156156135083235350419336262281120048111454037337021354442265278191384513651027380023182914394460530502245283493022353121776334815613083724207145741912330241753290725285730512296159015173064431912513502353611054653193817661515721264143346603469211635241022086446938684047986468239134723387855348069024963023331841336317291439190349701646104430684179427628381700186118741005409641472322142312545191870443527718003184620636472030432656423819235771730338121734308498945871281339531322243222249941341633708295942834316220594701443442682165721146886103053267520134348202579161818075863452273533624501201232622311270301416653954519480581616241002377518311412323035004887455241394883220193621174717339441273972631386435947075221695110124752730151604184949262278330517461324363637914641250427110993975439817444595383497121684732278834883878492442691894385287231372886179077324164277163022044376296618144615187114004071653779346730211602197946042792488035654115249448929584754424128141063240620555842264493918973341486453117234514179131583024176112962415799491449456119624132625155330084724206311154330367641136549254118133296023427062861277521163428662523259741643226332591261549331300347220708483260374172813092915215687523833249326215483217304522564994353171615627773471199250538082125375560140562840101629684581304124112685199748501733357914893938343942661632207510043268174042466722510456853324003855267745544729459049902098147033517125826322470384637124480167247671302286962775226883517774325383692180658549341435407643312983564120336253667258128184540292927141918387333764348264711861071325911752758392295105446633150233884322093820955548365371042806293781418818902816227385226042713039302573423538133513387521234897360742420212234898210116014804770246415121006179932114397940486433904105432072702143017983278186839692453207633551093146244362061437819694303124335104641124826315115352947251729397293734471139123222110211241038356147381951201226572109277145035064297282473947032378497741293502358309597838561041369306520283032489373930782701468457120487918323293595161323343455336933114111283634026104470168917821420291723733885258825212271340517113343134617242236782452517118771215254954279816222622888364749413677380139995772672152174131023320988220741403714262314630490821021778241865146813389363043024586123109529181046297446181719153146702646376564617814758381713503451250740869743581360218441803204226700437723248002294379247289942011210521581725248432442454174334663030106218101109141123284976120231481825121162926511376438526521802980471431933834033384917135123154548449790197327420412194468324328194225737723177184108442342984463338335404380277816003686424843872089759514813132039493798892783147516913083117390348916604637125315761152271612863299432779179688995390323417694761661355437784216445927802442334454526146553200119490100019453440761458913713854175021331189197116333562469534034848270895237612185160649217381438456926351754454132884333118222754762504231330374595252562225116038331153184248464913554366511421171181464058756167132632199138424921715267145583590409227514544878114711682435379949443881682513396525013742253079105034162045182286212245923042380766210224381602333636233737140734544231821119915306793331209528794159270428417133957558205839588017171534745116927201492707174838537841675310136613701126918307041724297134943702382112729213816305848219774421154731333139428538701432390022584651223912573895433143914197387128433071216416792628228432693869135968319421336574167237641705754885133394139814073699185422933013410510152949573238287514624440443512856604186322973449977479474916339832011486337343724526276110734998111417122516457519531019352437004464221335917839264426230349632898832483686770187319444153215940518622246495591517684582941261678467366312848216603379478041073238108646014163633752244446515984625392720593441591161217731542364695672198366928212906316816972630231130033325490558866822507583176173142291479323134311836418919904231579207949212228161982616433909837418134143189317217933867930219124658099392213283025644090354245228683806941138642423622024069445723557611943411083430656447759183115269322242237293135584146427967643651049894119044132582474100133714292546270531781414445028333350407536456713862329229376017921975115940032287325549282137398019411326258517632669321288511492143399736211052395431238654886261933093354337738502108466434532324121246930343743441621711369178637934650316215496648101596169431295104945104271531414557369916674046134117832331224107426388221078256723052441443332424433195163647304929570241771794534611837494830543223334048754798439224014192222129914260487136154778230490795855946107734781644121714044254334270235568519024356277314349201076393485915658474979284816994412450510333081831251828973204485419704022431543671285118289678928472248526490026424531601138943393789112645673430293748314249125614914787129442474077279012302736226187317851147214731432024819141162262721113471808119636031052306300745726496824902188157249801474607105753712494263445240481586300930921422903397020254490691354647093333172016821751214219875713213158849431349294522632233281911314704335786045502967429814591948296939873247337438221973526473785336844026463643422020183510142801447211217093648244049112017744984114537203650337243101390440814294591279325261111843433156028704089393125391017479334291804687123737031328483436024931320821502587128027325141866499129463246981383280320393251420515144124478420119062648329841213463361316942033308136131613838377119863644116400469348018202739140143533257384314448637695083575449227043330250480311003339267421743049726122111571125405266955323683380960267030944622338348883120372927643027140516114786425714416120643182124518952650446231961892402421225753899293340334776294926054430416040123343308720223764142823772534549131134102637245798332071490273322293831741146577981908208822411482321816403945179420832698444932458047213609242725764308401537276635831143627175742871853576139533664551438172139682236332934123698764145223061635420733101893208749031416317449122511385921537622448500366318781976268423802758177147337053121733415648223585480140141539362374210737269843601781437129552867713368564447524092038429145072768316688474344176239613495318637952882396245474208161612415712489415138971701290136684296140417593582364848624157130123085604906418549614417312227303978193317332135841527371015562433208914449471710969266128223465391624233566243015094465144935704672278613961206401738303768280848721154375034743876175284414934909154343862756458527412145209924403658369447523608127341444409411614433906992873472273041583006291276831852995255326265444193285519212483155724243840388622743781915437342044510213689749384781201539891920409511858822666117441428504632497120432387373117222053688382442612069355581537623984268035942210456446911549131417034832491864123014338149636414484336410133360148324625143964181719021904768294240804041254912183754490426449012408257190948833029448323557317823349139345616142892846438919012961973308245123842402116661437231083632739822341479726794094298647692711301632063002298813166702299307327124647287521271563137947851753793475524520803991498510094429182713572280373294385137174317443924811860122755911652752372849363012332393722194676156614241582324633261702268318521133242885838894895695302115328015991764284262501216481580647444267371339903739402045112152407163141514871980134417471130245934813426298722061609401157447785717043498304044442350226948742721018129546121136343435212976272224492251967397948305874727181131871277410314367591112963447324461879279346142006856499290054182961635094424137408243824084774259540911974337537021865127641001988280444104275104711662003591495925662721216149651107422125701583365344832723598356723482621123528915461779317940302347271017452800308623324176324790470167428954375174123115195931068022345449119283450256331121339914382125585514696228127843684746220524828726783214103548921098363389296233624802571387309377334961164157524051985288949486132613757167834201010436629432173152140464929832550412231989924106416945064874869195429343102325192739073000110437802729258075378339182812807181576360337463592932452032583301198371547922421939402920473595680119252411978722004370477715404337358130591845166932822904528436513302213451578435413854812062260627502531314444141883413220271341216245293823992713461365247952753357324042953202213126141943448304842113460415429853138331518263671154256231461784482738396841692192346084309281347727914289498489637181672285410966642564570457948433537854244618407424191118107212873893391945652694356344063328157739633904255123653953187239624382594298145885381703921965444132134753849817429535303300800290362043881061480738794191395510882692165117424141484142325902769432918814624023802451516962036467513231779933367491011631103110616628442403182411081367278304639253482411945464407332424632337415227951207113241874172068221738311113363811583417608169321892592349442740503898200246541554998874

MCLUST

> p.value<-apply(SimData,1,function(x) t.test(x[1:8],x[9:15],var.equal=T)$p.value)> > SigData<-SimData[p.value<0.05,]> dim(SigData)[1] 242 15> heatmap(SigData,labCol=ColLabels)>> BIC.emclust<-EMclust(SigData,1:10)> BIC.emclust

BIC: EEI1 -10599.4852 -9647.6453 -9685.8974 -9729.2395 -9796.1196 -9849.1097 -9912.6018 -9973.6459 -10037.43610 -10077.862

> plot(BIC.emclust)EEI "1" >

Determining the number of patterns

log(n) -)ˆ ,ˆ,...,ˆ ,ˆ,...,ˆ ;,...,( parametersG1G1n1 NpBIC xx μμ

Tum

or_6

Tum

or_2

Tum

or_3

Tum

or_1

Tum

or_7

Tum

or_8

Tum

or_4

Tum

or_5

Con

trol

_4

Con

trol

_2

Con

trol

_3

Con

trol

_7

Con

trol

_5

Con

trol

_1

Con

trol

_6

151161311742322932151426418267572198123522110024119133194522021351382071411222381895024085318413411323420417716775139918617026111150761053517637107169841721403413020821612510496147481399901681971528279162278718220494042889820217112122333153392795592810268118218301431981572092004710114424218719521016489431602374203181196193851031192183205123156558225921462372152551226233231831069345362111524691861086521344206229178239166701322249711277725691592284123011660115554154179786621621417318514521411719017512012817136119217149212718838201148109422129416515580236631143126118021113110912416319911019171126664615812722273137

1

11

1

1

1

1

1

11

2 4 6 8 10

-10

60

0-1

04

00

-10

20

0-1

00

00

-98

00

number of clusters

BIC

Summary

•The "weak filter" based on selecting "sub-significant" differentially expressed genes created artificial clusters

•When the whole dataset was used, the Bayesian information criteria did the right thing by estimating the correct number of clusters to be equal to one

•Take home message: When "filtering" before clustering make sure that appropriate statistical significance levels have been used

Using clustering to find "patterns" among differentially expressed genes

•Cluster analysis is preceded by a rigorous statistical analysis

•For example-identify genes that were "differentially" expressed on at least one experimental comparison. Among all these genes some will have similar behavior across all experimental conditions

•Clustering is a way of organizing behavior of differentially expressed genes across different experimental conditions

Using clustering to find "patterns" among differentially expressed genes

Up-Regulation

Weak Uniform Up-Regulation

Strong Uniform Up-Regulation

Weak Early Down-Regulation

Strong Uniform Down-Regulation

Down-Regulation

Strong LaterUp-Regulation

Using clustering to find "patterns" among all genes

•No filtering is performed

•You can perform the "quality filtering"

•Trying to identify statistically significant patterns

•Using the best available method becomes extremely important

Does It matter which clustering procedure we use?

SK

1 0h

–12

h

W30

3 0h

–12

h

Cel

l cyc

le 1

Cel

l cyc

le 2

SK

1 0h

–12

h

W30

3 0h

–12

h

Cel

l cyc

le 1

Cel

l cyc

le 2

SK

1 0

h –

12h

W30

3 0

h –

12h

Cel

l cyc

le 1

Ce

ll cy

cle

2

SK

1 0

h –

12h

W30

3 0

h –

12h

Cel

l cyc

le 1

Ce

ll cy

cle

2

Simple Commonly Used Method(Euclidian Distance Based Hierarchical Clustering)

"Complicated" Method(Context-specific Infinite Mixtures)

•5685 Yeast Genes Across Two Experiments (Cell Cycle and Sporulation)•NO VARIABILITY BASED FILTER•135 Genes with closest co-expression partners

"Objective" Performance Assessments Using KEGG as the Gold Standard

•Due to a large imbalance between the total number of negative and positive pairs:There are 17 times more negative pairs than positive pairs - a small FPR can still produce more false positive than true positives

Summary

•Using clustering alone, one can identify "significant" patterns of expression when using appropriate methodology

•For example, Yeast data clustered in this example did not have any replicates so the traditional analysis to identify differentially expressed genes before clustering is not feasable

•Statistical significance of resulting clusters needs to be carefully examined

Infinite Bayesian Mixtures

X

C M

r w

M=(1,…, K)

=(1,…, K)

=(1,…, K)

C=(c1,…,cN)

ci{1,…, K}

1-N

n ) jc ( ji,

ip

1-N

)cc ( jip

)σ, | ()σ,...,,σ,..., j,c| ( 2jjiK1K1ii Iμxμμx Nfp

K

1j

j)I(cjK1i

i π )π,...,π|c(p

)r,|()r,|( -1jj Iλμλμ Nfp

)2

βw,

2

β|σ()wβ,|σ( 2-2-

jj Gfp

)2

σ,

2

1|w()σ|w(

-22 xx Gfp

)2

1,

2

1|β()β( Gfp

)2

σ,

2

1|r()σ|r(

22 xx Gfp

)σ,|()σ,( 22 Iμλμ|λ xxxx Nfp

)K

α,...,

K

α|π,...,π( )Kα,|π,...,π( K1K1 Dfp

Conditional posterior distributions and Gibbs Sampler

I)μxμxc-2jji

ji,2jjiii σ, | (

1-T

nb )σ, , , | jc ( Nfp

2jj

1-2jj

2jji

2iiji dσd r,σ,( σ, | (

α1-T

αb)σ, , , | ij ,cc ( μ)λ|μI)μxμxc xx-

pfp N

)

n

σr

n

σr

,

n

σr

n

σr

| ( )r, , ,σ ,| (

j

2j1

j

2j1

j

2j1

j

2j

j1

j2jj I

λx

μλXcμ

Nfp )2

βw,

2

βMn | σ ( )wβ,, | σ (

2jj2-

j2-

j

sfp GMX,

)

Q

Q

,

Q

Q

r

| ( )r,,..., | (1-

2

1-2

1-2

1-i

i2

Q1 I

μ

μ

λμμλ

x

x

x

xx

Nff)

2

σ)-)`(-(

,2

1MQ |r ( ),..., |r (

2

iii

Q1

xλμλμ

λ,μμ

Gff

)2

σσβ

, 2

1Qβ | w( )β,σ,...,σ | w(

j

-2-2j

2-Q

2-1

x

Gff

j

-2j2

β2-

j

1-2

3)-(Qβ2-

Q2-

1 2

)wβσ-exp )(wσ

2

βexp)

2

β)(

2

β( )w,σ,...,σ | β ( f

Gibbs Sampler Result

in-burnn1n1nk,k,1 kk;),...,c(c~)cc( for xx|,...,,..., p

Sequence: (ck,1,….,ck,n), k=1,…,kmax

such that

•Posterior distribution summarized through

“posterior pairwise probabilities of co-

expression” p(ci=cj|X)

Properties•“Pooling” information from the whole dataset by estimating both “patterns” and “assignments” – similar to K-means (K-means is actually equivalent to a special case of the mixture models with known number of clusters)•Does not require specification of the right number of clusters (unlike K-means)•Gives direct estimates of statistical significance (unlike anything else on the market)•Instead of lamenting which distance measure to use – focus on the appropriate statistical model which is a well-defined problem•Works for any type of data

Finding important functional groups for up-regulated genes

Using the "Ease" annotation tool http://david.niaid.nih.gov/david/

We obtained following significant gene ontologiesUp_DexANDNE2ANDirr_381_GO.htm

Homework:1) Download and install Ease2) Select top 20 most-signficianly up-regulated genes in our W-C dataset and identify significantly over-represented categories (using the three-way ANOVA analysis)3) Repeat the analysis with 30, 40, 50 and 100 up-regulated and down-regulated gene4) Prepare questions for the next class regarding problems you run into