June 2016 - Zuogong

Post on 11-Apr-2017

51 views 0 download

transcript

Statistical inference of regulatory networks for circadian regulation

— Methodology Part

Zuogong Yue

Pizzaclub, 15th June 2016

Authors: Andrej Aderhold, Dirk Huskier, Marco Grzegorczyk

o Mathematical formulation of transcriptional regulation1

Problem Formulation

2

1 Barenco, M., Tomescu, D., Brewer, D., Callard, R., Stark, J., & Hubank, M. (2006). Ranked prediction of p53 targets using hidden variable dynamic modeling. Genome Biology, 7(3), R25.

o Regulatory networks (bipartite structure)

yg1

yg2

yg3

xg1

xg2

xg3

xg4

xg5

Methods

3

o Graphical Gaussian Models (GGM)

The components corresponding to two genes are stochastically independent conditional on the remaining system

if and only if the corresponding element in the inverse covariance matrix is zero.

Methods

4

o Sparse Regression (LASSO and Elastic Net)

(LASSO)

(Elastic Net)

Methods

5

o Time-varying Sparse Regression (Tesla)

Methods

6

o Hierarchical Bayesian Regression Model (HBR)

linear regression model:

prior:

then getting the posterior:

and the marginal likelihood:

Methods

7

o Hierarchical Bayesian Regression Model (HBR) (cont.)

Finally we get the marginal posterior distribution on

Maximizing the above posterior by Markov chain Monte Carlo (MCMC)

and

Methods

8

o Non-homogeneous Hierarchical Bayesian Model

Applying HBR on a multiple change-point process:

Divide the target variable into sub vectors

Methods

9

o Automatic Relevance Determination (ARD) - Sparse Bayesian Regression (SBR)

Using the prior distribution: (choosing appropriate hyper parameters can lead to sparse solutions):

The marginal likelihood:

Maximize the marginal likelihood by Expectation Maximization (EM) method

Methods

10

o Bayesian Spline Autoregression (BSA)

The original covariates are augmented with B-spline basis functions.

To encourage network sparsity, a slab-and-stick-like Bayesian variable selection scheme2 is used.

2 Smith, M., & Kohn, R. (1996). Nonparametric regression using Bayesian variable selection. Journal of Econometrics, 75(2), 317–343.

Methods

11

o Gaussian Processes (GP)

where is the well-known kernel function.

Calculate the posterior:

Marginalize and perform maximization:

Methods

12

o Mutual Information Methods (ARACNE)

The mutual information (MI) is given by

A pruning mechanism by Margolin (2006):

Methods

13

o Mixture Bayesian Network Models (MBN)

Representing as a Gaussian mixture model (GMM):

Maximize the likelihood of the conditional GMM:

Methods

14

o Gaussian Bayesian Network (BGe)

Calculate the posterior distribution of and perform maximization

Assume that

Impose a normal-Wishart prior:

15

Thank you!