June 2016 - Zuogong

transcript

Statistical inference of regulatory networks for circadian regulation

— Methodology Part

Zuogong Yue

Pizzaclub, 15th June 2016

Authors: Andrej Aderhold, Dirk Huskier, Marco Grzegorczyk

o Mathematical formulation of transcriptional regulation1

Problem Formulation

1 Barenco, M., Tomescu, D., Brewer, D., Callard, R., Stark, J., & Hubank, M. (2006). Ranked prediction of p53 targets using hidden variable dynamic modeling. Genome Biology, 7(3), R25.

o Regulatory networks (bipartite structure)

Methods

o Graphical Gaussian Models (GGM)

The components corresponding to two genes are stochastically independent conditional on the remaining system

if and only if the corresponding element in the inverse covariance matrix is zero.

Methods

o Sparse Regression (LASSO and Elastic Net)

(LASSO)

(Elastic Net)

Methods

o Time-varying Sparse Regression (Tesla)

Methods

o Hierarchical Bayesian Regression Model (HBR)

linear regression model:

prior:

then getting the posterior:

and the marginal likelihood:

Methods

o Hierarchical Bayesian Regression Model (HBR) (cont.)

Finally we get the marginal posterior distribution on

Maximizing the above posterior by Markov chain Monte Carlo (MCMC)

Methods

o Non-homogeneous Hierarchical Bayesian Model

Applying HBR on a multiple change-point process:

Divide the target variable into sub vectors

Methods

o Automatic Relevance Determination (ARD) - Sparse Bayesian Regression (SBR)

Using the prior distribution: (choosing appropriate hyper parameters can lead to sparse solutions):

The marginal likelihood:

Maximize the marginal likelihood by Expectation Maximization (EM) method

Methods

o Bayesian Spline Autoregression (BSA)

The original covariates are augmented with B-spline basis functions.

To encourage network sparsity, a slab-and-stick-like Bayesian variable selection scheme2 is used.

2 Smith, M., & Kohn, R. (1996). Nonparametric regression using Bayesian variable selection. Journal of Econometrics, 75(2), 317–343.

Methods

o Gaussian Processes (GP)

where is the well-known kernel function.

Calculate the posterior:

Marginalize and perform maximization:

Methods

o Mutual Information Methods (ARACNE)

The mutual information (MI) is given by

A pruning mechanism by Margolin (2006):

Methods

o Mixture Bayesian Network Models (MBN)

Representing as a Gaussian mixture model (GMM):

Maximize the likelihood of the conditional GMM:

Methods

o Gaussian Bayesian Network (BGe)

Calculate the posterior distribution of and perform maximization

Assume that

Impose a normal-Wishart prior:

Thank you!

June 2016 - Zuogong

Science