Verification and Testing Group, DCS, University of Sheffield Language based approach in biological...

Post on 15-Jan-2016

212 views 0 download

Tags:

transcript

Verification and Testing Group, DCS, University of Sheffield

Language based approach in biological modelling

Marian Gheorghe

University of Sheffield

MIPNETS Liverpool06/2003

Mipnets Liverpool 06/03

Verification and Testing Group, DCS, University of Sheffield

Summary

Formal languages and biology

L systems

DNA sequences

Membrane computing; X machines

Molecular X machines

Mipnets Liverpool 06/03

Verification and Testing Group, DCS, University of Sheffield

Formal languages/linguistics and biological models

FL/Linguistics and biology modern era started in 1950s

Both benefited from a mathematical approach

Around 30 years of almost independent development

1980s – Chomsky-like approach to molecular biology

Later – DNA computing, aqueous computing, membrane computing …

Verification and Testing Group, DCS, University of Sheffield

Mipnets Liverpool 06/03

L systems introduced as a model of development of simple multicellular organisms, such as blue-green bacteria Anabaena catenula.

axiom or initial set of elements

set of rewriting rules

yields a language

A Lindenmayer

L systems

Simulated model

Verification and Testing Group, DCS, University of Sheffield

Mipnets Liverpool 06/03

Comparison between a microscope picture of a fern gametophyte Microsorium linguaeforme (left) and a simulated model using L systems (right).

Mipnets Liverpool 06/03

Verification and Testing Group, DCS, University of Sheffield

LS = (Vocabulary, Axioms, Rules)Rewriting rules are applied in parallel to all occurrences

Axiom: aRules: a → aba

a; aba;abababa; abababababababa …

Definition. Example

Verification and Testing Group, DCS, University of Sheffield

Mipnets Liverpool 06/03

Axiom: A

Rules: A → F[+A][-A]FA F → FF

Graphics

http://www.cpsc.ucalgary.ca/Research/bmv/lstudio/whatis.html

Mipnets Liverpool 06/03

Verification and Testing Group, DCS, University of Sheffield

Mipnets Liverpool 06/03

The language of genes

Verification and Testing Group, DCS, University of Sheffield

Formal languages applied to biological sequence analysis

Biologically inspired linguistic formalism extensions

Mipnets Liverpool 06/03

Language based DNA modelling

Verification and Testing Group, DCS, University of Sheffield

Use of Chomsky grammars to model structure & interactions of biological macromolecules

Intramolecular and intermolecular interactions

DNA = sequences (strings) of basic nucleotides

adenine thymineguanine cytosine

x and y are complementary elements on a DNA sequence

Mipnets Liverpool 06/03

Definition

Verification and Testing Group, DCS, University of Sheffield

Alphabet: ΣDNA = {a, g, t, c}; a=t’, g=c’

Ideal DNA sequence entails pairing between nucleotide basis of ΣDNA

Let w = agtgc then u=gcact (the reversed complement)=w‘R

a g t g c

t c a c g

Mipnets Liverpool 06/03

The language definition

Verification and Testing Group, DCS, University of Sheffield

The language contains words wu, i.e. ww’R

A context-free grammar G: S → bSb’|ε, where b is any element of ΣDNA

b’ its complement and ε is the empty word – generates the language

agtgcgcact is obtained as

S=>aSt=>agSct=>agtSact=>agtgScact=> agtgcgcact

Mipnets Liverpool 06/03

The intramolecular language of genes

Verification and Testing Group, DCS, University of Sheffield

Previous language is linear (between regular and context-free)

Realistic stem-loop patterns might containi) unpaired elements and ii) arbitrarily folded branches

(i) obtained by adding to G: S → bSb’|ε, rules S → A and A → bA| ε

Mipnets Liverpool 06/03

Verification and Testing Group, DCS, University of Sheffield

Mipnets Liverpool 06/03

…folded branches

Verification and Testing Group, DCS, University of Sheffield

a

g

t

cat t

a a

a

g

t

c

t

ag

tc

Mipnets Liverpool 06/03

Derivation-like

Verification and Testing Group, DCS, University of Sheffield

a

g

t

cat t

a a

a

g

t

c

t

ag

tc

S

SS S

S

S

SS

S

S

S S

S

S

Mipnets Liverpool 06/03

The formal grammar

Verification and Testing Group, DCS, University of Sheffield

The initial grammar rules of G: S → bSb’|ε and the new ruleS → SS

The derivation and derivation tree are used to model secondary structures of biological macromolecules

Mipnets Liverpool 06/03

The grammars of intermolecular structure

Verification and Testing Group, DCS, University of Sheffield

Restriction enzymes cut DNA sequences at specific substrings

Enzyme MboI cuts just before gatc

Cut language: let w1δw2… δwn then the language contains sets {w1,w2,…wn}

Recombinant behaviour of DNA molecules (splicing systems)

Mipnets Liverpool 06/03

Language based models of cell

Verification and Testing Group, DCS, University of Sheffield

Cell: complex body containing compartments delimited by membranes; inside of each region: ions, DNA molecules

Cell behaviour: interactions, transfer – biochemical rules

Membrane roles: help compartmentalize, regulate transport

Mipnets Liverpool 06/03

Membrane characteristics

Verification and Testing Group, DCS, University of Sheffield

Bi-layer structure

Two sides have different electrical charges

Trans-membrane transfer: passive or active

Communication channels

Mipnets Liverpool 06/03

Cell model (membrane computing)

Verification and Testing Group, DCS, University of Sheffield

A hierarchical arrangement

Each membrane delimits a region

Each region contains a multiset of elements (simple molecules, DNA sequences…)

The elements evolve in time according to some (rewriting/combination) rules specific to each region or may be moved across the membranes

The rules may also dissolve/create membranes

Mipnets Liverpool 06/03

A computation in a membrane system

Verification and Testing Group, DCS, University of Sheffield

Initial configuration: multisets of initial elements inside of regions

Current configuration: the rules are applied in parallel in each region to the elements obtained in the previous configuration

The result is not a set of words like in usual language based approaches, but a set of multisets

Mipnets Liverpool 06/03

The rules

Verification and Testing Group, DCS, University of Sheffield

Rewriting/interaction rules but applied to multisets (interactions inside a region)

Communication rules – membrane crossing

Rewriting/interaction rules: catalysts, inhibitors, priorities

Communication rules: different electrical charges, passive (direct) vs active (mediated);symport/antiport…

Mipnets Liverpool 06/03

Outcomes

Verification and Testing Group, DCS, University of Sheffield

Computing competence

Efficiency (SAT, HPP problems in polynomial time)

Decidability

State machine with input, memory and output sets - and basic processing functions

Verification and Testing Group, DCS, University of Sheffield

k

1… k … n

1 … k

Memorym m’

h-1 (m”,k-1)= (k-1 ,m) ; k (m ,k)= (k ,m’ ) ; m0 – initial memory

k-1

k

Mipnets Liverpool 06/03

X (Eilenberg) machines

Mipnets Liverpool 06/03

Molecular X machines

Verification and Testing Group, DCS, University of Sheffield

Computationally complete

Finite state based with input/output streams

Structured hierarchically organized memory

Provide in every state specific sets of rules acting in parallelin various parts of the memory

Verification and Testing Group, DCS, University of Sheffield

(Ri,1,…, Ri,m)(Rj,1,…, Rj,m)

Variant 1:

structured memory

distributed rules

Mipnets Liverpool 06/03

Mipnets Liverpool 06/03

Verification and Testing Group, DCS, University of Sheffield

Variant 2:

set of machines

derived components

Mipnets Liverpool 06/03

Application

Verification and Testing Group, DCS, University of Sheffield

Behaviour of ant colonies (Monomorium pharaonis):Pheromone deposition rate; trail pheromone volatility; attraction to trail; population size

Different individual behaviour

Mipnets Liverpool 06/03

Conclusions

Verification and Testing Group, DCS, University of Sheffield

Formal grammars used to model general forms of inter/intra-molecular structure

New approaches, concepts, models

Biological relevance

Mipnets Liverpool 06/03

Links

Verification and Testing Group, DCS, University of Sheffield

Molecular X machineshttp://www.dcs.shef.ac.uk/~bernardf/molxm/index.htm

Membrane computinghttp://psystems.disco.unimib.it/

DNA computinghttp://www.liacs.nl/home/pier/webPagesDNA/