Date post: | 09-Apr-2017 |
Category: |
Engineering |
Upload: | negar-hashemian |
View: | 155 times |
Download: | 2 times |
preprint version of published journal: N. Hashemian and A. Armaou, Simulation, model-reduction and
state estimation of a two-component coagulation process, AIChE J., 2016, 62, pp 1557-1567,
DOI:10.1002/aic.15146
Simulation, Model-Reduction and State Estimation of
a Two-Component Coagulation Process
Negar Hashemian and Antonios Armaou
Dept. of Chemical Engineering, The Pennsylvania State University, University Park, PA 16802
Abstract
We address the issue of state estimation of an aggregation process through (i) using model
reduction to obtain a tractable approximation of the governing dynamics, and (ii) designing a
fast moving-horizon estimator for the reduced-order model. We first use the method of moments
to reduce the governing integro-differential equation down to a nonlinear ordinary differential
equation (ODE). This reduced-order model is then simulated for both batch and continuous
processes and the results are shown to agree with constant Number Monte Carlo (cNMC) sim-
ulation results of the original model. Next, the states of the reduced order model are estimated
in a Moving Horizon Estimation (MHE) approach. For this purpose we first employ Carleman
linearization and represent the nonlinear system in a bilinear form. This representation lessens
the computation burden of the estimation problem by allowing for analytical solution of the
state variables as well as sensitivities with respect to decision variables.
INTRODUCTION
In chemical engineering, material science and biology there are a multitude of processes character-
ized by dispersed phenomena. Systems involving such processes merit a particle population study
Correspondence concerning this article should be addressed to A. Armaou at [email protected]
rather than the traditional mass balance performed for continuous media. There are numerous
examples of these processes ranging from industrial applications to biological reactions. Crys-
tallization, polymerization, viral infections, and collocalization of enzymes in cells are just a few
instances of these processes that occur in our everyday life and motivate the work in this article.
The common characteristic of all these processes are the individual members of the population or
the particles. These particles are distinguished by their type, size and/or composition. Mostly, a
population balance governs the dynamic behavior of particulate systems. This results in complex
mathematical models, specifically for systems in which the particles are characterized by two or
more properties. The simplest system, in this area, is a two-component aggregation system with no
chemical reactions. This system is mostly used in pharmaceutical applications where through use
of a solvent called excipient the particles in a drug powder adhere together and form granules. In an
ideal granulation process the components, solvent and solute, are well mixed and the distribution of
the solute mass is uniform. The deviation from the mean of the solute mass in aggregates quantifies
the granulation quality which is referred to as blending degree. However, this output property is
not easily measurable during evolution of the process. This article focuses on these bi-component
granulation processes and estimation of the distribution of components in the product.
The population balance equation for granulation systems determines the dynamics of a bivariate
distribution function in form of an integro-differential equation. The variables of this distribution
function are size and composition of the aggregates. However, the weighting function in calculating
the integrals called coagulation kernel is mostly a function of particle size and independent of
composition. Solving this equation has been subject of many research efforts in the literature .1–4
For a general kernel, there is no analytical solution to the population balance equation. Matsoukas
et al. proposed the constant-Number Monte Carlo (cNMC) algorithm to simulate these dynamics
in both batch processes and semi-batch processes where there is a continuous inflow of binder.5,6
In the cNMC method, there should always be a constant number of particles in the “simulation
box”. Therefore, the box expands/contracts as the population decreases/increases to maintain the
same number of particles. Smith and Matsoukas showed this method is more accurate and faster
than the traditional Monte Carlo algorithms where a finite number of particles are considered in
2
a fixed-volume simulation box.7 More specifically, cNMC method reduces the decay of accuracy
in comparison to the traditional Monte Carlo simulation method, from square-root dependence to
logarithmic dependence on the number of coagulation events. In this article, we extend the use of
cNMC algorithm to continuous processes and utilize it for simulation of a bi-component coagulation
system.
Although extremely useful, stochastic simulations are computationally cumbersome. Subse-
quently, stochastic modeling is sometimes impractical, specially in the applications demanding high
computational costs such as state estimation, optimization, and process control. One solution is to
derive and use deterministic models that instead of microscopic behavior of the individual elements
focus on bulk statistics of the stochastic process under study. The method of moments is a powerful
technique in this matter. It can be used to transform the integro-differential governing coagula-
tion processes into a set of ODE. In this method, the closure of the obtained ODE set is critical.
Although, this closure issue is nonexistent in the examples with constant or additive coagulation
kernels, it introduces a challenge in more complicated examples such as the case of a Brownian
kernel. Direct quadrature method of moments is an efficient technique for single-component coag-
ulation simulations.8,9 However, Marshal el al. showed this method is not as accurate as cNMC
simulation for bi-component coagulation processes.10 Yu el al. employed Taylor expansions to de-
rive a closed finite dimensional ODE system that models a single-component coagulation process.11
The accuracy of the system improves with increasing the Taylor expansion’s order, which is equal
to the number of moments considered in the model. In this manuscript, we exploit the idea in Yu
el al.’s work but for bi-component coagulation processes. This results in a tractable ODE model
of the process. The accuracy of this reduced-order model is verified in a simulation study, when it
is compared against stochastic simulation results of the process. The verified model is finally used
for online estimation of the internal dynamics of the process, which is the second objective of this
work.
In the reduced-order model obtained from application of the method of moments, the states are
statistical moments of the particles’ distribution. Knowledge of these states yields comprehensive
information on different output properties, such as the blending degree. This information is also
3
very critical for process control purposes. The scientific literature is abundant with numerous
observer design techniques for estimation of the unknown state variables of a dynamic system using
measurements of the input and output trajectories. Moving Horizon Estimation (MHE) is one
of these design techniques that is specifically powerful in handling unknown disturbance signals
acting on the system or unknown noise signals corrupting the output measurements.14–18 MHE
uses the past and current output measurements and defines an error cost function over a moving
time window. It then finds the most probable realization of the state variables at the current time
by minimizing a cost function. Although this method has many advantages over other alternative
methods, it is computationally demanding. In this article, in order to deal with this issue, we use
a modified MHE design proposed in a previous work by the authors.23 In this modified design,
first, Carleman linearization is used to represent nonlinear systems in a bilinear form. Not only
we can reduce computational cost by analytically solving for the response of the bilinear system,
but also the KKT matrix of the optimization problem is found analytically which decreases the
computational complexity even further. This design framework is employed for the coagulation
process. Lastly, the designed estimator is shown to closely track the evolution of the statistical
moments of the particles using only noisy measurements of the particle population in the exit flow.
It is worth mentioning that there are many observer and controller design methods in the
literature for particulate processes.24,26–29 For example Kalani and Cristofides modeled a Titania
aerosol reactor in which a combination of a chemical reaction, coagulation and convective transport
happens. In their model, the collision kernel includes a Brownian and a turbulent shear collision
frequency function. Then, they used method of moments to control this process. However, because
of the Brownian kernel, the fractional moments appear in the ODEs. To address this issue, they
assumed the aerosol particles have a lognormal distribution.24 Hulburt and Katz, in another
work, used Laguerre polynomial expansions to obtain closed ODEs from the infinite set of moment
equations.25 Chiu and Christofide also employed this idea to approximate a crystallizer’s PDE
model by a finite ODE set. They next employed the reduced model to maximize the mean size
of the particles.26 Mesbah et al. used method of moments in a batch polymorphic crystallization
process and reduced the integro-differential model to a nonlinear differential algebraic equation
4
set.29 Then, they designed a stochastic model predictive controller to maximize the desired product
while meeting some probabilistic constraints. However, to the best knowledge of the authors there
has been no work on the estimation and control of bi-component coagulation processes.
In this manuscript, we describe and mathematically represent the coagulation process. Next, the
cNMC algorithm and the method of moments are explained. The article reviews and implements the
modified MHE design technique on the coagulation process. The last section starts with comparing
the simulation results obtained by cNMC and method of moments in both batch and continuous
processes. This section concludes with estimating the critical moments in the continuous process
in the presence of noise in the feed flow concentration and the output measurement.
Multi-Component Mass Balance
Each particle in a binary-component aggregation process is composed of two components: the
solute (drug) and the solvent (excipient). The total mass of a particle is denoted by p which is
also representative of the particle’s size. Moreover, s denotes the mass of the solute content in the
particle. The ratio of s to p is indicative of the particle’s composition. The probability distribution
function f(p, s) describes the population distribution of the particles as a function of their mass and
their solute content. In other words, the number of particles with a total mass within the differential
range [p− dp2 , p+ dp
2 ] and a solute mass in the range [s− ds2 , s+ ds
2 ] is given by f(p, s)dpds. For
simplicity, we combine the total and solute mass variables in an augmented vector, r = (p, s). Also,
hereafter, we refer to this augmented vector as characteristic size, or size in short.
During the coagulation process, the particles may collide and form a larger particle. The
aggregation kernel, k12 = k(r1, r2) is a function reflecting the dynamic behavior of particles in
the system. More specifically, it determines the rate of formation of particles of size r1 + r2 from
collision of particles of size r1 and r2. Conservation of particle and solute mass leads the mass
balance formulation in.5 This formulation considers an infinitesimal range of characteristic particle
size characterized by [r1 − dr12 , r1 + dr1
2 ]. As Fig. 1 suggests particles may appear in or disappear
from this range because of the following two reasons:
5
1. The particle population in the range increases when smaller particles stick together and form
a particle in this range. Say one of the particles has a size r2 < r1, then if it collides into a
particle of size r1 − r2 > 0 the result would be a new particle in the range considered. This
particle formation happens with a rate equal to k(r1 − r2, r2).
2. The particles within the range may form bigger particles when colliding with an arbitrary
particle of size r2 with a rate of k(r1, r2).
Taking the above two phenomena into account yields the net rate of change of the aggregate
population in the range as follows.
∂f(r1)
∂t=
1
2
r1∫0
k(r1 − r2, r2)f(r1 − r2) f(r2)dr2 −∞∫
0
k(r1, r2)f(r1)f(r2)dr2
(1)
It is important to note that for any particle the total mass is always greater than the mass of solute.
Therefore,
f(s, p) = 0 for s > p
If dependence of the distribution function f(.) on the composition of particles is ignored, then
the integro-differential equation obtained above simplifies to the so called Smoluchowski coagulation
equation. When the coagulation kernel is either constant or additive, the integro-differential equa-
tion can be analytically solved. However, the nature of the particle agglomeration process is more
complicated and the real life examples are modeled by more complex kernels such as Brownian.
For these complicated kernels, there is no explicit analytical solution for the distribution function.
This article next addresses this issue following two different approaches. Specifically, we employ a
cNMC algorithm and method of moments to simulate bi-component coagulation processes in batch
and continuous mode.
6
Constant-Number Monte Carlo Algorithm
This section utilizes a cNMC algorithm to numerically simulate the evolutions in a bi-component
agglomeration process. Constant-Number Monte Carlo shows two advantages over the traditional
Monte Carlo approaches: First, it is faster; second, the simulation accuracy decays at a much lower
rate.7 Figure 2, schematically shows how this algorithm works. The simulation box always includes
a fixed number of particles. As a result, when dilution or thickening occurs, the volume of the box
increases or decreases, respectively. This section, first explains cNMC as was originally presented
in5 for a batch process. Then, we modify this algorithm and extend its use to continuous processes.
Batch Process
As explained before, the simulation box considers a constant number of particles denoted by N.
Each of these particles is characterized by a size vector r. For any pair of particles, i and j,
there exists a kernel, ki,j , expressing their chance of collision and forming a larger particle. The
maximum and average of these kernel functions over all possible pairs of particles are denoted by
kmax and kave, respectively. At every sampling time we randomly pick two particles. The ratio
of the corresponding kernel to kave shows the agglomeration probability of the selected pair of
particles. We then generate a random number ρ ∈ (0, 1) as a metric for acceptance or rejection
of the agglomeration of the candidate pair. When ρ >ki,jkave
, the pair is rejected and the random
pair selection is repeated until the acceptance criterion is met. In the case, ρ ≤ ki,jkave
, the pair is
accepted. In order to systematically update the population with the accepted pair, we reassign the
properties of the ith particle as follows.
ri + rj → ri
Because after an agglomeration event the number of particles per unit volume decreases, next we
need to introduce a new particle into the simulation box. To do so, we randomly select an existing
particle in the box and copy its properties to redefine the jth particle. This addition of a new
7
particle affects the total volume of the box in the following manner:
ptotalptotal + pj
V → V (2)
where ptotal is the total mass of the particles before the update, and pj is the total mass of the new
jth particle after the update.
For a general Poisson process, the mean time elapsed before occurrence of the next random
event is given by:30
∆t =1
V∑ki,j
(3)
where V is the volume of the simulation box. Then, specifically for this aggregation process, the
elapsed time for a large enough N is approximately equal to:
∆t ≈ 1
V(n2
)kave
≈ 2
N2V kave(4)
Continuous Process
This article modifies the simulation technique described above to be used in a continuous process.
As opposed to the batch process, here we consider three possible events: (i) introduction of new
particles to the system as a feed, (ii) coagulation of particles as before, and (iii) particles exiting
the system as an output flow. At every simulation step, the stochastic simulation algorithm first
needs to determine which of these events occurs and how long it takes.
The events occur with different probabilities:
Pevent =Revent
Rin +Rcoag +Rout, event ∈ {in, coag, out} (5)
where P and R denote the probability and the rate of a particular event, respectively. The occur-
rence rates of all the events are listed in Table 1.
A random variable ρ with a uniform distribution over [0, 1] is used to determine which event
happens at any simulation step. Also, similar to the batch simulation, the average time elapsed
8
before the occurrence of the next event is obtained from Eq. (3):
∆t =1
Rin +Rcoag +Rout(6)
After the random selection of the event, the algorithm needs to simulate occurrence of the
selected event and update the particle population accordingly. Since simulation of the coagulation
event is the same as before, we focus on explaining the input and output flows and how to simulate
their effects on the particle population. For this purpose, let n represent the number of particles
that enter/leave the reactor during any simulation step.
The input flow has a particle distribution that is potentially different from the mixture in the
reactor. Define Cf to be the number of particles in the input flow and α to be the input/output
flow rate, where both the quantities are normalized by the reactor’s volume. Now, when n particles
enter the reactor, to maintain the number of particles constant, we randomly remove n particles
from the simulation box and replace them by the new entering particles. As a result, the volume
of the simulation box shrinks according to the following expression:
ptotal + pin − pdptotal + pin
(V +n
Cf)→ V (7)
where ptotal, pin and pd are the total mass of particles before the update, the total mass entered to
the box, and the mass of the particles removed from the box, respectively.
Now consider the output flow of the reactor. Since we assume that the reactor is well-mixed,
properties of output flow is the same as the reactor’s content. Hence, there is no change in the
particle distribution of particles in the box in this event. Consequently, the volume of the simulation
does not change.
Table 1 gives the occurrence rate and the corresponding volume change for each event.
9
Model Reduction by Method of Moments
As discussed before, the original integro-differential equation does not have an explicit analytical
solution. Monte Carlo simulation is a reliable approach to study the dynamical behavior of the
particles, but it is computationally demanding. This section seeks for an alternative and more
efficient solution by relying on model reduction techniques. More specifically, applying method
of moments to Eq. (1), we derive an ODE system that describes the dynamics the population’s
probabilistic moments.
To accomplish this goal, we first define mixed moment variables as follows:
Mij =
∫ ∞0
∫ p
0pisjf(p, s) dpds (8)
Therefore M00,M10, and M01 represent the total number of particles, the total mass, and the total
solute mass in a unit of volume, respectively. In a batch process, M10 and M01 have constant
values. However, in a continuous process because of the input and output streams these moments
may change in time.
To obtain rate of change of Mij with i, j ∈ Z+, Eq. (1) is multiplied by pisj and then integrated
over the region used in the definition of moments in Eq. (8). Matsoukas et al.5 show that after
some algebra and simplifications this results in the following equation:
d[Mij ]
dt=
1
2
∞∫0
∞∫0
drdr′[A(r + r′)−A(r)−A(r′)]f(r)f(r′)k(r, r′) (9)
where the function A(r) = pisj .
Here, we assume the aggregation kernel between particles is Brownian and calculated as:
k12 = k0 (2 + (p1
p2)13 + (
p2
p1)13 ) (10)
where k0 is a constant coefficient. Incorporating the Brownian kernel and using the moments’ def-
inition, Eq. (9) determines the dynamics of the population’s probabilistic moments. Theoretically,
10
all the moments are needed to recover the exact particle distribution. But, for practicality reasons,
we restrict i, j to the index set I = {0, 1, ...,m − 1}. This results in a total of m2 states which
construct the state vector x of the reduced-order model.
x = {Mij |i, j ∈ I}
In calculation of the time derivative of Mij , the left-hand side of Eq. (9) depends on some mixed
moments Mk,j with fractional indices.
x = F (x,Mkj) (11)
where k ∈ Q+ and j ∈ I. This expression of the system, however, is an approximation of the actual
dynamics and becomes more accurate as more probabilistic moments are considered. We deal with
the fractional moment Mkj , by utilizing a truncated Taylor expansion of pk about a desired point
p = p:
pk =
∞∑l=0
(k
l
)pk−l(p− p)l (12)
where
(k
l
)=
k∏n=1
l − n+ 1
n. It is more convenient to expand the system around the average mass
of the particles i.e. p = M10M00
.
By applying the first m terms of Eq. (12) in (8), the moments with fractional order can be
approximated by a function of x. Here, we use the second order Taylor expansion:
Mkj ≈M0j pk(1 +
k2
2− 3k
2) +M1j p
k−1(2k − k2) +M2j pk−2k
2 − k2
(13)
By increasing the order of Taylor expansion, the approximation in (13) becomes more accurate.
However, it requires the geometric increase of the number of states in the dynamic system (11)
which makes the computations more complex.
Moreover, note that for the continuous process the input and output streams should be consid-
11
ered in the population balance equation:
∂f(r1)
∂t=
1
2
r1∫0
k(r1 − r2, r2)f(r1 − r2) f(r2)dr2 −∞∫
0
k(r1, r2)f(r1)f(r2)dr2
+ α[fin(r1)− f(r1)]
(14)
where function fin gives the distribution of particles in the input stream.
We can close these ODE sets at any desired approximation level of the original infinite dimen-
sional ODEs, denoted by m. Appendix gives a closed ODE system for the batch and continuous
system employing the approximation. This approximation considers the first nine moments, Mi,j
where i, j ∈ {0, 1, 2}. However, since M10 and M01 moments are constant in the batch process,
the states reduce to 7. The simulation results in the next section, indicate the number of moments
with integer indices employed to reduce the system suffice for the underlying coagulation process.
In what follows, we will discuss how to use this derived ODE system to design an estimator for the
output properties. Also, the next section discusses about the moments estimation in the presence
of noise in the feed flow stream.
System State Estimation
This section implements the MHE approach to estimate the unmeasurable output properties. In
particle processes, one of the important output properties is the total variance of excess solute. Let
the average solute per particle mass be denoted by φ = M01M10
. So, the squared excess solute in a
particle is:
ε(r) = (p− φs)2 (15)
and the sum squared of excess is:
ε =
∫ ∞0
ε(r)dr = M02 − 2φM11 +M20 (16)
12
Therefore ε can be obtained by state estimation of the dynamic system. The system derived in the
previous section is presented in the compact form as follows:
x = F (x) +Dw
y = x1 + ν(17)
where w ∈ Rn and ν ∈ Rp are noise signals on the system dynamics and output measurements.
Moreover, D has an appropriate dimension. To simplify the computations, we transform the states
so that the desired nominal operating point denoted by x0 and the corresponding noise-free output
is at the origin.
MHE uses the history of the output trajectory, to update the states estimated in the past
and also obtain the most probable values for the current states. To bound the dimension of the
optimization problem, MHE defines an objective function over a moving window:
J(x0, {wi}Ni=1) = ‖x0 − x0‖2Π +
N∑i=1
(‖νi‖2Q + ‖wi‖2R)
Subject to:
wi ∈W, xi ∈ R+ and νi ∈ V
(18)
where νi = yi − yi, ‖z‖2M = zTMz; Q,R, and Π are tuning parameters with a symmetric positive
definite structure; and W and V are compact sets. The moving time window includes the N recent
unknown state vectors. This window moves one step forward at each sampling time. We assume the
noise wi and νi remain constant over the time interval (ti, ti+1]. Also, an uncertain prior knowledge
about the initial point x0 is available, which we denote by x0. We use the estimated state x1 at
t− 1 as the prior knowledge, namely x0. This estimated state would be out of the moving window
at the next sampling time, t.
The nonlinear dynamic optimization in (18) makes MHE slow. In oder to accelerate the opti-
mization computations, the following section employs the Carleman linearization and approximates
the nonlinear system given by Eq. (17) in a bilinear form.
13
Carleman Linearization
Using Taylor expansion the nonlinear system in Eq. (17) can be represented in a polynomial form:
x =
∞∑k=0
Akx[k] +Nw
y = x1 + ν
(19)
where Ak = 1k!∂kf∂xk|x=0, and x[k] denotes the kth Kronecker product calculated recursively from
x[k] = x ⊗ x[k−1] and x[0] = 1. Carleman linearization method truncates this expansion at the ηth
term and also defines an augmented state vector as follows:
x⊗ =
[xT x[2]T · · · x[η]T
]T(20)
This notation helps us present the approximated system in a compact bilinear form:23
x⊗ 'Ax⊗ +Ar +n∑l=1
[Dlx⊗wl +Dl0wl]
y 'Cx⊗ + ν
(21)
where A,Ar, Dl and Dl,0 matrices have the following structure:
A =
a1,1 a1,2 · · · a1,η
a2,0 a2,1 · · · a2,η−1
0 a3,0 · · · a3,η−2
......
. . ....
0 0 · · · aη,1
, Ar =
a1,0
0
0
...
0
14
Dl =
0 0 · · · 0 0
dl,2 0 · · · 0 0
0 dl,3 · · · 0 0
......
. . ....
...
0 0 ... dl,η 0
, Dl0 =
dl,1
0
0
...
0
ai,k =i−1∑α=0
I [α]n ⊗Ak ⊗ I [i−1−α]
n
dl,i =
i−1∑α=0
I [α]n ⊗Nl ⊗ I [i−1−α]
n
and Nl is the lth row of matrix N .
Note that with the current structure of x⊗, some states are repetitive. This causes an unneces-
sary increase in the system’s dimension. To avoid the excessive computation efforts, we eliminate
the repetitious states and modify the corresponding linearization matrices. Therefore, the dimen-
sion of the bilinear system reduces to:
r =
η∑i=1
(n− 1 + i
n− 1
)
Also, in general, the relation between the states in definition (20) may be lost during the system’s
time evolution. As a result, it is required to reset the augmented state vector x⊗ using the original
state variables.31 However, MHE updates x0 at every sampling time and calculates the extended
state using the updated original state variables.
Simulation
In this section, to implement the proposed MHE design, we first evaluate the performance of the
method of moments. To perform this evaluation, we use the result obtained by cNMC simulation
as a benchmark and compare it with reduced-order system derived by method of moments for
both batch and continuous processes. For the batch and continuous processes we consider 5000
15
particles in the Monte Carlo simulation. This number was seen to be sufficient for obtaining the
expected values in these stochastic phenomena, as when the simulations are repeated the results
do not change. We assumed k0 = 0.01 in all the simulations. However, at every sampling time it
is required to obtain kave, which makes the simulation very slow for this population of particles.
To deal with this issue, we estimate kave using only the random particles considered during the
selection of the reacting particles for the next event. The simulation assumes that initially half of
the particles are only composed of solute, and the other half is only solvent. Additionally, at the
beginning, all the particles have a mass equal to 1 and the number of particles in the unit volume
is 1. For the continuous process, α = 0.5, Cf = 2 and a uniform feed flow is assumed with particles
of characteristic size r = (1, 0.1). Lastly, the moments for the cNMC simulation at any sampling
time are calculated according to the following expression.
Mij =1
V
N∑l=0
pilsjl (22)
where (pl, sl) are the characteristic size of the ith particle.
Figure 3 and Figure 4 show the moments obtained by both the methods in the batch process and
the continuous process, respectively. The reduced model predicts the system’s behavior properly.
Also, the results show that only the first three terms of Taylor expansion suffice to approximate
the fractional moments as a function of the state variables of the system.
In Figure 3, we observe that the zeroth order moment monotonically decreases and finally
converges to 1 particle per unit volume. This is expected, as the batch process does not reach
an equilibrium and the particles continue merging and growing until only one particle remains.
Furthermore, M10 and M10 are seen to remain constant in Fig. 3. This results from the conservation
of solvent and salute mass in the reactor.
Figure 4 demonstrates the simulation results for the continuous process. There are clearly two
phases in the response plots of all the moments in this figure. First, there is a transient phase
which lasts an interval proportional to the residence time of the reactor. Next, all the moments
reach their corresponding steady state level after all the transient effects have died out.
16
Switching our focus to the estimation problem, we only consider continuous processes. The
number of the particles in the output flow, M00 can be measured. This is because in practice
online measurement of the other output properties moments is not as easy. During the simulation
of our estimator’s performance, we assume the feed flow concentration, Cf , is corrupted by a white
noise signal uniformly distributed over the interval [−0.5, 0.5]. Also, there is another zero-mean
uniformly distributed white noise signal affecting the output measurements with a maximum value
of 0.025. The MHE design parameters are chosen to be:
Q = 100, R = 10−4,Π0 = 0.1I
Moreover, the sampling time and the estimation horizon are chosen to be ∆T = 0.25 and N = 3,
respectively.
As discussed before, the measured output of the system is M00 (with noise degradation) shown
in Fig. 5.A. Additionally Fig. 5.B shows that the estimation error is very small in comparison with
the measurement and process noise. Also, Fig. 6 presents the estimated and true blending degrees
using Eq. (16). This figure shows that small changes in concentration may have an important effect
on ε, which necessitates design of a controller in parallel with this estimator.
The estimation was performed in MATLAB using a 3.4 GHz Intel Core i7-3770 processor. The
simulation used the fmincon function to find the optimal estimation variables at each sampling step.
Four different variations of the estimation problem were considered, with all of them giving the same
results as those in Fig. 5. In the most basic version, the nonlinear reduced-order model derived in
was directly employed. In another version, this nonlinear model was replaced with its second order
Carleman linearization. The other two versions also used the Carleman linearized model. However,
instead of relying on fmincon to numerically approximate the gradient and Hessian of the problem’s
objective function, these quantities were accurately calculated using analytical expressions. The
two first cases were solved using the ‘trust-region-reflective’ algorithm, whereas, the‘interior-point’
algorithm was used in the last two cases. The latter method was more efficient than the former
as it benefits from knowledge of the gradient of the cost function. Table 2 reports the simulation
17
runtimes for all these four cases.
According to this table, use of the Carleman linerized model and analytical calculation of the
derivatives of the optimization problem’s objective function, together, accelerates the simulation
more than 8 times. This is while the optimal estimates are very minimally affected and the method
has almost the same performance level as the nonlinear MHE.
Conclusions
This article addresses the problem of state estimation in a coagulation process. The distribution
of particles in this process is described as a function of the particles’ mass and composition. The
particle balance for this system results in an integro-differential equation which does not have a
closed form analytical solution in general.
Using the method of moments, the original model is represented as an ODE set, where the states
are distribution moments of the particles. This representation enables the application of estimation
methods to the coagulation process. This work also exploited Taylor expansions to approximate
some moments in terms of other moments to guarantee the closure of the reduced-order model.
The simulation results of the ODE set are seen to be in agreement with the results of Monte Carlo
simulations.
Blending degree is one of the important characteristics of the process, which can be directly
expressed as a function of the reduced order model of the process. However, there are noise signals
acting on the process dynamics and affecting the measurements of the system’s output. Therefore,
MHE is a proper approach to estimate these states, while taking into account the constraints of
the system. However, this method needs significant online numerical computational power to solve
the associated dynamic optimization in real time. This manuscript used Carleman linearization to
approximate the nonlinear continuous system in a bilinear form. This form admits an analytical
solution, which allows for the analytic calculation of the gradient and Hessian matrix of the cost
function with respect to the decision variables. The results showed MHE combined with Carleman
linearization reduces the running time by more than 8 times in comparison with the nonlinear
18
MHE.
Acknowledgments
This work was supported by the National Science Foundation, CBET Award 12-634902. Also, the
authors would like to thank Themis Matsoukas for helpful discussions on Monte Carlo simulations
of bi-component coagulation processes.
References
[1] Lushinkov AA. Evolution of coagulation systems: III. Coagulating mixtures. J of Colloid and
interface Sci. 1976;54:94-101.
[2] Gelbard FM, Seinfeld JH. Coagulation and growth of a multicomponent aerosol. J of Colloid
and Interface Sci. 1978;63:472-479.
[3] Krapivsky PL, Ben-Naim E. Aggregation with multiple conservation laws. Phys. Rev. E.
1996;53:291-298.
[4] Vigil RD, Ziff RM. On the scaling theory of two-component aggregation. Chem Eng Sci.
1998;53:1725-1729.
[5] Matsoukas T, Lee K, Kim T. Mixing of components in two-component aggregation, AIChE J.
2006;52:3088-3099.
[6] Marshall CL, Rajniak P, Matsoukas T. Multi-component population balance modeling of gran-
ulation with continuous addition of binder. Powder Technol. 2013;236:211-220.
[7] Smith M, Matsoukas T. Constant-number Monte Carlo simulation of population balances.
Chem Eng Sci. 1998;53:1777-1786.
[8] Fox R O. Optimal moment sets for multivariate direct quadrature method of moments. Ind.
Eng. Chem. Res. 2009;48:9686-9696.
19
[9] Marchisio D L, Fox R O. Solution of population balance equations using the direct quadrature
method of moments. J. Aerosol Sci. 2005;36:43-73.
[10] C. L. Marshall, P. Rajniak, and T. Matsoukas, “Numerical simulations of two-component
granulation: Comparison of three methods,” Chem. Eng. Res. Des., vol. 89, no. 5, pp. 545-
552, 2011.
[11] Yu M, Lin J, Chan T. A new moment method for solving the coagulation equation for particles
in Brownian motion Aerosol Sci. & Technol. 2008;42:705-713.
[12] Tatiraju S, Soroush M. Nonlinear state estimation in a polymerization reactor, Ind. Eng.
Chem. Res. 1997;36:2679-2690.
[13] Soroush M. State and parameter estimations and their applications in process control. Comput.
Chem. Eng. 1998;23:229-245.
[14] Jazwinski A H. Stochastic processes and filtering theory, New York: Academic Press, 1970.
[15] Jang S S, Joseph B, Mukai H. Comparison of two approaches to on-line parameter and state
estimation of nonlinear systems. Industrial & Engineering Chemistry Process Design and De-
velopment 1986;25:809-814.
[16] Rao C V, Rawlings J B, Lee J H. Constrained linear state estimation - A moving horizon
approach. Automatica 2001;37:1619-1628.
[17] Rao C V, Rawlings J B. Constrained process monitoring: Moving-horizon approach. AIChE
J. 2002;48:97-109.
[18] Rao C V, Rawlings J B, Mayne D Q . Constrained state estimation for nonlinear discrete-
time systems: stability and moving horizon approximations. IEEE Transactions on Automatic
Control 2003;48:246-258.
[19] Kalman R E. A new approach to linear filtering and prediction problems. Transactions of the
ASME-Journal of Basic Engineering 1960;82:35-45.
20
[20] Smith G L, Schmidt S F, McGee L A. Application of statistical filter theory to the optimal
estimation of position and velocity on board a circumlunar vehicle. Washington, D.C.: National
Aeronautics and Space Administration, 1962.
[21] Haseltine E L, Rawlings J B. Critical evaluation of extended Kalman filtering and moving-
horizon estimation. Ind Eng Chem Res. 2005;44:2451-2460.
[22] Schneider R, Georgakis Ch. How to not make the extended Kalman filter fail, Ind. Eng. Chem.
Res. 2013;52(9):3354-3362.
[23] Hashemian N, Armaou A. Fast moving horizon estimation of nonlinear processes via Carleman
linearization. American Control Conference, Chicago, IL , 2015.
[24] Kalani A, Christofides P D. Modeling and control of a Titania aerosol reactor. Aerosol Sci. &
Technol. 2000;32:369-391.
[25] Hulburt H M, Katz S. Some problems in particle technology: A statistical mechanical formu-
lation. Chem Eng Sci. 1963;19:555-574.
[26] Chiu T Y, Christofides P D. Robust control of particulate processes using uncertain population
balances. AIChE J. 2000;46:266-280.
[27] Shi D , El-Farra N H , Li M , Mhaskar P, Christofides P D. Predictive control of particle size
distribution in particulate processes. Chem Eng Sci. 2006;61:268-281.
[28] Larsen P A, Rawlings J B. The potential of current high-resolution imaging-based particle size
distribution measurements for crystallization monitoring AIChE J. 2009; 54: 896-905
[29] Mesbah A , Streif S, Findeisen R, Braatz R D. Stochastic nonlinear model predictive control
with probabilistic constraints. American Control Conference, Portland, OR, 2014.
[30] Cinlar, E. Introduction to stochastic processes. Englewood Cliff, NJ: Prentice-Hall, 1975.
[31] Fang Y, Armaou A. Nonlinear model predictive control using a bilinear Carleman linearization-
based formulation for chemical processes. American Control Conference, Chicago, IL , 2015.
21
Appendix
This Appendix, derives the two ODE sets approximating the batch and continuous coagulation
processes, by incorporating Eq. (13) in Eq. (1) and (14), respectively.
Batch Process
Here, to simplify the equations, we scale M10 to unity and therefore M01 = φ. Also, we define the
state vector of the reduced model in the following form:
x =
[M00 M11 M20 M02 M12 M21 M22
]T
Consequently, the expression for evolution of the moments is given by the following ODE set:
x1 =−k0
81x2
1(−2x21x
23 + 13x1x3 + 151)
x2 =k0
81(x1x2 + 310φ+ 23x1φx3 + 4x2
1x6 − 19x21x2x3 + 5x3
1x3x6)
x3 =k0
81(26x1x3 − 4x2
1x23 + 302)
x4 =k0
81(−4x2
6x41 + 34x2x6x
31 − 8x6x
21φ− 70x2
2x21 + 70x2x1φ+ 302φ2)
x5 =k0
81(x1x5 + 4x2
1x7 − 8φ2/x1 − 32x1x22 + 10x3
1x26 + 662φx2 + 310x4
+ 38x1φx6 + 23x1x4x3 − 22x21x2x6 − 19x2
1x5x3 + 5x31x3x7)
x6 =k0
27(−14φ/x1 + 4x1x6 + 131φx3 + 217x2 − 19x1x2x3 + 5x2
1x3x6)
x7 =k0
81(12x1x7 + 38φ2/x2
1 + 728x22 + 38x2
1x26 − 152φx2/x1 − 42x4/x1 + 796φx6
+ 393x4x3 + 651x5 − 152x1x2x6 − 57x1x5x3 + 15x21x3x7)
(23)
22
Continuous Process
For the continuous process, we also consider the moments M01 and M10 as state variables:
x =
[M00 M10 M01 M11 M20 M02 M12 M21 M22
]T
Similar to Eq. (14), because of input and output flow, we modify Eq. (9) for this case:
d[Mij ]
dt=
1
2
∞∫0
∞∫0
drdr′[A(r + r′)−A(r)−A(r′)]f(r)f(r′)k(r, r′) + α[uij −Mij ] (24)
where uij is the moment variable obtained from the particles distribution in the feed flow.
23
Therefore, implementing method of moments in Eq. (24) yields:
x1 =k0
81(2f4x2
5 − 13f2x1x5 − 151)− α(x1 − u00)
x2 = −α(x2 − u10)
x3 = −α(x3 − u01)
x4 =k0
81(5f3x5x8 − 19f2x4x5 + 4fx8x1 + 23fx3x5 + x4x1 + 310x3x2)− α(x4 − u11)
x5 =k0
81(−4f2x2
5 + 26x1x5 + 302x22)− α(x5 − u20)
x6 =k0
81(−4f4x2
8 + 34f3x4x8 − 8f2x3x8 − 70f2x24 + 70fx3x4 + 302x2
3)− α(x6 − u02)
x7 =k0
81(10f3x2
8 + 5f3x5x9 − 22f2x4x8 − 19f2x7x5 + 4fx9x1 + 38fx3x8 − 32fx24
+ 23fx6x5 + x7x1 + 662x3x4 + 310x6x2 − 8x23/f)− α(x7 − u12)
x8 =k0
27(5x5x8f
2 + 4x8x1 − 19x4x5f + 217x4x2 + 131x3x5 − 14x3x2/f)− α(x8 − u21)
x9 =k0
81(38f2x2
8 + 17x5x9f2 − 152fx4x8 − 73fx7x5 + 19x9x1 + 796x3x8 + 595x7x2
+ 728x24 + 389x6x5 − 152x3x4/f − 56x6x2/f + 38x2
3/f2)− α(x9 − u22)
(25)
where f = x1/x2.
24
Sensitivity Analysis
We assume the noise vector in a small enough time interval is constant. Hence, the bilinear system
derived by Carleman linearization method has an analytical solution in the interval [ti, ti+1]:
xi,⊗ = exp(∆TAi)xi−1,⊗ + A−1i (exp(∆TAi)− I)(Ar +
m∑j=1
Dj0wi,j) (26)
where ∆T = ti − ti−1, Ai = A+∑m
j=1Djwi,j and wi =
[wi,1 wi,2 ... wi,m
]T.
Consequently, using chain rule the first derivatives of the estimation error, vi with respect to
the elements of x0 and wi vectors are given by:
∂vi∂x0,k
= −CEiEi−1...E1∂x0,⊗∂x0,k
(27)
and,
∂vi∂wp,k
= −CEiEi−1...Ep+1
(∂Ep∂wp,k
xp−1
+ [A−1p
∂Ep∂wp,k
+∂A−1
p
∂wp,k(Ep − I)](Ar +
m∑j=1
Dj0wi,j) + A−1p (Ep − I)Dk0
) (28)
where Ei = exp(∆TAi) and p ≤ i. Also, the estimation error is independent to the noise signal in
future and therefore, for p > i,∂vi∂wp,k
= 0.
Similarly, the second order derivatives of vi are available analytically:
∂2vi∂x0,l∂x0,k
= −CEiEi−1...E1∂2x0,⊗
∂x0,l∂x0,k(29)
∂2vi∂x0,l∂wp,k
= −CEiEi−1...Ep+1∂Ep∂wp,k
Ep−1...E1∂x0,⊗∂x0,k
(30)
25
∂2vi∂wp,l∂wp,k
= −CEiEi−1...Ep+1
(∂2Ep
∂wp,l∂wp,kxp−1 +
[∂A−1p
∂wp,l
∂Ep∂wp,k
+∂A−1
p
∂wp,k
∂Ep∂wp,l
+ A−1p
∂2Ei∂wp,l∂wp,k
+∂2A−1
p
∂wp,l∂wp,k(Ep − 1)
](Ar +
m∑j=1
Dj0wi,j)
+[A−1p
∂Ep∂wp,k
+∂A−1
h
∂wp,k(Ep − I)
]Dl0 +
[A−1p
∂Ep∂wp,l
+∂A−1
p
∂wp,l(Ep − I)
]Dk0
),
(31)
and
∂2vi∂wp,l∂wq,k
= −CEiEi−1...Ep+1∂Ep∂wp,l
Ep−1...Eq+1
(∂Eq∂wq,k
xq−1
+ [A−1q
∂Eq∂wq,k
+∂A−1
q
∂wq,k(Eq − I)](Ar +
m∑j=1
Dj0wi,j)
) (32)
where q < p ≤ i.
Using information provided by Eq. (27-32) in form of gradient and Hessian accelerated the
dynamic optimization in (18).
26
Table 1: Possible events in a continuous process
Event Rate
Coagulation Rcoag = N2/(2V kave)
Inlet stream Rin = αCfV/n
Outlet stream Rout = αN/n
27
Table 2: Run Time in the Simulations
gradient Hessian 2ndorder nonlinear
No No 21.0(s) 47.7(s)Yes No 6.4(s) -Yes Yes 5.7(s) -
28
Figure 1: Two possible scenarios in coagulation process for particles in a specific range
29
Figure 2: cNMC algorithm
30
0 5 10 15 200.7
0.8
0.9
1
time
M00
Monte CarloMethod of Moments
0 5 10 15 200
0.5
1
1.5
2
time
M12
0 5 10 15 200.5
1
1.5
2
2.5
time
M21
0 5 10 15 200.4
0.5
0.6
0.7
0.8
time
M02
0 5 10 15 200.5
1
1.5
2
time
M20
0 5 10 15 200.25
0.5
0.75
1
time
M11
0 5 10 15 200
1
2
3
4
5
time
M22
Figure 3: Comparison of method of moments and cNMC simulation results for a batch process; solidlines and circles represent the moments obtained by method of moments and cNMC simulation,respectively. 31
0 5 10 15 201
1.2
1.4
1.6
1.8
2
time
M00
Monte CarloMethod of Moments
0 5 10 15 20
1.2
1.6
2
time
M10
0 5 10 15 20
0.25
0.5
time
M01
0 5 10 15 200.2
0.3
0.4
0.5
time
M11
0 5 10 15 20
1.2
1.8
2.4
time
M20
0 5 10 15 200
0.1
0.2
0.3
0.4
0.5
time
M02
0 5 10 15 200
0.1
0.2
0.3
0.4
0.5
time
M12
0 5 10 15 200.25
0.3
0.35
0.4
0.45
0.5
time
M21
Figure 4: Comparison of method of moments and cNMC simulation for the continuous process; solidlines and circles represent the moments obtained by method of moments and cNMC simulation,respectively. 32
A.
0 1 2 3 4 5
time
1.8
1.9
2M
00
TrueMeasuredEstimated
B.
0 1 2 3 4 5
1.9
2
2.1
time
M10
True Moment
Estimated Moment
0 1 2 3 4 5
0.19
0.2
0.21
time
M01
0 1 2 3 4 50.22
0.23
0.24
time
M11
0 1 2 3 4 52.2
2.3
2.4
time
M20
0 1 2 3 4 50.022
0.023
0.024
0.025
time
M02
0 1 2 3 4 50.29
0.31
0.33
time
M21
0 1 2 3 4 5
0.03
0.032
time
M12
0 1 2 3 4 50.05
0.052
0.054
time
M22
Figure 5: (A) True, measured and estimated first moment in the continuous process; solid black,dashed black and gray lines represent the measured, estimated and true outputs, respectively. (B)Moments estimation using Carleman linearization in the presence of noise on the feed concentration(Cf ) and the output measurements (M00); black and gray lines represent the estimated and truemoments, respectively.
33
0 1 2 3 4 52.2
2.3
2.4
2.5
time
ε
Figure 6: True and estimated blending degree in the continuous process; black and gray linesrepresent the estimated and true ε, respectively.
34