Summary

Overview

Parameter space

Difficulties

References

Modelling

Summary

We show that the circuit will work!

Chemical Reaction Network modelling with both a simple model and a more elaborate model to show robust perfect adaptation.
The more elaborate model can also be used to fit experimental data to obtain parameters
Utilising Control Theory and Mathematical Analysis to show that the equilibrium point of the system is stable

Being the first the analyse the repression-based antithetic integral controller architecture

The activation based module is well-analysed within the literature
Decide to focus more on the activation case due to fewer parameter constraints

Conducting pre-experimental parameter analysis to optimise design strategy

Through Global Sensitivity Analysis (GSA), we have found that θ₂, or the production rate of Z₂ is the most important parameter affecting the performance of the system
Further exploration of the parameter space identified that different promoter and RBS strengths are the key the factorial design, prompting us to use modular JUMP Golden Gate Assembly.
Through Mathematical Analysis, we compute the steady state value of the circuit, allowing the expression of a product at a custom desired level for implementation.

Overview

We constructed and implemented a variety of complex models in order to guide our experimental design as well as using these models to answer fundamental questions about our system and probe aspects of our project that we could not do so experimentally.
We used a variety of modelling techniques in our work in order to tailor our modelling efforts to answer specific questions in an efficient manner. Using multiple approaches to modelling the system furthermore allowed us to verify each individual model; this allows us to consider which assumptions are valid and to capitalise on the strengths of each individual model.
Specifically, we first considered the ordinary differential equations that describe the time evolution of the species in our system. To derive these ODEs we considered the fluxes that determine the concentration of each species in the system. To support our analytical derivation of the relevant ODEs we used SimBiology to build our model and confirm our ODEs. In this, we used a model which included both transcription and translation.
The parameters of the ODEs were obtained from the literature where possible along with first-principle calculations. These ODEs could then be solved deterministically and stochastically to obtain the time evolution of the species in the system in the presence of a variety of types of perturbations (described in SimBiology as events).
In parallel to this, we used a reduced set of ODEs (which combined the effects of transcription and translation) to explore the stability conditions of our system by using mathematics of control theory. In this we also performed extensive parameter optimisation and sensitivity analysis which guided our experimental design throughout our project in an integrated manner; this was crucial and necessary in order to reduce the sheer volume of circuits we needed to build and test in the finite time given. In this, the information obtained from these models was used in our experimental design and the results from experiment were then incorporated into our models by making our parameters more accurate and in this we were able to go through many design-test-build-learn cycles. See section ‘Control Theory Modelling’ for a complete description of the model, including results and how they were integrated into our wider project by influencing our experimental design.

Our project is furthermore heavily based on the mathematics of control theory and therefore we felt it was important to include a description of the relevant mathematics that we used to design our project and build our models. See ‘Theoretical Background’ for a summary of the most important concepts that we used and built upon in our modelling endeavours.

For the documentation of our code, please visit the following GitHub repository.

1. Gene Regulatory Network (GRN) Simulation of Varying Complexity

1.1. Modelling the GRN of the antithetic feedback controller

Here the GRN of the antithetic feedback motif was used to simulate the response of our system to perturbation. Modelling efforts in the literature (such as in Briat et al. 2019) have focused on a simplified system in which the dynamics of transcription and translation are combined. This is a common simplification used when simulating the dynamics of genetic circuits since the differences in time scale between these two processes allow for transcription to often be assumpted to be at steady state (since it occurs more rapidly). However, a more realistic model including both transcription and translation was used here for two primary reasons: firstly, to test whether robust adaptation was still achieved when transcription and translation are treated separately, and thus to test the assumptions in previous models (and in separate modelling efforts with in this project) are valid. It is especially important to test these model assumptions in case where the system is highly non-linear, as ours is. Secondly, be using this more realistic model of the network, we can understand the parameters of the system better as we can set parameters specific to trancriptional rate and translational rates (which can be determined form literature or 'back-of-the-envelope' calculations) which are easier to understand than a combined 'production rate' parameter, for instance. In this we can insert parameters reflect the characteristics of the parts we will use more accurately, which is important when choosing which parts to use in our assemblies, and then post-experiment we can fit our more detailed model with our results to deduce these more physically significant parameters (such as translational rate of a specific protein).

Here we first use Hill activation in the model, for the activation of Z₂ by X and for X by Z₁, and then simplify the model by linearising the Hill activation function to give linear activation.

Presented below is the model derivation, a detailed description of the model, the results from the simulations and then a discussion of the implications of the results, including how they influenced our experimental work and helped us to understand our lab results, and a consideration of future work and next steps.

Model Derivation

Here we describe the features and derivation of the ODE-based mathematical model of the proposed antithetic integral control system GRN.

Model Assumptions

We assume that protein degradation is predominantly due to dilution from cell division and not from active degradation of protein species.
We use a continuous and deterministic ODE model to represent the system dynamics. Reactions are represented through mass-action kinetics and population growth is not explicitly accounted for.
For all species in the system it is assumed that the variation in plasmid copy number is not large enough to significantly affect the dynamics and can be neglected from the model.
We assume that interactions between the sigma factors and RNA polymerase can be neglected.
We assume a cell volume equal to that of a typical E.Coli cell of 1fL.

Model Diagram

Figure 1. A diagram of the Gene Regulatory Network for the Antithetic Integral Controller. Circles represent processes and ovals represent species. Blue circles= basal transcription, lilac circles= translation, green circles= activation (Hill activation in this case), red circles= degradation, white circle= binding/unbinding of antithetic molecules.

System of Ordinary Differential Equations (ODEs)

Variables:

Symbol	Variable
[M_Z₁]	mRNA concentration for Z₁
[M_Z₂]	mRNA concentration for Z₂
[M_X]	mRNA concentration for X
[M_R]	mRNA concentration for reporter
[P_Z₁]	Z₁ protein concentration
[P_Z₂]	Z₂ protein concentration
[P_X]	X protein concentration
[P_R]	reporter protein concentration
[C]	Z₁-Z₂ complex concentration

Table 1.Variables in the ODE describing the antithetic integral controller

Parameters (hill activation):

Symbol	Parameter	Units
χ_{0_{,_i}}	Basal rate of transcription for species i	nM min^−¹
χ_{1_{,_i}}	Maximal rate of transcription for species i	nM min^−¹
k_i	Rate of translation of species i from it’s mRNA	min^−¹
γ_i	Rate of mRNA degradation for species i	min^−¹
δ_i	Rate of protein degradation for species i	min^−¹
n_i	Hill coefficient for activation by species i	-
K_i	Dissociation constant for activation by species i	nM
η⁺	Rate of association of Z1-Z2 complex	min^−¹molecules^−¹
η⁻	Rate of dissociation of Z1-Z2 complex	min^−¹

Table 2. Parameters associated with the antithetic integral controller using Hill activation.

Equations (hill activation):

$$\frac{d[M_{Z1}]}{dt}=\chi_{0,Z1} - \gamma_{Z1}[M_{Z1}]$$

$$\frac{d[P_{Z1}]}{dt} = k_{Z1}[M_{Z1}] - \delta_{Z1}[P_{Z1}] - (\eta^+[P_{Z1}][P_{Z2}] - \eta^-[C])$$

$$\frac{d[M_{Z2}]}{dt} = -\gamma_{Z2}[M_{Z2}] + \frac{\chi_{1,Z2}[P_X]^{n_x}}{K_X + [P_X]^{n_x}} + \chi_{0,Z2}$$

$$\frac{d[P_{Z2}]}{dt} = k_{Z2}[M_{Z2}] - \delta_{Z2}[P_{Z2}] - (\eta^+[P_{Z1}][P_{Z2}] - \eta^-[C])$$

$$\frac{d[M_X]}{dt} = \frac{\chi_{1,X}[P_{Z1}]^{n_{Z1}}}{K_{Z1} + [P_{Z1}]^{n_{Z1}}} - \gamma_X[M_X] + \chi_{0,X}$$

$$\frac{d[P_X]}{dt} = k_X[M_X] - \delta_X[P_X]$$

$$\frac{d[M_R]}{dt} = \frac{\chi_{1,R}[P_{Z1}]^{n_{Z1}}}{K_{Z1} + [P_{Z1}]^{n_{Z1}}} - \gamma_R[M_R] + \chi_{0,R}$$

$$\frac{d[P_R]}{dt} = k_R[M_R] - \delta_R[P_R]$$

$$\frac{d[C]}{dt} = (\eta^+[P_{Z1}][P_{Z2}]-\eta^-[C]) - \delta_C[C]$$

Parameter choice (hill activation):

Parameter	Baseline Value	Reference
χ_{0_{,_i}}	90 nM min^−¹	Stricker et al. (2008)
χ_{1_{,_i}}	1800 nM min^−¹	Stricker et al. (2008)
k_i	81 min^−¹	Stricker et al. (2008)
γ_i	0.54 min^−¹	Stricker et al. (2008)
δ_i	0.03 min^−¹	Annunziata et al. (2017)
n_Z1	2	U. Alon An introduction to systems biology
K_Z1	3000 nM	Rhodius et al. (2013)
n_X	2.3	Meyer et al. (2019)
K_X	4300 nM	Meyer et al. (2019)
η⁺	0.0108 min^−¹nM^−¹	Stricker et al. (2008)
η⁻	0.000108 min^−¹	Stricker et al. (2008)

Table 3. Choice of parameters for the antithetic integral control circuit used in our project.

Note that the parameters chosen for the promoter induced by the X species were those associated with the CinR inducible promoter, as documented in Meyer et al. (2019), because this is the inducible promoter we are using for the activation based antithetic integral feedback motif (see section: Circuit design).

Results (hill activation)

Below are the simulations showing the response of the system to perturbations in the concentration of X, in the rate of translation of X and in the rate of degradation of X. In each case the system is able to adapt to the perturbation and return to the steady state reference point.

Figure 2. Response of the system to a perturbation X=1.5X with the antithetic integral control motif.

Figure 3. Response of the system to a perturbation of a 1.5 increase in the degradation rate of X with the antithetic integral control motif.

Figure 4. Figure 3. Response of the system to a perturbation of a 1.5 increase in the translation rate of X with the antithetic integral control motif.

1.2.Global Sensitivity Analysis (GSA)

In GSA, model quantities are varied together to simultaneously evaluate the relative contributions of each quantity with respect to a model response. Here, GSA was used in order to obtain information on which parameters of the system the response of X was more sensitive to. This would help us to make an informed decision on how best to perturb the system in the lab by chosing perturbations that we know the concentration of our protein of interest, represented by the variable X, is most sensitive to.
Furthermore, by identifying which parameters X is most sensitive to, we can focus efforts on accurate characterization of the parameters in question since uncertainties in these parameters will be more important and influential than uncertainties in parameters that X is less sensitive to.
SimBiology performs a decomposition of the model output (response) variance by calculating the first- and total-order Sobol indices. The first-order Sobol indices give the fractions of the overall response variance that can be attributed to variations in an input parameter alone. The total-order Sobol index gives the fraction of the overall response variance that can be attributed to joint parameter variations.
The first order sensitivity index indicates the effect of an individual parameter whereas the total order can indicate the effect of the interactions of that parameter with the other parameters and their combined influence on the output. Thus, the difference between first order and total order indices can provide information on to what extend the inputs are interacting with each other.
To this end, a global sensitivity analysis was performed using 20 inputs, the parameters of our circuit model, and with the output variable being the concentration of protein X at steady state (t=1000s). The parameters were sampled from a uniform distribution spanning an order of magnitude below and above the value of the parameter used in the deterministic and stochastic simulations (with the exception of the hill coefficient which was sampled from a normal distribution between a smaller range of values as the uncertainty in this is less).

GSA Results:

Table 4 shows documents the Sobol index values for the parameters used in the GSA.

Parameter	First order index	Total order index
χ_0,Z1	0.1237	0.6514
χ_0,Z2	0	0
χ_0,X	0.0169	0.0141
χ_1,Z2	0	0
χ_1,X	0	0
k^Z1	0	0
k^Z2	0.1243	0.6581
k^X	0.1574	0.7491
γ_Z1	0.1243	0.6581
γ_Z2	0.1237	0.6516
γ_X	0.0061	0.0021
δ_Z1	0	0
δ_Z2	0	0
δ_X	0.0488	0.0290
n_Z1	0	0
n_X	0	0
K_Z1	0	0
K_X	0	0
η⁺	0	0
η^-	0

Table 4. Sobol indices for the GSA.

Table 5 ranks the influence of the input parameters on the output X according to their first order Sobol index.

Rank (highest to lowest)	Parameter
1	k_X
2	k_Z2
3	γ_Z1
4	γ_Z2
5	χ_0,Z1
6	γ_X
7	χ_0,X
8	δ_X
9	χ_0,Z2
10	K_X
11	η⁺
12	n_X
13	K_Z1
14	k_Z1
15	δ_Z2
16	n_Z1
17	δ_Z1
18	χ_1,X
19	χ_1,Z2
20	η^-

Table 5. The influence of input parameters on output parameter according to their first order Sobol index from the GSA.

Table 6 ranks the influence of the input parameters on the output X according to their total order Sobol index.

Rank (highest to lowest)	Parameter
1	k^X
2	k^Z2
3	γ_Z1
4	γ_Z2
5	χ_0,Z1
6	δ_X
7	χ_0,X
8	γ_X
9	η⁺
10	χ_0,Z2
11	δ_Z2
12	χ_1,Z2
13	χ_1,X
14	k^Z1
15	K^Z1
16	n^X
17	K^X
18	n^Z1
19	δ_Z1
20	η^-

Figure 6 The influence of input parameters on output parameter according to their total order Sobol index from the GSA.

These results show that for the most important parameters (those with Sobol indices greater than 0 to 4 decimal places), their order of importance does not change when considering first order and total order indices suggesting that interactions between parameters is not greatly affecting the sensitivty of X to that parameter. That said, there is an increase in index value for a some parameters and therefore interactions between parameters is not completely absent. This analysis, and the ranking of influence of parameters, can be used to chose which parameters should be changed to perturb the system; specifically, those parameters that are ranked highest in influence because X is most sensitive to variation in these parameters at steady state and thus changing these parameters would influence X. We want to perturb the system such that we can then see if the system is able to adapt using our integral control motif; to test if our control circuit is effective we thus need to have a clear, strong perturbation.

1.3.Linearising Hill Activation

To simplify the model we make the additional assumption:
Hill activation dynamics can be approximated by linear mass action kinetics.
This reduces the system of ODEs to as follows: Parameters (linear activation):

Symbol	Parameter	Units
χ_{0_{,_i}}	Basal rate of transcription for species i	nM min^−¹
χ_{1_{,_i}}	Maximal rate of transcription for species i	nM min^−¹
k_i	Rate of translation of species i from it’s mRNA	min^−¹
γ_i	Rate of mRNA degradation for species i	min^−¹
δ_i	Rate of protein degradation for species i	min^−¹
η⁺	Rate of association of Z1-Z2 complex	min^−¹nM^−¹
η⁻	Rate of dissociation of Z1-Z2 complex	min^−¹

Table 7. Parameters for the antithetic integral control model after linearising the Hill activation function.

Equations (linear activation):

$$\frac{d[M_{Z1}]}{dt}=\chi_{0,Z1} - \gamma_{Z1}[M_{Z1}]$$

$$\frac{d[P_{Z1}]}{dt} = k_{Z1}[M_{Z1}] - \delta_{Z1}[P_{Z1}] - (\eta^+[P_{Z1}][P_{Z2}] - \eta^-[C])$$

$$\frac{d[M_{Z2}]}{dt} = -\gamma_{Z2}[M_{Z2}] + \chi_{1,Z2}[P_X] + \chi_{0,Z2}$$

$$\frac{d[P_{Z2}]}{dt} = k_{Z2}[M_{Z2}] - \delta_{Z2}[P_{Z2}] - (\eta^+[P_{Z1}][P_{Z2}] - \eta^-[C])$$

$$\frac{d[M_X]}{dt} = \chi_{1,X}[P_{Z1}] - \gamma_X[M_X] + \chi_{0,X}$$

$$\frac{d[P_X]}{dt} = k_X[M_X] - \delta_X[P_X]$$

$$\frac{d[M_R]}{dt} = \chi_{1,R}[P_{Z1}] - \gamma_R[M_R] + \chi_{0,R}$$

$$\frac{d[P_R]}{dt} = k_R[M_R] - \delta_R[P_R]$$

$$\frac{d[C]}{dt} = (\eta^+[P_{Z1}][P_{Z2}]-\eta^-[C]) - \delta_C[C]$$

Here the choice of the parameters used in this system of equations remain the same as those used above.
The purpose of linearising the system is so that it can be modelled stochastically, as detailed in the following section.

1.4. Stochastic Model

A stochastic simulation is a simulation of a system that has variables that can change stochastically (randomly) with individual probabilities. They are more realistic in many ways than deterministic ODEs since they use populations rather than concentrations (which is important when dealing with small populations of species) and models the inherent uncertainty in all biological reactions. One algorithm that is well suited for stochastic simulation of biological networks is the Gillespie algorithm. The algorithm is particularly useful for simulating reactions within cells, where the number of reagents is low and keeping track of the position and behaviour of individual molecules is computationally feasible.
Briefly, the Gillespie algorithm incorporates the stochastic nature of biological networks by, at each iteration of the algorithm, drawing 1. the time until the next reaction and 2. the type of reaction that occurs from a probability distribution that is weighted by the propensity function of the reactions. See the Gillespie paper for more details (ref).

This method is the basis of the stochastic simulation algorithm used in SimBiology.

The time evolution of probability distributions is used in a Chemical Master Equation (CME) to describe the dynamics of a chemical system, however directly solving for this distribution is impractical for most realistic problems. The stochastic simulation algorithm (SSA), used by the stochastic solver in SimBiology, instead efficiently generates individual simulations that are consistent with the CME, by simulating each reaction using its propensity function. Thus, analyzing multiple stochastic simulations to determine the probability distribution is more efficient than directly solving the CME.

Both the Gillespie algorithm, and the stochastic simulation algorithm, become computationally impractical for large populations and fast reactions (as was the case with the more complex CME of our network).
The stochastic simulation in the case of the more complex model, including both transcription and translation, took a long time to run and it was not plausible to run an ensemble simulation for this reason. Because stochastic simulations use the probabilistic nature of the type of reactions in biological systems, multiple runs of a stochastic simulation are required and an ensemble of runs can then be analyzed. Therefore, we used the simpler model, which combined transcription and translation processes, to run a stochastic simulation ensemble.

Stochastic Model Results:

Depicted are the stochastic model simulations of the number of X molecules from t=0 to t=8000 seconds with a perturbation of a 1.5x increase in number of X molecules at t=5000 seconds. Fig.5 shows the results of an individual run of the stochastic simulation and Fig.6 shows the results of an ensemble simulation comprising of 10 runs. These results show that our model circuit is able to adapt to perturbations robustly, as is seen in the deterministic case, even with stochasticity considered. The perturbations and the system response are above the level of noise and the trajectory of the number of X molecules with time does not vary significantly between successive stochastic simulation runs.

Figure 5. Stochastic simulation of the antithetic integral control motif with linear activation: The number of X molecules with time with a perturbation of a 1.5x increase in X molecules at t=5000 seconds.

Figure 6. Ensemble tochastic simulation of the antithetic integral control motif with linear activation: The number of X molecules with time with a perturbation of a 1.5x increase in X molecules at t=5000 seconds. 10 runs of the simulation are plotted.

2. Mathematical and Parameter analysis

3. Control Theory

3.1 What is control theory and why are we using it?

Control theory is a branch of engineering that deals with the control of dynamical systems with the aim of designing systems with good stability, that can compensate for uncertainty, reject disturbances and attenuate noise, leading to a system with a good performance. Control theory is applied to a wide range of Engineering disciplines. For example, in electrical engineering where capacitors, resistors, operational amplifiers, etc. are being utilised to create an electrical circuit that gives a desired response or in mechanical engineering where movement of systems are described with defined systems of differential equations. Despite its use in various other Engineering areas, application of control theory in synthetic biology to create genetic circuits have been difficult due to its intrinsic non-linearity and unpredictability in biological systems. Here we will bring you onto a journey of the system analysis of the antithetic integral controller with control theory and we hope you will appreciate the usefulness and limitations of applying control theory in synthetic biology.

3.2. Understanding the system in a control sense

The aim of the antithetic integral controller is to allow X to be expressed at the same level even in response to uncertainties in parameters, or perturbation. The antithetic integral controller contains Z₁ and Z₂ as the controller species. Z₁ acts as an actuator where it directly controls the amount of X being produced while Z₂ acts as a sensor which its production depends on the amount of X present. Together Z₁ and Z₂ annihilate each other at the same rate which serve as the error signal of the system.

The equations of our system is as follows:
For activation:
$$\dot x = \theta_1 z_1 - \gamma_p x$$
$$\dot z_{1} = \mu_1 - \eta z_1 z_2 - \gamma_c z_1$$
$$\dot z_{2} = \theta_2 x - \eta z_1 z_2 - \gamma_c z_2$$
For repression:
$$\dot x = \frac{\alpha}{\theta_{1} z_{1} + 1} -\gamma x$$
$$\dot z_{1} = \mu_{1} - \eta z_{1} z_{2}$$
$$\dot z_{2} = \frac{\mu_{2}}{\theta_{2} x + 1} - \eta z_{1} z_{2}$$

The following images (Figure 7 and Figure 8) shows the Chemical Reaction Network (CRN) of the activation and the repression system respectively.

Figure 7. Chemical Reaction Network of the activation system

Figure 8. Chemical Reaction Network of the repression system

In control theory, block diagrams are often utilised to represent a system. Our system can be summarised with the following block diagram (Figure 9).

Figure 9. Block diagram of the antithetic integral controller system

What are block diagrams?

In control systems, the goal is to produce a controlled output Y(s) from the input U(s). The Plant/Process (P(s)) is the system we are trying to control. In order to control a system, a controller (C(s)) is designed. The transfer function in this open loop system is G(s) = C(s)P(s). Figure 10 shows the block diagram of an open loop system. The definition and the importance of transfer functions will be discussed further below.

Figure 10. Block diagram of a general control system

3.3. How to set the system up for control theory analysis?

In this section, we will explain the reasoning behind the steps we take to get closer to analysing the system with the control theory toolbox, as well as the maths on how to do it with our system.

3.3.1. Equilibrium

Equilibrium point of an ODE is the value which the ODE converges to. In a simple system, the equations can be easily solved analytically. However, in more complicated systems, numerical methods have to be used to solve the equation as there may not be a closed form solution. The equilibrium position is especially important for non-linear systems as linearisation about the equilibrium position is necessary for the analysis of the stability of the control system.
The equilibrium position of x, z₁ and z₂ with different γc values is being plotted in the range of γ_c = 0 to γ_c = 0.1 for both activation (Figure 11) and repression (Figure 12). When γ_c = 0, there is an analytical solution for the equilibrium positions which can be easily obtained.
For activation:
$$x_{eq} = \frac{\mu_1}{\theta_2}$$, $$z_{1eq} = \frac{\mu \gamma_p}{\theta_1 \theta_2}$$, $$z_{2eq} = \frac{-\gamma_c^{2} + \theta_1 \theta_2}{\eta \gamma_p}$$
For repression:
$$x_{eq} = \frac{\mu_{2} - \mu_{1}}{\mu_{1} \theta_{2}}$$, $$z_{1eq} = \frac{\alpha \mu_{1} \theta_{2} \left(\mu_{2} - \mu_{1}\right)}{\gamma \theta_{1} \left(\mu_{2} -\mu_{1}\right)}$$, $$z_{2eq} = \frac{\gamma \theta_{1}\left( \mu_{1} \mu_{2} - \mu_{1}^{2} \right)}{\alpha \eta \mu_{1} \theta_{2} - \gamma \eta \left(\mu_{2} - \mu_{1}\right)}$$
With γc not equal to 0 however, an analytical solution is difficult to obtain. Therefore, numerical solution is obtained through odeint in Python and how the equilibrium position changes depending on the value of γ_c is plotted. This range is chosen because γ_c = 0.00038 as calculated with the dilution rate of E. coli. It is interesting to note the equilibrium position follows a monotonic trend for both activation and repression when γ_c changes in the physical ranges of values.
The following parameter values are used when analysing the activation system: θ₁ = 0.005, γp = 0.00038, η = 0.018/60, µ₁ = 0.7, θ₂ = 0.0005.
The following parameter values are used when analysing the repression system: α = 10/60, θ₁ = 0.05, γp = 0.00038, η = 0.018/60, µ₂ = 1.5, µ₁ = 0.7, θ₂ = 0.005.

3.3.2. Linearisation and state space representation

Systems are often characterised with ODEs and these ODEs are often non-linear thus is very difficult to solve. To utilise the control systems toolbox, the system has to be linearised about its equilibrium position using a Jacobian matrix. Linearisation is similar to zooming in into the dynamics of the system at the fixed point, assuming that just around this fixed point, the dynamics is linear. To understand how linearisation works graphically, look at Figure 13. As we can see, further from the fixed point however, linearisation of the system does not hold. Therefore, the system has to be working closed to around the fixed point for linearisation to be justified. After the ODEs are linearised, the system can be written in a state space form i.e. a sytem of linear ODEs arranged in a specific format as below.
$$\dot x = Ax + Bu$$ $$y = Cx + Du$$
where x is the states of the system, u is the input of the system and y is the output of the system. A, B, C and D are appropriate matrix describing the system of ODEs. By taking Laplace transform, the transfer function of the system can be found.

What are states?

In a dynamical system, its memory/present output y(t₁) depends not only the present input u(t₁) but also on past inputs u(t) where t ≤ t₁. The states of the system x(t) are a set of variables of the system chosen to represent this memory and the choice of a state is not unique. Essentially, x(t₀) summarises the effect on the future of input and states prior to t = t₀.
There are certain methods to choose the correct states for the system. One can choose states intuitively, considering independent energy elements or energy elements of the system. For instance, in electrical circuits, the voltage across capacitors or the current in inductors are reasonable choices. In mechanics, positions and velocities of masses can often also be chosen as states. In our system, intuitively, the number of species X, Z₁ and Z₂ gives the memory elements of the system and thus intuitively are chosen as the state of the system.

Setting up the input:

In order to see whether the system can robustly adapt, the input of the system is the disturbance to x. Let the disturbance be d. Here d = 0.75θ1 where θ1 is the production rate of x for each z1 molecule. It will not be realistic if d is set too high and the dynamics of the system will deviate to very far away from the equilibrium point, so looking at the system response and the stability of the linearised system will not be justified.

Activation:

After linearisation about its equilibrium position:
$$\begin{bmatrix} \dot \delta x \\ \dot \delta z_1 \\ \dot \delta z_2 \end{bmatrix} = \begin{bmatrix} -\gamma_p & \theta_1 & 0\\ 0 & -\eta z_2 - \gamma_c & -\eta z_1 \\ \theta_2 & -\eta z_2 & -\eta z_1 - \gamma_c \end{bmatrix} \begin{bmatrix} \delta x \\ \delta z_1 \\ \delta z_2 \end{bmatrix} + \begin{bmatrix} 1 \\ 0 \\ 0 \end{bmatrix} d$$ $$y = \begin{bmatrix} 1 & 0 & 0 \end{bmatrix} \begin{bmatrix} \delta x \\ \delta z_1 \\ \delta z_2 \end{bmatrix}$$ where $$A = \begin{bmatrix} -\gamma_p & \theta_1 & 0\\ 0 & -\eta z_2-\gamma_c & -\eta z_1 \\ \theta_2 & -\eta z_2 & -\eta z_1 - \gamma_c \end{bmatrix}$$, $$B = \begin{bmatrix} 1 \\ 0 \\ 0 \end{bmatrix}$$, $$C = \begin{bmatrix} 1 & 0 & 0 \end{bmatrix}$$

Repression:

After linearisation about its equilibrium position:
$$\begin{bmatrix} \dot \delta x \\ \dot \delta z_1 \\ \dot \delta z_2 \end{bmatrix} = \begin{bmatrix} -\gamma_p & -\frac{\alpha \theta_1}{(\theta_1 z_{1eq} + 1)^2} & 0\\ 0 & -\eta z_2 - \gamma_c & -\eta z_1 \\ -\frac{\mu_2 \theta_2}{(\theta_2 x_{eq}+1)^2} & -\eta z_2 & -\eta z_1 - \gamma_c \end{bmatrix} \begin{bmatrix} \delta x \\ \delta z_1 \\ \delta z_2 \end{bmatrix} + \begin{bmatrix} 1 \\ 0 \\ 0 \end{bmatrix} d$$
$$y = \begin{bmatrix} 1 & 0 & 0 \end{bmatrix} \begin{bmatrix} \delta x \\ \delta z_1 \\ \delta z_2 \end{bmatrix}$$
where $$A = \begin{bmatrix} -\gamma_p & -\frac{\alpha \theta_1}{(\theta_1 z_{1eq} + 1)^2} & 0\\ 0 & -\eta z_2 -\gamma_c & -\eta z_1 \\ -\frac{\mu_2 \theta_2}{(\theta_2 x_{eq}+1)^2} & -\eta z_2 & -\eta z_1 - \gamma_c \end{bmatrix}$$, $$B = \begin{bmatrix} 1 \\ 0 \\ 0 \end{bmatrix}$$, $$C = \begin{bmatrix} 1 & 0 & 0 \end{bmatrix}$$

3.3.3. Laplace Transform

Systems described in ODEs can be solved and visualised in the time domain. However, while the time domain shows us its actual response, we do not have a lot of tools to analyse the system in the time domain. In the Laplace domain (s domain) however, a lot of tools from the control theory toolbox can be used to analyse the system in an simpler way. In the Laplace domain, instead of using time as the variable, s (any complex number) is being used as the variable.
Laplace transform is defined as the following:
$$\mathscr{L}\{f(t)\}=\int_{t=0}^{\infty}f(t)e^{-st}dt$$
Although the Laplace transform expression looks complicated, Laplace transform of ODEs are very easy where X(s) is the Laplace transform of X.
$$\mathscr{L}\{\dot x\}=sX(s) - x_0$$
where X(s) is the Laplace transform of x(t) and x₀ is the initial condition of the differential.

After Laplace transform of the ODE, we obtain a transfer function that links the input of the system (U(s)) to the output of the system (Y(s)). A transfer function is the Laplace transform of the impulse response of the system. The impulse response of the system refers to how the system respond to an impulse in the time domain. Looking at the expression of transfer function and manipulating it with the control systems toolbox, one can determine its stability, amplitude and phase with inputs at different frequencies.
Let the transfer function be G(s), U(s) be the input of the Laplace domain and Y(s) be the output in the Laplace domain, U(s)G(s) = Y(s). In a single-input single-output system, Laplace transform can be taken on the single differential equation to find the transfer function.

For a multi-input multi-output system, a state space representation of a system is required. The transfer function can be obtained through state space representation through G(s) = C(sI-A)^-1B + D.

3.4. What are the different types of systems?

3.4.1. Open loop:

An open loop system is one that does not have any feedback i.e. the output will not influence the input. Let the open loop transfer function be G(s) directly linking the input (U(s)) and output (Y(s)). In principle, it is possible to choose a desired transfer function G(s) and use C(s) = G(s)/P(s) to obtain Y(s) = G(s)U(s). In practice, this will not work because it requires an exact model of the plant/process without any disturbances or uncertainties. Therefore, a feedback system is needed. The block diagram of an opened loop system is shown in Figure 13.

3.4.2. Closed loop:

Negative feedback is important in this case to deal with the effects of uncertainty. The closed loop transfer function for a negative feedback loop is H(s) = G(s)/(1+KG(s)) where K is the feedback gain of the system. In this case, we define the return ratio L(s) = KG(s). According to the block diagram, negative feedback allows the output at that moment in time to be compared to the reference value and the error signal at that time is being fed back into the system. Looking at the denominator of the closed loop transfer function, we can see that the closed loop characteristic equation is 1+KG(s) = 0 thus the stability and characteristics of the closed loop system compared to the open loop system can be very different (Further discussed below).

The block diagram of a negative feedback system is shown in Figure 14. In terms of how a negative feedback system rejects disturbances, we have to look at its sensitivity function (explained below). Essentially the sensitivity function is in the form of 1/(1+KG(s)) which means that a higher K leads to a lower sensitivity function value, thus rejecting disturbances. Although negative feedback allows disturbance rejection and can further reduce steady state error if the gain is very high, in biological systems, gain of a system is limited and that the steady state error can never be 0.

3.4.3. Integral Controller:

In Laplace domain, an integrator is 1/s in the transfer function. The presence of an integrator inside the controller ensures that there will be no steady state error assuming that the closed loop system is asymptotically stable, even in the presence of constant disturbances and demands. An integrator essentially is the memory of error signal through time.
To look at the integrator of the system, an additional state can be added to the state space representation which represents the error signal of the system. The detailed steps of doing this is discussed below.
The block diagram of an integrator is shown in Figure 15.

3.4.4. Comparasion:

After understanding the different types of systems, it is time to simulate them to see how they respond to perturbation and whether they robustly adapt.

3.5. Analysing systems of ODE with the control theory toolbox

In this section, control theory toolbox will be utilised to characterise the system, the most important concepts behind is equilibrium and stability.

3.5.1. Stability

What are poles of the transfer function?

Activation - without degradation

The Routh-Hurwitz criterion states that for the roots of the characteristic equation to be negative real part, $$a_{1}a_{2} > a_{0}a_{3}$$ where $$a_{1}a_{2} = (\gamma_p + \eta (z_{1eq}+z_{2eq}))(\eta \gamma_p (z_{1eq}+z_{2eq}))$$ and $$a_{0}a_{3} = \eta \theta_1 \theta_2 z_{1eq}$$
Substituting $$z_{1eq} = \frac{\mu \gamma_p}{\theta_1 \theta_2}$$, $$a_{0}a_{3} = \eta \mu \gamma_p$$
$$a_{1}a_{2} = z_{1eq} z_{2eq} \eta^{2} \gamma_p + ... = \mu \gamma_p \eta +...$$
where +... are non-negative values as all the parameters are non-negative This proves that $$a_{1}a_{2} > a_{0}a_{3}$$ and that the system is stable linearised about the equilibrium point. Looking at the system we have used for simulation, the poles of the system are
−0.0386 + 0.0000i, − 0.0002 + 0.0014i, − 0.0002 − 0.0014i

Activation - with degradation

With γc > 0, there’s no close form analytical solution to the steady state value of the ODE. However, given that γc is very small and that without degradation $$a_{1}a_{2} > a_{0}a_{3}$$. It is reasonable to deduce that for the range of value of γc we have, the equilibrium point will be stable.
For the system we have used for simulation, we can analyse its stability through looking at the poles.
The poles of the system are:
−0.0389 + 0.0000i, − 0.0004 + 0.0014i, − 0.0004 − 0.0014i As all the poles have a negative real part, the equilibrium point of the system is stable.

Repression - without degradation

The eigenvalue of the repression system leads to a very complicated expression.
Here, we analyse the stability of the system through looking at the poles of the system we are simulating to show that for realistic values, the equilibrium point is stable.
The poles of the system are:
−0.0436 + 0.0000i, − 0.0002 + 0.0007i, − 0.0002 − 0.0007i As all the poles have a negative real part, the equilibrium point of the system is stable.

Repression - with degradation

Similarly, the eigenvalue of the repression system does not have a closed form solution. Therefore, like above, we analyse the stability of the system through looking at the numerical values of the poles.
The poles of the system are:
−0.0387 + 0.0000i, − 0.0004 + 0.0008i, − 0.0004 − 0.0008i As all the poles have a negative real part, the equilibrium point of the system is stable.

3.5.2. System Response

In this section, we will simulate the systems of non-linear ODEs with Python odeint and see how the system respond to perturbations. We will then vary the magnitude of perturbation to show how the system behaves.
The graphs below shows the response of the system in response to small perturbation with varying magnitudes for both the Activation and Repression system, with and without degradation.

Looking at the system with and without degradation, one can see that for the non-degradation case, the system is able to robustly adapt, return back to its original steady state value. However, for the degradation case, it reaches the new steady state faster given the same parameter value and the same disturbance.

Leaky integrator

An integrator is a controller that ensures 0 steady state error. While 0 steady state error is ideal, an integral controller have slow dynamics. A Leaky integrator is a type of integrator that allows for faster dynamics and increased stability but in the expense of having some small steady state error. Essentially, it allows some memory of the error to be lost in the process, just like in our system where controller species are being degraded, leading to a leaky integrator instead of a perfect integrator.

3.6 How much uncertainty can the system handle for θ₁ and θ₂?

Here, we analyse the system in the activation case only using Nyquist Diagrams and sensitivity functions to show how much variability the system can take in the value of θ₁ and θ₂. To investigate this, we first need to obtain the open loop function of the system, using the parameter θ₁ and θ₂ as the feedback gain to investigate how big θ₁ and θ₂ can be for the system to be stable.

Uncertainty in θ₁

To obtain the open loop system, we break open the loop in order to investigate the open loop system. The explanation of breaking the loop open is shown in Figure 25.

Uncertainty in θ₂

To obtain the open loop system, we break open the loop in order to investigate the open loop system. The explanation of breaking the loop open is shown in Figure 26.

Sensitivity Function of the Open loop system

The sensitivity function of the parameter θ₁ and θ₂ is shown in the following graphs. As we can see from the graph below, the peak of the sensitivity function of θ₁ is at 0dB, 7.03 dB and 7.3dB for the non-degradation and degradation case respectively, meaning that the closed loop system can be quite sensitive to the value of θ₁. For θ₂ however, the peak of the sensitivity function are 1.53dB and 1.52dB for the non-degradation case and degradation case respectively, showing that the system will become unstable if θ₂ reaches a certain value. However, the frequency where the sensitivity function reaches to above 0dB is at the high frequency end, meaning that disturbance will very likely be rejected as disturbances usually have low frequencies.

Sensitivity function and Nyquist Diagram

Nyquist diagram provides an easy way to visualise the stability of a feedback system and to access the maximum feedback gain obtained before the system becomes unstable. It allows us to deduce the properties of the closed loop system based on the properties of the open loop system. It essentially plots the value of the open loop transfer function transfer function (G(s)) in the complex plane at all frequencies from ω = 0 to ω = ∞. Assuming that the open loop system is stable, the closed loop system will be stable if the Nyquist diagram doesn’t have a net encirclement of the -1/K point. In the analysis of our system, K = 1 so if there’s no encirclement of the -1 point. The closed loop system is stable.

Why is the -1 point significant?

Remember that the denominator of a closed loop transfer function is 1+L(s). If L(s) = -1, the denominator of the system will be 0 and the system will go to infinity at the frequency and thus is unstable at that specific frequency.

Apart from whether the Nyquist diagram encircles the -1 point, there are other properties of the Nyquist diagram that is important for the analysis of the robustness of the system - gain margin, phase margin and the peak of sensitivity function. The gain margin is the amount of change in the open-loop gain required to make a closed-loop system unstable i.e. the distance between the -1 point to the point where the Nyquist diagram crosses the real axis. The phase margin is the amount of change in open-loop phase required to make a closed-loop system unstable where phase is strongly related to the time delay of the system.

In control theory, sensitivity function is a very important concept and is given by S = (1 + L(s))⁻¹L(s). The property of the sensitivity function can be obtained through looking at the Nyquist diagram of the system. Essentially, the closer the Nyquist plot of L(jω) is to the point -1, the less robust the system will be when the loop is closed. If there is uncertainty in parameters such that the process or the controller is larger than what we model it to be, the Nyquist diagram will magnify. Therefore, the closer the Nyquist diagram is to the -1 point, the less it can magnify before it crosses over the -1 point, leading to instability. Moreover, the Nyquist diagram also contains phase information where time delay causes the Nyquist diagram to rotate, which can bring some part of the Nyquist diagram closer to the -1 point.

The peak of the sensitivity function is the minimum distance from the -1 point to the Nyquist plot. Therefore, when looking at the sensitivity function, the magnitude of the peak gives us information on the robustness. Remember that S = (1 + L(s))⁻¹L(s) so the higher S is, the closer L is to -1 at that particular frequency and the more likely the system will be unstable. Apart from the peak of the sensitivity function, it is also good practice to look at the frequencies of the sensitivity function. Ideally we want the sensitivity function to have a lower magnitude at lower frequency as it is closely associated with reference tracking and disturbance rejection.

The interpretation of gain margin, phase margin and the peak of the sensitivity function is shown in the following diagram.

3.7. Looking at the Integrator

As the antithetic circuit acts as an integral controller, looking at the integrator of the system is important. An integrator is the error signal of the system which in this case, is governed by the difference of z1 and z2. In order to look at the integrator, new states has to be created to visualise the integrator.

Activation:

Including integrator as states, we get:
$$\dot x = \theta_1 \frac{q_1 + q_2}{2} - \gamma_p x$$ $$\dot q_1 = \mu_1 - \theta_2 x + \gamma_c (z_2 - z_1)$$ $$\dot q_2 = \mu_1 + \theta_2 x - 2\eta z_1 z_2 - \gamma_c(z_1 + z_2)$$
Substituting:
$$ z_1 = \frac{q_1 + q_2}{2}$$ $$ z_2 = \frac{q_2 - q_1}{2}$$ We obtain:
$$\dot x = \theta_1 \frac{q_1 + q_2}{2} - \gamma_p x$$ $$\dot q_1 = \mu_1 - \theta_2 x - \gamma_c q_1$$ $$\dot q_2 = \mu_1 + \theta_2 x - \frac{\eta}{2} (q_{2}^{2} - q_{1}^{2}) - \gamma_c q_2$$
The response of the integrator in response to small varying perturbation is shown in Figure 32 for the non-degradation case and Figure 33 for the degradation case.

Repression:

For the repression case, the systems of ODEs visualising the integrator becomes:
$$\dot x = \frac{\alpha}{\theta_1 \frac{q_1 + q_2}{2} + 1} - \gamma_p x$$ $$\dot q_1 = \mu_1 - \frac{\mu_2}{\theta_2 x + 1} - \gamma_c q_1$$ $$\dot q_2 = \mu_1 + \frac{\mu_2}{\theta_2 x + 1} - \frac{\eta}{2}(q_{2}^{2}-q_{1}^{2}) - \gamma_c q_2$$ where q1 is the integrator.
Writing it as the state space representation, after linearisation, we obtain:
The response of the integrator in response to small varying perturbation is shown in Figure 36 for the non-degradation case and Figure 37 for the degradation case.

3.8. Comparison with non-linearised system

As we can see above, we are analysing the system through linearisation. Analysing the system response of the linearised system is only justified if the dynamics of the system is close to the equilibrium point being linearised about. Here we show how different the response of the linearised and the non-linearised system can be if the disturbance is set too high, deviating the system away from the equilibrium fixed point.

Let’s first look at the response of the system when the disturbance is d=0.00375 for both the linearised and the non-linearised system. We can see that the system response is very similar and thus the analysis of system response with the linearised system is justified.

From this, we can see how linearisation of the system makes calculations easier and allows us to apply the control theory toolbox easily to analyse the stability of the system about the point we linearised at. However, we should bear in mind that if the disturbance is too big where we push the system away from the equilibrium, analysing the system response of the linear system is no longer valid as the linear system’s behaviour may deviate a lot from the non-linear system’s behaviour.

It is also important to note that in this journey of using control theory to analyse the system, we are only using 3 ODEs to describe the system, nesting various dynamics into single variables. In real biological systems, there are multiple steps that goes in between, for instance, the variable θ1 represents the action of z1 on the promoter that affects the transcription rate, the strength of RBS thus the translation rate also plays a role. Therefore, to make our system more realistic, we can add in extra ODEs to describe these dynamics but this will make our model more complicated, making it more difficult to analyse.

3.8. Outlook of application of control theory in synthetic biology

In this part, we have discussed one of the recent advances of using an antithetic integral controller to reduce steady state error in synthetic biology. As we can see, we have applied the classical control theory toolbox to design robust genetic circuits, making them more reliable in the presence of noise and be less sensitive to to variability. Throughout the past years, genetic circuit based controller have been evolving, from single transcription unit level (e.g. autoregulatory feedback) to single cell level (e.g. incoherent feedforward loop, antithetic integral controller) to the control of single cell populations or multiple populations. Recently, in-silico methods is being investigated to build more sophisticated controllers. With such amazing evolution over the past years, we are hopeful that in the future, control theory will be applied even more in synthetic biology.

4. Difficulties in Parameter Analysis

In order to find out the parameters of the model, we did literature search on relevant paper. We first referred to the Aoki et al.,(2019) for the values that they have used for their model. μ is a collective parameter that include promoter activity and RBS strengths and we only know the relative strength of the promoters and RBS, we find it a bit difficult to find appropriate values to put into our model. For 𝜃, not only the transcription rate and translation rate matters, the ability for the species to bind onto the promoter also matters. For this, we require the Kd of the system. We are able to find the translation rate of RBS on the iGEM website for B0032m (medium) and B0034 (strong) but we are unable to find it for B0033. In terms of promoter, promoter strengths are always relative or are given in fluorescence. Without knowing the conditions that the promoter was characterised in, we are unable to convert the promoter activity into transcription rates. Despite this, we came across a paper by Kannan et al., (2018) that provides us with insight to convert fluorescence into transcription rates. However, to be able to convert fluorescence into transcription rate, the value of translation rate, biomass, time series data of fluorescence, maturation time of the fluorescent protein and degradation rates of all the species have to be known. Due to time constraints, we ar unable to do this characterisation ourselves.

We have also attempted in using the Promoter Calculator from Salis Lab to find out the promoter strengths of the inducible promoters without the activator and repressor bound onto it. Through looking at the paper by Rhodius et al (2013), there are values for fold change in fluorescence under sigma factor binding is being characterised. Similarly, for species X the paper by Meyer et al., (2019) has clearly characterised the Kd, relative promoter units of the promoters with and without activators and repressors. We tried to use the fold change to get the promoter strength when the activator/repressor is bound then compare the results to the well characterised Anderson promoter to obtain a relative strength of the promoters we are using. However, the result we obtain using this method yields counter-intuitive results where the promoter activated by an activator gets a lot stronger than when a repressor is not bound onto its promoter.

These attempts therefore shows how difficult it is to put in parameter values into our model, especially if the part itself is not well characterised. We believe that in the future, given enough time, we could characterise the parts ourselves and compare it with well characterised promoters to put in more reasonable range of parameters for the specific promoter we are using.

Summary

Overview

1. Gene Regulatory Network (GRN) Simulation of Varying Complexity

1.1. Modelling the GRN of the antithetic feedback controller

Model Derivation

Model Assumptions

Model Diagram

System of Ordinary Differential Equations (ODEs)

Results (hill activation)

1.2.Global Sensitivity Analysis (GSA)

GSA Results:

1.3.Linearising Hill Activation

1.4. Stochastic Model

Stochastic Model Results:

2. Mathematical and Parameter analysis

3. Control Theory

3.1 What is control theory and why are we using it?

3.2. Understanding the system in a control sense

What are block diagrams?

3.3. How to set the system up for control theory analysis?

3.3.1. Equilibrium

3.3.2. Linearisation and state space representation

What are states?

Setting up the input:

Activation:

Repression:

3.3.3. Laplace Transform

3.4. What are the different types of systems?

3.4.1. Open loop:

3.4.2. Closed loop:

3.4.3. Integral Controller:

3.4.4. Comparasion:

3.5. Analysing systems of ODE with the control theory toolbox

3.5.1. Stability

What are poles of the transfer function?

Activation - without degradation

Activation - with degradation

Repression - without degradation

Repression - with degradation

3.5.2. System Response

Leaky integrator

3.6 How much uncertainty can the system handle for θ1 and θ2?

Uncertainty in θ1

Uncertainty in θ2

Sensitivity Function of the Open loop system

Sensitivity function and Nyquist Diagram

Why is the -1 point significant?

3.7. Looking at the Integrator

Activation:

Repression:

3.8. Comparison with non-linearised system

3.8. Outlook of application of control theory in synthetic biology

4. Difficulties in Parameter Analysis

5. References

3.6 How much uncertainty can the system handle for θ₁ and θ₂?

Uncertainty in θ₁

Uncertainty in θ₂