MODELLING KINETICS OF HALOCARBON DEGRADATION

The conventional degradation of halocarbons consists of two distinct steps:

Conversion of halocarbons to an intermediate via Cytochrome P450CAM under anaerobic conditions.
Subsequent conversion of that intermediate to oxidised products via the action of Toluene Dioxygenase under aerobic conditions.

To further our understanding of microbial halocarbon degradation pathways, a kinetic enzyme model based on a system of ordinary differential equations was built. With the help of this model, the kinetics of the system were studied, the time scale of the enzymatic degradation steps was estimated, and the number of bacteria in the system was modelled as a function of time. A conceptual understanding of the kinetics of microbial halocarbon degradation played a critical role in experimental design to engineer the chassis (Pseudomonas putida), and to determine requirements for the design of the bioreactor.

INTRODUCTION

In our project, Halocleen, we aim to degrade halocarbons into environment-friendly products which can be consumed by bacteria. The conventional process of degradation was briefly described in the previous section; however, the overall process is very complex, and therefore a simplified model was developed.

Bacterial species like Pseudomonas putida and various Methylotrophs have been found to degrade halocarbons naturally or when engineered synthetically. However, neither the kinetics nor the mechanism of the degradation is fully known.

Therefore, in order to fully understand the mechanism of degradation and the conversion pathway, a kinetic model was developed. kinetic models describe, in general, how states and properties of a system vary with time and provide excellent simplified estimates of the complex processes occurring in the system. Some aspects of the bacterial system, such as the decay rate of the bacteria and the time required for oxygen concentration to drop to 0, could not be accurately estimated in a limited time frame, so the model was used to make predictions regarding these. Thereby, the assaying techniques and bioreactor models were optimised.

GENERAL SCHEMATIC OF ENZYME SYSTEM

The general schematic of the enzyme model involves two enzymes working in sync, Cytochrome P450CAM, and Toluene Dioxygenase. The Halocarbon substrate is degraded under anaerobic conditions by Cytochrome P450CAM to give an intermediate which is degraded under aerobic conditions by Toluene Dioxygenase. The organic products obtained are consumed via bacterial metabolic processes while F-/Cl- ions are released by the bacteria. Cytochrome P450CAM under aerobic conditions, however, gives a side product which cannot be further degraded by the bacteria. The plasmid with the genes coding for Cytochrome P450CAM has a hypoxic promoter. In the presence of oxygen, oxygen binds to the promoter. This prevents the transcription of the plasmid, thereby preventing the formation of the enzyme.

APPROACH

To build a model for Halocarbon degradation, enzyme mechanisms, bacterial growth, and toxicity were studied, and published experimental data was gathered. The system was mathematically described via a system of differential equations based on this knowledge. These describe the change in the substrate, the intermediate, the product, and the side product concentrations and the number of bacteria as a function of time. Subsequently, numerous simulations were performed to obtain the model, which minimised the difference between experimentally obtained data and theoretical simulations. Thereby, the kinetics of Halocarbon degradation were studied, and the most influential parameters affecting Halocarbon degradation were identified to provide a time scale for the experimental setup.

A BASIS FOR OUR MODEL

The skeleton of the model, including side products and intermediates, is based on the hypothesis and data obtained by Dr Lawrence Wackett et al.[1][2][3] Data regarding the efficiencies of each of the enzymes under varying concentrations of oxygen were used to develop the kinetics of the model.[4]

Various parameters in the model were determined using published data regarding the bacterial metabolism of halocarbons[1][3] and the Brenda software cum enzyme database. We also looked at certain models developed by previous iGEM Teams, especially that by Team Wageningen 2021.[11]

SKELETON OF OUR MODEL

The variation in concentration of various species in the system with time was modelled using a system of differential equations. The Michaelis Menten rate equation was used to describe the various enzymatic conversions. It involves the following assumptions:

First order with respect to all species involved
Steady-state approximation for Enzyme Substrate Complex
Formation of a single Enzyme Substrate Complex
Ein = E + ES

These assumptions hold for this system.[1][2][3] Additionally, the following assumptions are made:

That there is a very small delay in the consumption of the intermediate after its formation. Hence, the conversion is treated as a two-step series enzyme reaction.
That the rate of formation and the rate of degradation is the same for the enzymes. Hence, the concentration of enzyme per bacteria remains constant.

The total concentration of enzymes in the system, however, is a function of the number of bacteria.

$$E_0 =N \cdot \epsilon_0$$

Where:

$E_0$ is the concentration of enzyme overall
$\epsilon_0$ is the concentration of enzyme in one bacterium
$N$ is the number of bacteria as a function of time.

It has been assumed that the number of bacteria follows a logistic growth curve, and an additional term was added to account for the death of the bacteria due to possible accumulation of species like $ \ce{F-}/\ce{Cl-} $ ions or the reactant, intermediate, side product or the product.

$$\frac{d[N]}{dt}=rN(N_0-N)-f(P,S)N$$

Where:

$N$ is the number of bacteria as a function of time.
$N_0$ is the initial number of bacteria.
$r$ is a constant describing the rate of bacterial growth.
$f(P,S)$ is a factor which accounts for overall toxicity due to various species in the medium.
$P$ refers to product concentration
$S$ refers to substrate concentration.

It is assumed the toxicity increases linearly with the concentration of the compounds; hence, $f(P,S) = \lambda P + \lambda 'S$. Pseudomonas putida can survive in anaerobic conditions for limited durations. Hence the effect of $ \ce{[O2]} $ can be neglected.

Diffusion of HCFCs:

Henry’s law is used in order to describe the partial pressure of the halocarbons. Since halocarbons are volatile substances, they exist partially in liquid form and partially in gaseous form. Hence, it is important to model the partial pressure for the assay performed, which involved a liquid cell culture and a stock solution of the halocarbons. The bioreactor design, however, is based on a solid-state bioreactor model, and hence, does not require this part of the model for its analysis. Hence we have the following:

$$\frac{S_{gas}}{S_{liq}}=K_H$$

The time delay due to the diffusion of HCFCs throughout the liquid is neglected. Hence, it is considered to be homogeneously distributed. This is enforced through the use of a magnetic stirrer in all of the experiments.

The halocarbons enter the bacteria via specified proteins existing naturally for the uptake of toluene. This data was provided via the insights given by Dr LP Wackett et al who worked with this system earlier. We use the following differential equation via Fick’s law, giving:-

$$\frac{d[S]}{dt} = k_d ([S_{liq}]-[S])$$

Consumption of $ \ce{O2} $:-

Oxygen is consumed in two parts, one portion of it is consumed by the bacteria for its metabolism, and the other is used up by Toluene Dioxygenase for conversion of the intermediate to the product. It is also possible that oxygen remains in the system during the consumption of substrate by Cytochrome P450CAM. The enzyme, under aerobic conditions, uses up oxygen with the substrate to give a non-degradable side product.

We consider a factor $ \alpha $ which is a function of oxygen concentration. $ \alpha $ is the fraction of substrate which is acted upon by Cytochrome P450CAM under anaerobic conditions. As the concentration of oxygen increases, $\alpha$ decreases, and the amount of side product increases as well. An approximation of $\alpha$ is given by: \[ \alpha = \frac{1}{(1+b[\ce{O2}])} \]

where b is a constant that accounts for the minimum threshold amount of oxygen consumable by the bacteria.

THE DIFFERENTIAL EQUATIONS

$$\ce{\alpha [HCFC] + E_1 -> (E_1 - \alpha[HCFC])_{complex} -> I + E_1} \require{mhchem}$$

This is performed by Cytochrome P450CAM, under anaerobic conditions. In this:

$ \ce{\alpha[HCFC]} $: concentration of the substrate available for the anaerobic reaction.
$ \ce{E1} $: enzyme Cytochrome P450CAM

$$\ce{I + O2 + E2 -> [E2IO2]_{complex} -> E2 + P}$$

This is performed by Toluene Dioxygenase. Here, P: Final Products.

Using the steady state approximation, we get:

$$\ce{(E1-\alpha[HCFC])_{complex} = \frac{\alpha[HCFC]\cdot[E_1]_0}{K_m + \alpha[HCFC]}} \tag{1}$$

In this:

$\alpha$: fraction of substrate reacting in anaerobic conditions

$ K_m $: Michaelis-Menten constant

$ \ce{[E1]0} $ is initial concentration of Cytochrome P450CAM.

$ \ce{(E1-\alpha[HCFC])_{complex}} $: concentration of the substrate enzyme complex.

Substituting the concentration of the complex from Equation 1, we get the rate of change of concentration of HCFC due to the anaerobic action of Cytochrome P450CAM to be:

$$ \ce{\frac{d\alpha [S]}{dt} = k_1 \alpha [S][E1]0 + \frac{(k_1\alpha [S] + k_{-1})\alpha [S][E1]0}{K_m + \alpha [S]}} \tag{2}$$

Here, $S = \ce{[HCFC]}$

Using the steady state approximation, we get:

$$\ce{[E2IO2]_{complex} = \frac{(1-\alpha)[S][E1]0[O2]}{K_{m}' + \alpha [S][O2]}} \tag{3}$$

Hence, we get the following equation for the rate of formation of the product:

$$\ce{\frac{d[P]}{dt} = \frac{k_2'[I][O2][E2]_0}{K_m'+[I][O2]}} \tag{4}$$

Here, $ \ce{[E2]0} $ is the initial concentration of Toluene Dioxygenase. Hence, we finally get the rate of change of the concentration of the intermediate to be:

$$\ce{\frac{d[I]}{dt}=\frac{k_2\alpha [S][E1]0}{K_m + \alpha [S] } +\frac{k_{-1}[I][O_2][E2]0 - k_1'[I][O2][E1]0}{K_m' + [I][O2]}} \tag{5}$$

Now, to account for the remaining HCFC content:

$$\ce{(1-\alpha)[HCFC] + E1 -> (E1-\alpha[HCFC])_{complex} -> P_{side} + E1}$$

This is performed by Cytochrome P450CAM under aerobic conditions. Here $ \ce{P_{side}} $ is the undegradable side product formed.

Using the steady state approximation, we get:

$$\ce{[E1SO2] = \frac{[SO2][E1]0[O2]}{K_m'+\alpha [S][O2]}} \tag{6}$$

Here, $ [\ce{SO2] = (1-\alpha)[S]} $

Similar to the procedure for determining the rate of change of HCFC concentration by anaerobic action, we can do so for aerobic action of Cytochrome P450CAM, using Equation 6 for the concentration of the complex.

Finally, we write the rate of change of concentration of $ \ce{O2} $ as follows.

$$\ce{\frac{d[O2]}{dt}=k_{-1}"[E1SO2]-k_1"[S][O2][E1]0 + k_1 "[S][O2][E1SO2] + k_1'[E2][O2]-k_1'[E2]0[I][O2]-\mu N}$$

Here, $\mu$ is a constant that quantifies the metabolic oxygen consumption by the bacteria. This is treated as a zero-order term relative to the oxygen concentration.

Note that here S is the concentration of substrate in the bacteria. This has to be related to the concentration of substrate in the medium and in the gas. To relate the substrate concentration in the bacteria with the concentration of substrate in the medium, we use the diffusion equation through the porin and to relate the concentration of the substrate in the medium with the gas, we use Henry’s law equation.

$$K_m = \frac{k_2 + k_{-1}}{k_1}$$

For each enzyme-substrate:

$$\ce{K_H\frac{d[S_{liq}]}{dt}=\frac{d[S_{gas}]}{dt}}$$

$$\ce{\frac{d[S]}{dt} = k_d([S_{liq}]-[S])}$$

FINAL DIFFERENTIAL EQUATIONS

$$\ce{K_H\frac{d[S_{liq}]}{dt}=\frac{d[S_{gas}]}{dt}}$$

$$\ce{\frac{d[S]}{dt} = k_d([S_{liq}]-[S])}$$

$$\ce{\frac{d[O2]}{dt}=k_{-1}"[E1SO2]-k_1"[S][O2][E1]0 + k_1 "[S][O2][E1SO2] + k_1'[E2][O2]-k_1'[E2]0[I][O2]-\mu N}$$

$$\ce{\frac{d[P]}{dt} = \frac{k_2'[I][O2][E2]_0}{K_m'+[I][O2]}}$$

$$\ce{\frac{d[I]}{dt}=\frac{k_2\alpha [S][E1]0}{K_m + \alpha [S] } +\frac{k_{-1}[I][O_2][E2]0 - k_1'[I][O2][E1]0}{K_m' + [I][O2]}}$$

$$ \frac{d\alpha [S]}{dt} = k_1 \alpha [S][E_1]_0 + \frac{(k_1\alpha [S]+k_{-1})\alpha [S][E_1]_0}{K_m + \alpha [S]}$$

Results

Graphs

Please scroll the table if you cannot see the entire graph.


Oxygen concentration as a function of time The amount of oxygen in the system falls off approximately linearly with time. According to this, it can be expected that after 1hr of oxygen insertion, the oxygen should fall off to acceptable levels. This result helps in the switching between hypoxic and aerobic phases.	Substrate concentration as a function of time The concentration of the substrate decays exponentially which fits the results by Wackett et al.[1]

Concentration of Intermediate as a function of time The concentration of the Intermediate is predicted to increase exponentially time. Here, however, it is approximately linear	Concentration of Product as a function of time The concentration of the Product increases exponentially with time, as predicted.

Number of bacteria as a function of time The number of bacteria (N) increases logistically and then decays due to the product toxicity factor. Thus, a culture with an OD of 0.7 would be expected to stay alive for nearly 2 hours. This set the time scale for the assays. This graph is consistent with the experimental data which indicates that the bacteria die off in a span of 2hrs.	Compilation of all the graphs Shows the results of all the graphs in a collated format. The graph helps in inferring the relevent quantities simultaneously. The legend from the graph is given below.

Graphs are plotted using MATLAB. Source code for the plots is available here:

Codes

Graph Parameter	Legend
Oxygen concentration	O
Concentration of Substrate	S
Concentration of Intermediate	I
Concentration of Product	P
Number of bacteria	N

Values of Parameters

It was not possible to obtain all parameters for our model via a literature survey; therefore, numerous approximations were made, and grid searches were used in order to obtain the orders of magnitude and the values of each parameter. In this process, the sensitivity of the system to the values of each parameter was also obtained.

Parameter	Use	Value (in correct units)	References
b	Oxygen threshold constant	10	Obtained via grid search on matlab
$k_d$	Diffusion coefficient	0.8	Obtianed via grid search on matlab
$K_H$	Henry's constant	0.0001	( Page 4680-4700 of [12] )
$N_0$	Initial no of bacteria	10^8	OD off 0.7 is to be used everywhere, it can be approximated to 10^8
$[E_1]_0$	Cam conc in bacteria	10	Otained via grid searches on Matlab and using [13]
$[E_2]_0$	Tod conc in bacteria	10	Obtianed via grid search on matlab
r	Growth constant for bacteria	0.00002	Using extrapolation of graphs in [14]
$\lambda$	Constant for Product toxicity	90000	Obtianed via grid search on matlab
$\lambda'$	Constant for substrate toxicity	90000	Obtianed via grid search on matlab
$\mu$	Constant for bacterial oxygen metabolism	0.006	Obtained via extrapolation of graphs in [14]
$k_1,k_1,k_m,k_2$	Enzyme constants for cam in aerobic conditions	0.001,0.001,0.001,0.001	Obtained using Brenda enzyme database [13]
$k_{1d},k_{-1d},k_{md},k_{2d}$	Enzyme constants for cam in anaerobic conditions	0.002,0.001,0.001,0.02	Obtained using Brenda enzyme database [13]
$k_{1e},k_{-1e},k_{me}$	Enzyme constants for tod	0.001,0.001,0.001	Using extrapolation of graphs in [3]

MOLECULAR DOCKING AND DYNAMICAL SIMULATIONS

INTRODUCTION

Molecular Docking is a powerful tool used to study and make predictions regarding ligand-protein binding, the pose and site, and the relative strength of binding. A molecular docking software performs a search based on an algorithm that evaluates the system's energy due to electrostatic and Van der Waals interactions for different ligand conformations until it is minimised. Based on the predicted pose, an affinity score is calculated. Given a list of docking candidates, this score can be used to rank them. It is used extensively in drug design applications to screen potential drug candidates.

Due to the varying needs based on the variety of available ligands and receptors, several different software packages have been developed. Hence, a crucial part of the docking process is understanding the capabilities of different available software, their options, and deciding the protocol to use.

Molecular Docking, however, has a broader scope than just predicting binding poses and scoring ligands; nowadays, many software packages are capable of predicting binding poses while accounting for side-chain flexibility, predicting active sites, and also determining binding and docking positions in the presence of water molecules. This simulates the natural interactions better and provides better predictive power. Some of the capabilities have been used to understand the enzymes' behaviour and generate qualitative and quantitative predictions crucial for this project.

A thorough literature review was conducted to understand the features offered by the various docking software, including Autodock 4, Autodock Vina, GOLD, Flexx, and Glide, amongst others. Based on the current literature[15], the Autodock suite of tools was found to be the most suitable due to the high accuracy of pose prediction and better accessibility and reproducibility because it is freely available. Autodock is also one of the most highly cited docking software applications and is used by Professor Nagasuma Chandra's lab, which also helped with understanding the different applications and benefits of Autodock.

The protocol for different experiments for the enzyme was decided based on several considerations with reference to published and cited protocols. The protocol broadly follows [15], with modifications made when necessary. The protocols relevant to the experiments conducted are given below.

Protocol

In case the pdf does not render, please click here:

Protocol

Brief Procedure:

The enzyme database entries were found on the RSCB and Uniprot databases and cross-verified using BLAST search. Based on parameters such as resolution and presence of co-crystallised ligands, a suitable PDB file was selected with the best possible resolution and the minimal number of ligands co-crystallised (with necessary cofactors, redox factor, supporting ions and natural ligands).

Suitable ligands were identified based on common Halocarbons used in the refrigerant industry, and availability for experimentation were selected. The SDF files were obtained from PubChem. The files were converted to PDB format using Pymol. In addition, Camphor and Toluene were also selected as a ligand for control and comparison.

Ligand and Receptor coordinate preparation was carried out as listed in the protocol (2.1). Note that the Fe atom in Heme was assigned a Partial charge of +3 based on the literature on Cytochrome P450 enzymes and their mechanism. Similarly, in Toluene Dioxygenase, the Fe atoms (both in ferredoxin and ions) were assigned a charge of +2 and the S atoms a partial charge of -2. (refer to Protocol 2.1 Step 8)

Active sites were obtained from the literature [17] and were visualised in visualisation software. The list of possible active sites for Reductive Dehalogenation reactions in Cytochrome P450cam includes F87, Y96, F98, T101, T185, L244, V247, V295, D297, I395, and V396.
Active sites were then predicted from AutoLigand (Protocol 2.3). Docking was carried out (Protocol 2.2) with Camphor and Toluene, and the docked and crystallographic positions were visualised and compared along with the active site and active site predictions. The comparison is used to verify the protocol.
Following verification, docking was carried out for all the other ligands of interest. (Note: Hydrated dockings were avoided as the protein and ligand interact via hydrophobic interactions)
All the obtained results were visualised on a suitable visualisation software the results were tabulated.

Results:

Docking with Cytochrome P450cam (Camphor-5-monooxygenase)

The results obtained from docking the various ligands are presented below in a tabular format. Docking with camphor is also shown as a verification of the protocol.

Please scroll the table to the right to view the entire table.

Image of the enzyme Cytochrome P450cam →	Video Displaying Docking with Camphor and Active Site Pocket

Affinity of Camphor = -7.0 kcal/mol

Table with Docking Results

Please scroll the table to the right to view the entire table.

Ligand	Classification	Afffinity	Predicted Product
R-22	HCFC	-3.2	$ \ce{CO2} $ through $ \ce{COF2} $ intermediate
R-12	CFC	-3.7	Unknown
R-123	CFC	-4.7
R-110	Halocarbon	-5.0
R-112a	CFC	-5.2
R-113	CFC	-5.3
R-141b	HCFC	-3.8	Acetic acid (under aerobic conditions)
R-134a	HFC	-4.0	Trifluoroacetic acid (under aerobic conditions), and 1,1-difluoroethane (under anaerobic conditions)
R-124	HCFC	-4.7
R-142b	HCFC	-3.7	Unknown
TCE	Halocarbon	-3.8	Trichloroethanol and Trichloroacetic acid

Observations: We observe that CFC's and CFC like halocarbons have highest negative affinity. Meanwhile HCFC's have lower affinity and HFC's are even lower. The lower affinity for HFCs and Methane based halocarbons may explain their reluctance to undergo the enzymatic reaction.

Docking with Toluene Dioxygenase (Benzene 1,2-dioxygenase)

Please scroll to the right to view the entire table.

Image of the Enzyme Toluene Dioxygenase	Image displaying Docking with Toluene and Active Site Pocket

Affinity of Toluene = -5.7 kcal/mol

Table with Docking Results

Please scroll to the right to view the entire table.

Initial Substrate	Afffinity	Predicted Product
R-110	-4.5	Glyoxylic acid/Oxalic Acid
R-112a	-4.6	Glyoxylic acid/Oxalic Acid
R-113	-4.6	Glyoxylic acid/Oxalic Acid
R-123	-4.0	Glyoxylic acid/Oxalic Acid
R-124	-4.0	Glyoxylic acid/Oxalic Acid
R-134a	-3.5	Glycolic acid
TCE	-3.9	Glyoxylic acid/Oxalic Acid

Observations We observe that all the end-products are as expected and can be utilised in metabolism of the bacteria.

Inferences:

The docking images show that the predicted active site matches the active site pocket in the literature. In addition, the binding site of camphor and our halocarbon substrates is the same. An essential aspect of this binding site is allowing interaction with the heme group. The reversible reduction and oxidation of the iron atom in the Heme from Fe(III) to Fe(II) is essential for the proposed mechanism, as elaborated on the design page.

Similarly, the dockings and active site for Toluene Dioxygenase are closely matched with that for the native ligand, Toluene. In addition, the docked positions are in close proximity with the $ \ce{Fe^2+} $ ion, which reversibly oxidises to $ \ce{Fe^3+} $ thereby enabling the reactions to occur as per the proposed mechanisms. Hence, the docking experiments were essential in providing critical insights into our mechanism and helped provide credibility and better predictions for the reaction products.

The affinity results also help us interpret the feasibility of our proposed reactions which are essential for proof of concept. Literature provides evidence for reductive dehalogenation reactions for substrates like Hexachloroethane and a few CFCs. The affinities are reflective of the binding energies of the ligand and protein in the active site.

More negative values indicate a greater strength of binding; hence, the affinities are indicative of the feasibility of the reaction. R-110, i.e., Hexachloroethane, has a binding affinity of -5 kcal/mol, while some of the other refrigerants we are interested in degrading (CFCs and HCFCs) have a range from 4 kcal/mol to 5kcal/mol. This indicates that our proposed reactions are suitable from an energetic basis.

Similarly, for Toluene Dioxygenase, TCE (trichloroethylene) has been shown to degrade experimentally. The binding affinity for the same was found to be -3.9 kcal/mol from docking experiments. Most of the other substrates for the enzyme (after Cytochrome P450cam) have similar, if not higher binding affinities indicating a high likelihood of degradation by the enzyme.

The docking experiments proved to be an engineering success by taking inputs from the WetLab team and providing critical information for the proposed reactions and substrates.

Taking our Docking Results Further

One of the challenges faced in enzyme kinetic modelling was the unavailability of specific parameters since the reactions are not well studied. One critical parameter was the Dissociation constant, $ k_d $, which describes how well the ligand binds to the enzyme. An important result from statistical modelling of the ligand-receptor system gives us the equation:

$$ p_{bound} = \frac{c/k_d}{1+c/k_d} \tag{1} $$ $$ k_d = c_0e^{\beta \Delta E} \tag{2}$$ $$\Delta E = E_{binding} – E_{solvation} $$

Now, from the Molecular Docking results, we obtain the Binding energy $ E_{binding} $, and hence we require the solvation energy in order to estimate the Dissociation constant. For obtaining the value of $ E_{solvation} $, we have modelled the dynamics of our substrates in water using Molecular Dynamics simulations.

A Physical Interlude[18]

Here a brief derivation of Equations 1 and 2 will be shown. Professor Sumantra Sarkar, Department of Physics, IISc, has been instrumental in guiding us through modelling the Ligand-Receptor model.

Consider the system with a fixed temperature $ T $ and let the energy fluctuate about an average energy $ <E> $. Under these assumptions, we can apply the Boltzmann Distribution for probabilities of a microstate with energy $ E_i $ as

$$ P(E_i) = \frac{exp \left(- \frac{E_i}{k_BT} \right)}{\sum_i exp \left(- \frac{E_i}{k_BT} \right)} $$

Considering our system of multiple ligands and a receptor, we have two possible states: ligand-bound and ligand-unbound. Consider the system's energy in the bound state to be $ E_1 $ and in the unbound state to be $ E_2 $. Hence we have

$$ p_{bound} = \frac{e^{-\beta E_1}}{ e^{-\beta E_1} + e^{-\beta E_2}} = \frac{e^{-\beta (E_1 – E_2)}}{1+ e^{-\beta (E_1 – E_2)}} $$

Here $ \beta = 1/k_BT $

Now consider the system where there are L ligands and N possible states a ligand can occupy in the solution. In the unbound case, all the ligands are in the solution, and that gives a multiplicity of $ m_{ub} = \frac{N!}{(N-L)!L!} $, and the energy is $ LE_{sol} $. When one of the ligands is bound to the receptor, the number of ligands in the solution becomes $ N-1 $. This gives a multiplicity of of $ m_b = \frac{N!}{(N-L+1)!(L-1)!} $, and the energy is $ (L-1)E_{sol} + E_{bind} $. Applying the Boltzmann formulation along with the multiplicities gives us:

$$\Delta E = (L-1)E_{sol} + E_{b} – LE_{sol} = E_{sol} – E_{b} $$

$$ p_{bound} = \frac{m_be^{-\beta \Delta E}}{m_{ub} + m_b be^{-\beta \Delta E}} $$

$$p_{bound} = \frac{\frac{L}{N-L+1}e^{-\beta \Delta E}}{1 + \frac{L}{N-L+1}e^{-\beta \Delta E}} $$

As $ N>>L, N-L+1 \approx N $. Hence, this gives us

$$ p_{bound} = \frac{\frac{L}{N}e^{-\beta \Delta E}}{1 + \frac{L}{N}e^{-\beta \Delta E}} $$

Now, assume that the ligand has some volume $ V_0 $ and the total volume of the container here is $ V $. Then, $ \frac{L}{N} = \frac{L}{NV_0 \times c_0} $ where $ c_0 = 1/V_0 $ is the concentration of 1 molecule per ligand volume. Also, $ L/NV_0 = L/V = c $ where $ c $ is the concentration of the enzyme. Hence, this would give us $ \frac{L}{N} = \frac{c}{c_0} $ and

$$p_{bound} = \frac{c/k_d}{1+c/k_d}$$ $$k_d = c_0 e^{\beta \Delta E}$$

Hence, we have derived the equations we are going to use here.

Molecular Dynamics Simulation

The solvation energies for our molecules of interest were unavailable on existing databases; hence, prediction by simulation was necessary. Out of several methods employed in the literature, Molecular Dynamics simulations are commonly used with accurate predictions.

To perform the simulations, we used the package GROMACS[19], along with the tutorials, which were helpful in performing the simulations. It uses Free Energy Perturbations (or alchemical free energy computation)[20] to evaluate the difference in free energy between two states. In order to perform the simulations, the Jupyter notebook was followed with the GROMACS configuration file and the topology file obtained from the Virtual Chemistry database[21]. The topology files used Generalized Amber Force Field (GAFF) and the Optimised Potential for Liquid Simulations for All Atoms (OPLS/AA). GAFF, OPLS/AA along with SPCE(a water model) yield good results as is well characterised in literature.

In order to perform successful molecular dynamics simulations in water, the topol.top files need to be edited. The files available from online sources or those obtained by using 'pdb2pqr'. In particular, the following changes are neccessary for the topol files to be compatible with solvation experiments: necessary:

Inclusion of the following code block at the top of the file

;#define _FF_OPLS
                #define _FF_OPLSAA
                  #include "oplsaa.ff/forcefield.itp"

Commenting out the [ defaults ] section
Inclusion of the following code block at the end of the document above the [ system ] section.

                ; Include SPC water topology
                  #include "oplsaa.ff/spc.itp"

In addition, file name changes must be made in the Jupyter notebook depending on the naming of the files. The topology file should be saved as topol.top.

The molecular dynamics simulation was run for Hexachloroethane and Camphor. Running the computation for all the ligands was prohibitive because of the high runtime and computational power required. Based on the simulations and docking with Cytochrome P450cam, the results obtained are tabulated below:

Ligand	Solvation Energy (kcal/mol)	Binding Affinity (kcal/mol)	$ k_B T $ at 300K (kcal/mol)	$ c_0 \; (M) $	$ K_d \; \mu m^{-1} $	Literature value ( $ \mu M^{-1} $ )[17]
Camphor	1.9	-7	0.593	1	0.3	0.24
Hexachloroethane (R110)	3.0	-5	0.593	1	1.38	1.09

Our predictions based on molecular simulation methods give us a close match, hence verifying the capability of the methods and protocols. The simulations require intensive computational power. For better predictions, the program can be run with higher number of steps which would give more consistent results. Experiments run may yield slightly different results due to the fact that the calculations are perturbative and the approximations of statistical mechanics may not hold in every case. Regardless, the method helps in contributing not only to WetLab, but can also help in providing better constants needed for future implementations.

Active Side modification

The active site of the enzyme is the region where the substrate binds and undergoes a chemical reaction. The active site is usually a pocket in the enzyme, which is lined with amino acids that are involved in the chemical reaction. The residues around the active site define the specificity and the reactivity of the enzyme. Altering the residues in this region directly influences the interactions made with the substarte which can alter the specificty of the enzyme. The shape and size of the pocket determine what substrate the enzyme can accomodate and also determine the orientation in which the substrate binds. This in turn influences contacts with Catalytic residues, which may either increase or decrease the efficiency with which the reaction catalysis occurs.

We modelled active site modifications in-silico in order to analyse if engineering the protein can help facilitate the reactions required. We performed this analysis for Cytochome P450cam, which is involved in the first catalytic step. The active sites of the enzyme were reviewed and existing literature on active site modifications was studied. Based on that, it was shown that modification of the residues 87 (F -> W), 247 (V -> L) and 96 (Y -> F) can help in increasing the efficiency of the reaction [17]. Experimental evidence showed that F87W-V247L was the most active mutant, however, it was specualted that the Y96F mutation may also help in increasing the efficiency of the reaction further. We have carried out the three amino acid modifications and tested the change in binding affinity via Molecular Docking.

In order to carry out the modification, the sequences were edited and the structure prediction was carried out by AlphaFold2[26] and the predicted structure was used for docking of Camphor, Hexachloroethane (R-110), and HFC-134a as per the protocol described above. The results are tabulated below along with images of the enzyme and docked ligand.


Image of Overlap of Modified Protein (green) with Original Protein (pink)	Image of Overlap of Modified Protein Active Site (green) with Original Protein Active Site(pink) with labelled amino acid modifications

AlphaFold Predictions Validations

Ligand	Binding Affinity (kcal/mol) [Unmodified Enzyme]	Binding Affinity (kcal/mol) [With Active Site Modification]	$ k_d $ ($ \mu mol^{-1} $ )[Unmodified Enzyme]	$ k_d $ ($ \mu mol^{-1} $ )[With Active Site Modification]
Camphor	-7.0	-7.8	0.3	0.08
Hexachloroethane	-5.0	-5.6	1.38	0.5
HFC 134a	-4.0	-4.1	-	-

Observations: From the above table, we can observe that active site modification did indeed reduce the dissociation constant for Hexachloroethane as predicted in [17] and the activity is about 2.5 times higher as predicted. However, the same increase in Affinity is not observed for HFC-134a. With this small change, the ratio of the Dissociation constant after modification is $ e^{-\beta/10} = 1.06$ which is not a significant change.

From the observations, we can infer that the active side modification does help in the case of hexachloroethane and possibly other halocarbons, but it did not appear to work for HFC-134a, a very popular refrigerant. Hence, while active site modifications may help in increasing reactivity, properties like activation energy reduction and incorporation of stronger redox partners like Spinach ferredoxin [27]

Bioreactor and Financial Modelling

We have also modelled our implementation as a bioreactor model. In addition, to understand the impact of our model and building the model of the Bioreactor according to the local requirements, we have done financial modelling. To read more about our Bioreactor model and Financial modelling, please click here.

REFERENCES

[1] Hur, H. G., Sadowsky, M. J., & Wackett, L. P. (1994). Metabolism of chlorofluorocarbons and polybrominated compounds by pseudomonas putida g786(phg-2) via an engineered metabolic pathway. Applied and Environmental Microbiology, 60(11), 4148–4154. https://doi.org/10.1128/aem.60.11.4148-4154.1994

[2] Chaudhry, G. R., & Chapalamadugu, S. (1991). Biodegradation of halogenated organic compounds. Microbiological Reviews, 55(1), 59–79. https://doi.org/10.1128/mr.55.1.59-79.1991

[3] Zylstra, G. J., Wackett, L. P., & Gibson, D. T. (1989). Trichloroethylene degradation by escherichia coli containing the cloned pseudomonas putida F1 toluene dioxygenase genes. Applied and Environmental Microbiology, 55(12), 3162–3166. https://doi.org/10.1128/aem.55.12.3162-3166.1989

[4] Kinetics of toluene degradation by toluene--oxidizing bacteria as a function of oxygen concentration, and the effect of nitrate | FEMS Microbiology Ecology | Oxford Academic (oup.com)

[5] Guengerich, F. P. (2012). Mechanisms of cytochrome P450-mediated reactions. Encyclopedia of Drug Metabolism and Interactions. https://doi.org/10.1002/9780470921920.edm007

[6] de Jong, H., Casagranda, S., Giordano, N., Cinquemani, E., Ropers, D., Geiselmann, J., & Gouzé, J.-L. (2017). Mathematical modelling of microbes: Metabolism, gene expression and growth. Journal of The Royal Society Interface, 14(136), 20170502. https://doi.org/10.1098/rsif.2017.0502

[7] Monod, J. (1949). The growth of bacterial cultures. Annual Review of Microbiology, 3(1), 371–394. https://doi.org/10.1146/annurev.mi.03.100149.002103

[8] Chang, Wk., Criddle, C.S. Biotransformation of HCFC-22, HCFC-142b, HCFC-123, and HFC-134a by methanotrophic mixed culture MM1. Biodegradation 6, 1–9 (1995). https://doi.org/10.1007/BF00702293

[9] DeFlaun, M., Ensley, B. & Steffan, R. Biological Oxidation of Hydrochlorofluorocarbons (HCFCs) by a Methanotrophic Bacterium. Nat Biotechnol 10, 1576–1578 (1992). https://doi.org/10.1038/nbt1292-1576

[10] Criddle, C. S. (1993). The kinetics of cometabolism. Biotechnology and Bioengineering, 41(11), 1048–1056. https://doi.org/10.1002/bit.260411107

[11] Modeling Dynamics of Coupled Nitrification and Aerobic Denitrification, iGEM Wageningen 2021 https://2021.igem.org/Team:Wageningen_UR/Model/Nitrogen

[12] Alberto Redondas, Virgilio Carreño, Sergio F. León-Luis, Bentorey Hernández-Cruz, & et. al. (2018). EUBREWNET RBCC-E Huelva 2015 Ozone Brewer Intercomparison. Atmos. Chem. Phys. Discuss. https://doi.org/10.1038/nbt1292-1576

[13] Information on EC 1.14.15.1 - camphor 5-monooxygenase - BRENDA Enzyme Database (brenda-enzymes.org)

[14] Alagappan, G., & Cowan, R. M. (2004). Effect of temperature and dissolved oxygen on the growth kinetics of pseudomonas putida F1 growing on benzene and toluene. Chemosphere, 54(8), 1255–1265.https://doi.org/10.1016/j.chemosphere.2003.09.013

[15] Pagadala, N. S., Syed, K., & Tuszynski, J. (2017). Software for molecular docking: a review. Biophysical reviews, 9(2), 91–102.https://doi.org/10.1007/s12551-016-0247-1

[16] Forli, S., Huey, R., Pique, M. et al. Computational protein–ligand docking and virtual drug screening with the AutoDock suite. Nat Protoc 11, 905–919 (2016). https://doi.org/10.1038/nprot.2016.051

[17] Walsh, M.E., Kyritsis, P., Eady, N.A.J., Hill, H.A.O. and Wong, L.-L.(2000), Catalytic reductive dehalogenation of hexachloroethane by molecular variants of cytochrome P450cam (CYP101). European Journal of Biochemistry, 267: 5815-5820. https://doi.org/10.1046/j.1432-1327.2000.01666.x

[18] Phillips, R., Kondev, J., Theriot, J., Garcia, H., & Kondev, J. (2012, October 29). Physical Biology of the Cell (2nd ed.). Garland Science.

[19] GROMACS, ree and open-source software suite for high-performance molecular dynamics and output analysis.https://www.gromacs.org/

[20] GROMACS, Free Energy of Solvation Tutorialhttps://tutorials.gromacs.org/free-energy-of-solvation.html

[21] Virtual Chemistry Databasehttps://virtualchemistry.org/index.php

[22] Forli, S., Huey, R., Pique, M. et al. Computational protein–ligand docking and virtual drug screening with the AutoDock suite. Nat Protoc 11, 905–919 (2016). https://doi.org/10.1038/nprot.2016.051

[23] Morris, G. M., Huey, R., Lindstrom, W., Sanner, M. F., Belew, R. K., Goodsell, D. S. and Olson, A. J. (2009) Autodock4 and AutoDockTools4: automated docking with selective receptor flexiblity. J. Computational Chemistry 2009, 16: 2785-91.

[24] O. Trott, A. J. Olson, AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization and multithreading, Journal of Computational Chemistry 31 (2010) 455-461

[25] The PyMOL Molecular Graphics System, Version 2.0 Schrödinger, LLC.

[26] Jumper, J., Evans, R., Pritzel, A. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021). https://doi.org/10.1038/s41586-021-03819-2

[27] Ferredoxin-Mediated Electrocatalytic Dehalogenation of Haloalkanes by Cytochrome P450cam, Marc Wirtz, Josef Klucik, and Mario Rivera, Journal of the American Chemical Society 2000 122 (6), 1047-1056 DOI: 10.1021/ja993648o

Parameter	Use	Value (in correct units)	References
b	Oxygen threshold constant	10	Obtained via grid search on matlab
\(k_d\)	Diffusion coefficient	0.8	Obtianed via grid search on matlab
\(K_H\)	Henry's constant	0.0001	( Page 4680-4700 of [12] )
\(N_0\)	Initial no of bacteria	10^8	OD off 0.7 is to be used everywhere, it can be approximated to 10^8
\([E_1]_0\)	Cam conc in bacteria	10	Otained via grid searches on Matlab and using [13]
\([E_2]_0\)	Tod conc in bacteria	10	Obtianed via grid search on matlab
r	Growth constant for bacteria	0.00002	Using extrapolation of graphs in [14]
\(\lambda\)	Constant for Product toxicity	90000	Obtianed via grid search on matlab
\(\lambda'\)	Constant for substrate toxicity	90000	Obtianed via grid search on matlab
\(\mu\)	Constant for bacterial oxygen metabolism	0.006	Obtained via extrapolation of graphs in [14]
\(k_1,k_1,k_m,k_2\)	Enzyme constants for cam in aerobic conditions	0.001,0.001,0.001,0.001	Obtained using Brenda enzyme database [13]
\(k_{1d},k_{-1d},k_{md},k_{2d}\)	Enzyme constants for cam in anaerobic conditions	0.002,0.001,0.001,0.02	Obtained using Brenda enzyme database [13]
\(k_{1e},k_{-1e},k_{me}\)	Enzyme constants for tod	0.001,0.001,0.001	Using extrapolation of graphs in [3]

INTRODUCTION

GENERAL SCHEMATIC OF ENZYME SYSTEM

APPROACH

A BASIS FOR OUR MODEL

SKELETON OF OUR MODEL

Diffusion of HCFCs:

Consumption of \( \ce{O2} \):-

THE DIFFERENTIAL EQUATIONS

FINAL DIFFERENTIAL EQUATIONS

Results

Graphs

Codes

Values of Parameters

MOLECULAR DOCKING AND DYNAMICAL SIMULATIONS

INTRODUCTION

Protocol

Protocol

Brief Procedure:

Results:

Docking with Cytochrome P450cam (Camphor-5-monooxygenase)

Table with Docking Results

Docking with Toluene Dioxygenase (Benzene 1,2-dioxygenase)

Table with Docking Results

Inferences:

Taking our Docking Results Further

A Physical Interlude[18]

Molecular Dynamics Simulation

Active Side modification

Bioreactor and Financial Modelling

REFERENCES