The goal of metabolic engineering for industrial applications is the overproduction of metabolites with the help of microorganisms. While the field originated from single modifications in metabolic pathways, today’s approaches to metabolic engineering include a much more systematic view of biological systems. Computational tools based on flux balance analysis (FBA) to analyze genome-scale metabolic models (GSMMs) have significantly advanced our capability to engineer microorganisms for industrial applications. We applied FBA to GSMMs for two purposes: First, we used FBA to find and evaluate suitable growth media for MonChassis’s engineered yeasts that maximize the flux towards monoterpenoid synthesis. We screened 125 and 53 carbon sources for the production of monoterpenoids with Saccharomyces cerevisiae and Yarrowia lipolytica, respectively. As in upscaling aerobic biotechnological processes the oxygen uptake rate is a key rate-limiting factor, we further ranked the top 20 carbon sources by their required oxygen uptake rate. Based on this, we decided on oleic acid as the most promising carbon source for the potential upscaling of the monoterpenoid production with MonChassis’s yeasts. However, wet lab testing of oleic acid as the sole carbon source did not show any α-pinene production. This was likely because of improper execution of the monoterpenoid extraction.
The second application of FBA to GSMMs was the identification of further genetic engineering targets for MonChassis’s
yeast platform. To this regard, we applied flux scanning based on enforced objective flux (FSEOF) algorithm. We found
that the cytosolic production of monoterpenoids in
For efficient biotechnological fermentation processes it is crucial to find optimal growth media. Growth media
and related procedural parameters like oxygen transfer are not only driving cost factors, but have great influence
on metabolic pathway regulation, too. Therefore, we optimized our growth media with flux balance analysis (FBA),
focusing specifically on the carbon source to scale up the production of monoterpenoids in MonChassis. Using FBA
we developed a simulation algorithm, with which we generated a tailored ranking of growth media carbon sources for
each of our proposed strains. Carbon source candidates were ranked based on their ability to facilitate
In the process of finding optimal reaction conditions for monoterpenoid production in MonChassis there are challenges regarding both the technical process engineering as well as the biological parameters. Within the latter, especially growth media are important to optimize, as they play a key role for the operation efficiency. First, the growth medium influences regulation of major metabolic pathways. Also, when considering oxygen as part of the growth medium, it determines whether the organisms can grow aerobically. Lastly, growth medium components are deciding cost factors. All these parameters get increasingly more relevant when the fermentation is scaled-up. As scale-up is an integral part of MonChassis, we aim to find optimal growth media for our proposed production strains.
Growth media are composed mainly of minerals, essential amino acids, metal ions, dissolved oxygen, and a carbon source. Especially important is the carbon source, as it is one of the most deciding factors for the metabolism next to oxygen availability. Over the past century, the most widely used carbon source glucose has proven to be ideal for the growth of many microorganisms. However, glucose is not ideal for every microorganism and every fermentation goal. It has been shown in the literature that alternative substrates can rival or even outperform glucose as the main carbon source in specific applications (Boonyanit et al., 2011; Farhi et al., 2011). Due to the similarity to our approach, an especially relevant case is presented in (Farhi et al., 2011), where the authors tested oleic acid as sole carbon source for peroxisomal terpene production in Yarrowia lipolytica. With oleic acid, product yields almost 20% higher than with glucose were reached. This underlines the importance to optimize growth media.
When it comes to composing growth media, there are close to infinite substrate candidates and combination
possibilities. Furthermore, different fermentation processes require different quantities of dissolved oxygen.
Thus, the space of growth media compositions suited to grow microorganisms is vast. Exploring this space
experimentally in the wet lab would require great financial and labor costs, as well as consuming much time,
making this option infeasible for us. However, by in silico simulation of carbon source alternatives and
their oxygen demands we narrowed down the screening space very efficiently. For all four of our strains,
Saccharomyces cerevisiae cytosol and peroxisome and
The method we used to analyze the models is flux balance analysis (FBA). FBA allows to analyze GSMMs by calculating the fluxes of all reactions that are part of the metabolic network. We used this method for the purpose of simulating the performance of different carbon sources in the growth media. The concept of FBA is as follows: First, the metabolic network of the considered organism is translated into a stoichiometric matrix S of n x m dimensions, containing n reactions and m metabolites. This forms the basic GSMM. This matrix is multiplied in a second step with a vector v representing the reaction fluxes v1 – vn. This vector contains the coefficients of interest that we want to determine. Finally, two integral constraints are added. The first one is the assumption that the metabolism is in complete steady state. This means that the fluxes of all metabolites are not zero, i.e.. the production of a metabolite must equal its consumption. Thereby, we obtain the linear equation S × v = 0, constituting a solvable linear equation system. Second, an objective function is added. As for matrices with n > m there is more than one solution possible, to find an optimal solution we must add an objective function as the last constraint. This function is to be defined as the goal of the optimization problem. It is simply the reaction whose flux we aim to maximize. The solution can then be found by linear programming (Orth et al., 2010), the solver we used was glpk. This concept is depicted in figure 1.
With the formulated solvable optimization problem of maximizing the
To find optimal carbon sources for the growth media of our production strains, we ranked all considered carbon
source candidates regarding the maximal
The resulting rankings from the first step of this analysis and the oxygen demand analysis results can be seen in
figure 2. For
The ranking shows that oleic acid facilitates the highest overall
Following the simulative instance presented above, we translated the results into a wet lab experiment. To decide
on the carbon source to test we oriented on the calculated selection. However, this decision was made only after
additional research about the properties of the candidates that were not regarded up until this point. These
include practical necessities like solubility, toxicity and finally the price. Oleic acid, one of the best
performing carbon sources regarding maximal
With this approach we aimed to experimentally measure the actual
For the implementation of this experiment, we chose three different mixtures with varying compound ratios. The
first mixture (1) contained 100% SD-Medium but with glucose being completely replaced by oleic acid. Oleic acid
was used in the same concentration as glucose before (20 g/l). To ensure better solubility of the fatty acid, we
added 0.1% (v/v) Tween 80. The second mixture (2) contained the exact same but with 1% (v/v) Tween 80. This was
done to check which concentration of it would solubilize better. The third mixture (3) contained only 50%
SD-Medium without glucose and again 20 g/l oleic acid. In this reaction we aimed to test whether oleic acid could
also partially replace other expensive components of the growth media, like amino acids. Lastly, we added a
control reaction with 100% regular SD-Medium (including glucose). All four reaction mixtures were inoculated
in triplicates with the same preculture of strain
In the last step we extracted the cell ingredients with ethyl acetate and performed GC-MS measurements, following the extraction protocol.
Unfortunately, we were not able to measure
The chosen approach has the major advantage that costly and time-consuming wet lab experiments can be reduced to a minimum. As difficult-to-know enzyme kinetics are not required for the analysis of GSMMs with FBA, prior acquisition of experimental data could be omitted. Due to this simplicity and the power of linear programming solvers, we were able to perform the simulative instance of this approach within seconds on a normal laptop. This allowed us to explore the space of possible carbon sources much faster and with much less experimental effort, saving both time and resources. Only the final testing in the wet lab was necessary to validate the prior prediction.
However, the underlying mathematical concept of FBA has clear limits. While abstracting from kinetic parameters sped up the implementation significantly as touched on above, at the same time it alleviated the accuracy of the prediction in terms of exact product concentrations. Also, by using FBA, any physiochemical interactions between metabolites and/or non-metabolite structures outside of known stoichiometric reactions were not regarded. Lastly, properties like solubility, stability, and toxicity of compounds were dismissed. The lack of toxicity consideration for instance resulted in the analysis ranking ethanol in positions higher than we would expect from a biological standpoint of view. Therefore, applying FBA to GSMMs for the considered purpose of media optimization can give us a profound first suggestion, but we must further validate the computed predictions.
We conclude that by using in silico modeling it is possible to gain valuable insights into the properties
and performances of growth media compounds. The identified carbon sources like oleic acid, stearate and other
long-chain fatty acids could be promising carbon source alternatives to glucose for the production of
monoterpenoids. When scaling up the fermentation process, we could additionally benefit from the oxygen demand
analysis that we performed. Even if the maximal
In addition to optimizing the growth media for
Many tools are available to identify genetic identification targets (Gu et al., 2019). Most tools, though, are only suitable for predicting the effect of knockouts, as the phenotype of knockouts is easier to predict (Choi et al., 2010). A prominent algorithm to identify overexpression and knockout targets is flux scanning based on enforced objective flux (FSEOF) (Choi et al., 2010). As the growth media optimization, FSEOF relies on constraint-based FBA of GSMMs and, thus, works with the same assumptions and constraints as explained before.
FSEOF was performed as described before (Choi et al., 2010; Park et al., 2012) with the same GSMMs that we used
for the growth media optimization. The first step of the FSEOF algorithm is the calculation of the initial fluxes
through the biomass objective function vBiomass and the reaction of interest
vTarget_initial, which in this case was the formation of
We identified similar targets for overexpression and downregulation, depending on the compartment but independent
from the yeast species (Fig. M2). To produce
Even though the FSEOF algorithm was primarily developed to find overexpression targets, it can also identify targets for downregulation (Choi et al., 2010). Highly similar downregulation targets were identified for all four GSMMs subjected to the FSEOF algorithm. An overview of the top five overexpression and downregulation targets for our four yeast models can be found below. The highest-ranked downregulation targets catalyze reactions of the Krebs cycle and consumption reactions of cofactors. It is, again, important to note that FBA-based methods do not necessarily provide biologically meaningful results in the sense that enforcing an increased flux through a reaction of interest can eliminate growth. Downregulation of the Krebs cycle is likely to be detrimental to the growth of yeasts. Thus, new downregulation targets should be identified using more sophisticated tools. Additionally, the following steps of finding further engineering targets should focus on the cofactor supply of the mevalonate pathway. For this, specialized tools, like cofactor modification analysis (Lakshmanan et al., 2013) can be used.
The results presented here already provide valuable insights into the limiting steps within our respective designed pathways. Further, by modifying the identified genetic targets, we overcome these limitations and even further improve MonChassis’s yeast strains to make large-scale monoterpenoid production finally possible.
Overview on the identified genetic engineering targets
Bröker, J.N. et al. (2018) ‘Upregulating the mevalonate pathway and repressing sterol synthesis in Saccharomyces cerevisiae enhances the production of triterpenes’, Applied Microbiology and Biotechnology, 102(16), pp. 6923–6934. Available at: https://doi.org/10.1007/s00253-018-9154-7.
Boonyanit, T., Sroisiri, T. and Doan Minh, T. (2011) ‘Fermentation of various sugars and sugar substitutes by oral microorganisms’, Asian Pacific Journal of Tropical Biomedicine, vol. 1, no. 2, Supplement, pp. S258–S260 [Online]. DOI: 10.1016/S2221-1691(11)60166-4.
Choi, H.S. et al. (2010) ‘In Silico Identification of Gene Amplification Targets for Improvement of Lycopene Production’, Applied and Environmental Microbiology, 76(10), pp. 3097–3105. Available at: https://doi.org/10.1128/AEM.00115-10.
Ebrahim, A., Lerman, J. A., Palsson, B. O. and Hyduke, D. R. (2013) ‘COBRApy: COnstraints-Based Reconstruction and Analysis for Python’, BMC Systems Biology, vol. 7, no. 1, p. 74 [Online]. DOI: 10.1186/1752-0509-7-74.
Farhi, M., Marhevka, E., Masci, T., Marcos, E., Eyal, Y., Ovadis, M., Abeliovich, H. and Vainstein, A. (2011) ‘Harnessing yeast subcellular compartments for the production of plant terpenoids’, Metabolic Engineering, vol. 13, no. 5, pp. 474–481 [Online]. DOI: 10.1016/j.ymben.2011.05.001.
Gu, C. et al. (2019) ‘Current status and applications of genome-scale metabolic models’, Genome Biology, 20(1), p. 121. Available at: https://doi.org/10.1186/s13059-019-1730-3.
Lakshmanan, M. et al. (2013) ‘Cofactor Modification Analysis: A computational framework to identify cofactor specificity engineering targegts for strain imporvement’, Journal of Bioinformatics and Computational Biology, 11(06), p. 1343006. Available at: https://doi.org/10.1142/S0219720013430063.
Lu, H., Li, F., Sánchez, B. J., Zhu, Z., Li, G., Domenzain, I., Marcišauskas, S., Anton, P. M., Lappa, D., Lieven, C., Beber, M. E., Sonnenschein, N., Kerkhoven, E. J. and Nielsen, J. (2019) ‘A consensus S. cerevisiae metabolic model Yeast8 and its ecosystem for comprehensively probing cellular metabolism’, Nature Communications, Nature Publishing Group, vol. 10, no. 1, p. 3586 [Online]. DOI: 10.1038/s41467-019-11581-3.
Mishra, P., Lee, N.-R., Lakshmanan, M., Kim, M., Kim, B.-G. and Lee, D.-Y. (2018) ‘Genome-scale model-driven strain design for dicarboxylic acid production in Yarrowia lipolytica’, BMC Systems Biology, vol. 12, no. Suppl 2, p. 12 [Online]. DOI: 10.1186/s12918-018-0542-5.
Nielsen, J. (2017) ‘Systems Biology of Metabolism’, Annual Review of Biochemistry, vol. 86, no. 1, pp. 245–275 [Online]. DOI: 10.1146/annurev-biochem-061516-044757.
Orth, J. D., Thiele, I. and Palsson, B. Ø. (2010) ‘What is flux balance analysis?’, Nature Biotechnology, Nature Publishing Group, vol. 28, no. 3, pp. 245–248 [Online]. DOI: 10.1038/nbt.1614.
Park, J.M. et al. (2012) ‘Flux variability scanning based on enforced objective flux for identifying gene amplification targets’, BMC Systems Biology, 6(1), p. 106. Available at: https://doi.org/10.1186/1752-0509-6-106.