For a synthetic biologist it is fundamental to develop a deep understanding of the cellular mechanisms. Through the improvement of experimental techniques for the manipulation of the cell’s genotype, it has been possible to engineer cells in order to exhibit a new programmed behavior that could be useful to produce molecules of industrial interest. The capability of building complex biological constructs has made possible to solve environmental, health and agricultural problems around the world (Chandran et al., 2008).
The chassis selection is a crucial step during the design process. Two of the most common chassis were selected to be analyzed in the context of our project. However, we asked for advice from an expert in metabolic engineering and the production of recombinant proteins. Thus, we added to our list the well-known emergent chassis Pichia pastoris.
To start with the chassis selection, we identified two fundamental characteristics that had to be accomplished to successfully produce our proteins. First, these peptides are short and come from a eukaryotic organism. Then, the chassis should be capable of folding and assembling eukaryotic proteins. Second, these peptides have disulfide bonds. Then, the chassis should make post-translational modifications correctly.
Due to the characteristics of the chassis Pichia pastoris, this methylotrophic yeast was selected as the host cell of our gene construct. Additionally, P. pastoris has enormous advantages such as obtaining high recombinant protein production yields.
Figure 1. Chassis selection
Throughout the design step it is essential to ideate creative solutions to solve the possible problems and disadvantages that could affect the development of the project. Thus, the design step is a constant search for improvement. Six different designs of genetic circuits were proposed at the beginning of the project. We classified these circuits in two types, the ones that use methanol and the methanol free systems. From the analysis it was determined that the more feasible designs were the ones that use methanol.
Figure 2. Methanol System vs Free Methanol System
Three of the original genetic circuit designs were selected to perform a second analysis process. Each option proposes the protein production in a different manner. In the first one, two different strains are employed to produce the peptides. In the second one, the production of both peptides happens in the same strain using the same inducible promoters. In the third option, two different promoters will be employed to the protein production, avoiding a possible metabolic overload (Balbas, 2004). From this analysis, the circuit with simultaneous expression was discarded.
Figure 3. Number of cells and promoters
The most optimal genetic circuit designs were selected from the previous screening process. The main goal of this project is the development of a system where the expression happens in different cells. Thus, we assemble in silico circuits, design and optimize lab protocols and perform an experimental process to develop this option.
Two genetic circuits were designed to express each peptide in different strains. However, both have the same general fundamentals and behavior. The pPICZαA plasmid was selected to assemble the two genetic circuits. The protein expression will be regulated by the well-characterized and strong promoter PAOX1. This promoter is induced with methanol and repressed with glycerol, ethanol, and glucose. This promoter contains a Kozak consensus sequence that serves as the initiation site for transcription. After it, an alpha factor secretion signal will target recombinant proteins to the growth medium, simplifying the purification step. After these parts, both circuits can be divided into three sections: a purification section, a fusion protein section, and a detection section.
Figure 4. Genetic circuits
This section contains an 8X His tag domain head that will be used in the purification step. This tag was selected because it is unlikely to affect the protein function and has well-established protocols (Young et al., 2012). However, the peptides U1-theraphotoxin-Sp1a and Omega-hexatoxin-Hv2a only have 34 and 45 amino acids (UniProt, 2021) thus the interference of the tag with its function is probable. In order to overcome this possible issue, we proposed two solutions. First, we added a flexible linker of four amino acids (Gly-Gly-Ser-Gly) that allows the tag mobility, decreasing the probability of an interference. Second, in case the linker does not work as expected, the removal of the tag and linker after purification is possible due to the insertion of the Xa factor that once translated will be recognized by the protease (Young et al., 2012).
It contains one of the venom insecticidal peptides OAIP-1 or AcTx-Hv2a fused to a lectin. The snowdrop lectin has been characterized as a transporter of molecules with biological activity throughout the hemolymph of the insects after oral ingestion. In addition, a 3X Ala linker was added in between to ensure flexibility and no affection of the peptide function (Powell, 2020).
Lastly, a c-Myc sequence was added after lectin for further detection and quantification of the protein expression.
Figure 5. Purified protein
The Green Fluorescent Protein (GFP) will be employed as a reporter protein. it will confirm the expression of the fusion protein; also, it will help us select high producers and obtain protein expression data for the math model. To achieve coexpression, we added the “self cleaving” peptide T2A before the eGFP sequence. The cleaving mechanism of T2A is the ribosome skipping in the formation of a glycyl-prolyl peptide bond at its C-terminus (Liu, 2017). The eGFP is not part of the fusion protein, avoiding its possible interference with the venom peptides. Also, the GFP will not be secreted to the growth medium.
In addition, the circuits have two restriction enzymes sites. One for KpnI and one for Cfr421
Figure 6. Protein production induced by methanol
A common standardized way to present the behavior of a genetic circuit in synthetic biology is through boolean operators (AND, NOT and OR). These operators use biological inputs such as the presence or absence of a reagent and translated to an algebraic meaning that can 1 or 0 respectively.
In this project the circuits iUAM_PICZαA_OAIP and UAM_PICZαA_Hv2a behave as a NYPLY gate which is the union between an AND and a NOT gate. Thus, the AND gate must have two 1 in the input to produce a 1 in the output that means the production of heterologous proteins. In order to produce these inputs, glycerol should be absent and methanol present (Fig. 7).
Figure 7. Boolean representation of the genetic circuits
This behavior allows a growth phase with glycerol and an expression phase with methanol, avoiding a metabolic overload and an efficient utilization of carbon sources in each phase (Fig. 8).
Figure 8. Protein production with methanol induction
The build step of the design cycle is a great opportunity to put hands-on work. Therefore, this could be one of the most challenging phases of the project development. The results obtained from the experimental phase are highly important for the next steps. The data obtained during this phase could be fundamental to propose new improvements.
An experimental process was designed and proposed to assemble the gene constructs. We aim to obtain a strain of the yeast Pichia pastoris capable of executing the instructructions programmed in the expression cassette that will be integrated into the genome of this microorganism.
Figure 9. Experimental overview
The test phase helped us to understand the overlook points in our design and improve them:
During the test phase we noticed that restriction enzymes needed to exchange the fragments inside the circuit were expensive and unaffordable. Thus, we could not use them.
In addition, the Gibson Assembly does not recommend using primers with His tag sequences. In our case, one of our primers had an His tag sequence; these could be the cause of the little amount of transformant colonies. To learn more about it visit the results link.
Although the test helped us identify the points a and b to improve, we were able to appreciate our achievements. One of them was the use of eGFP, which helped us identify the expressing colonies.
Overall, these are key points to improve in further designs:
Avoid His tag sequences in Gibson Assembly or add extra pair bases at the beginning .
Exchange the restriction enzyme sites for a more common pair (KpnI and XhoI).
Additionally, these are key points that should be conserved in further designs:
The eGFP is a good tool to identify the expressing colonies.
Thus, the optimized final circuit is:
Figure 10. Optimized circuit
Chandran, D., Copeland, W., Sleight, S., & Sauro, H. (2008). Mathematical modeling and synthetic biology. Drug Discovery Today: Disease Models, 5(4), 299–309. https://doi.org/10.1016/j.ddmod.2009.07.002
Balbás, P., & Lorence, A. (2004). Recombinant Gene Expression: Reviews And Protocols, 2/E (Second Edition). Balbas, Paulina, Lorence, Argelia (Eds.).
Young, C. L., Britton, Z. T., & Robinson, A. S. (2012). Recombinant protein expression and purification: a comprehensive review of affinity tags and microbial applications. Biotechnology journal, 7(5), 620–634. https://doi.org/10.1002/biot.201100155
UniProt. (2021). U1-theraphotoxin-Sp1a precursor - Selenotypus plumipes (Australian featherleg tarantula). UniProtKB - K7N5K9 (TX1_SELPU). Recuperado 18 de octubre de 2021, de https://www.uniprot.org/uniprot/K7N5K9
UniProt. (2021). Omega-hexatoxin-Hv2a - Hadronyche versuta (Blue mountains funnel-web spider). UniProtKB - P82852 (TOT2A_HADVE). Recuperado 18 de octubre de 2021, de https://www.uniprot.org/uniprot/P82852
Powell, M. E., Bradish, H. M., Cao, M., Makinson, R., Brown, A. P., Gatehouse, J. A., & Fitches, E. C. (2020). Demonstrating the potential of a novel spider venom-based biopesticide for target-specific control of the small hive beetle, a serious pest of the European honeybee. Journal of Pest Science, 93(1), 391–402. https://doi.org/10.1007/s10340-019-01143-3