Three levels of biosafety
Since Colourectal involves the administration of a genetically modified organism to humans, it is important to build in biosafety and biocontainment mechanisms. A multi-layered strategy encompassing different genetic safeguards makes of our living diagnostic a safe tool for humans and the environment. In the first place, Escherichia coli Nissle 1917 (EcN) should not be able to colonise parts of the body other than the colon, as this would represent an important health concern. To this end, our living diagnostic will be engineered for surviving only in the presence of mucin, a glycoprotein found in the lining of the colon. Second, our living diagnostic should not be able to survive outside of the human body to avoid its spread into the environment. To achieve this, a kill switch, which is activated at temperatures below that of the human body, has been implemented. In the third place, another inducible kill switch has been integrated, this time triggered by the fourth pill of the Colourectal self-test, which is administered to the patient in case of an adverse response, or once the diagnosis process finishes.
Colon biocontainment: a two-component chimeric system for mucin dependence
In order to mitigate risks associated to the spread of the living diagnostic from the colon to other parts of the body, a mucin auxotrophy has been implemented. Our molecular system is based on a chimeric two-component system (TCS) activated by mucin, the main family of proteins found in the mucus. Mucins are continuously produced by the cells lining the gastrointestinal tract [1,2] and therefore, always present in the colon. The TCS controls the expression of the guanylate kinase gene gmk, essential for cell for its role in nucleotide metabolism [3]. When mucin is present (permissive conditions), the essential gene is expressed allowing cell survival. By contrary, in absence of mucin (non-permissive conditions), gmk is not expressed, leading to cell death.
The TCS consists of a chimeric sensor formed by Dismed2, the extracellular domain of RetS, which is in turn a sensor protein from Pseudomonas aeruginosa known to sense mucin [4], and the transmitter domain of EnvZ, part of the popular E. coli-derived two-component system EnvZ-OmpR [5,6]. The EnvZ-OmpR TCS functions in such a way that EnvZ, when activated, phosphorylates the transcription factor OmpR, which subsequently activates the OmpC promoter. As indicated, in our system OmpC regulates the transcription of the essential gene gmk.
Figure 1: The chimeric two-component system (TCS) is formed by combining the extracellular Dismed2 domain of the RetS protein from Pseudomonas aeruginosa and intracellular HisKA and HATPase intracellular domains of EnvZ part of the EnvZ-OmpR TCS. In this system mucin activates the chimeric TCS, which in turns allow the expression of the essential gene gmk.
The chimeric sensor, named Dismed2-EnvZ was constructed by cloning the two domains under the control of the constitutive promoter (BBa_J23100) and strong RBS (BBa_J34801). The construct (BBa_K4244009) is inserted in a medium copy number plasmid to avoid any possible toxic effect in the host. After successful cloning of Dismed2-EnvZ as well as the OmpC promoter with the reporter gene (GFP) (BBa_K4244011), these constructs have been transformed into an E. coli from the Keio collection lacking the native gene for envZ [7] to test the mucin dependency. The results of our plate reader experiments with a fluorescent reporter show that increasing mucin concentrations result in high activation levels of gene expression (Figure 2).
Figure 2: Green fluorescent protein (GFP) fluorescence value of Escherichia coli strain lacking the EnvZ gene, after induction with varying concentrations of mucin in M9 medium at 14th hour of incubation. The first negative control is the naked strain, harbouring no plasmid. The second negative control have the Ompc:GFP plasmid (BBa_K4244011). While chimeric two-component system (TCS), contains both the OmpC:GFP and the chimeric Dismed2 EnvZ (BBa_K4244009). The graph shows that with 5 % of mucin the fluorescence of the chimeric TCS is significantly higher than with no mucin.
The results also suggest that the chimeric receptor shows leakiness, a natural phenomenon given that the EnvZ-OmpC is a native osmotic pressure responsive system in E. coli Nissle 1917. This is shown by the levels of fluorescence in the cells only containing the OmpC-GFP (control -), and the cells containing the complete chimeric TCS but without the presence of the ligand (mucin). Therefore, the main future goal of our system would be the reduction of this leakiness. As a potential solution, we propose the deletion of the two other components of the EnvZ-OmpR system: ompF and ompC. Adding to this, next step would be the cloning gmk under OmpC and knock-out this gene from EcN genome to perform cell death assay with following conditions: mucin and no mucin.
Biocontainment: temperature kill switch
When our living diagnostic leaves the user’s body through the faeces, it could potentially spread into the wild representing a risk for the environment. To prevent this, a temperature-dependent kill switch has been implemented as biocontainment strategy allowing cell survival only at the permissive condition of 37 °C, the body temperature.
Two temperature kill switches with temperature-dependent tunable promoters, which had previously been successfully demonstrated in literature [8–10] were tested in EcN as part of our project. In the work of Piraner and colleagues, the temperature kill switch was constructed using the transcriptional auto-repressor TIpA
(BBa_K2500004)
from the virulence plasmid of
Salmonella typhimurium [8]. This protein contains a coiled C-terminal domain that undergoes uncoiling between 37 °C and 45 °C and an N-terminal DNA-binding domain that acquires a dimeric state with lower temperatures and blocks transcription. If
tIpA and its promoter are upstream the antitoxin of a toxin-antitoxin system
(Ba_K4244019) (figure 3, left),
they will cause a decrease of antitoxin expression when temperature drops, leading to the microorganism’s death [10]. The second kill switch that was tested, was obtained from the work of Stirling and colleagues, and contains the temperature-sensitive regulatory region of the cold shock protein A (PcspA) [9]. The pcspA
(BBa_K4244021)
is a constitutive promoter that has a high rate of transcription but contains a long 5′ UTR of 159 bp that, at 37 °C, acquires an unstable secondary structure and leads to its degradation. On the other hand, at lower temperatures it forms a stable configuration, which allows translation. In this case, a toxin (ccdB) under the cspA promoter (Figure 3, right) is expressed but counteracted by the antitoxin ccdA
(BBa_K4244023),
which in turn is expressed by being placed after the constitutive LacUV5
(BBa_M36801) [10]. As indicated, at lower temperature the cspA stabilizes and ccdB overexpresses ccdA, which leads to cell death.
Figure 3: The promoter pTIpA, regulated by TlpA, is upstream of the ccdA antitoxin. When temperature reaches below 37 °C, this leads to a decreased expression of the antitoxin and the toxin ccdB leads to the microorganism’s death (left). The promoter pcspA is upstream the toxin (ccdB) while the antitoxin (ccdA) is placed after the constitutive LacUV5. When temperature decreases, the expression of the toxin increases, leaving to microorganism’s death (right). Adapted from [10].
Both systems containing the temperature-sensitive kill switches have been tested with a reporter gene in EcN. This was done to test if at different temperatures the results in our microorganism matched those of literature: increase of fluorescence with decreasing temperature for the pTlpA system and decrease of fluorescence with decrease of temperature in the pcspA system. Since both systems showed the correct behaviour with the fluorescent reporter, this gene was replaced with either a toxin or an antitoxin according to the two systems’ rationales in order to perform temperature sensitive survival assays. On the one hand, the pTlpA system did not show the expected results in our host even after our troubleshooting thanks to our partnership with NanoBuddy, due to growth could also be observed under non-permissive conditions. On the other hand, the pcspA system showed a dramatic decrease of CFU at temperatures that were lower than 37 °C as shown in Figure 4.
Figure 4: The temperature sensitive survival assay shows that the pcspA system with the toxin-antitoxin (ccdA-ccdB) no colonies are present in the plates incubated at temperatures below 37°C compared to the control, pcspA system with reporter gene.
Since the pcspA system was shown to work both with a reporter gene and with the toxin-antitoxin kill switch, the next step would be its integration in the EcN genome. This would allow both to have an antibiotic resistance-free system that is not susceptible to plasmid loss, and to decrease the risk of emergence of mutants that could survive also under non-permissive conditions. Having the construct in the chromosome of EcN increases the genetic stability of the kill switch, decreasing the possibility of escapers.
“Terminator” kill switch
What if our living diagnostic causes any discomfort to the person who ingests it? What if colorectal cancer has already been detected and it is no longer necessary to report its presence through coloured stool? And how do we make sure that any trace of the Colourectal microorganism abandons the user’s body after the self-test process?
The third biosafety mechanism of our project has been installed to address all these questions by means of another inducible kill switch with two clear states: ON in the presence of inducer, and OFF in its absence. Three components were identified as the most important for the performance of such a kill switch: an inducible promoter/regulator system, an effective killing mechanism, and a way to ensure zero basal levels of killing under the permissive conditions.
Inducible system
A great variety of inducible expression systems driven by different molecules can be found both in the literature and in the iGEM repository
(e.g., pBad/araC, BBa_I0500). However, since we aimed to introduce this kill switch circuit in our living diagnostic, which will perform its function in the human colon environment, the search for a suitable system became tricky. The inducer molecule of choice could not be part of the colon, be produced by tumour cells, or be a food component. Moreover, it could not be toxic to humans, the environment, or the colon microbiome.
All these requirements drastically reduced the number of possibilities, but the inducible expression system NahR/Psal, activated by acetylsalicylic acid (aspirin), initially seemed to be a possible solution. Aspirin is not present in food or in the colon, and it is considered safe as it is used for medical purposes. Since it is generally administered as a painkiller, we thought it would be possible to use it as the inducer molecule for the kill switch and advice users of Colourectal to avoid intake of aspirin during the self-test process. The part for this inducible expression system
(BBa_J61051) was obtained from the iGEM plates to start experimentally testing its activity. However, gastroenterologist Markus Gwingger strongly advised us to not use aspirin as our inducing molecule, since aspirin is often taken as blood thinner by our target group.
“You are using it in a population over 55 of which many will have heart disease and be on aspirin [...]. You definitely need to change that”. - Markus Gwiggner, gastroenterologist
Therefore, our design needed to be changed and a different inducible expression system had to be tested. Because we developed a three-level rhamnose inducible system in our project, we decided to include this kill switch in the third step of this induction system. As a proof of concept, we used a rhamnose-inducible system to test the killing mechanism.
Killing mechanism
The method of choice to cause cell death was DNA degradation, controlled by our inducible expression system and elicited by a type-IC Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) device, which belongs to the Class I CRISPR-Cas systems. Unlike the more frequently used Class II systems, that only consist of 1 effector protein (e.g., Cas9 and Cas12a), the effector complex of Class I encompasses a CRISPR RNA (crRNA) and a number of different Cas proteins, and commonly receives the name of Cascade. In type-IC, the Cascade complex has 3 subunits – Cas5, Cas7 and Cas8 – that are responsible for recognising the target DNA and recruiting Cas3, an enzyme with single-strand DNA helicase-nuclease activity able to cleave and degrade target DNA in a processive way (Figure 5) [11,12]. For the Cascade complex to recognise the target sequence, a short 2-4 nucleotide sequence known as Protospacer Adjacent Motif (PAM) must be found upstream the sequence. In the case of Type-IC CRISPR-Cas, the most commonly used PAM sequence is 5'-TTC-3' [11].
Figure 5. Schematic of Type-IC Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-associated (Cas) system. One Cas5, one Cas8 and seven Cas7 (yellow, blue, and green respectively), together with the CRISPR RNA (crRNA) (purple), form the Cascade complex, recognise the target DNA and recruit Cas3 (pink). Cas3, thanks to its dual helicase-nuclease activity, is able to cleave and degrade DNA [12].
Type-I CRISPR-Cas systems have proven to be very effective mechanisms to achieve cell death in previous studies. For instance, an article by Caliando and Voigt (2014) demonstrates the utilisation of a highly effective type-IE system for inducible cell killing. However, we decided to use the type-IC CRISPR system from P. aeruginosa since, although its mechanism is not that well-known, its mechanism is very similar and has a more compact size (3 proteins instead of 5 in the Cascade complex), which would reduce the burden in the cell [11]. The advantage of this system compared to the more frequently used Cas9 or Cas12a is that the helicase-nuclease activity of Cas3 allows for large genomic deletions, that are assumed to be harder for the cell to repair than just double stranded breaks, and hence more deadly [12].
A good spacer targeting EcN’s genome had to be designed to achieve a system with a high killing efficiency. With that purpose, we analysed different features in EcN’s genome and discovered the presence of Repetitive Extragenic Palindromic (REP) elements. REP elements are very conserved sequences of around 35-40 bp long that are found in around 500 positions of the E. coli genome, always in intergenic regions. Their role in the cell is not well known, but they are believed to be involved in translation regulation [14]. With that information, we designed a single spacer to target multiple points in the genome simultaneously (5’-TTGCCGGATGCGGCGTAAACGCCTTATCCGGCCT-3’), expecting to achieve a highly genotoxic response upon induction. The REP element we used as a target was repeated 20 times in the EcN’s genome with a 100 % accuracy, and in three of those occasions it was preceded by the necessary PAM sequence (TTC), suggesting that our spacer would target the genome at least at three different loci.
A plasmid, referred as pCas3cRh, containing the CRISPR array with the designed spacer and the cas genes (cas3, cas6, cas8 and cas7) under the inducible expression system RhaRS/PrhaB was employed to test the kill switch efficiency in EcN. We performed this test in M9 medium with 50 mM of glucose and different rhamnose concentrations (0, 0.1, 0.5, 1, 3 and 5 mM) (Figure 6).
As it can be observed, our inducer managed to hinder cell growth in a gradual manner when applied at different concentrations. By inducing our kill switch with high concentrations of rhamnose (3 and 5 mM), we managed to significantly impair the growth of EcN, which was effectively inhibited for 15 h. The emergence of escapers with the ability to circumvent the CRISPR-Cas mechanism [12] most likely resulted in the growth observed after that period of time.
• Figure 6: Plate reader growth experiment conducted during 24h. Escherichia coli Nissle 1917 (EcN) containing pCas3cRh with the designed spacer targeting a Repetitive Extragenic Palindromic (REP) sequence was tested on different rhamnose concentrations (0, 0.1, 0.5, 1, 3 and 5 mM). The positive control consisted of EcN wildtype in M9 medium with 50 mM of glucose.
Basal expression correction
Inducible expression systems are known to have some level of basal expression (leakiness) in the absence of an inducer molecule. In some cases, that does not represent a problem, but in our case even a very low basal expression could result in genotoxicity, leading to undesired cell death, and possible mutations in the Cas proteins, the spacer or the inducible expression system [12]. To overcome this obstacle, we are currently working on the implementation of a constitutively expressed antiCRISPR (Acr) protein to block any potential Cas3 basal expression levels [15]. Our aim is to increase the stability of our diagnostic tool under permissive conditions allowing Cas3 to carry out its activity only when the inducer is provided. Our efforts currently focus on testing AcrC1 and AcrE1, which we envision could prevent our Type-IC CRISPR-Cas system from cleaving the host’s genome when unwanted, bettering in the future the stability of the kill switch and the characteristics of our product [16,17].