We attempted to apply TFAM, a protein known to seal (protect) mitochondria, to DNA storage technology known for its volatility. By inserting the TFAM protein along with our DNA sample in E.coli bacteria, we tested the effect of TFAM protein by comparing it to its placebo, a DNA sample without TFAM protein. The nucleotides (A, C, G, and T) of our DNA sample were translated into numbers 1 and 0 (e.g., AC = 1), which would then be interpreted into a smiley face. We exposed the two samples to several stress factors, including UV light and H2O2, and found that TFAM protected our DNA sample: the smiley face was largely intact (see Results for more information). In conclusion, the finding supported our hypothesis that TFAM will preserve DNA information as it does in other bodies of cells.
This project can advance the scientific, environmental, and economic fields. As society is entering its fourth industrial revolution, humans are generating data at a soaring rate, and the conventional data storage method will not be able to accommodate such an exponential rise. Although the DNA data storage method had been discussed as the potential solution for the efficiency problem (enabling 1,075 million GB of data to be stored in just 5 g of DNA), such a method had not been stable enough for implementation. By providing DNA-based data storage with a method that could enhance its stability, we hope to help advance this technology to step closer to real-world application. In academics, our project could further the understanding of DNA storage in genetics, supporting many potential future studies.
The effects of climate change have been rapidly expanding since it started in the mid-1900s. It is a long-term phenomenon that has cumulative harmful effects, and it is often said that if we do not turn around now, we are never going back. Even though the primary cause of climate change is the greenhouse gasses emitted from transportation, electronic waste also contributes to this major phenomenon. When not appropriately recycled, electronic waste can release harmful chemicals that affect our natural resources. When burnt, the process emits greenhouse gasses and toxic chemicals into the water and land, affecting communities nearby.
By utilizing DNA-based data storage, it is possible to reduce the effects of data storage on the climate, hopefully contributing to delaying climate change. Moreover, current digital data storage methods demand large amounts of electricity, workforce, and space, causing inefficiency. Employing DNA-based data storage may provide a solution to these problems. DNA is a dense material to store data in, significantly reducing the space needed for data storage. Experts have shown that one gram of DNA could store 215 petabytes of digital data; that is, all information humans have acquired could be contained in a single room if all of them are stored inside DNA. DNA is also a sustainable and durable material, which will require less electricity and workers as well.
Implementation of this technology may also benefit the economy by increasing the employment spaces available by creating numerous companies and different subfields with experts. This will be mentioned later in the economics section.
Considering when the project is first used in the world, our proposed end users will be data-based companies with the financial ability to invest in such technology. More specifically, these companies will be those that have not yet accepted the existing data storage technology and seek possible future economic growth, freedom from spatial limitations, and environmental benefits.
The experts estimate more than 20 years for DNA-based data storage to be widely available and commercialized to the general population like USB does today. When such a method is commercialized, it would be an extremely efficient way of storing methods with their four nucleotides that have a far longer degradation rate. The model would become widely accessible by the general public rather than scientists. Scientists, in most cases, do not have to deal with large amounts of data, and therefore, they do not feel a strong need to utilize this model. However, the general public that needs to access and store data constantly in their daily lives would benefit.
Although DNA-based data storage has rising potential, there are still limitations in its active practicality and commercialization. While modern, conventional data storage methods utilize CPU, which allows for high-speed data storage and retrieval through electricity, DNA data storage entails considerable cost and time in DNA synthesis and sequencing. Although the costs associated with both have been getting exponentially lower in recent years, there remains a huge gap compared to conventional methods. In addition, in terms of speed, Microsoft has estimated the rates for encoding data must be at least 100MB per second, but our current speed is only 400 bytes per second, which is millions of times slower than conventional methods. The current technology synthesizing DNA is also a major obstacle to applying DNA-based data storage.
Synthesized DNA is crucial for storing binary data, but the current development of DNA synthesis is inadequate to implement DNA-based data storage immediately. The cost and time it takes to synthesize DNA is a major limitation of applying this model in real life.
While DNA data storage using TFAM has multiple benefits, there are also possible challenges and concerns regarding safety. First, there are concerns associated with security. DNA data storage utilizes the same principle as the current data storage methods with computers, but it is still prone to information leakage. When private or crucial information is released, this may lead to consequences that individuals and companies must face. As DNA-based data storage is also a recent and emerging field of study, no definite hacking prevention mechanism exists. While the current data storage centers all utilize unique hacking prevention programs, developing one for DNA-based data storage will take time. The possibility that the data will be hacked before a definite mechanism is developed cannot be ignored.
Another problem is safety issues regarding DNA disposal. DNA, especially recombinant or synthetic DNA, is considered biohazardous waste, meaning it must be disposed of properly and carefully. There is no current method for how data stored in DNA must be disposed of, therefore a system must be developed and put into place. Another challenge that must be addressed is errors in synthesizing DNA. While there has been a great improvement in reading DNA, the skills of synthesizing DNA are not that advanced. Therefore, the process is prone to errors, which may result in the wrong data being stored or not containing the original data.
Benefits: DNA-based data storage technology can be widely used when implemented, attracting a greater target of populations than data-based storage. It is predicted that there will be an increase in the number of companies who invest and build with a focus on this novel technology, which can create a butterfly effect in benefitting the economic world. These companies will develop in size and number, growing financially. As they expand, they will be employing many new individuals to work in this field, and accordingly, more people will be needed to educate these workers, creating more spaces for employment.
Detriments: Of course, there cannot always be benefits to a novel technology. Existing but relatively small data-based storage companies may experience bankruptcy as they need to compete against novel technology. Moreover, as DNA data storage is a relatively high-cost technology, requiring approximately $1,300 per GB, only a few companies will have enough money to invest in such technology. Because DNA data storage allows for almost an endless amount of data storage, only a few additional companies will be needed to expand the field, possibly causing the initial starters to monopolize the data storage field.
Implementation will primarily affect data storage companies, their current workers, and the population that lives near these centers. To ensure security is comparable to the previous data storage centers, we must consider how their security works and transfer as many security strategies and technologies as possible. To diminish the economic hardships resulting from a sudden change, we could publically open up the details of such technology to allow smaller companies to rebrand in a smooth and timely manner if desired. Moreover,access to expertise and education regarding DNA-based data storage management could be made financially available to prevent current workers from experiencing unemployment.
Although DNA data storage has its benefits regarding density, longevity, and safety, DNA storage still lacks the accessibility of what digital data storage can achieve. Without developing a new technology where individuals can easily convert DNA data to different forms, CPU and USBs for conventional data storage, DNA-data storage will most likely be used by a limited number of big data companies. With this issue, our interviewee (Director Yoon-Sung-Joon of Fortugabio) recommended us a “hybrid implementation” method. This method involves using both DNA storage and digital data storage which can increase the benefit of the system and technology while making up for the problems regarding the accessibility of the DNA storage.
Kim, J. W. (2019, May 20). Naver data center is urgent... Residents oppose, Yongin city responds “I don’t know.” Hankyung. Retrieved August 7, 2022, from https://www.hankyung.com/it/article/2019052057351
Malone, J., & Higgins, D. (n.d.). Data Centers: Balancing climate change and digital growth. AECOM. Retrieved August 7, 2022, from https://aecom.com/without-limits/article/data-centers-balancing-climate-change-and-digital-growth/#:~:text=Data%20centers%20are%20where%20the,and%20they%20need%20it%20fast.
Panda, D., Molla, K. A., Baig, M. J., Swain, A., Behera, D., & Dash, M. (2018, May). DNA as a digital information storage device: Hope or hype? 3 Biotech. Retrieved August 7, 2022, from https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5935598/
Thomasy, H. (2022, January 13). The world has a data storage problem. is DNA the answer? NEO.LIFE. Retrieved August 7, 2022, from https://neo.life/2021/07/the-world-has-a-data-storage-problem-is-dna-the-answer/
Jayaraman, R. (2017, August 10). Can DNA solve the data storage problem? . PreScouter. Retrieved August 11, 2022, from https://www.prescouter.com/2017/08/dna-solve-data-storage/