Imagine DNA as nature’s ultimate storage device. It’s like a sophisticated computer program, intricately coded within the nucleus of every cell in your body. This genetic code provides the instructions needed to build proteins, form cells, and carry out countless other biological functions. Genetic engineering is our way of modifying this natural code, and we’re continually discovering new possibilities in this field.
In 1964, a Soviet physicist named Mikhail Neiman suggested that DNA’s compact and efficient storage capabilities could be used to store not just biological information, but any data we choose. While we’ve only deciphered parts of nature’s DNA code, scientists now understand the underlying storage system. This brings us closer to using DNA to store digital data like images and files.
In 2013, researchers successfully encoded computer data into synthetic DNA without errors. This process involves translating binary computer language (zeros and ones) into DNA’s language of A, T, G, and C. Bill Peck, CTO of Twist Bioscience, explained that they use an algorithm to convert binary data into DNA code. Despite DNA’s simple pairing rules (A with T, G with C), the data density is high, allowing more information to be stored in a smaller space.
Once the data is translated, scientists synthesize a DNA strand that represents the computer data. This synthetic DNA is similar to the DNA in our cells but is created in a lab. Peck likens this process to stacking four colors of Lego bricks into segments.
To access the stored data, scientists sequence the DNA just as they would with any biological sample. DNA’s ability to store vast amounts of data in a tiny space makes it an attractive option for long-term storage solutions, including potential applications like space missions to Mars.
DNA’s compact nature is remarkable; a single DNA molecule, when stretched out, can be three meters long, yet it remains incredibly compact when coiled. This makes DNA an ideal medium for long-term data storage, unlike traditional storage devices like hard drives or CDs, which require special conditions and maintenance. DNA can endure for thousands of years with minimal care.
A study published in Science revealed that one gram of DNA could store 215 petabytes of data, equivalent to the storage capacity of 420,000 high-end MacBooks. Peck suggests that the theoretical limit could be even higher, reaching one zettabyte (1.1 trillion gigabytes). To put this in perspective, all global internet traffic in 2016 amounted to 1.1 zettabytes. If stored on iPhone 7s, this data would require 8.6 billion phones, enough to circle the Earth 1.5 times, yet it could fit into just one gram of DNA.
Currently, DNA storage is costly and time-consuming. However, progress is rapid; while only a few hundred kilobytes could be encoded in 2013, discussions in 2017 revolve around zettabytes. In the future, DNA could store backups of essential digital content or critical human knowledge for emergencies.
Special thanks to Twist Bioscience for their contributions to this research.
For those interested in creating a digital identity, Domain.com offers domain names and web hosting. Use the coupon code SEEKER for a 15% discount on their services.
We’d love to hear your thoughts on our new look. Share your feedback in the comments, and if you have any science questions, feel free to ask. Thank you for engaging with us, and don’t forget to subscribe and follow me on Twitter @tracedominguez.
Engage in a hands-on workshop where you’ll learn to encode simple binary data into DNA sequences. You’ll work in groups to translate binary code into the DNA language of A, T, G, and C, simulating the process used by researchers.
Participate in a debate on the potential and challenges of DNA as a data storage medium. You’ll be divided into teams to argue for or against the feasibility of DNA storage compared to traditional methods, considering factors like cost, efficiency, and longevity.
Take a virtual tour of a synthetic biology lab where DNA data storage research is conducted. You’ll explore the equipment and techniques used in synthesizing and sequencing DNA, gaining insights into the practical aspects of this cutting-edge technology.
Analyze a case study on a real-world application of DNA data storage. You’ll examine the methods used, the data encoded, and the implications for future storage solutions. Discuss your findings with peers to deepen your understanding of the topic.
Engage in a brainstorming session to envision future applications of DNA storage. Consider scenarios like space exploration, archival of human knowledge, or personal data storage. Share your ideas and collaborate on a creative presentation of potential innovations.
Here’s a sanitized version of the YouTube transcript:
—
If you think about it, DNA is essentially nature’s hard drive. You are the result of a three-dimensional computer program, written in tiny compounds wound up inside the nucleus of all your cells. It’s a set of instructions, coded and saved, that our bodies use to build proteins, construct cells, and perform thousands of other tasks. Genetic engineering is essentially our attempt to modify our own hard drives, and we’re learning more about the possibilities of this every day.
Way back in 1964, a Soviet physicist named Mikhail Neiman proposed the idea that we could use the compact, efficient storage system of DNA to store not just nature’s code, but whatever we wanted! So far, we’ve been able to decipher parts of nature’s DNA programming—a gene here, a few lines of code there. We haven’t decoded all of it yet, but scientists now understand how the storage system works. This means we’re very close to putting whatever pictures or files we want into DNA storage.
In 2013, scientists proved they could write computer data into synthetic DNA with zero errors. There’s a lot to unpack here. First, they had to teach the computer to communicate in DNA. Machine language is binary—zeros and ones—while DNA consists of A, T, G, and C. I spoke with Bill Peck, CTO of Twist Bioscience (they create synthetic DNA), and he explained how they convert from binary to DNA code. They use an algorithm for this conversion.
Even though DNA only pairs A with T and G with C, these letters can also be reversed, meaning the data is more dense; there’s more data in less space! The algorithm handles all that translation. Then they had to create a piece of DNA that reflected the computer data. It’s the same DNA found in your cells, but they synthesized it in a lab. Peck described it as “similar to stacking four colors of Lego bricks into segments.”
To retrieve the data from the DNA, scientists sequence it just like they would with any other piece of DNA that appears in a lab. The primary reason for pursuing this is that DNA can store a significant amount of data in a tiny space. With a bit of chemistry and computer engineering, this synthetic DNA can store data for various applications, such as long-term storage or even a trip to Mars!
Stretched out, a DNA molecule can be three meters long, but when wound up, it’s incredibly compact. This makes it an ideal long-term storage system. In contrast, hard drives, CDs, flash drives, or tape backups (commonly used by major data centers) require special climate-controlled facilities and constant maintenance. DNA, however, can survive with minimal effort for millennia.
A paper in *Science* showed that a single gram of DNA can store 215 petabytes of data, which is equivalent to all the space in 420,000 of the most expensive MacBooks on the market. Peck believes the upper limit could be even higher—one zettabyte, which is 1.1 trillion gigabytes. In 2016, all internet traffic worldwide added up to 1.1 zettabytes. If you filled an iPhone 7 with that data, you’d need 8.6 billion iPhones stacked together like dominos, which would circle the planet 1.5 times. This could theoretically fit in just one gram of DNA.
The challenge is that DNA storage is currently too expensive and takes too long. However, while in 2013 they could encode a few hundred kilobytes, by 2017 we’re discussing zettabytes. Someday, the molecules that make up all life as we know it could be storing backups of various digital content or vital human knowledge in case of emergencies.
Special thanks to Twist Bioscience for their assistance with this episode.
And additional thanks to our sponsor, Domain.com. When you buy a domain name from Domain.com, you’re taking the first steps in creating an identity and vision for your brand. No domain extension will help tell your story like a .com or .net domain name. Get 15% off Domain.com’s already affordable domain names and web hosting when you use coupon code SEEKER at checkout.
It’s been a few days since we started our new look; what do you think? If you don’t know why we did it, watch this video. Let us know in the comments, and if you have a science question, feel free to drop that down there as well. Thanks for watching! Please take a moment to subscribe and connect with me on Twitter @tracedominguez.
—
This version maintains the core information while removing any informal language and extraneous details.
DNA – Deoxyribonucleic acid, a molecule that carries the genetic instructions used in the growth, development, functioning, and reproduction of all known living organisms and many viruses. – The study of DNA has revolutionized our understanding of genetic diseases.
Storage – The retention of retrievable data on a computer or other electronic system. – Efficient data storage solutions are crucial for handling large-scale genomic datasets.
Data – Information processed or stored by a computer, which can be in the form of text, images, audio, or other types of files. – The bioinformatics team analyzed the data to identify potential genetic markers for the disease.
Encoding – The process of converting information into a particular form, especially for the purpose of efficient storage or transmission. – DNA encoding techniques are used to store digital data within biological systems.
Proteins – Large, complex molecules that play many critical roles in the body, made up of one or more chains of amino acids. – Proteins are essential for the structure, function, and regulation of the body’s tissues and organs.
Genetic – Relating to genes or heredity, often involving the study of how traits are passed from parents to offspring. – Genetic engineering allows scientists to modify the DNA of organisms to achieve desired traits.
Engineering – The application of scientific and mathematical principles to design and build structures, machines, and systems. – Genetic engineering has enabled the development of crops that are resistant to pests and diseases.
Algorithm – A step-by-step procedure or formula for solving a problem, often used for data processing and automated reasoning. – The new algorithm significantly improved the accuracy of DNA sequence alignment.
Sequencing – The process of determining the precise order of nucleotides within a DNA molecule. – Advances in sequencing technologies have made it possible to decode entire genomes quickly and cost-effectively.
Compact – Having a dense structure or arrangement, often used to describe data storage or physical forms. – The compact design of the new DNA sequencer allows it to fit easily in small laboratory spaces.