Skip to main content

Search Here

Technology Insights

DNA Data Storage in 2026: How Microsoft, Twist Bioscience, Catalog, Biomemory, and Iridia Are Archiving the World's Data Inside Synthetic DNA

DNA Data Storage in 2026: How Microsoft, Twist Bioscience, Catalog, Biomemory, and Iridia Are Archiving the World's Data Inside Synthetic DNA

  • Internet Pros Team
  • June 21, 2026
  • AI & Technology

Humanity now creates more data in a single day than it produced in entire centuries - and almost none of it has anywhere safe to live for the long haul. Hard drives wear out in years, magnetic tape degrades in decades, and every copy has to be migrated to new hardware again and again, forever. Nature solved this problem billions of years ago. The instructions to build a blue whale or a redwood are written in DNA, a molecule so dense and so durable that we still read it from bones tens of thousands of years old. In 2026, DNA data storage - encoding ordinary digital files into synthetic strands of DNA - has moved from astonishing lab demos toward the first real products for archiving the world's most precious information.

Why We Are Running Out of Places to Put Data

The total amount of data the world generates is now measured in zettabytes - trillions of gigabytes - and the curve keeps steepening as AI, video, sensors, and scientific instruments pour out information. The uncomfortable truth is that most of this never needs to be touched again but must never be lost: medical records, legal archives, scientific datasets, film masters, national libraries, financial ledgers. This is "cold" archival data, and the media we use for it are quietly failing us.

Tape and disk share three fatal flaws for century-scale storage: they are not dense enough, they do not last, and they demand endless migration as formats and hardware go obsolete. A data center full of spinning disks also burns power continuously just to keep bits alive. DNA flips every one of these problems on its head.

How You Store a File in DNA

Computers store everything as binary - ones and zeros. DNA stores information in a four-letter chemical alphabet: the bases A, C, G, and T. Converting between them is conceptually simple. A stream of bits is mapped onto a sequence of those four letters (so each base can carry two bits), then split into millions of short fragments, each tagged with an address so the file can be reassembled later. The full pipeline has four stages:

1. Encode

Software converts the digital file into A/C/G/T sequences, adding addresses and error-correcting redundancy so a few damaged strands never sink the file.

2. Write (Synthesis)

A DNA synthesizer manufactures those sequences chemically or enzymatically, building physical strands base by base. This is the slow, expensive step.

3. Store

The dried DNA is sealed in a tiny capsule. With no power and no moving parts, it sits stable for centuries to millennia.

4. Read (Sequencing)

When the data is needed, a DNA sequencer reads the strands back into A/C/G/T, and software decodes them into the original bits.

The Two Numbers That Make DNA Irresistible

Two properties explain why some of the biggest names in computing are chasing this. The first is density. DNA can theoretically pack on the order of an exabyte - a billion gigabytes - into a single gram of material. Put differently, every film, song, and book humanity has ever made could fit in a volume smaller than a sugar cube. The second is longevity. Where a hard drive is doing well to survive a decade, DNA kept cool and dry remains readable for thousands of years, with no electricity and no migration cycle.

"We have been encoding civilization in DNA for billions of years. The only new idea is choosing what the sequence means. A molecule that survives in a fossil will easily outlive any drive, tape, or cloud we have ever built."

A molecular storage researcher on why DNA is the natural archive

Who Is Building DNA Storage in 2026

What was a handful of academic papers a few years ago is now a small industry spanning hyperscalers, biotech firms, and startups:

  • Microsoft & the University of Washington - the Molecular Information Systems Lab pioneered fully automated, end-to-end DNA storage and random access, proving you can retrieve a specific file without reading the whole pool.
  • Twist Bioscience - a leader in high-throughput silicon-based DNA synthesis, supplying the dense strands that storage systems are written onto.
  • Catalog Technologies - sidesteps slow base-by-base writing by pre-making DNA building blocks and assembling them like movable type, with its Shannon platform aiming at scale and even computation on stored data.
  • Biomemory - the French startup selling consumer "DNA Cards" that store a small message for a claimed 150-plus years, an early sign of productization.
  • DNA Script - commercializes enzymatic DNA synthesis, using enzymes instead of harsh chemistry to write strands faster and greener.
  • Iridia - building a semiconductor-style chip to write and read DNA electronically, chasing the device that could make DNA storage a rack-mounted product.

DNA vs. Today's Archival Media

PropertyHard DiskMagnetic TapeSynthetic DNA
DensityLowModerateExtreme (exabytes per gram)
Lifespan~5 years~10-30 yearsCenturies to millennia
Power at restContinuousNoneNone
Migration neededConstantPeriodicNever (chemistry won't obsolete)
Access speedInstantMinutesHours (today)

The Honest Trade-offs

DNA storage is not about to replace your SSD, and the people building it are the first to say so. The hurdles are real:

  • Writing is slow and costly. Synthesizing DNA at data-center scale is still far more expensive per gigabyte than tape - the single biggest barrier. Bringing that cost down is the whole game.
  • Access is measured in hours, not milliseconds. Sequencing takes time, so DNA suits deep cold archives you rarely open - not active databases.
  • Errors must be engineered around. Synthesis and sequencing both make mistakes, so heavy error-correcting codes and redundancy are mandatory.
  • It needs standards. An archive meant to last 1,000 years needs agreed-upon encoding formats so future generations can still decode it - exactly the kind of work industry consortia are now starting.

"Nobody is putting their email inbox in DNA. The target is the data you must keep for a century but will almost never read - and for that, nothing else even comes close."

An archival storage architect on DNA's real use case

What This Means for Your Business

For most organizations, DNA storage is a technology to track, not to deploy this quarter. But the strategic signal matters: the cost of writing DNA is falling along a curve that looks a lot like early sequencing, which dropped faster than Moore's Law. Smart teams are already asking which of their datasets are truly permanent, separating "keep forever" data from the churn, and watching for the first cloud providers to offer DNA-backed cold tiers. The era of throwing away data because it costs too much to keep is ending - and the most durable, dense archive ever conceived turns out to be the same molecule that wrote you.

At Internet Pros, we help businesses design data architectures that are ready for what's next - from intelligent cloud storage tiers to long-term backup strategy. Get in touch to talk about future-proofing your data, or explore more technology insights on our blog.

Share:
Tags: AI & Technology Software Development Business

Related Articles