Download as pdf or txt
Download as pdf or txt
You are on page 1of 8

MEMORY AND STORAGE

The DNA Data


Storage Model
Dave Landsman , Western Digital Corporation
Karin Strauss , Microsoft Corporation

base. These fundamental capabili-


Reliably storing digital data in synthetic DNA ties make it possible to encode digital
data into a sequence of bases (adenine,
fits naturally into the layered Open Systems guanine, cytosine, and thymine, or
AGCT), write that sequence as a set of
Interconnect model and shares many parallels corresponding DNA molecules (syn-
with reliably storing data using existing storage thesis), store the molecules, prepare
them for reading (retrieval), read them
technologies and interfaces. back as a sequence of bases (sequenc-
ing), and finally, decode the original

D
digital data (Figure 1). To learn more
about this process, see Preserving Our
NA data storage, or using synthetic DNA as a Digital Legacy: An Introduction to DNA Data Storage.1
data storage medium, is being seriously con- Even though synthetic DNA as a data storage medium
sidered as an archival storage solution due to is similar to traditional storage media in many ways, it is
its volumetric data density potential, data re- worth highlighting some key differences.
tention characteristics, sustainability, and potential for First, in traditional storage, the media is premanufactured
dramatically lower total cost of ownership versus existing (e.g., SSDs use NAND cells, HDD, and tape use magnetic do-
storage technologies. mains on a platter or strip, respectively) and written by mod-
ifying the state of the media. For such devices, capacity and
INTRODUCTION throughput scaling require modifications to both media and
The biotechnology industry has made DNA data storage pos- write/read heads. In contrast, the most common DNA data
sible today due to decades of investment in molecular-level storage method does not employ a premanufactured media
technologies for medical and life sciences applications that substrate. (Note: Methods to attach DNA to a planar substrate,
now enable us to construct and read synthetic DNA, base by to serve as a “memory/storage cell”, are being considered; this
article does not cover these methods.) Instead, the storage
Digital Object Identifier 10.1109/MC.2023.3272188
media—DNA molecules—are manufactured during write op-
Date of current version: 26 June 2023 erations. In this case, DNA’s universal, fixed, and reader/writer

78 CO M PUTE R P U B LISHED BY THE IEEE COMP UTER SOCIE T Y 0018-9162/ 23©2023IEEE


EDITOR TOM COUGHLIN
Coughlin Associates; tom@tomcoughlin.com

independent physical structure enables 1 Electronic Molecular Electronic 1


throughput improvements without me- 0 0
dia changes, or data migration when 0
G
Storage
G
0
adopting new writer/reader generations. 0 Encoding Synthesis Retrieval Sequencing Decoding 0
C C
1 1
Second, because DNA media is de- A A
0 0
tached from any array-based substrate, C C
1 1
T T
protocol information such as object iden- 0 0
tifiers and segment indices must be em- 1 1
bedded within each DNA molecule in the 1 1
DNA archive, for locating objects, object FIGURE 1. The DNA data storage system. Encoding and decoding are performed in the
segmentation, etc. Despite this over- electronic domain and translate bits to bases and vice versa. Synthesis and sequencing are
head, DNA can achieve much higher vol- the interfaces to and from the molecular domain, creating and reading DNA sequences.
umetric density as detached media than
as array-based media.
Third, DNA has unique error char- DNA MECHANICS IN BRIEF from them, effectively copying the
acteristics as a medium for end-to-end We are most familiar with DNA as a “dou- cell’s genetic information. Both ssDNA
storage. For example, in addition to sub- ble-helix” (Figure 3), or dual-stranded and dsDNA are used in different parts
stitutions (akin to bit flips), insertions DNA (dsDNA), where the base adenosine and applications of the DNA data stor-
and deletions may also occur. Encoders on one strand has a chemical binding af- age pipeline.
try to avoid certain sequences2,3,4 to re- finity (complementarity) with the base Note that no organisms or cells are
duce interference with the writing and thymine on the other strand, and the used for DNA data storage: synthetic
recovery process. base cytosine has an affinity with the DNA for data storage is constructed
Despite the uniqueness of synthetic base guanine. Complementarity is used and manipulated through well-con-
DNA as a storage medium, there are in nearly all of the techniques employed trolled chemical processes, covered
many parallels with traditional data stor- in DNA data storage, in the process in the section “The DNA Data Storage
age and storage interface mechanisms. called hybridization (Figure 4). Physical Layer.”
In this article, we draw parallels between In organisms, cell division natu-
the DNA data storage model and the rally replicates the genetic code. Cel- APPLICATION,
Open System Interconnection (OSI) model lular mechanisms separate the two PRESENTATION, AND
(Figure 2). Before we begin describing strands of the original dsDNA into two SESSION LAYERS
the DNA data storage layer model, we de- single-stranded DNA strands (ssDNA) The upper three layers of the OSI
scribe a few things about DNA. and create two new dsDNA molecules model map to the DNA data storage

• Network Resource Access Protocols/Services


Application Application • Cataloguing, Archive Management, ...
• HTTP, FTP, SMTP, ...

• Translate, Encrypt, Compress, Serialize, ...


Presentation Presentation • Encryption/Compression
• Ensure Data in Usable Format for Transmission
Binary
Session • Establish, Manage and Terminate Sessions Session • Interface for DNA Archive Access (1100...)
• Application Programming Interfaces
• Reliable End-to-End Packet Delivery
Transport • Transformations
• Error Recovery
• Segmentation/Packetization
• Move Packets From Source to Destination DNA Channel • Error Detection/Correction
Network
• Internetworking/Routing (Codec SW)

• Define Format on the Wire; Bits to Frames, ... • Bits-to-Bases


Data Link • Optional Symbol Space Protocol
• Hop-to-Hop Delivery DNA
(AGCT...)
DNA Physical • Synthesis, Storage, Sequencing
Physical • Transmit Bits Over a Physical Medium (Chemistry) • Retrieval: Primers and Probes
(a) (b)
FIGURE 2. The layers of (a) the OSI model and (b) the DNA data storage model.

J U LY 2 0 2 3  79
MEMORY AND STORAGE

THE DNA DATA STORAGE while maintaining sufficient accu-


Sugar
Phosphate CHANNEL LAYER racy, is around 300 bases. Even if ev-
Backbone The DNA channel layer takes as ery base in a strand could be used for
Base Pair input a bitstream or other digital payload data, a typical strand would
object from the session layer and encode tens of bytes. Thus, like in
Adenine Thymine processes those bits to ensure that the traditional networking, where bit-
DNA sequences written and read by streams must be broken into smaller
the physical layer can be successfully pieces to fit limited-size packets, DNA
Guanine
Cytosine decoded, enabling recovery of the data storage requires breaking data
or ig i n a l d ig ital source data. The objects into many segments during
DNA channel layer is implemented as encoding to fit limited-size strands.
a software codec.2,3,4,5,6,7,11 Reassembling these segments in the
The DNA channel [Figure 5(b)] shares right order when sequencing requires
conceptual characteristics with a more adding indices to each segment before
FIGURE 3. The DNA double helix. (Source: traditional network/electrical channel synthesis. The need for segment indi-
National Human Genome Research Insti- [Figure 5(a)]. The DNA channel layer ces (and other protocol fields) intro-
tute; used with permission.) roughly incorporates the functionality duces a tradeoff between the number
of the transport through the data link of segments used to represent an ob-
layer in the OSI model. It receives a bit- ject in a DNA pool (that is, maximum
model effectively unchanged in func- stream from the session layer, preparing object size) and the number of bases
tion; at this level of abstraction, DNA it for handoff to the lower layers for the used for the segment index (that is, in-
is simply another storage medium. conceptual equivalent of “transmis- dex overhead).
The application layer is the inter- sion,” which, in the DNA case, means Error correction: In constructing
face closest to users. For example, writing, storing, recovering and read- segments for storage, the DNA codec
since DNA data storage is best matched ing DNA molecules. As there is no must add redundant information for
to long-term storage, the application physical “wire” in DNA data storage, error correction because random er-
layer is likely to define how data are the data link layer functionality is em- rors (base insertions, deletions, and
logically organized for archival pur- bedded in the DNA codec, instead of in substitutions) and erasures (missing
poses, including metadata describing the transmitter/receiver pair (Figure 5, sequences) may affect such segments
the data in the archive. dashed lines). through the DNA physical layer. Er-
The presentation layer maps natu- When the bitstream is presented ror correction can augment segments
rally to the preparation of bitstreams as to the DNA channel layer, the follow- with additional data (inner code) or
input to the lower layers. It may include ing types of operations are performed, add segments that contain additional
transformations such as encryption not necessarily in this order. data (outer code). After the DNA is se-
and compression. Like with other me- Packetization: In a network or fab- quenced, the DNA codec uses this re-
dia, such functions may be accelerated ric, the key function of the transport dundant information to recover lost
by special-purpose (silicon-based) hard- and network layers is packetization, sequences, correct remaining errors,
ware support. routing, and flow control across the and reliably deliver the original bit-
The session layer provides access link. Neither routing nor flow control stream back to the upper layers.
to the DNA data storage interface. are relevant here, but packetization is. Translation: To store digital data in
Various implementations are possible, The longest strand of synthetic DNA DNA, the data must be translated from
but the most commonly discussed is that can currently be constructed digital (0 or 1) to bases (A, T, C, or G).
object based. In this implementation, the base by base with standard chemistry, Although it is possible to simply map
interface provided by the ses- every two bits in a bitstream into
sion layer is basically an object one of the four possible bases,
store with a key-value schema. there are advantages to other
It offers basic primitives such mappings. For example, map-
as read/write of individual objects pings of longer bitstreams to
or all objects, as well as primitives longer base sequences may bet-
with more complex semantics, ter accommodate certain error
such as indexed search. These correction methods and be more
commands translate to logical space efficient.
FIGURE 4. Hybridization: A process in which comple-
and physical storage operations at Transformations: With DNA,
mentary bases bind to each other.
lower layers. pat ter n s of repeated ba ses

80 COMPUTER  W W W.CO M P U T E R .O R G /CO M P U T E R


Transport Layer Physical and Data Link Layer Transport Layer

Network Stack and Switches Transmitter Receiver Network Stack and Switches

• Packetization Protocol Payload ECC


Bitstream Line Coding (8b10b, etc.) Line Decoding
• Routing
111100.. • Flow Control

Protocol Payload ECC

• Depacketization
• Routing Bitstream
Symbols to Signals Signals to Symbols
• Flow Control 111100..
(D→A, Shaping, ...) (A→D, Shaping, ...)

Analog
(a) Electrical

DNA Channel Layer DNA Physical Layer DNA Channel Layer

DNA Codec (Encode) Synthesizer Storage Sequencer DNA Codec (Decode)

• Scramble • ECC Removal


Bitstream • Packetize • Depacketize Bitstream
111100.. • ECC Insertion • Descramble 111100..
Symbols to Molecules Molecular Molecules to Symbols
• Bits-to-Bases • Bases-to-Bits

Protocol Payload ECC Protocol Payload ECC

(b) DNA
FIGURE 5. Comparing (a) the electrical channel model with (b) the DNA channel model.

J U LY 2 0 2 3 
81
MEMORY AND STORAGE

(homopolymers), a high proportion of similar in nature to those used in exist- technology-based implementations that
Gs and Cs (high GC content), and some ing storage systems. In the next section, miniaturize the synthesis device and
other specific patterns can cause er- we discuss the DNA physical layer. can achieve higher write throughput by
rors. For example, some sequencing synthesizing more sequences in paral-
techniques exhibit errors with long THE DNA DATA STORAGE lel (thousands in well plates versus bil-
homopolymers, and high GC content PHYSICAL LAYER lions on chips). For example, chips akin
may affect sequencing preparation The DNA physical layer [Figure 2(b)] in- to static RAM arrays have been used to
protocols. The transformations in the volves the basic chemistry of DNA for synthesize sequences of DNA, one se-
DNA channel layer avoid transforming writing (synthesis), physically preserv- quence per array position8 (Figure 6).
digital bitstreams into such problem- ing (storage), and reading (sequencing) Like other storage technologies,
atic patterns of bases, thereby reducing DNA molecules. It also involves the scaling relies on miniaturization of
downstream errors and associated er- basic chemistry of DNA for retrieval, these positions so that more fit onto
a chip, but instead of increasing
capacity, more positions (synthesis
sites) on a DNA synthesis chip increase
write throughput, as more sequences
In summary, the DNA channel layer encodes an can be synthesized simultaneously.
input bitstream so that the DNA sequences that are
sent to the physical layer can be effectively and Sequencing
successfully decoded after “transmission.” The two main methods of DNA sequenc-
ing considered for DNA data storage to-
day are sequencing-by-synthesis (SBS)
ror correction overhead, and improving which is both a general preparation (Figure 7) and nanopore sequencing
overall data reliability. This is concep- step for sequencing and also an integral (Figure 8). SBS20,21 indirectly ide­ntifies
tually similar to the actions in a net- step in how systems implement DNA the bases in a source ssDNA strand by
work/electrical channel to mitigate an- storage operation requests from the up- reconstructing a dsDNA strand from
alog effects. For example, a serial link per layers. We will first briefly discuss that source. As each complementary
maintains close to an equal number of synthesis, storage, and sequencing and base is attached in building the dsDNA
ones and zeroes on the wire. then cover retrieval as it is central to strand, the attachment event enables
Optional DNA space protocol: In ad- the functioning of logical storage oper- identification of the original source base,
dition to the actions above, the codec ations for DNA data storage. usually by optical means. Nanopore se-
may insert additional base sequences quencing22,23 detects bases directly
into the already transformed digital Synthesis (no dsDNA construction) by measuring
data (process not shown in Figure 3). DNA synthesis has historically been disturbances in ionic current as a DNA
The purposes are use-case specific implemented with large machines, strand is guided through a tiny opening
but generally involve random access using hoses, valves, and plastic contain- (nanopore). Nanopores can be biological
to or search within the DNA archive. ers known in the life sciences as well or, in development, solid state. A main
An example is the addition of an ob- plates. Hoses and valves will not disap- tradeoff between the two methods is
ject ID, assigned by the session layer pear; however, advances in the semi- that SBS offers higher accuracy but typ-
to sequences belonging to the same conductor industry have enabled silicon ically requires a batched process that
object. The section “Retrieval” dis-
cusses this further.
After these steps, the resulting
DNA sequences are handed over to the
DNA physical layer.
In summary, the DNA channel layer
encodes an input bitstream so that the Detection
DNA sequences that are sent to the Event
physical layer can be efficiently and suc-
cessfully decoded after “transmission” 2 µm
Source
(synthesis, storage, retrieval, and se- Strand
quencing). In doing this, the DNA chan-
FIGURE 6. An electrochemical DNA syn- FIGURE 7. SBS sequencing. (Source:
nel uses transformations and process-
thesis array at 2 μm pitch.8 Illumina, Inc.; used with permission.)
ing steps that are well structured and

82 COMPUTER  W W W.CO M P U T E R .O R G /CO M P U T E R


results in high latency, while nanopore DNA pull-out with magnetic nanoparti- complement the target site of one or
sequencing offers lower latency at typi- cles15 (Figure 9). more objects, attach to the target mol-
cally lower accuracy. Both methods re- PCR requires the session layer to as- ecules (recall that A pairs with T, and C
quire physical retrieval and preparation sign a set of object IDs, which, at encod- with G) and, along with special enzymes
of the molecules for reading that may ing time, the channel layer appends as called polymerases, kickstart the PCR
add to the readout latency. a set of predefined DNA base sequences process. Over multiple PCR cycles, the
As with synthesis, further scal- to both ends of every DNA sequence be- attached probes (called primers when
ing in sequencing is needed to make longing to an object in the archive; these used in PCR), enable the polymerases
DNA data storage practical, but the base sequences are referred to as target to copy only the target DNA molecules,
fundamentals for both synthesis and sites. At retrieval time, probes, that is, making them more abundant than
sequencing are now in place on semi- short DNA sequences that perfectly molecules representing other objects.
conductor-based platforms, and scal-
ing such platforms is something for
which the technology industry has an
impressive track record.

Electrical Current (pA)


Storage
After being written and before being
read, the DNA molecules need to be
stored in an environment that prevents
Unwinding
them from degrading.9,10,11 While a Enzyme
wide variety of preservation methods Nanopore
are common in biotechnology broadly, Read Position
DNA used for data storage is typically
stored dry and in a chemically inert en-
vironment to increase storage density
and stability. Significantly, with such
Ionic Current
methods, DNA for data storage shows
potential for extremely long endur-
ance at room temperature, supporting
reliable long-term storage that is low
cost and sustainable. We expect that
FIGURE 8. Nanopore sequencing. (Source: National Human Genome Research Insti-
large pools of DNA molecules (say, tera-
tute, https://www.genome.gov; used with permission.)
bytes of storage) will be organized into
even larger libraries where pools will
be addressed using a physical coordi-
Target Site
nate system,12 like tape libraries today. Magnetic
Nanoparticle
Retrieval: Probes for Selected
Copied
storage operations Object A
Retrieval defines how DNA molecules are Ploymerase
Probe
extracted from a DNA archive and pre- Enzyme
pared for sequencing. The most basic
operation is to read the entire archive, Object B
Not Copied Not Selected
which is equivalent to reading all data in
a tape from beginning to end. However,
when a pool of DNA contains multiple ob-
(a) PCR (b) Magnetic Nanoparticle Pull-Out
jects, random access operations (for ex-
ample, “seeking” the location of an object
or searching for sets of objects) need to be
FIGURE 9. Two random access methods: (a) Probe attaches to the target site so that
performed on the pool.
only the DNA sequences with the Object ID are copied during PCR. (b) Probe attaches
There are a variety of methods to
to the target site of sequences with the Object ID, and magnetic nanoparticles used for
implement such operations, but two
molecular pull-out with a magnet.
popular methods are PCR 3,13,14 and

J U LY 2 0 2 3  83
MEMORY AND STORAGE

The result is a pool in which most of the methods in the OSI layered storage storage architecture,” Science, vol.
DNA molecules represent the object(s) model will help the nascent DNA data 355, no. 6328, pp. 950–954, Mar. 2017,
to be read. storage ecosystem evolve. doi: 10.1126/science.aaj2038.
DNA pull-out with magnetic nano­ 8. B. H. Nguyen et al., “Scaling DNA
particles uses the same principle ACKNOWLEDGMENT data storage with nanoscale elec-
of DNA attachment but a different We thank Kyle Tomek, John Hoff- trode wells,” Sci. Adv., vol. 7, no. 48,
method to make the molecules repre- man, Damien Le Moal, and Luis Ceze Nov. 2021, Art. no. eabi6714, doi:
senting the object of interest more for their valuable contributions and 10.1126/sciadv.abi6714.
numerous in a solution. With this feedback on this manuscript. We also 9. D. Coudy, M. Colotte, A. Luis, S. Tuffet,
process, the channel layer only needs thank our respective research teams and J. Bonnet, “Long term conservation
to append the object IDs as target sites and collaborators for their substantial of DNA at ambient temperature. Im-
at one end of every DNA sequence be- contributions to this field. Dave Lands- plications for DNA data storage,” PLoS
longing to an object. At retrieval time, man is the corresponding author. One, vol. 16, no. 11, Nov. 2021, Art.
the probes are attached to magnetic no. e0259868, doi: 10.1371/journal.
nanoparticles. As the probes bind REFERENCES pone.0259868.
to target DNA molecules, the target 1. “Preserving our digital legacy: 10. L. Organick et al., “An empirical com-
DNA molecules can then be sepa- An introduction to DNA data parison of preservation methods for syn-
rated from the rest of the DNA mole- storage,” DNA Data Storage Alliance. thetic DNA data storage,” Small Methods,
cules in the pool with a magnet. [Online]. Available: https:// vol. 5, no. 5, May 2021, Art. no. 2001094,
For either random access method, dnastoragealliance.org/dev/wp doi: 10.1002/smtd.202001094.
additional preparation steps such as -content/uploads/2021/06/DNA 11. R. N. Grass, R. Heckel, M. Puddu,
further PCR steps and DNA cleanup -Data-Storage-Alliance-An D. Paunescu, and W. J. Stark, “Robust
may be required, depending on the se- -Introduction-to-DNA-Data-Storage.pdf chemical preservation of digital infor-
quencer used for reading the informa- 2. N. Goldman et al., “Towards practical, mation on DNA in silica with error-cor-
tion out. high-capacity, low-maintenance infor- recting codes,” Angewandte Chemie,
Tradeoffs between different chemi- mation storage in synthesized DNA,” vol. 54, no. 8, pp. 2552–2555, Feb. 2015,
cal implementations of object random Nature, vol. 494, no. 7435, pp. 77–80, doi: 10.1002/anie.201411378.
access along dimensions such as reliabil- Feb. 2013, doi: 10.1038/nature11875. 12. S. Newman et al., “High density DNA
ity, preparation, and reading overheads 3. L. Organick et al., “Random access in data storage library via dehydration
are an active area of research for DNA large-scale DNA data storage,” Nature with digital microfluidic retrieval,”
data storage,16 as well further work on Biotechnol. vol. 36, no. 3, pp. 242–248, Nature Commun., vol. 10, no. 1,
implementing even more advanced op- Mar. 2018, doi: 10.1038/nbt.4079. Apr. 2019, Art. no. 1706, doi: 10.1038/
erations such as search,17 content sim- 4. S. R. Srinivasavaradhan, S. Gopi, H. s41467-019-09517-y.
ilarity search,18 file preview,19 etc. D. Pfister and S. Yekhanin, “Trellis 13. S. M. Tabatabaei Yazdi, Y. Yuan,
This discussion has shown how, for BMA: Coded trace reconstruction on J. Ma, H. Zhao, and O. Milenkovic,
the same read-object request from IDS channels for DNA storage,” in “A rewritable, random-access DNA-
the session layer, d i f ferent mecha- Proc. IEEE Int. Symp. Inf. Theory (ISIT), based storage system,” Scientific Rep.,
nisms can be used at the lower chan- 2021, pp. 2453–2458, doi: 10.1109/ vol. 5, Sep. 2015, Art. no. 14138,
nel and physical layers, similar to ISIT45174.2021.9517821. doi: 10.1038/srep14138.
how two NAND f lash storage devices 5. G. M. Church, Y. Gao, and S. Kosuri, 14. C. Winston, L. Organick, D. Ward,
are implemented differently despite “Next-generation digital information L. Ceze, K. Strauss, and Y. J. Chen,
using the same command interface. storage in DNA,” Science, vol. 337, no. “Combinatorial PCR method for
6102, p. 1628, Sep. 2012, doi: 10.1126/ efficient, selective oligo retrieval from

W
science.1226355. complex oligo pools,” ACS Synthetic
hile DNA as a storage me- 6. W. H. Press, J. A. Hawkins, S. K. Jones Biol., vol. 11, no. 5, pp. 1727–1734, Feb.
dium has fundamental Jr., J. M. Schaub, and I. J. Finkelstein, 2022, doi: 10.1021/acssynbio.1c00482.
differences from tradi- “HEDGES error-correcting code for 15. K. N. Lin, K. Volkel, J. M. Tuck, and
tional storage, many of the data trans- DNA storage corrects indels and A. J. Keung, “Dynamic and scalable
formations and error processing con- allows sequence constraints,” Proc. DNA-based information storage,”
siderations for DNA data storage have Nat. Acad. Sci. USA, vol. 117, no. 31, Nature Commun., vol. 11, no. 1, Jun.
analogies to transmitting data through pp. 18,489–18,496, Aug. 2020, doi: 2020, Art. no. 2981, doi: 10.1038/
“traditional” network/storage electri- 10.1073/pnas.2004821117. s41467-020-16797-2.
cal channels. It is our hope that fram- 7. Y. Erlich and D. Zielinski, “DNA Foun- 16. K. J. Tomek et al., “Driving the scal-
ing DNA data storage implementation tain enables a robust and efficient ability of DNA-based information

84 COMPUTER  W W W.CO M P U T E R .O R G /CO M P U T E R


storage systems,” ACS Synthetic storage,” Nature Commun., vol. 12, no. sequencing,” Nature Biotechnol., vol.
Biol., vol. 8, no. 6, pp. 1241–1248, 1, Jun. 2021, Art. no. 3518, doi: 10.1038/ 26, pp. 1146–1153, Oct. 2008, doi:
May 2019, doi: 10.1021/acssynbio. s41467-021-23669-w. 10.1038/nbt.1495.
9b00100. 20. C. Fuller et al., “The challenges of
17. J. L. Banal et al., “Random access DNA sequencing by synthesis,” Nature
memory using Boolean search in an Biotechnol., vol. 27, pp. 1013–1023, Nov.
archival file storage system,” Nature 2009, doi: 10.1038/nbt.1585. DAVE LANDSMAN is a distin-
Mater., vol. 20, no. 9, pp. 1272–1280, 21. S. Goodwin, J. McPherson, and W. guished engineer at Western
Sep. 2021, doi: 10.1038/s41563 McCombie, “Coming of age: Ten years of Digital Research, Milpitas, CA
-021-01021-3. next-generation sequencing technologies,” 95035 USA. Contact him at dave.
18. C. Bee et al., “Molecular-level Nature Rev. Genetics, vol. 17, pp. 333–351, landsman@wdc.com.
similarity search brings comput- Jun. 2016, doi: 10.1038/nrg.2016.49.
ing to DNA data storage,” Nature 22. M. MacKenzie and C. Argyropou- KARIN STRAUSS is a senior princi-
Commun., vol. 12, no. 1, Aug. los, “An introduction to nanopore pal research manager at Microsoft
2021, Art. no. 4764, doi: 10.1038/ sequencing: Past, present, and future Research, Redmond, WA 98052 USA
s41467-021-24991-z. considerations,” Micromachines, vol. 14, and an affiliate full professor at the
19. K. J. Tomek, K. Volkel, E. W. Inder- no. 2, 2023, Art. no. 459, doi: 10.3390/ University of Washington, Seattle,
maur, J. M. Tuck, and A. J. Keung, mi14020459. WA 98195 USA. Contact her at
“Promiscuous molecules for smarter 23. D. Branton et al., “The potential kstrauss@microsoft.com.
file operations in DNA-based data and challenges of nanopore

Over the Rainbow: 21st Century


Security & Privacy Podcast
Tune in with security leaders of academia,
industry, and government.

Lorrie Cranor

Bob Blakley

Subscribe Today
www.computer.org/over-the-rainbow-podcast Digital Object Identifier 10.1109/MC.2023.3277764

You might also like