1. Random access DNA memory using Boolean search in an archival file storage system
- Author
-
Miguel Reyes, Mark Bathe, Paul C. Blainey, Joseph Berleant, Hellen Huang, Cheri M. Ackerman, Tyson R. Shepherd, and James L. Banal
- Subjects
Path (computing) ,Computer science ,Information Storage and Retrieval ,02 engineering and technology ,010402 general chemistry ,01 natural sciences ,Polymerase Chain Reaction ,Fluorescence ,Data_FILES ,General Materials Science ,Selection (genetic algorithm) ,business.industry ,Archives ,Mechanical Engineering ,Sorting ,General Chemistry ,computer.file_format ,DNA ,021001 nanoscience & nanotechnology ,Condensed Matter Physics ,Silicon Dioxide ,0104 chemical sciences ,Metadata ,Mechanics of Materials ,Scalability ,Synthetic Biology ,Image file formats ,0210 nano-technology ,business ,File storage ,computer ,Random access ,Computer network ,Plasmids - Abstract
DNA is an ultrahigh-density storage medium that could meet exponentially growing worldwide demand for archival data storage if DNA synthesis costs declined sufficiently and if random access of files within exabyte-to-yottabyte-scale DNA data pools were feasible. Here, we demonstrate a path to overcome the second barrier by encapsulating data-encoding DNA file sequences within impervious silica capsules that are surface labelled with single-stranded DNA barcodes. Barcodes are chosen to represent file metadata, enabling selection of sets of files with Boolean logic directly, without use of amplification. We demonstrate random access of image files from a prototypical 2-kilobyte image database using fluorescence sorting with selection sensitivity of one in 106 files, which thereby enables one in 106N selection capability using N optical channels. Our strategy thereby offers a scalable concept for random access of archival files in large-scale molecular datasets.
- Published
- 2020