Abstract
DNA is an ultra-high-density storage medium that could meet exponentially growing worldwide data storage demand. However, accessing arbitrary data subsets within exabyte-scale DNA data pools is limited by the finite addressing space for individual DNA-based blocks of data. Here, we form files by encapsulating data-encoding DNA within silica capsules that are surface-labeled with multiple unique barcodes. Barcoding is performed with single-stranded DNA representing file metadata that enables Boolean logic selection on the entire pool of data. We demonstrate encapsulation and Boolean selection of sub-pools of image files using fluorescence-activated sorting, with selection sensitivity of 1 in 106 files per channel. Our strategy in principle enables retrieval of targeted data subsets from exabyte- and larger-scale data pools, thereby offering a random access file system for massive molecular data sets.