PT - JOURNAL ARTICLE AU - Zan, Xiangzhen AU - Xie, Ranze AU - Yao, Xiangyu AU - Xu, Peng AU - Liu, Wenbin TI - A super robust and efficient DNA storage architecture based on modulation encoding and decoding AID - 10.1101/2022.05.25.490755 DP - 2022 Jan 01 TA - bioRxiv PG - 2022.05.25.490755 4099 - http://biorxiv.org/content/early/2022/05/25/2022.05.25.490755.short 4100 - http://biorxiv.org/content/early/2022/05/25/2022.05.25.490755.full AB - Thanks to its high density and long durability, synthetic DNA has been widely considered as a promising solution to the data explosion problem. However, due to the large amount of random base insertion-deletion-substitution (IDSs) errors from sequencing, reliable data recovery remains a critical challenge, which hinders its large-scale application. Here, we propose a modulation-based DNA storage architecture. Experiments on simulation and real datasets demonstrate that it has two distinct advantages. First, modulation encoding provides a simple way to ensure the encoded DNA sequences comply with biological sequence constraints (i.e., GC balanced and no homopolymers); Second, modulation decoding is highly efficient and extremely robust for the detection of insertions and deletions, which can correct up to ∼40% errors. These two advantages pave the way for future high-throughput and low-cost techniques, and will kickstart the actualization of a viable, large-scale system for DNA data storage.Competing Interest StatementThe authors have declared no competing interest.