1/19/2024 0 Comments Jing xingBefore use, the codons in the DNA storage file should be screened by DNA sequence characteristics to avoid systematic errors. “Symbol-codon” indicates the mapping relationship between a symbol and a codon (Fig. 3a showed the mapping relationship between “0” and “(A)C/(C)G/(G)T/(T)A”. “Bit-base” indicates the mapping rule between bits and bases. To date, there are two types of codebooks: “bit-base” (Goldman et al. The codebook should follow certain rules to avoid systematic errors during DNA data storage, such as DNA secondary structure, high GC content, and homopolymer (Organick et al. The codebook reflects the encoding and decoding mapping rule between 0–1 binary codes and DNA A-T-C-G quaternary codes, which guides the conversion between digital bitstream and DNA storage file (Clelland et al. 2), which is commonly less than 250 nt due to the limitations of current synthesis and sequencing technologies (Kumar et al. Herein, the sequence structure of encoding and decoding DNAs generally includes three parts: address information, data payload and error-correcting code (Fig. As mentioned above, based on the same “codebook”, the encoding method of DNA storage is to convert 0–1 binary codes to A-T-C-G DNA quaternary codes, while the decoding method is to convert quaternary sequencing data to 0–1 binary files. In this review, we mainly introduce some mainstream encoding and decoding methods of DNA storage technology. Furthermore, with rapid development of biotechnology and information technology (BT&IT), DNA storage is expected to fundamentally change the pattern of data storage and transmission, further leading to revolutionary changes in various important areas of the national economy such as manufacturing, internet industry, and national security (The DNA data storage alliance 2021). Development of efficient and accurate DNA storage encoding and decoding methods will play a very important and even decisive role in the transition of DNA storage from the laboratory to practical application, which may fundamentally change the information industry in the future.ĭNA storage breaks through the limitation of existing storage medium of silica-based materials (such as hard disk, optical disk, and removable magnetic disk): compared with the existing digital data storage technologies, DNA storage technology has the advantages of high data density, long storage time, low energy consumption, convenience for carrying, concealed transportation, and multiple encryptions (De Silva and Ganegoda 2016). This review provides important references and improved understanding of DNA storage methods. Their advantages and disadvantages are also reviewed: direct mapping method is easy and efficient, but has high error rate and low logical density fountain code can achieve higher storage density without random access inner and outer code has error-correction design to realize random access at the expense of logic density. The first three encoding/decoding methods belong to in vitro DNA storage, representing the mainstream research and application in DNA storage. In this review, we mainly summarize the recent research advances of four main encoding and decoding methods of DNA storage technology: direct mapping method between 0 and 1 binary and A-T-C-G quaternary codes in early-stage, fountain code for higher logical storage density, inner and outer codes for random access DNA storage data, and CRISPR mediated in vivo DNA storage method. DNA storage is a new digital data storage technology based on specific encoding and decoding methods between 0 and 1 binary codes of digital data and A-T-C-G quaternary codes of DNAs, which and is expected to develop into a major data storage form in the future due to its advantages (such as high data density, long storage time, low energy consumption, convenience for carrying, concealed transportation and multiple encryptions).
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |