| CPC G16B 50/00 (2019.02) [G16B 30/20 (2019.02); H03M 7/40 (2013.01)] | 7 Claims |
|
1. An encoding method, comprising:
transcoding information to be encoded into a first binary code sequence and a second binary code sequence, using a processor;
encoding the first binary code sequence and the second binary code sequence, wherein the first and second binary code sequence have the same number of bits, into an encoded sequence of the information to be encoded, using the processor, the encoded sequence composed of multiple of four different kinds of symbols, wherein the four different kinds of symbols are four kinds of deoxyribonucleotides of adenine (A), cytosine (C), guanine (G), and thymine (T) and the encoded sequence is a nucleic acid sequence composed of the four kinds of deoxyribonucleotides;
splitting the nucleic acid sequence into a plurality of sequence fragments;
adding an index identifier to each of the plurality of sequence fragments, the index identifier of a corresponding sequence fragment comprising position information of the corresponding sequence fragment, wherein the index identifier of the corresponding sequence fragment is a DNA sequence; and
synthesizing the sequence fragments into nucleic acid fragments as an encoding result,
wherein the encoding comprises:
determining a current bit of the encoded sequence based on a current bit of the first binary code sequence, a current bit of the second binary code sequence, and a previous bit of the encoded sequence, the current bit of the encoded sequence being a bit other than a first bit of the encoded sequence;
wherein determining the current bit of the encoded sequence comprises:
determining a first candidate symbol group of the current bit of the encoded sequence based on the current first bit of the first binary code sequence according to a first mapping relationship, the first candidate symbol group comprising two of the four different kinds of symbols;
determining a second candidate symbol group of the current bit of the encoded sequence based on the current bit of the second binary code sequence and the previous bit of the encoded sequence according to a second mapping relationship, the second candidate symbol group comprising two of the four different kinds of symbols, wherein the first mapping relationship and the second mapping relationship are configured to ensure that the first candidate symbol group and the second candidate symbol group comprise one identical symbol; and
determining the identical symbol as the current bit of the encoded sequence.
|