Date post: | 25-Jan-2017 |
Category: |
Documents |
Upload: | pavani-de-silva |
View: | 1,226 times |
Download: | 2 times |
Digital Data Storage in DNA
P. Y. De Silva124037U
Background• Rapid growth of information generated all
around the world.• Need for information to be stored for
extended periods.
Identified failures of existing storage media
• Prone to damage from high temperature, moisture and magnetic fields.
• Function only within limited temperature range.
• Subject to decay over long periods.• Non bio degradable.• Release high amounts of heat energy.• Uses more energy for functioning.
Importance of DNA as storage medium• High capacity• High data storage density• Withstand extreme environmental conditions• High memory space• Secure as invisible to human eye• Effective power usage• Can store data for long periods by protecting
from water and oxygen
Structure of DNA • DNA consisting of Adenine, Guanine,
Cytosine and Thymine. (A,G, C and T)• Paired into nucleotide base pairs A-T
and G-C• Single nucleotide can represent 2 bits of
information.• High memory space due to 3D structure
Encoding Models• Microvenus project• Genesis project• PCR based encoding• Alignment based encoding• Church and Goldman model• Rewritable RAM• Next generation sequencing model
Codes for encoding• The Huffman Code• The Comma Code• The Alternate Code• Comma Free Code• Improved Huffman Coding • Perfect Genetic Code
Basic approaches for designing DNA codons
• Template map strategy• De Bruijin Construction• Stochastic Method
Data Storage Styles
• Surface based approach• Soluble approach
DNA secret writing algorithms
• Steganography technique using DNA hybridization
• Chromosomes DNA indexing• DNA XOR OTP with tiles
DNA secret writing algorithms• Steganography technique using DNA
hybridization
DNA secret writing algorithms• Chromosomes DNA indexing
Problems associated with DNA secret writing
• Lack of knowledge• High cost • Difficulty in understanding• Inappropriateness to be used by general public
due to the biological tests and trials which has to be performed in highly technology equipped laboratories.
DNA memory applications
• National security for information hidingpurposes and for data stenography.
• Deinococcus bacteria to preserve data at nuclear catastrophe
• DNA watermark in the seeds track their sales andpreserve their copyrighted products against illegitimateplanting.
DNA memory applications
• Capture pollutants that will contaminate with the ecological resources
• Preserve safely the personal information of a person such as medical information and family history in their own bodies
Big data storage in DNA and DNA Computing
• Volume of data thatcan be stored on an electronic computer
• The speed thresholds• DNA computing consumes significantly less energy than
the electronic computers• The storage space needed to store information is lesser
DNA Computing• Hamilton Path Problem• 20 Variable 3-SAT Problem
Discussion• Microvenus project was inaccurate and was not distinctively decodable.
Genesis and microvenus were inaccurate and inefficient.
• PCR based secure as recipient should be aware of the encryption key and primer but the limitation was scalability of data encoded in the limited size of microdots as only 136 bits data were able to be encode.
• Bancroft et al proposed of information which consists of single poly primer key (PPK), forward and reverse primer common 5-6 bases to indicate the stored information.
• Hitches associated with PCR methods are need of PCR, knowledge of primers, widespread experimental obstacles and practical problems.
• Data retrieval by PCR based amplification is prone to breakage of either side of DNA annealing sites, which are crucial to read even partial of encoded data
Discussion• Greater speed and lower cost of reading DNA data will be acquired
through Alignment based approach. Disadvantage of this approach is multiplication of cassettes lead to redundant volumes. This approach ensures higher durability and data inheritance
• Next generation sequencing includes advantages of employment of one bit representation per base, avoiding the constructing of long DNA which is challenging to assemble in reading the information. Synthesizing, storing and sequencing multiple copies of olingo is done with the intention of evading sequence verifying constructs and cloning. Each copy has the capability to correct the errors in other copy as the errors are almost never coextensive. In this approach the cost incurred is
DiscussionComparison of codes used for encoding
Discussion • I have analyzed the encoding models used used for storing data
in DNA and have critically analyzed how each method has overcome the drawbacks associated with the prior methods
• Comparison of the codes for encoding data and how the researchers have identified the gaps in-between and how they have addressed the drawbacks have been analyzed in this article.
• Data storage styles and approaches in designing DNA codons have been identified.
• DNA secret writing techniques have been studied in detailed and the advantages of each method have been critically analyzed. Problems associated with secret writing in DNA have also been identified.
Discussion • Application of DNA as a memory device is discussed in
detail while approaches of big data storage and DNA computing has also been studied.
• Outcome of this research is a review article on ‘Digital data storage in DNA’
Future Directions• Analyzing the environmental
conditions under which the experiments are carried out
• Preservation of DNA for long term storage.
Thank you !!!!