+ All Categories
Home > Documents > Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Date post: 29-Jan-2016
Category:
Upload: toby-houston
View: 216 times
Download: 0 times
Share this document with a friend
Popular Tags:
63
Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing
Transcript
Page 1: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Yuki Juan2003.5.5

Automatic DNA and Genome Sequencing

Page 2: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Genetic Mapping

Page 3: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Automated DNA Sequencing

Principle of Sanger SequencingHigh-Throughput SequencingReading Sequence TracesContig AssemblyEmerging Sequencing Methods

Page 4: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

http://www.mun.ca/biology/scarr/4241chaptertwo/Biology4241chaptertwo/

Chapter2GenomeSequencingandAnnotation.htm#sanger

Page 5: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.
Page 6: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

The First Cycle in PCR

Page 7: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

The Second Cycle in PCR

Page 8: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

The Third Cycle in PCR

Page 9: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Basic chain terination method developed in 1974 by Frederick Sanger

The Principle of Dideoxy (Sanger) Sequencing

Page 10: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Strategy of the Chain-termination Method for Sequencing DNA

Page 11: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Strategy of the Chain-termination Method for Sequencing DNA

Page 12: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Fluorescence Detection

Page 13: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

High-Throughput Sequencing

The new techniques and equipment include:

Four-color fluorescent dyes have replaced the radioactive label.Automatical trace readingImprovement in the chemistry of template purification and the sequencing reaction.Capillary electrophresis

Page 14: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Automated Sequencing Method

Page 15: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Automated Sequencing Method

Page 16: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

ABI PRISM® 3700 DNA Sequencer

Page 17: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

ABI PRISM® 3700 DNA Sequencer

Price: $65,50A fully automated, multi-capillary electrophoresis instrument designedAutomatically analyze multiple runs of 96 samples

Page 18: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

MegaBACE 1000 DNA Sequencer

Page 19: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

MegaBACE 1000 DNA Sequencer

An automated machine capable of high-throughput DNA analysis, processing 96 samples in just a few short hours.Applications :

DNA sequencinggenotyping fragment analysis.

6 arrays of 16 capillaries with an interior diameter of about 100 µm.The system uses high-pressure nitrogen gas to inject the capillaries with Linear Polyacrylamide, a denaturing gel.

Page 20: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Base-callingUsing automated softwarePhred program developed at the University of Washington.

Reading Sequence Traces

Page 21: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Phred Programuse algorithms to convert trace files into base sequences and assign quality values to

each base call in the sequence

Page 22: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

The Phred Base-calling Algorithm

Page 23: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

SNP: Single nucleotide polymorphism

Automated Sequence Chromatograms

Page 24: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Phred Quality Value Distributions

Dark blue: Bases 100-400 in each sequenceLight blue: All basesThe predicted error rate increases for longer fragments

Page 25: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Contig: A contiguous (touching; adjoining) stretch of cloned DNAThe finishing step in sequencing a multi-clone stretch of DNA, and involves alignment, editing, and error correction. Sequence editing software( from the University of Washington)

phrap assemblerconsed graphic editor

Contig Assembly

Page 26: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Phrap assemblerhttp://www.mrc-lmb.cam.ac.uk/pubseq/manual/gap4_unix_90.ht

ml

Page 27: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

An aligned reads window in Consed

Page 28: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Alignment algorithms

The Needleman-Wunsch method (1970) was the first computationally feasible algorithm for sequence alignment.

Page 29: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.
Page 30: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Alignments based on these algorithms may vary due to differences in the weighting of their default parameters.

weighting of the effects of indels relative to single base mismatchesweighting attached to quality scores of bases from contributing sequencweighting attached to frequency of mismatches

Page 31: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Sequencing by Hybridization (SBH) Mass Spectrophotometric Sequences Direct Visualization of Single DNA Molecules by Atomic Force Microscopy (AFM) Single Molecule Sequencing Techniques Single Nucleotide Cutting

Emerging Sequencing Methods

Page 32: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Uses the complementarity of the two strands of DNA molecules to determine if a match to an oligonucleotides is present in the DNA.Possible for short sequences

Sequencing by Hybridization (SBH)

Page 33: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

fragmented oligonucleotides can be identified by time of flight through a vacuum chamberUseful for fragmented DNA molecules under 50 bases longLikely possible to determine full sequence of molecule divided into all possible oligonucleotidesMethods fast, and should become cheap

Mass Spectrophotometric Sequences

Page 34: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Can observe bumps in ssDNA, but not resolve basesPossibly hybridize molecule to Oligonucleotides with bulky modified side groups

Direct Visualization of Single DNA Molecules by AFM

Page 35: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Extremely fast and relatively cheapCan accommodate long DNA fragmentsNanopore sequencing

Single Molecule Sequencing Technique

Page 36: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Single-molecule Nanopore Sequencing

Page 37: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Protein pore channel in electrically polarized membraneSingle DNA molecule pulled through by electrophoresesNucleotides transiently block ion movement, resulting in drop in current resolutioIf slowed to about 1 base per millisecond, could sequence 1Kb per second, three orders of magnitude faster than capillary sequencers

Nanopore Sequencing

Page 38: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Can suspend long strand of DNA in a vacuum by molecular tweezersExonuclease molecule cuts off single nucleotides to be read by fluorescent signal or imprinting on grid

Single Nucleotide Cutting

Page 39: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Hierarchical SequencingShotgun SequencingSequence Verification

Genome Sequencing

Page 40: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Hierarchical versus Shotgun Sequencing

Page 41: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Hierarchical versus Shotgun Sequencing

Both processes involve fragmenting the genome and aligning fragments due to overlapping sequences.Both aim for 5-10x redundancy in sequence representation.Main difference is that hierarchical sequencing attempts to align large cloned fragments (~100kb) into a tiling path.shotgun sequencing omits this step. The entire genome is fragmented into small pieces which are then aligned using computer algorithms.Hierarchical sequencing was the basis of the publicly funded Human Genome Project.Shotgun sequencing was the basis of the privately funded Human Genome Project

Page 42: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Also known as top down,  map based, or clone by clone sequencing Steps involved:

Shear DNA into manageable units (50 - 200 kb)

* This is accomplished by sonication* Amplification (PCR)* Clone into vector of choice (BAC'S usually)Create DNA library

* aim for 5-10x redundancySelection of a tiling path

Hierarchical Sequencing

Page 43: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Cloning Vectors Using in Genome Sequencing

Page 44: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Hierarchical Assembly of a Sequence-contig Scaffold

Page 45: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

The Tiling Path

Cab be assembled using a combination of three methods

HybridizationFingerprintingEnd-sequencing

Page 46: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Create probes for specific sequencesOften uses robots to replicate plate clones that show probe hybridizationThe genome can be probed for many different sequences, leading to islands of overlapping clones that will be joined later in the process.Chromosome walking - use the end sequence of a clone to create a probe for an adjacent clone.

Hybridization

Page 47: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Use restriction digest profile to determine sequence overlapDone by complex computer algorithms

Fingerprinting

Page 48: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Alignment BAC clones by Hybridization and Fingerprinting

Page 49: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

End-sequencing

Sequence the end of BAC clonesCreate a probe for that end sequence, and hope that it hybridizes near the middle of another clone

Page 50: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

3 steps: Filtering

Removal of contaminating fragmentsThey may be bacterial in origin, or clones that show evidence of recombination.

Assembling the Layout generating and ordering each BAC contigPosition of each contig can be confirmed by alignment with previously characterized Sequence Tagged Sites (STS)

Merging Aligning BAC contigs that are known to be adjacent to each other

Assembly of The Draft Genome

Page 51: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Computer algorithms are used to assemble contigs from thousands of overlapping sequences

Shotgun Sequencing

Page 52: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

ScreenerOverlapperUnitiggerScaffolder

Tasks performed by Computational Algorithms

Page 53: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Masks (marks & hide) sequences that contain repetitive DNA. e.g. Microsatellites, ALU repeats, ribosomal DNAThese sequences are not taken into consideration when determining overlap

Screener

Page 54: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Compares every unscreened read against every other unscreened readIs essentially the same as performing a BLAST searchSearches for overlap of a predetermined length (40 bp for Human Genome Project)

Overlapper

Page 55: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Blast Output

Page 56: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

A local Alignment

Page 57: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Unitigger

Unitig: a contig formed from a series of overlapping unambigously unique sequences

Page 58: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

U-unitigs and Repeat Resolution

Page 59: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Scaffolder

Uses mate-pair information to link U-unitigs into scaffold contigsMost of the remaining gaps at this point are due to repeat elements, and can be resolved by the following method:

Unitigs that were not classified as U-unitigs are placed in the gaps.

These are often referred to as overcollapsed unitigsIf their placement is supported by two or more mate-pairs, it is referred to as a ROCK.If their placement is supported by one mate-pair, it is referred to as a STONE.Small gaps can be filled in by chromosome walking

Page 60: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Assembly of a Mapped Scaffold

Page 61: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Proportion of Fly and Human Genomes in Large Scaffolds

Page 62: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

CompletenessAccuracyValidity of assembly

Sequence Verification

Page 63: Yuki Juan 2003.5.5 Automatic DNA and Genome Sequencing.

Alignment of Two Draft Human Genome Assemblies


Recommended