Date post: | 30-Mar-2015 |
Category: |
Documents |
Upload: | lukas-woolverton |
View: | 212 times |
Download: | 0 times |
Standardized Workflows (II)
Carlos Oscar SorzanoTechn. Director I2PC
Natl. Center Biotechnology (CSIC)
Interchange Points needan interchange standard
Specific proposal in the discussion
Some interchange points• CTF estimation of micrographs:
• List of Micrograph, Voltage, DefocusU, DefocusV, AngleUX, Cs• Particle picking:
• List of Micrograph, micrographXcoor, micrographYcoor• Particle extraction:
• List of images• 2D Alignment/Classification:• 3D Alignment/Classification:
• List of image/volume [2D or 3D alignment]• [A list of class representatives]• [Class representative assignment]
Interchange Proposal• Interchange information:– Data: images, volumes and stacks– MetaData: list of …
• Data structure– Data: Array of real values with X varying faster– MetaData: Table with specific column names
• Data format– Data: MRC file– MetaData: STAR file with specific block names
Data: Images, volumes and stacks.mrc MRC 2D image.mrc MRC 3D image (can be distinguished from 2D by header).mrcs MRC Stack of 2D images
Heymann, J. B.; Chagoyen, M. & Belnap, D. M.J. Structural Biology, 2005, 151, 196-207
Header Val0 Val1 ValN
ValN
Val0 Val1
x
y
(0,0)x
y
(0,0)
MetaData: Interchange format
MetaData: Components
• UID: Unique Identifier• Micrograph (one or many motifs)• Image (motif of interest)• Volume• CTF• Coordinates• 2D Alignment• 3D Alignment• Comment
MetaData: UID, Micrograph, CommentAny entry in the metadata file has a unique entry number
listOfMicrographs.3dem:# 3DEM_STAR_1 General file comment # data_block_1loop_ _UID _micrographLocator _comment1 InputData/micrograph1.mrc “Comment 1” 2 InputData/micrograph2.mrc “Comment 2”data_block_2loop_ _UID _micrographLocator3 InputData/micrographA.mrc 4 InputData/micrographB.mrc
MetaData: CTFAny entry in the metadata file has a unique entry number
listOCTFs.3dem:# 3DEM_STAR_1 *# data_block_1loop_ _UID _Voltage _DefocusU _DefocusV _AngleUX _Cs1 200 1.50 1.54 30.0 2.0
(in kV)(in μm)(in μm)
(in degrees)(in mm.) X
Y
UV
MetaData: Images and VolumesImages can be in individual files (mrc) or stacks (mrcs).Volumes are in individual files (mrc).
listOfImages.3dem:# 3DEM_STAR_1 * # data_blockloop_ _UID _imageLocator1 image00001.mrc 2 image00002.mrc3 [email protected] [email protected]
listOfVolumes.3dem:# 3DEM_STAR_1 * # data_blockloop_ _UID _volumeLocator1 volume00001.mrc 2 volume00002.mrc
MetaData: Coordinates
Heymann, J. B.; Chagoyen, M. & Belnap, D. M.J. Structural Biology, 2005, 151, 196-207
x
y
(0,0)
listOfCoordinates.3dem:# 3DEM_STAR_1 * # data_blockloop_ _UID _micrographXcoor _micrographYcoor1 500 10002 750 2500
MetaData: 2D Alignment
x
y Δx=+5
Δy=-15
Δψ=60°
[email protected] [email protected] [email protected]
listOfAlignedImages.3dem:# 3DEM_STAR_1 * # data_blockloop_ _UID _imageLocator _imageXOff _imageYOff _imagePsiOff1 [email protected] 0 0 02 [email protected] 5 -15 03 [email protected] 5 -15 60
MetaData: 2D Alignment
x
y
listOfAlignedImages.3dem:# 3DEM_STAR_1 * # data_alignedImageList_Aloop_ _UID _imageLocator _imageXOff _imageYOff _imagePsiOff _imageFlip1 [email protected] 0 0 0 02 [email protected] 5 -15 60 1
Δx=+5
Δy=-15
Δψ=60°
flip
MetaData: 2D Alignment
x
y
Δx=+5
Δy=-15
Δψ=60°
flip
14 1( ) ( )I I Rr r
1
x
y
r
cos sin
sin cos
0 0 1
x
R m m m y
1,1m
MetaData: 3D AlignmentList of 3D aligned images.3dem:# 3DEM_STAR_1 * # data_blockloop_ _UID _imageLocator _homogeneousEulerMatrixI1 mage00001.mrc [a11 a12 a13 Xoff a21 a22 a23 Yoff a31 a32 a33 0]
Heymann, J. B.; Chagoyen, M. & Belnap, D. M.J. Structural Biology, 2005, 151, 196-207 List of 3D aligned volumes.3dem:
# 3DEM_STAR_1 * # data_blockloop_ _UID _volumeLocator _homogeneousEulerMatrix1 volume00001.mrc [a11 a12 a13 Xoff a21 a22 a23 Yoff a31 a32 a33 Zoff]
MetaData: RelationshipslistOfImagesAndDownsampledImages.3dem:# 3DEM_STAR_1 * # data_fullSize_imagesloop_ _UID _imageLocator1 [email protected] [email protected]_downsampledSize_imagesloop_ _UID _imageLocator3 [email protected] [email protected]_correspondingDownsampleImageloop_ _UID _UID1 _UID25 1 36 2 4
1 2RUID UID
MetaData: Interchange format
• There can be any number of blocks within a Star file.• Each block can combine any number of components• Examples:
• Micrograph and locations:micrographLocator, micrographXcoor, micrographYcoor
• Micrograph and CTF:micrographLocator, Voltage, DefocusU, DefocusV, AngleUX, Cs
• Micrograph, image, location and CTF:micrographLocator, imageLocator,
micrographXcoor, micrographYcoor , Voltage, DefocusU, DefocusV, AngleUX, Cs
Some interchange points• CTF estimation of micrographs:
• Block name: micrographCTFs• micrographLocator, Voltage, DefocusU, DefocusV, AngleUX, Cs
• Particle picking:• Block name: particlePicking• micrographLocator, micrographXcoor, micrographYcoor
• Particle extraction:• Block name: imageList• imageLocator
• 2D Alignment/Classification:• 3D Alignment/Classification:
• A block per class: class_00001, class_00002, …• Class blocks: imageLocator [2D or 3D alignment]
• [A block of class representatives]• Blockname: class_representatives• imageLocator
• [Class representative assignment]• Blockname: class_representative_assignment• UID1, UID2
How to check?
Automatic check of results
Syntactically and Semantically
Discussion
• Is this the correct strategy? (interchange points + interchange format)
• Which interchange points?• Which interchange format?• How to reach a standard?
Interchange objectsCTF determination
Micrograph screening
Particle picking
Particle extraction and screening
2D alignment and classification
Initial volume construction
3D Model refinement
Micrograph phase correction
3D Model amplitude correction
• What do we want to interchange?• What is the minimum amount of information
needed?• How are we going to store it?