+ All Categories
Home > Documents > NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical...

NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical...

Date post: 28-Dec-2015
Category:
Upload: harriet-harper
View: 215 times
Download: 1 times
Share this document with a friend
Popular Tags:
35
9 -11 January 2006 – Catania, Italy 9 -11 January 2006 – Catania, Italy NA4 Generic Applications NA4 Generic Applications Meeting Meeting GRID node for biomedical applications: GRID node for biomedical applications: distributed image analysis for early distributed image analysis for early diagnosis of Alzheimer Disease diagnosis of Alzheimer Disease Livia Torterolo Livia Torterolo
Transcript
Page 1: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

9 -11 January 2006 – Catania, Italy9 -11 January 2006 – Catania, Italy

NA4 Generic Applications Meeting NA4 Generic Applications Meeting

GRID node for biomedical applications: GRID node for biomedical applications: distributed image analysis for early distributed image analysis for early

diagnosis of Alzheimer Diseasediagnosis of Alzheimer Disease

Livia TorteroloLivia Torterolo

Page 2: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

ContentsContents

• Introduction to SPM analysis for Introduction to SPM analysis for diagnosis in ADdiagnosis in AD

• Development of SPM service through Development of SPM service through a web portala web portal

• GRID implementation of SPM serviceGRID implementation of SPM service

• Current Status and PlansCurrent Status and Plans

Page 3: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

PartnersPartners

GRID node has been installed at BioLab, GRID node has been installed at BioLab, University of Genoa.University of Genoa.

The first part of work has been done in The first part of work has been done in collaboration with:collaboration with:

San Raffaele of MilanoSan Raffaele of Milano University of Milano – BicoccaUniversity of Milano – Bicocca

GRID implementation has carried out by GRID implementation has carried out by Bio-Lab team.Bio-Lab team.

Page 4: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

ContentsContents

• Introduction to SPM analysis for Introduction to SPM analysis for diagnosis in ADdiagnosis in AD

• Development of SPM service through Development of SPM service through a web portala web portal

• GRID implementation of SPM serviceGRID implementation of SPM service

• Current Status and PlansCurrent Status and Plans

Page 5: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

Introduction to Alzheimer DiseaseIntroduction to Alzheimer Disease

• AD is the most common form of dementia, AD is the most common form of dementia, accounting for more than half of all accounting for more than half of all dementias in elderly peopledementias in elderly people

• Clinically, AD is characterized by a Clinically, AD is characterized by a progressive loss of cognitive abilitiesprogressive loss of cognitive abilities

• Memory loss is typically the earliest sign Memory loss is typically the earliest sign of ADof AD

Page 6: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

Analysis of PET imagesAnalysis of PET images

Conventional Reporting: qualitative analysis of flux imagesConventional Reporting: qualitative analysis of flux images

Qualitative analysis of images doesn’t Qualitative analysis of images doesn’t reveal anomalies in the distribution of the reveal anomalies in the distribution of the tracer in cortical, subcortical and cerebellar tracer in cortical, subcortical and cerebellar regions. regions.

The metabolic study is in the normal range.The metabolic study is in the normal range.

Qualitative analysis of images reveals a Qualitative analysis of images reveals a reduction in the accumulation of the tracer in reduction in the accumulation of the tracer in correspondence of the inferior parietal lobe, correspondence of the inferior parietal lobe, bilaterally. bilaterally.

Hypometabolism in these regions.Hypometabolism in these regions.

Qualitative analysis of images doesn’t Qualitative analysis of images doesn’t reveal anomalies in the distribution of the reveal anomalies in the distribution of the tracer in cortical, subcortical and cerebellar tracer in cortical, subcortical and cerebellar regions. regions.

The metabolic study is in the normal range.The metabolic study is in the normal range.

F18-FDGF18-FDG Tc99-ECDTc99-ECD

Qualitative analysis of images reveals a Qualitative analysis of images reveals a reduction in the accumulation of the tracer in reduction in the accumulation of the tracer in correspondence of the inferior parietal lobe, correspondence of the inferior parietal lobe, bilaterally. bilaterally.

Parietal hypoperfusion.Parietal hypoperfusion.

Page 7: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

SPM (Statistical Parametric Mapping )SPM (Statistical Parametric Mapping )

Functionality:Functionality:• Image displayImage display• Image segmentationImage segmentation• Image realignment and co-registrationImage realignment and co-registration• Nonlinear spatial normalizationNonlinear spatial normalization• SmoothingSmoothing• Statistical analysis – parameter estimation {GLM}Statistical analysis – parameter estimation {GLM}• Statistical analysis – interface {TGF}Statistical analysis – interface {TGF}• Results display (Graphical, tabular and image format)Results display (Graphical, tabular and image format)

by Institute of Neurology at University College Londonby Institute of Neurology at University College London

Page 8: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.
Page 9: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.
Page 10: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.
Page 11: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.
Page 12: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

ContentsContents

• Introduction to SPM analysis for Introduction to SPM analysis for diagnosis in ADdiagnosis in AD

• Development of SPM service through Development of SPM service through a web portala web portal

• GRID implementation of SPM serviceGRID implementation of SPM service

• Current Status and PlansCurrent Status and Plans

Page 13: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

Why a portal?Why a portal?

SPM requirements:SPM requirements:• the knowledge of functions used in the image analysis in the knowledge of functions used in the image analysis in

order to provide the correct values of parameters and in order to provide the correct values of parameters and in order to understand the resultsorder to understand the results

• a large set of images of normal patients to be used for a large set of images of normal patients to be used for comparison during the statistical parametric mapping. comparison during the statistical parametric mapping. The accuracy of ipoperfusion maps is strictly related to The accuracy of ipoperfusion maps is strictly related to the number of normal studiesthe number of normal studies

www.neuroinf.itwww.neuroinf.it

Page 14: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

SET of CONTROLS 1(PET, SPECT IMAGES)

STATISTICALSTATISTICALTOOL (SPM)TOOL (SPM)

SET of CONTROLS 2(PET, SPECT IMAGES)

SET of CONTROLS 3(PET, SPECT IMAGES)

SET of CONTROLS n(PET, SPECT IMAGES)

IMAGE of PATHOLOGICAL

SUBJECT(PET or SPECT IMAGE)

STATISTICALANALYSIS OF

THE UPLOADEDIMAGE

www.neuroinf.it

Web-based applicationWeb-based application

Page 15: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

Design of the application’s structureDesign of the application’s structure

Image Acquisition

Image Transfer

Parameter fitting for the

statistical analysis

Maps Visualization

Management Node

User Node

1

2

2

3 3

4

Porting on GRID

Query to data catalogueJob submission

Statistical Analysis

Results transfer

Repository Node

Extraction of image information

Results transfer

Extraction of image information

Results transfer

2

Repository Node

1

Page 16: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

ContentsContents

• Introduction to SPM analysis for Introduction to SPM analysis for diagnosis in ADdiagnosis in AD

• Development of SPM service through Development of SPM service through a web portala web portal

• GRID implementation of SPM serviceGRID implementation of SPM service

• Current Status and PlansCurrent Status and Plans

Page 17: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

SPM: Why GRID?SPM: Why GRID?

• Doctors would have an easy access to normal Doctors would have an easy access to normal PET/SPECT databases without moving images PET/SPECT databases without moving images from hospitals from hospitals

• Waiting time for statistical parametric mapping Waiting time for statistical parametric mapping depends on the number of normal images stored depends on the number of normal images stored in every database in every database It is very important to It is very important to split this application into independent jobs and to split this application into independent jobs and to run them at the same time to increase software run them at the same time to increase software performancesperformances

Page 18: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

GILDA testbed GILDA testbed

GILDA testbed: https://gilda.ct.infn.it/GILDA testbed: https://gilda.ct.infn.it/

Page 19: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

LCG InstallationLCG Installation

• Installation and Set up of the LCG Grid Installation and Set up of the LCG Grid Environment v. 2.4.0.Environment v. 2.4.0.

6 machines are dedicated to LCG tests:6 machines are dedicated to LCG tests:

1 CE, 1 SE, 3 WNs, 1 UI, 1 LCG install server 1 CE, 1 SE, 3 WNs, 1 UI, 1 LCG install server

12 CPUs, 7Giga (RAM), ≈ 400Giga of 12 CPUs, 7Giga (RAM), ≈ 400Giga of storage storage

• BioLab site BioLab site is now on maintainance to is now on maintainance to upgrade from upgrade from LCG 2.4.0 to LCG 2.6.0LCG 2.4.0 to LCG 2.6.0

Page 20: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

DIST – GENOVA siteDIST – GENOVA site

Page 21: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

LCG node: LCG node: TechnicalTechnical characteristics (I)characteristics (I)

• is powered by an autonomous power line and a is powered by an autonomous power line and a uninterruptible power supply with a capacity of uninterruptible power supply with a capacity of 3000 VA is provided3000 VA is provided

• the UPS is big enough to maintain online the the UPS is big enough to maintain online the node for about 30 - 45 ‘node for about 30 - 45 ‘

• storage element has a redundant power supply storage element has a redundant power supply and redundant hot-swap cooling system (fan)and redundant hot-swap cooling system (fan)

• storage is provided by 4 U320 scsi 10.000 rpm storage is provided by 4 U320 scsi 10.000 rpm disks (about 400 GByte) and a raid 5 controller disks (about 400 GByte) and a raid 5 controller with 128 mb r/w onboard cachewith 128 mb r/w onboard cache

Page 22: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

LCG node: LCG node: TechnicalTechnical characteristics(II)characteristics(II)

• network is provided by a 24 port Gigabit switch network is provided by a 24 port Gigabit switch and each node has a dual gigabit network and each node has a dual gigabit network adapter (only one channel is used by now)adapter (only one channel is used by now)

• node is connected to Internet through the local node is connected to Internet through the local department LANdepartment LAN

• 1Gbit/sec connection to the GARR POP in 1Gbit/sec connection to the GARR POP in GenoaGenoa

• the University of Genoa is connected to the the University of Genoa is connected to the Milan Internet Exchange (Internet backbone Milan Internet Exchange (Internet backbone access point) with a 155 Mbps line access point) with a 155 Mbps line

Page 23: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

Installation phaseInstallation phase

• with this type of hardware (brand HP with this type of hardware (brand HP Proliant DL 140 and DL 380 server) we Proliant DL 140 and DL 380 server) we didn't have any troubles didn't have any troubles

• node’s installation has been performed node’s installation has been performed without problemswithout problems

• very good system administration support very good system administration support from staff in Catania from staff in Catania

Page 24: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

Development phase:Development phase:Migration of the application to LCGMigration of the application to LCG

Main objectives:Main objectives:

• to distribute PET/SPECT images on different to distribute PET/SPECT images on different Storage Elements available on the GRID and Storage Elements available on the GRID and register data - with related metadata - on LCG register data - with related metadata - on LCG File Catalog (LFC). File Catalog (LFC).

• to access images from User Interface using to access images from User Interface using logical file names (LFN) without copying them on logical file names (LFN) without copying them on Worker Nodes. Worker Nodes.

Page 25: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

Data Management and File AccessData Management and File Access

Page 26: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

LCG_utilsLCG_utils

• CLI: CLI: lcg-* commands as lcg-cp, lcg-cr, lcg-del, lcg-la, lcg-lr, ecc….

• C API:C API: int int lcg_cplcg_cp (char * (char *src_filesrc_file, char *, char *dest_filedest_file, char *, char *vovo, int , int

nbstreamsnbstreams, char * , char * conf_fileconf_file, int , int insecureinsecure, int , int insecureinsecure);); int int lcg_dellcg_del (char * (char *filefile, int , int aflagaflag, char *, char *sese, char *, char *vovo, char , char

**conf_fileconf_file, int , int insecureinsecure, int , int verboseverbose););…………

Used to store application’s data and images on distributed Used to store application’s data and images on distributed storage elements and register them in LFC catalogstorage elements and register them in LFC catalog

Page 27: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

Grid File Access Library (GFAL)Grid File Access Library (GFAL)

GFAL provides calls for catalog interaction, storage management and file access and can be very handy when an application requires access to some part of a big Grid file but does not want to copy the whole file locally

we used GFAL API to access to distributed images without copying them locally

Page 28: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

GFAL: File I/O API GFAL: File I/O API

Examples:Examples: • int gfal_opengfal_open (const char * filename, int flags, mode_t mode);

• ssize_t gfal_readgfal_read (int fd, void *buf, size_t size);• int gfal_closegfal_close (int fd);

……....

GFAL accepts GUIDs, LFNs, SURLs and TURLs as GFAL accepts GUIDs, LFNs, SURLs and TURLs as file names, and, in the first two cases, it tries to find the file names, and, in the first two cases, it tries to find the closest replica of the file.closest replica of the file.

the application code has been modified to access the application code has been modified to access images using their logical file namesimages using their logical file names

Page 29: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

A GFAL callA GFAL call

Page 30: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

LCG File Catalog (LFC)LCG File Catalog (LFC)

Functionalities:Functionalities:

• Use of lcg_utilsUse of lcg_utils• Use of GFAL callsUse of GFAL calls• Use of GSI Use of GSI

certificationcertification

• Access to grid file in SEs from “anywhere”Access to grid file in SEs from “anywhere”• Several replicas of files in different sitesSeveral replicas of files in different sites• Copy of data from/to local file system to GRIDCopy of data from/to local file system to GRID

Page 31: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

Re-Design of the applicationRe-Design of the application

With reference to different LCG tools mentioned above, With reference to different LCG tools mentioned above, application code has been modified and structured in the application code has been modified and structured in the following way:following way:

Registration and storage of data files (PET/SPECT images) on SEs Registration and storage of data files (PET/SPECT images) on SEs available using lcg_utils.available using lcg_utils.

Development of a C program with GFAL C API in order to access Development of a C program with GFAL C API in order to access distributed images using their LFNs and to extract some information distributed images using their LFNs and to extract some information necessary to SPM analysis without copying them locally.necessary to SPM analysis without copying them locally.

Job Submission: creation of a JDL file to submit the executable (and Job Submission: creation of a JDL file to submit the executable (and not the images) with GFAL call to the GRID.not the images) with GFAL call to the GRID.

Statistical Analysis: running of SPM analysis from results obtained Statistical Analysis: running of SPM analysis from results obtained from job submission. Statistical analysis is performed outside GRID from job submission. Statistical analysis is performed outside GRID environment.environment.

Page 32: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

Final GRID structureFinal GRID structure

Page 33: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

ProblemsProblems

• If you want to access a file on a classic SE using GFAL If you want to access a file on a classic SE using GFAL from your UI, you won't be able to do this from the UI from your UI, you won't be able to do this from the UI (because of insecure RFIO that needs uid (because of insecure RFIO that needs uid correspondance), but you should be able to do it from a correspondance), but you should be able to do it from a WN in the same site (there is indeed such WN in the same site (there is indeed such correspondance between uids in WNs and in SEs).correspondance between uids in WNs and in SEs).So the options are:So the options are:    1. Copy the file with gridftp to your UI, then 1. Copy the file with gridftp to your UI, then access it locallyaccess it locally

2. Send your application (that contains the GFAL 2. Send your application (that contains the GFAL calls) in a job to a CE in the same site as the SE where calls) in a job to a CE in the same site as the SE where the file you want to access sits.the file you want to access sits.

• Support for Metadata using LFC catalog: Support for Metadata using LFC catalog: There is an only field for metadataThere is an only field for metadata No way to do metadata queriesNo way to do metadata queries

Page 34: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

ContentsContents

• Introduction to SPM analysis for Introduction to SPM analysis for diagnosis in ADdiagnosis in AD

• Development of SPM service through Development of SPM service through a web portala web portal

• GRID implementation of SPM serviceGRID implementation of SPM service

• Current Status and PlansCurrent Status and Plans

Page 35: NA4 Generic Applications Meeting 9 -11 January 2006 – Catania, Italy GRID node for biomedical applications: distributed image analysis for early diagnosis.

Current Status and PlansCurrent Status and Plans

• Integration of GRID application with Integration of GRID application with neuroinformatics web interfaceneuroinformatics web interface

• To find a better way to manage metadataTo find a better way to manage metadata• Integrate the application in GENIUSIntegrate the application in GENIUS• To value the possibility of parallelize the To value the possibility of parallelize the

statistical analysisstatistical analysis

Thanks for your Thanks for your attention!attention!


Recommended