+ All Categories
Home > Documents > Biopackages

Biopackages

Date post: 13-Jan-2016
Category:
Upload: chessa
View: 15 times
Download: 0 times
Share this document with a friend
Description:
Biopackages.net. Operating System Packages for Bioinformatics Allen Day 2005.05.17. What is a package?. Software, config files, documentation, and/or data encapsulated in a single file Metadata describing: Version, license, package “category” Dependencies What the package provides. - PowerPoint PPT Presentation
Popular Tags:
20
Biopackages.net Operating System Packages for Bioinformatics Allen Day 2005.05.17
Transcript
Page 1: Biopackages

Biopackages.net

Operating System Packages for BioinformaticsAllen Day2005.05.17

Page 2: Biopackages

What is a package? Software, config files, documentation,

and/or data encapsulated in a single file

Metadata describing: Version, license, package “category” Dependencies What the package provides

Page 3: Biopackages

GMOD target audience Small MODs

Page 4: Biopackages

Package Dependency Graph

Dependencies What the package provides

chado

chado-Hsa

genome-Hsa-nib ucsc-blat

genome-Hsa-annotation-affymetrix

genome-Hsa-annotation-gene

postgresql-AffxSeq

postgresql-server

perl-bioperl

obo-core

perl-go-perl

Page 5: Biopackages

Dependencies

Build Dependency Installation Dependency

Page 6: Biopackages

What is a Package Manager? Tools to manage installation,

upgrade, uninstallation of packages Verify package integrity (checksums) Maintain system integrity

Transactional Allow rollbacks

Dependency checking Dependency graph recursion Allow software customization (patches)

Page 7: Biopackages

Why bioinformatics packages? Consistency of installation process

Bioinfo. package installs vary wildly, and commonly lack documentation

Automatic dependency installation Perl modules especially bad – bioperl has 60+

modules in its dependency tree Integrity/Auditing of system state

Know an installed package works, which version, how to replicate system setup

Tighter integration with operating system Daemons, config & log file locations, etc.

Page 8: Biopackages

What’s available?

RPM packages only right now Primary focus on Fedora Core 2

Some RPMs also available for Fedora Core 3 RedHat 9 Cygwin

Page 9: Biopackages

What’s available?

Three primary foci Applications Libraries Data sets

Page 10: Biopackages

Applications

Gbrowse Textpresso BLAT daemon NCBI Toolkit (BLAST, etc) HMMer

Page 11: Biopackages

What’s available?

Libraries Bioperl R & Bioconductor Squid EMBOSS

Page 12: Biopackages

What’s available?

Data sets Genome & protein sequence Sequence features Ontologies All installed using a common directory

structure

Page 13: Biopackages

What’s available? UCSC tools (utilities, BLAT system

service, CGI scripts) Bioperl R / Bioconductor GMOD apps (Gbrowse, Textpresso, …) Data packages

Genome sequence (fa, nib, blastdb) Genome features (Affy probeset

alignments, mRNA, etc)

Page 14: Biopackages

GMOD Components Available

chado-Hsa gbrowse textpresso

gmod-web-Hsa

turnkey

chado

das2-Hsa

apollo-Hsa

cmap-Hsa

‘Hsa’ can be substituted for your organismCurrently built for ‘Cel’, ‘Hsa’, ‘Sce’

ucsc-BLATgenome-Hsa-nib

Page 15: Biopackages

More details…

chado

chado-Hsa

genome-Hsa-nib ucsc-blat

perl-go-perl

genome-Hsa-annotation-affymetrix

genome-Hsa-annotation-gene

postgresql-AffxSeq

postgresql-serverperl-bioperl

…… ………

Page 16: Biopackages

Gene Expression Components

chado-Hsa BioconductorR

Quant/NormPipeline

chado-GEC

DAS/2 forGenotyping,GeneChip

Page 17: Biopackages

Resources

http://www.biopackages.net ~1000 RPMs for Fedora Core 2, 3 Available via yum

See site for a configuration example.

Page 18: Biopackages

TODO

Support more architectures Build for Cygwin & OS X. RPM has been

ported to both Automate package build process

Build farm of multiple architectures, controllable via scheduler (GridEngine)

Automate (if possible) inclusion of new software / data releases

Page 19: Biopackages

TODO

Build community interest and involvement Keep adding more packages! Keep existing packages current!

Page 20: Biopackages

Acknowledgements

Patrick Alger Jared Fox Brian O’Connor Todd Harris Lincoln Stein Stanley Nelson