bioperl project update

17
BioPerl Project Update Jason Stajich (on behalf of BioPerl developers)

Upload: bosc

Post on 11-May-2015

1.877 views

Category:

Technology


5 download

DESCRIPTION

Title: BioPerl Project UpdateAuthor: Jason Stajich

TRANSCRIPT

Page 1: BioPerl Project Update

BioPerl Project UpdateJason Stajich

(on behalf of BioPerl developers)

Page 2: BioPerl Project Update

BioPerl

Since ~1995

10s of developers over the years

2002 Publication cited ~500 times

Page 3: BioPerl Project Update

DeObfuscator

Interface to search available methods in objects

Dave Messina and Laura Kavanaugh

http://bioperl.org/cgi-bin/deob_interface.cgi

Page 4: BioPerl Project Update

Search class names by string or Perl regex (examples: Bio::SeqIO, seq, fasta$)

Submit Query

OR select a class from the list:

sort by methodsort by method

Welcome to the BioPerl Deobfuscator [ bioperl-live ] what is it?

Bio::Align::AlignI An interface for describing sequence alignments.

Bio::Align::DNAStatistics Calculate some statistics for a DNA alignment

Bio::Align::PairwiseStatistics Base statistic object for Pairwise Alignments

Bio::Align::ProteinStatistics Calculate Protein Alignment statistics (mostly distances)

Bio::Align::StatisticsI Calculate some statistics for an alignment

Bio::Align::Utilities A collection of utilities regarding converting and manipulating alignment objects

Bio::AlignIO Handler for AlignIO Formats

Bio::AlignIO::arp ARP MSA Sequence input/output stream

Bio::AlignIO::bl2seq bl2seq sequence input/output stream

Bio::AlignIO::clustalw clustalw sequence input/output stream

Bio::AlignIO::emboss Parse EMBOSS alignment output (from applications water and needle)

methods for Bio::SimpleAlign

Method Class Returns Usage

accession Bio::SimpleAlign An acc string $myalign->accession("PF00244")

add_seq Bio::SimpleAlign nothing$myalign->add_seq($newseq); it - just adds it to the hashes.

add_SeqFeature Bio::SimpleAlign nothing$feat->add_SeqFeature($subfeat); $feat->add_SeqFeature($subfeat,'EXPAND')

add_tag_value Bio::AnnotatableI not documented See add_Annotation

addSeq Bio::SimpleAlign not documented not documented

all_gap_line Bio::SimpleAlign string $line = $align->all_gap_line()

annotation Bio::SimpleAlign Bio::AnnotationCollectionI object $ann = $aln->annotation or $aln->annotation($ann)

average_percentage_identity Bio::SimpleAlignThe average percentage identity of the alignment

$id = $align->average_percentage_identity

Page 5: BioPerl Project Update

Gratuitous Statistics

975 modules in core, 375 in run

600,000+ lines of code/POD

Lots of emails go through the list!

Page 6: BioPerl Project Update

Phylo-HackathonHosted at National Evolutionary Synthesis Center (www.nescent.org)

Several BioPerl participants focusing on interfacing with phylogenetics and molecular evolution software

Generated more documentation for existing BioPerl related evolutionary tools

Page 7: BioPerl Project Update

Evolutionary ToolsPopulation Genetics tests and processing code

McDonald-Kreitman, Tajima’s D, LD

Phylogenetics

Tree parsing and manipulation

Simple NJ tree building

Tree Drawing

Page 8: BioPerl Project Update

More Evolutionary Interfaces

Parsing and running of PAML for Ka, Ks rates and model testing

PhastCons & PhyloFast interfaces

NJTree (TreeFam software) running and parsing of NHX results

Interface to HyPhy for relative rates, model testing, and Ka/Ks testing

Page 9: BioPerl Project Update

BioSQL support

Had been for storing sequence, features, annotations

Postgresql and mysql schemas

New support for phylogenetic trees and topological queries

Perl modules in bioperl-db package

Page 10: BioPerl Project Update

GFF3 Support

Bio::DB::SeqFeature is interface for RDBMS for GFF3 storage

Backend for Gbrowse

Postgresql, Mysql, and Flatfile (BDB) implementations

Page 11: BioPerl Project Update

Documentation

Two years since we started the BioPerl Wiki and reorganized documentation

Have expanded HOWTOs and user guides

Tracking project needs and progress to wiki

Page 12: BioPerl Project Update

Education

BioPerl instruction through courses at NESCent (Phyloinformatics)

Programming for Biology course at CSHL

Several workshops at universities

Page 13: BioPerl Project Update

Release schedule

1.4 was released in Dec 2004!

1.5.1 and 1.5.2 have come out and are in CPAN as ‘unstable’

Plans afoot for 1.6 release

In late 2007

Page 14: BioPerl Project Update

1.6 release needs

Remove broken code for auto feature to string generation

Full Bio::FeatureIO support or removal

Tidy GFF/GFF3 and Genbank parsing

Page 15: BioPerl Project Update

Other Admin news

Plan to migrate from CVS to SVN in August (George Hartzell)

Do we reorg to different logical components (Core w/ sequences)

Page 16: BioPerl Project Update

Major Kudos

Project Management & Leadership

Sendu Bala - Cambridge Univ

Chris Fields - Univ of Illinois, Urbana-Champaign

Page 17: BioPerl Project Update

More Kudos

Nathan Haigh - Sheffield Univ

Hilmar Lapp - NESCent

Torsten Seeman - Monash University

Dave Messina - Washington Univ St Louis

Mauricio Herrera Cuadra - Facultad de Estudios Superiores Iztacala, UNAM