data retrieval

25
Data Retrieval

Upload: malla-reddy-college-of-pharmacy

Post on 17-Jul-2015

127 views

Category:

Education


1 download

TRANSCRIPT

Page 1: Data retrieval

Data Retrieval

Page 2: Data retrieval

Access to Distributed dataBiological data is widely distributed over the WWW.

Data can be retrieved by, 1. Search engines

2. Data retrieval tools

Page 3: Data retrieval

Search EnginesExamples for Search Engines, Google Yahoo! Search LeapFish

Bing Using Search engines 1. Can find relevant web pages 2. It is difficult to find desired information 3. Difficult to find specific information.

Page 4: Data retrieval

Leapfish

Page 5: Data retrieval

bing

Page 6: Data retrieval

Data retrieval toolsDedicated to access information for molecular biologists.Most widely used are, 1. Entrez 2. DBGET 3. SRS Each of these allows, - Text based searching of a no. of linked DBs. - Sequence searching.They differ in, - The DBs they cover - How the retrieved information is accessed and presented.

Page 7: Data retrieval

Entrez- WWW-based data retrieval system.

- Developed by NCBI (National Centre for Biotechnology Information).

- Integrates information held in different DBs.

Page 8: Data retrieval

EntrezData bases covered by Entrez are,Nucleic acid - GenBank, RefSeq, PDB.Protein seqs - SWISS-PROT, PIR.3D structures – MMDBGenomes – Many sourcesPopSet – From GenBankOMIM – OMIMTaxonomy – NCBI taxonomy databaseBooks- BookshelfProbeSet – GEO (Gene Expression Omnibus)Literature - PubMed

Page 9: Data retrieval

Entrez

Page 10: Data retrieval

Entrez

Page 11: Data retrieval

DBGETAn integrated data retrieval system developed and maintained by, - The Institute for Chemical Research (Kyoto University) - The Human Genome Center (University of Tokyo)Data bases covered are, Nucleic acid Seqs – GenBank, EMBL Protein Seqs – SWISS-PROT, PIR 3D structures – PDB Seq motifs – PROSITE Enzyme reactions – LIGAND Literature – LITDB Medline etc.,

Page 12: Data retrieval

DBGET

Page 13: Data retrieval

SRSSRS - Sequence Retrieval System - Data retrieval tool developed by EBI - Integrates 80 molecular biology DBs - An Open source software (Can be installed locally)

SRS has an associated scripting language called Icarus

Page 14: Data retrieval

SRSSRS - Sequence Retrieval System - Data retrieval tool developed by EBI - Integrates 80 molecular biology DBs - An Open source software (Can be installed locally)

SRS has an associated scripting language called Icarus

Page 15: Data retrieval

Genomics

Page 16: Data retrieval

GenomicsWhat is Genomics? The study of genomes.

In addition to the coding regions (genes), genomics comprise: Control elements Introns and exons Gene clusters Elements common to all chromosomes Episomal elements

Page 17: Data retrieval

GenomicsBenefits of Genomics:Genome sequencing helps in, - Identifying new genes (Gene discovery) - Looking at chromosome organization and structure - Finding gene regulatory seqs - Comparative genomicsThese in turn lead to advances in, - Medicine - Agriculture - Animal husbandry - Biotech - Evolution

Page 18: Data retrieval

GenomicsBranches of Genomics,1. Structural Genomics – Building genomic maps, 3D structures.2. Functional Genomics – Transcriptomics, Proteomics, Metabolimomics, Enzymes3. Comparative Genomics – Population distribution and Phenotypic associations4. Evolutionary Genomics – Phylogenetic relationships5. Pharmacogenomics – Interaction of drugs with genomes, Drug discovery

Page 19: Data retrieval

GenomicsTools required for Genomics,Robotics- SequencingStatistics- SoftwareHigh throughput assays- MicroarraysHigh speed computing- Database workBioinformatics- Algorithms, Graphics

Page 20: Data retrieval

Proteomics Proteome is the protein complement of the genome Proteomics is the study of proteomes Human genome = 30,000 to 60,000 genes Human proteome = 300,000 to 12,00,000 Reasons for Proteome>Genome: - Multiple ORFs - PTM - Internal peptide products

Page 21: Data retrieval

ProteomicsGoal: Identify all the proteins expressed by a cell or tissue.

Why to study proteomics? Analysis of mRNA does not always correlate with expressed

proteins Some samples – Serum, Urine – can't be used for mRNA

studies. PTM can not be detected from mRNA Location of proteins can not be known from mRNA

Page 22: Data retrieval

ProteomicsSpecialized proteomics1. Expression Proteomics2. Cell Map Proteomics3. PTM4. Protein- Protein interactions5. Protein- Ligand Interactions6. Protein structure

Page 23: Data retrieval

ProteomicsProteomics approach, Separation of proteins using 2D electrophoresis. Stain gel Excise spots of interest Digest with trypsin Characterize peptides by MS/MALDI TOF Compare peptide seqs with database of seqs. Identify the class of proteins

Page 24: Data retrieval

ProteomicsMethods to study Protein-Protein interactions1. Yeast 2 Hybrid2. AP-MS (Affinity purification-MS)

Protein Microarrays can use immobilized- Proteins- Peptides- Carbohydrates- Antibodies- Small molecules to study other interactions.

Page 25: Data retrieval

Proteomics

Applications:1. Protein mining2. Differential expression profiling3. Network mapping4. Study protein modifications