copyright openhelix. no use or reproduction without express written consent1
TRANSCRIPT
Copyright OpenHelix. No use or reproduction without express written consent 1
dbGaP
The Database of Genotypes and Phenotypes
Materials prepared by:
Cynthia Perreault-Micale, Ph.D.
Dorothy S. Reilly, Ph.D.
www.openhelix.com
Updated: Q4 2010Version 2
Copyright OpenHelix. No use or reproduction without express written consent 3
NCBI dbGaP Agenda
Introduction & Credits Basic Searching & Browsing Report Types Analysis Reports Advanced Searching Summary Exercises
dbGaP: www.ncbi.nlm.nih.gov/gap
http://www.ncbi.nlm.nih.gov/gap
Copyright OpenHelix. No use or reproduction without express written consent 4
Many Contribute dbGaP Studies
Launched Dec. 2006 Major contributors include: National Eye Institute’s Age Related Eye Disease Study, The Genetic Association Information Network (GAIN), & many more!
Copyright OpenHelix. No use or reproduction without express written consent 5
Data Content & Organization
Many types of studies are included
2 types of access - open & controlled
Open access: studies, documents, variables, analyses
Controlled access: de-identified phenotypes & genotypes for individual study subjects
Specific authorization process for controlled access requests
Copyright OpenHelix. No use or reproduction without express written consent 6
Multiple Data Types - Stable Identifiers
phs = Studies
phd = study Documents
phv = phenotypic Variables
pha = genotype-phenotype Analyses
pht = dataseTs
Study number phs000007.v1.p1
“v” data version “p” participant
set version
Unique identifiers assigned
These numbers increase whendata changes
Copyright OpenHelix. No use or reproduction without express written consent 7
Citing a Study in dbGaPBMC Medical Genetics 8:S7, 2007
http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pubmed&pubmedid=17903306
URL with uniquestudy ID number
Copyright OpenHelix. No use or reproduction without express written consent 8
dbGaP Homepage Resources
http://www.ncbi.nlm.nih.gov/gap
ImportantLinks
GettingStarted
Access to dbGaP Data
Tutorial, Overview, FAQs & Submission instructions Apply for Controlled Access, FTP site, Browse Studies RSS Feed, Conduct Codes, Security Procedures & Contact Us
Start Searches& Help
Copyright OpenHelix. No use or reproduction without express written consent 9
dbGaP Homepage Additional Resources
List ofLatest Studies
GettingStarted
Contact
Categorized list of NCBIresources
Popularresources
Featuredresources
NCBIinformation
Additionalaccess
Genetics &Medicine
Copyright OpenHelix. No use or reproduction without express written consent 10
NCBI Homepage - The Interlinked Entrez Network
http://www.ncbi.nlm.nih.gov
dbGaP
All Databases
dbGaP results
An Entrez Global Query - results in all databases
Copyright OpenHelix. No use or reproduction without express written consent 11
Results Utilize Many Handy Entrez Features
Manyhelpful
tools & links
Search box & menu
Display Settings Categorized results
For more information about Entrez search functions:• OpenHelix Entrez Overview tutorial (www.openhelix.com) • NCBI Entrez documentation (http://www.ncbi.nlm.nih.gov/bookshelf/br.fcgi?book=helpentrez&part=EntrezHelp; http://www.ncbi.nlm.nih.gov/Entrez/tutor.html)
Copyright OpenHelix. No use or reproduction without express written consent 12
dbGaP Credits
Nature Genetics 39:1181, 2007http://www.nature.com/ng/journal/v39/n10/full/ng1007-1181.html
Nature Genetics 39:1045, 2007http://www.nature.com/ng/journal/v39/n9/full/ng2127.html
Many additional sources of information
Click FAQ link on homepage
Copyright OpenHelix. No use or reproduction without express written consent 13
NCBI dbGaP Agenda
Introduction & Credits Basic Searching & Browsing Report Types Analysis Reports Advanced Searching Summary Exercises
dbGaP: www.ncbi.nlm.nih.gov/gap
Copyright OpenHelix. No use or reproduction without express written consent 14
Accessing Studies from dbGaP Homepage
http://www.ncbi.nlm.nih.gov/gap
List AllStudies
Browse AllStudies
Copyright OpenHelix. No use or reproduction without express written consent 15
Browsing Studies from dbGaP Homepage
Resulttabs
Choose how many items
per page
Click to returnto homepage
Next page of results
Click Apply
Copyright OpenHelix. No use or reproduction without express written consent 16
Browsing Studies - Study Information
Colored icons indicateVariable, Document &
Analysis Reports available
Copyright OpenHelix. No use or reproduction without express written consent 17
Browsing Studies - More Study Information
Many typesof studies
A variety of study types: Exome Sequencing,
Family, Single Patient, Longitudinal, and more.
Parent-Offspring trios: Data are collected from parent-
parent-offspring sets.
Case-Control Studies: Identify individuals with a disease of interest and a control group without the disease. The frequency or levels of an attribute [eg,
specific genotype] are compared between these
groups.
Copyright OpenHelix. No use or reproduction without express written consent 18
Browsing Studies - Right Side Options
Filtersinactive
Find related data
Recent activity
HandyLinks
Copyright OpenHelix. No use or reproduction without express written consent 19
Browsing Other Record Types
http://www.ncbi.nlm.nih.gov/gap
Copyright OpenHelix. No use or reproduction without express written consent 20
Basic Searching from Homepage
prostate cancer
Our basic search example: prostate cancer
Copyright OpenHelix. No use or reproduction without express written consent 21
Basic Search Results
Number of results & access by record type
Search term
Copyright OpenHelix. No use or reproduction without express written consent 22
NCBI dbGaP Agenda
Introduction & Credits Basic Searching & Browsing Report Types Analysis Reports Advanced Searching Summary Exercises
dbGaP: www.ncbi.nlm.nih.gov/gap
Copyright OpenHelix. No use or reproduction without express written consent 23
Study Report Access
phs000130
phs = Studies
Search Results:
Enter Study ID
phs000130
Click here
Copyright OpenHelix. No use or reproduction without express written consent 24
Study Report
NIDDK IBD Genetics Consortium Crohn’s Disease Genome-Wide Association Study
Accession: phs000130.v1.p1
We will enlarge each section next
auth. data access req.
study name, #,navigation tabs
Search within study,
Associated Substudies, if available
description
auth. access data
public data (FTP)
criteria, molecular
history, publications
diseases (MeSH)
study attribution
Copyright OpenHelix. No use or reproduction without express written consent 25
Study Report: Description
Linked Logos
May also see studyversion history
Navigation tabs
Study type &number of participants
Copyright OpenHelix. No use or reproduction without express written consent 26
Study Report: Data Access
Authorized access data- rules for this study
Public data - FTP
Copyright OpenHelix. No use or reproduction without express written consent 27
Study Report: Criteria & Molecular
Copyright OpenHelix. No use or reproduction without express written consent 28
Study Report: Study History & Publications
Copyright OpenHelix. No use or reproduction without express written consent 29
Study Report:Diseases, Attribution, Access
Research Use Statements
Copyright OpenHelix. No use or reproduction without express written consent 30
Sample Variable Report phv = phenotypic Variables
Access
Click here
Copyright OpenHelix. No use or reproduction without express written consent 31
Sample Document Report phd = study Documents
Navigation
Document Name& Accession
Search
Study Name& Accession
Copyright OpenHelix. No use or reproduction without express written consent 32
Associated Datasets
Name & Accession
Similar organization
pht = dataseTs
Description
Dataset type
Dataset Summary
Copyright OpenHelix. No use or reproduction without express written consent 33
NCBI dbGaP Agenda
Introduction & Credits Basic Searching & Browsing Report Types Analysis Reports Advanced Searching Summary Exercises
dbGaP: www.ncbi.nlm.nih.gov/gap
Analysis Methods
Copyright OpenHelix. No use or reproduction without express written consent 34
Sample Analysis Report
Analysis Plots
http://gmed.bu.edu/about/index.html“Please note that all associations. . . are considered "candidate associations" pending replication in additional studies.”
Details
Publications
pha001340.1
Enter ID into homepage search box
pha = genotype-phenotype Analyses
Study version history
Browser
Copyright OpenHelix. No use or reproduction without express written consent 35
Analysis Report & Genome BrowserdbSNP homepage:http://www.ncbi.nlm.nih.gov/projects/SNP/
Click here
Copyright OpenHelix. No use or reproduction without express written consent 36
Analysis Browser
zoomscroll
Controls
NCBI Service Utility menu: SNP Genotype, Entrez SNP,
Sequence Viewer,Map Viewer
HTML view to maximize, print or save
Toggleexpanded
or compacted
Close that data track
Highlight top ten SNP hits
Copyright OpenHelix. No use or reproduction without express written consent 37
Analysis Browser - More Features
Other buttons allow you to close particular windows Mouse over many items for tool tips & links
Copyright OpenHelix. No use or reproduction without express written consent 38
Browse SNPs
sample size, P-value rankallele freq., call rateLink to dbSNP record
Scroll over
Mouse over bin to see SNP record
highlightedGWAS catalog page# in bin = # markers
tested, + for 10 or more
Copyright OpenHelix. No use or reproduction without express written consent 39
Analysis Browser - Customizing Your Display
We will remove the GWAS Catalog Chromosome 3 SNP Bins reference track & highlight the top ten SNPs on the Sequence Viewer
Clickhere
One referencetrack removed
Top ten SNPshighlighted
Copyright OpenHelix. No use or reproduction without express written consent 40
NCBI dbGaP Agenda
Introduction & Credits Basic Searching & Browsing Report Types Analysis Reports Advanced Searching Summary Exercises
dbGaP: www.ncbi.nlm.nih.gov/gap
Copyright OpenHelix. No use or reproduction without express written consent 41
Advanced Searching TIPS
Use the Boolean operators AND, OR and NOT. They must be capitalized.
Words can be truncated with * (asterisk): fibro*
Add quotes, as in “fibroblast growth factor”, for complete phrase, exact word order
(Use with caution, can be TOO restrictive)
http://www.ncbi.nlm.nih.gov/gap
Copyright OpenHelix. No use or reproduction without express written consent 42
Advanced Search Example
heart rate AND exercise
Our advanced search example: heart rate AND exercise
Copyright OpenHelix. No use or reproduction without express written consent 43
Search Results: Heart Rate AND Exercise
Limits: exclude or include specific types of records
Advanced search: similar functions, & several more
Click to open the dbGaP Advanced Search form
Copyright OpenHelix. No use or reproduction without express written consent 44
dbGaP Advanced Search Form
Limits, SearchDetails, Help
Back to homepage
Search History
Search Builder
Help
Help
Multiple easy options in a single query form
Copyright OpenHelix. No use or reproduction without express written consent 45
dbGaP Advanced Search - Search Builder
Click Searchor Preview
Term added toSearch box
Click here
heart rate
Choose Boolean
Repeat steps as many times as needed to build query We will add the term “AND exercise[Variable]”
Search Builder
Copyright OpenHelix. No use or reproduction without express written consent 46
dbGaP Advanced Search - Search Builder Results
Query: (heart rate[Variable]) AND exercise[Variable]
Copyright OpenHelix. No use or reproduction without express written consent 47
dbGaP Advanced Search - Search History
Search History
Most RecentQueries
Click to access
Combine, view, delete or save previous searches
#1 AND #2
Click Searchor Preview
Click here to learn more about My NCBICopyright OpenHelix. No use or reproduction without express written consent 48
Saving Searches in My NCBI
Register & log in to My NCBI
Click here to save search in My NCBI
About My NCBISave
Copyright OpenHelix. No use or reproduction without express written consent 49
NCBI dbGaP Agenda
Introduction & Credits Basic Searching & Browsing Report Types Analysis Reports Advanced Searching Summary Exercises
dbGaP: www.ncbi.nlm.nih.gov/gap
Copyright OpenHelix. No use or reproduction without express written consent 50
Summary of NCBI’s dbGaP
Many groups contribute data
http://www.ncbi.nlm.nih.gov/gap
Copyright OpenHelix. No use or reproduction without express written consent 51
Data Organized into Uniform Report Types
Stable identifiers assigned
Genome browserprovided
Copyright OpenHelix. No use or reproduction without express written consent 52
NCBI dbGaP Agenda
Introduction & Credits Basic Searching & Browsing Report Types Analysis Reports Advanced Searching Summary Exercises
dbGaP: www.ncbi.nlm.nih.gov/gap
Copyright OpenHelix. No use or reproduction without express written consent 53