in-spire training - dpcpsi · opa_t#1192_dec-13-2016 office of portfolio analysis sharing search...
TRANSCRIPT
Office of Portfolio AnalysisOPA_T#1192_Dec-13-2016
IN-SPIRE Training
Office of Portfolio AnalysisDivision of Program Coordination, Planning, and
Strategic InitiativesNational Institutes of Health
Todays Instructors: Patricia Forcinito and Matt Perkins
Office of Portfolio AnalysisOPA_T#1192_Dec-13-2016
Mission of the Office of Portfolio Analysis
Consult &Collaborate
Disseminate Best PracticesClassroom Training (Custom Classes Available)
Online TrainingWeb Resources (Case Studies, FAQs)
Office HoursSymposia
Develop New Methods
BuildTools
Facilitate DataCleaning & Analysis
Supportdata-driven
decision-making
OPA Director:Dr. George Santangelo
https://dpcpsi.nih.gov/opa/index
Office of Portfolio AnalysisOPA_T#1192_Dec-13-2016
iSearchiSearch is NIH’s next-generation portfolio analysis platform, providing comprehensive, easy-to-use access to a carefully curated, extensively linked data set of global grants, patents, publications, clinical trials, and drugs.
https://itools.od.nih.gov/dashboard/#login
Office of Portfolio AnalysisOPA_T#1192_Dec-13-2016
Part 1• Understand how IN-SPIRE works and what it might
be used for.
• Learn how to import a dataset, do some visualizations and export results.
Part 2
• Learn how to use the following features:• Searches
• Search Networks
• Time Tool
• Correlation
• Facets
Course Objectives
Office of Portfolio AnalysisOPA_T#1192_Dec-13-2016
• Developed by Pacific Northwest National Laboratory(PNNL) to help analyze huge datasets
About IN-SPIRE
• Can help understand data in two different ways:
– Convergent
• Focusing on a particular set of information or relationships.
• Add efficiency to the process of analyzing large datasets.
– Divergent
• Remaining open to potentially interesting information or relationships.
Office of Portfolio AnalysisOPA_T#1192_Dec-13-2016
Examples of IN-SPIRE use in portfolio analysis
1 23
4
5
6
15
16
171819
20
7
89
14
13 12
10 11
IC (b)
IC (a)
Is there overlap between agencies/ICs/divisions?
IN-SPIRE Galaxy View
Applicable for other comparisons, eg:• between NIH & non-NIH grants• Different NIH POs working in the
same research area etc.
Office of Portfolio AnalysisOPA_T#1192_Dec-13-2016
Evolution of Portfolios: Stem Cell ResearchYear 2009
Searched QVR for “Stem Cell” in Title and Abstract
291 Projects
IN-SPIRE ThemeView Classic
brain, nscs, nsc
Human, gene, embryonic
Office of Portfolio AnalysisOPA_T#1192_Dec-13-2016
193 Projects
Year 2013Evolution of Portfolios: Stem Cell Research
IN-SPIRE ThemeView Classic
Neurons, brain, neural stem cells
Office of Portfolio AnalysisOPA_T#1192_Dec-13-2016
Identify training activities in T34 grants
• Identify Portfolio.• Use Search Networks to
combine range of Search Terms and group to create categories.
• Develop understanding of training activities.
Office of Portfolio AnalysisOPA_T#1192_Dec-13-2016
Understanding datasets / Manual Coding
Office of Portfolio AnalysisOPA_T#1192_Dec-13-2016
How does IN-SPIRE create a visualization with my documents?
Today’s example: Excel Spreadsheet with grants belonging to the transplantation portfolio
Source Dataseti.e. grants, pubs, etc.
Text to be mined by IN-SPIRE
Title Abstract Specific Aims
Office of Portfolio AnalysisOPA_T#1192_Dec-13-2016
How does IN-SPIRE create a visualization with my documents?
Determinesdistinguishing words called
“Major Terms”
scan
Identifies the termsthat are least
randomly distributed
Uses thesedistinguishing
words to create asignature for each
document
Signature generation
clustering
IN-SPIRE groupsthe documents intoclusters based on
their signaturesimilarity.
projection
Clusters are arranged in a
high-dimensional
space and thenflattened downto a 2D graph
Source Dataseti.e. grants, pubs, etc.
Office of Portfolio AnalysisOPA_T#1192_Dec-13-2016
IN-SPIRE VisualizationGalaxy View
• Each document is represented by a dot or “star” in the “sky”
• Documents are cluster together according to their content similarity
Office of Portfolio AnalysisOPA_T#1192_Dec-13-2016
IN-SPIRE VisualizationThemeView Classic
3-D representation of the clusters
• By default the peak height represents number of documents• Peak height can be adjusted to represent for example: total awarded $, etc.
Office of Portfolio AnalysisOPA_T#1192_Dec-13-2016
Demonstration followed by hands on exercise
Office of Portfolio AnalysisOPA_T#1192_Dec-13-2016
• Learn:
• Searching
• Groups
• Search networks
• Time tool: documents, groups, flows
• Correlation tool
• Facets tool
Part 2: Objectives
Office of Portfolio AnalysisOPA_T#1192_Dec-13-2016
Search Networks
• Similar to searches / groups.• Two advantages:
• Useful for combining a number of searches and visualizing results.
• Search networks can be saved and copied to other projects or shared between users
• Can use searches created in search function (and groups) or create new ones.
• Links between searches are easy to create and remove.
• Values update when new data is loaded.• OPA guidance here: https://opa-
trainingportal.od.nih.gov/?page_id=961
Office of Portfolio AnalysisOPA_T#1192_Dec-13-2016
Sharing Search Networks• Once you’ve created a Search Network, it will appear in the project folder, and
can be dragged to other projects to use on other datasets.
• To transfer Search Networks to another user:• Copy to Home folder• Log out of IN-SPIRE.• Use the IN-SPIRE Meta Data Sharing tool (likely to be at):
• C:\Users\perkinsmj\INSPIRE\bin\Meta Data Sharing• Follow the process in reverse to import.
Office of Portfolio AnalysisOPA_T#1192_Dec-13-2016 http://in-spire.pnnl.gov/government.stm
How do I get access to the tool?
Office of Portfolio AnalysisOPA_T#1192_Dec-13-2016
Leaveempty
Office of Portfolio AnalysisOPA_T#1192_Dec-13-2016
During today’s class we covered:
• How IN-SPIRE works and what it might be used for
• How to import a dataset (differences within computation, categorical, date, and numeric fields)
• How to get IN-SPIRE main visualizations (Galaxy View and Theme View Classic)
• Used:
• Search feature
• Created groups using multiple methods
• Search networks
• Time tool
• Correlation tool
• Facets tool
QUESTIONS?