introduction to text mining

Post on 12-Jan-2015

1.551 Views

Category:

Documents

3 Downloads

Preview:

Click to see full reader

DESCRIPTION

 

TRANSCRIPT

Introduction to text mining

Lars Juhl Jensen

>10 km

exponential growth

~45 seconds per paper

text mining

information retrieval

find the relevant papers

user-specified query

“yeast AND cell cycle”

entity recognition

identify the concepts

comprehensive lexicon

orthographic variation

“black list”

Reflect

augmented browsing

Pafilis, O’Donoghue, Jensen et al., Nature Biotechnology, 2009

used by publishers

information extraction

formalize the facts

co-mentioning

NLPNatural Language Processing

Gene and protein names

Cue words for entity recognition

Verbs for relation extraction

[nxexpr The expression of [nxgene the cytochrome genes [nxpg CYC1 and CYC7]]]is controlled by[nxpg HAP1]

molecular networks

information on side effects

Campillos & Kuhn et al., Science, 2008

Acknowledgments

Sean O’Donoghue

Sune Frankild

Heiko Horn

Evangelos Pafilis

Michael Kuhn

Reinhardt Schneider

top related