searching for the best engine 2007. 12. 07 presented by gong gi hyun, ids lab., seoul national...

9
Searching for the Best Engine 2007. 12. 07 Presented by Gong GI Hyun, IDS Lab., Seoul National University

Upload: quentin-shields

Post on 17-Jan-2018

215 views

Category:

Documents


0 download

DESCRIPTION

Copyright  2007 by CEBT 5 Problems with Today's Web Search  Too many search results and too many irrelevant search results. After spending time on the first few pages of the search results, you don't have time or patience to go beyond those pages.  No ability to manage the results by defining context or meaning. It is not easy to build an advanced search query. Lack of hints for search.  No user-friendly visual management with mouse click.  Documents are ranked by a search engine according to an algorithm. specific to a search engine, and not specific to your interests.  Over-ranked commercial and under-ranked non-commercial search results. IDS Lab. Seminar - 3Center for E-Business Technology

TRANSCRIPT

Page 1: Searching for the Best Engine 2007. 12. 07 Presented by Gong GI Hyun, IDS Lab., Seoul National University

Searching for the Best Engine

2007. 12. 07Presented by Gong GI Hyun, IDS Lab., Seoul National University

Page 2: Searching for the Best Engine 2007. 12. 07 Presented by Gong GI Hyun, IDS Lab., Seoul National University

Copyright 2007 by CEBTCenter for E-Business Technology

Search Engine 1st generation (X)

1995, Digital Equipment Corp figured out how to store the words on Web pages as an index that lent itself to lightning-fast searches.

2nd generation (△) Google's innovation was to further rank a Web page by the other

pages that link to it, on the somewhat shaky assumption that if a page is much linked-to.

"The Google results just had too much stuff I wasn‘t looking for.” 3rd generation (?)

Semantic Web? Personalization? Social network?

IDS Lab. Seminar - 2

Page 3: Searching for the Best Engine 2007. 12. 07 Presented by Gong GI Hyun, IDS Lab., Seoul National University

Copyright 2007 by CEBTCenter for E-Business Technology

5 Problems with Today's Web Search Too many search results and too many irrelevant search

results. After spending time on the first few pages of the search results, you don't have time or patience to go beyond those pages.

No ability to manage the results by defining context or meaning. It is not easy to build an advanced search query. Lack of hints for search.

No user-friendly visual management with mouse click. Documents are ranked by a search engine according to an algo-

rithm. specific to a search engine, and not specific to your interests.

Over-ranked commercial and under-ranked non-commercial search results.

IDS Lab. Seminar - 3

Page 4: Searching for the Best Engine 2007. 12. 07 Presented by Gong GI Hyun, IDS Lab., Seoul National University

Copyright 2007 by CEBTCenter for E-Business Technology

Semantic Search We need algorithms that match the meaning of concepts

and emulate "understanding“ The sense of a word as a factor in its ranking algorithm. One of the first impacts of semantic search engine will

be on the handling of long-tail queries. Popularity algorithms fail at the long-tail queries, be-

cause there is never enough statistical sampling. The idea of "personalized search" actually requires seman-

tic capabilities without the need for tracking the user's be-havior.

NLP based “Semantic Search Engine” approach : HAKIA

IDS Lab. Seminar - 4

Page 5: Searching for the Best Engine 2007. 12. 07 Presented by Gong GI Hyun, IDS Lab., Seoul National University

Copyright 2007 by CEBTCenter for E-Business Technology

Long Tail Query

IDS Lab. Seminar - 5

Page 6: Searching for the Best Engine 2007. 12. 07 Presented by Gong GI Hyun, IDS Lab., Seoul National University

Copyright 2007 by CEBTCenter for E-Business Technology

Is Semantic Technology the An-swer? Peter Norvig : “They don't want the burden of having to express

it as a full sentence.” Google AdSense? NLP processing?

NLP may well be web 4.0 and semantic web.

IDS Lab. Seminar - 6

Page 7: Searching for the Best Engine 2007. 12. 07 Presented by Gong GI Hyun, IDS Lab., Seoul National University

Copyright 2007 by CEBTCenter for E-Business Technology

What is the “Perfect Search”? What do we expect when we enter a term into a search

box? Get the Perfect Answer.

Can we explain what we want perfectly? Can we expect the "perfect" answer all the time? Interaction is needed (Google does not do that)

Excuse me, what do you mean? Did you mean to look for ~~?

IDS Lab. Seminar - 7

Page 8: Searching for the Best Engine 2007. 12. 07 Presented by Gong GI Hyun, IDS Lab., Seoul National University

Copyright 2007 by CEBTCenter for E-Business Technology

The Alternative Search Engine of the Year, 2007!

IDS Lab. Seminar - 8

Page 9: Searching for the Best Engine 2007. 12. 07 Presented by Gong GI Hyun, IDS Lab., Seoul National University

Copyright 2007 by CEBTCenter for E-Business Technology

Quintura : Semantic Map As you click on words, they get added to your query,

causing the words in your map to update and restrict the focus of your search, allowing you to quickly and graphically structure very specific queries.

By clicking through a semantic map will allow you To spend less time sifting through irrelevant results. To refine the search when you're not exactly sure of the

query you should be using.

IDS Lab. Seminar - 9