proximity within paragraph: a measure to enhance document retrieval performance

2

PROXIMITY WITHIN PARAGRAPH: A MEASURE TO ENHANCE DOCUMENT RETRIEVAL PERFORMANCE Srisupa Palakvangsa-Na- Ayudhya John A. Keane School of Informatics, The University of Manchester, UK QUERY = Informatics and Manchester and Course How to calculate a score? Informat ics Manchest er Course Block 1 = 3 Informat ics Block 2 = 1/3 Informat ics Course Block 3 = 2/3 44 . 0 3 3 3 3 / 2 3 / 1 3 Score List Returned 1.Doc R: 0.85 2.Doc M: 0.71 3.Doc L: 0.69 4.Doc A: 0.56 1) Assign weights to each queried word 2) Split a document into blocks 3) Calculate a similarity score 4) Order documents by their scores

Upload: erica-hardin

Post on 31-Dec-2015

21 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

DESCRIPTION

3) Calculate a similarity score. 1) Assign weights to each queried word. 4) Order documents by their scores. 2) Split a document into blocks. Block 1. Informatics Manchester Course. = 3. Block 2. Informatics. = 1/3. Block 3. Informatics Course. = 2/3. John A. Keane. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: PROXIMITY WITHIN PARAGRAPH: A MEASURE TO ENHANCE DOCUMENT RETRIEVAL PERFORMANCE

PROXIMITY WITHIN PARAGRAPH: A MEASURE TO ENHANCE DOCUMENT

RETRIEVAL PERFORMANCESrisupa Palakvangsa-Na-Ayudhya John A. Keane

School of Informatics, The University of Manchester, UK

QUERY = Informatics and Manchester and Course

How to calculate a score?

Informatics Manchester

Course

Block 1

= 3

Informatics

Block 2

= 1/3

Informatics Course

Block 3

= 2/3

44.0333

3/23/13Score

List Returned

1.Doc R: 0.85

2.Doc M: 0.71

3.Doc L: 0.69

4.Doc A: 0.56

1) Assign weights to each queried word

2) Split a document into blocks

3) Calculate a similarity score

4) Order documents by their scores

Page 2: PROXIMITY WITHIN PARAGRAPH: A MEASURE TO ENHANCE DOCUMENT RETRIEVAL PERFORMANCE

This work was supported by a PhD studentship from the Royal Thai Government

Experimental Setup Compared with Minimum Distance Between Queried Pair (MQP) Employed three test sets based on WT10G data set and 50 queries (Topics 501-550) Used two criteria: the average interpolated precision-recall (AvgPrec) and the average Discounted Cumulative Gain (AvgDCG).

Results

Test Set 1

0

2

4

6

8

10

12

Rank

Av

gD

CG

.

PWP

MQP

Test Set 2

0

2

4

6

8

10

12

Rank

Av

gD

CG

.

PWP

MQP

Test Set 3

0123456789

Rank

Av

gD

CG

.

PWP

MQP

Test Set 1

0

10

20

30

40

50

60

70

0 10 20 30 40 50 60 70 80 90 100

Recall

Av

gP

rec

PWP

MQP

Test Set 2

0

10

20

30

40

50

60

0 10 20 30 40 50 60 70 80 90 100

Recall

Av

gP

rec

PWP

MQP

Test Set 3

0

10

20

30

40

50

60

70

0 10 20 30 40 50 60 70 80 90 100

Recall

Av

gP

rec

PWP

MQP

Discussion

AdvantagesPWP MQP

tends to obtain higher AvgDCG values considers all logical blocks more fine-grained

Disadvantages

PWP MQP

considers only the number of

non-queried words between unique

queried words pairs difficult to rank documents as the same

score is given to many documents more coarse-grained

multiple-topics issue likely to obtain lower DCG values than

MQP, due to the use of multiple logical

blocks

Writing an Essay. Structure of an Essay Introductory Paragraph Supporting Paragraph 1 Supporting Paragraph 2 Supporting Paragraph 3 Concluding Paragraph

Proximity Mobile Payments - Finald3nrwezfchbhhm.cloudfront.net/lib/Proximity_Mobile_Payments_200709.pdf · Proximity Mobile Payments: ... proximity mobile payment data is linked directly

Match the main idea to each paragraph Paragraph 1 Paragraph 2 Paragraph 3 Paragraph 4 A. dream B. personal information C. character( 性格品质 )

Introduction Body Paragraph 1 Body Paragraph 2 Body Paragraph 3 Conclusion

Proximity Sensor/Proximity Switch/Proximity Sensors/Proximity Switches

Paragraph Construction - Liberty University · Paragraph Transitions A paragraph might begin with a reference to the preceding paragraph. These are called paragraph transitions and

Proximity, Proximity, Proximity Long

The Essay Introduction First body paragraph Second body paragraph Third body paragraph Conclusion

A Proximity-aware Interest-clustered P2P File Sharing …hs6ms/publishedPaper/Journal/2014/A Proximity... · A Proximity-aware Interest-clustered P2P File ... Proximity-aware clustering

Multimedia Retrieval. Outline Audio Retrieval Spoken information Music Document Image Analysis and Retrieval Video Retrieval

Informative Essay Outline Paragraph # 1 Introduction Paragraph # 2 Main idea/body 1 Paragraph # 3 Main idea/body 2 Paragraph # 4 Main idea/body 3 Paragraph

3. paragraph structure and hub paragraph

Flowchart and Paragraph ProofsFlowchart and Paragraph Proofs

Power Writing and Beyond. 5-Paragraph Essay Format Introduction Body Paragraph 1 Body Paragraph 2 Body Paragraph 3 Conclusion

Retrieval Model Overview Boolean Retrieval Retrieval INFO 4300 / CS 4300 ! Retrieval models – Older models » Boolean retrieval » Vector Space model – Probabilistic Models »

Proximity as principles: Directness, community norms and ... · Proximity as principles: Directness, community norms and the ... the proximity and duty of care ... view that proximity

Modern information retrieval...•Unit may be character, word, paragraph, etc. •Examples: •united within 5 words of american: Finds documents that discuss “United Airlines and

Proximity Operations for the Robotic Boulder Capture ...asteroid, now referred to as Option A2. The proximity operations design for the Option B includes identification and retrieval

Introduction to Information Retrieval Introduction to Information Retrieval CS276 Information Retrieval…

Ranking and Retrieval of Image Sequences From Multiple … · 2015-05-26 · Ranking and Retrieval of Image Sequences from Multiple Paragraph Queries Gunhee Kim Seoul National University

Design Principles. Proximity “ Proximity, or closeness, implies a relationship.”

Capacitive Proximity Sensors Theory of Operation...54 Capacitive Proximity Sensors Theory of Operation Capacitive proximity sensors are similar to inductive proximity sensors. The

1 Proximity-Based Opinion Retrieval Mark CarmanFabio CrestaniShima Gerani

Weighted Proximity Best-Joins for Information …rvt/icde09-multikw-final.pdfWeighted Proximity Best-Joins for Information Retrieval † Risi Thonangi, 1Hao He,2 AnHai Doan,3 Haixun

PROXIMITY-BASED GRAPH EMBEDDINGS FOR MULTI-LABEL ... · document collections for text categorization based on the “bag of words” model. 1 INTRODUCTION In information retrieval

Lecture 4: Information Retrieval Techniques/Model and ... · –1) Basic Retrieval Techniques –Boolean Searching –Truncation Search –Proximity search/Operators –2) Advanced

Writing a Paragraph - Wikispaces5thgradereadingresourcesmcboe.wikispaces.com/file/view/Paragraph... · Writing a Paragraph HAMBURGER ... will never forget this special day. ... paragraph

Information Retrieval: Introduction · Information Retrieval: Introduction De nition of Information Retrieval De nition of Information Retrieval Information Retrieval (IR) is about

IM Series Proximity Switch - Electrónica Embajadores · 2020-03-13 · Proximity Switch IM Series!" 01!"#$!"!!!"! PROXIMITY SENSOR TYPE Proximity sensor The use of a magnetic field

Proximity Measuring Systems using TQ 4xx Proximity

Media Retrieval Information Retrieval Image Retrieval Video Retrieval Audio Retrieval Information Retrieval Image Retrieval Video Retrieval Audio Retrieval

Proximity Sensors Selection Guide PROXIMITY SENSOR … · Proximity Sensors Selection Guide Proximity Sensor Selection Guide Cables and Accessories Cables and accessories can be found

Ranking and Retrieval of Image Sequences from Multiple ...lsigal/Publications/cvpr2015_text2pic_abstract.pdf · Ranking and Retrieval of Image Sequences from Multiple Paragraph Queries

Leveraging Conceptual Lexicon ： Query Disambiguation using Proximity Information for Patent Retrieval Date : 2013/10/30 Author : Parvaz Mahdabi, Shima

Weighted Proximity Best-Joins for Information Retrieval