dependency hashing for n-best ccg parsing

15

1 Dependency Hashing for n-best CCG Parsing Dominick Ng and James R. Curran Presented by Yun Huang

Upload: holleb

Post on 05-Feb-2016

55 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

DESCRIPTION

Dependency Hashing for n-best CCG Parsing. Dominick Ng and James R. Curran Presented by Yun Huang. CCG derivation Dependency Evaluation All components of a dep. structure must match golden standard Prec./Recall/F-score. Background: CCG. Background: CCGbank. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Dependency Hashing for n-best CCG Parsing

1

Dependency Hashing for n-best CCG Parsing

Dominick Ng and James R. Curran

Presented by Yun Huang

Page 2: Dependency Hashing for n-best CCG Parsing

2

Background: CCG

• CCG derivation• Dependency

• Evaluation– All components of a de

p. structure must match golden standard

– Prec./Recall/F-score

Page 3: Dependency Hashing for n-best CCG Parsing

3

Background: CCGbank

• CCGbank was created by converting the phrase-structure trees in the PTB into normal-form CCG derivations. (99.44% covered)

Page 4: Dependency Hashing for n-best CCG Parsing

4

Background: C&C parser

• Supertagger: assign possible lexical categories to word (eg. S\NP, (S\NP)/PP for swim)– Tag dictionary extracted from training data– Adaptive supertagging: β and k

• C&C parser: log-linear model parser– POS tags and lexical categories as input.– CKY chart parsing– N-best reranking

Page 5: Dependency Hashing for n-best CCG Parsing

5

Ambiguity in n-best CCG parsing

• Spurious ambiguity– Norm-form (usually right branching)

• Absorption ambiguity

• Diversity problem: n-best CCG derivations, but with duplicated dependencies

Page 6: Dependency Hashing for n-best CCG Parsing

6

Dependency Hashing (1)

• Constraint: any n-best candidate must not have the same dependencies as any candidate already in the list.– Similar in SMT: remove duplicated strings– Delete which: later inserted? lower score?

Page 7: Dependency Hashing for n-best CCG Parsing

7

Dependency Hashing (2)

• Implementation:– 32-bit hash value for each dependency

– Bit-wise XOR to combine sub-derivations– Only hash value, no hash table

• Collision: miss some useful dependencies

Page 8: Dependency Hashing for n-best CCG Parsing

8

Diversity experiments

• Dependency

• Grammatical relation

Page 9: Dependency Hashing for n-best CCG Parsing

9

Parsing Results

• Oracle– Reranking u

pper bound

• Reranking

Gap

Page 10: Dependency Hashing for n-best CCG Parsing

10

Three types of error

• Grammar error– Only a subset of CCGbank rules are used– Seen rule constraint

• Supertagger error– Restricted categories by frequency cutoff – Probability threshold βand cutoff k

• Model error– Suboptimal parse

Page 11: Dependency Hashing for n-best CCG Parsing

11

Grammar Error

• Given gold-standard categories, the parser F-score is 99.49%, with 95.61% coverage

• Grammar error accounts about 0.5% of overall parser errors, and 4.4% drop in coverage

Page 12: Dependency Hashing for n-best CCG Parsing

12

Supertagger and model error

• Supertagger error : differ from oracle• Model error : differ from baseline

Page 13: Dependency Hashing for n-best CCG Parsing

13

More experiments

• Tradeoff of speed and accuracy

• Gold/automatic

POS tags

Page 14: Dependency Hashing for n-best CCG Parsing

14

Conclusion

• Dependency hashing for n-best CCG– Avoid derivations with same dependency– Increase diversity in n-best list

• Comprehensive error analysis– Grammar error: 0.5%– Supertagger error: 5%– Model error: 7.5%

Page 15: Dependency Hashing for n-best CCG Parsing

15

Thank you

Q & A

Weighted Parsing, Probabilistic Parsing

Combinatory Categorial Grammar Teil 2: Semantik in der CCGmujdricz/referate/... · 2014. 2. 10. · CCG (2), 29.04.2008 41 Parsing Beispiel für die Ausgabe des Parsers:The school-board

The Care Act January 2015 Norwich CCG South Norfolk CCG North Norfolk CCG West Norfolk CCG Gt Yarmouth & Waveney CCG

14. Hashing - lec.inf.ethz.chlec.inf.ethz.ch/DA/2019/slides/daLecture9.handout.2x2.pdf · 14. Hashing Hashtabellen, Pre-Hashing, Hashing, Kollisionsauösung durch Verketten, Einfaches

Hashing, Hashing Tables Chapter 8. Class Hierarchy

Hashing - Introduction - McMaster Universitycarette/CS1MD3/2005/slides/hashing23.pdf · Hashing - Introduction ... Hashing. Universal Hashing ... idea: extendible hash tables? 21

Hashing 1 Hashing. Hashing 2 Hashing … * Again, a (dynamic) set of elements in which we do ‘search’, ‘insert’, and ‘delete’ n Linear ones: lists, stacks,

Hashing - ibr.cs.tu-bs.de12.pdf · 4/33 Hashing Hashfunktionen Kollisionen Ausblick Uberblick¨ Aufgabe Realisierung Hashing Hashing - Menge U potentieller Schl¨ussel sehr groß,

High Dimensional Search Min-Hashing Locality Sensitive Hashing

Chapter 8 Hashing Concept of Hashing Static Hashing ...chun/DS(II)-Ch08-Hashing.pdf · 1 C-C Tsai P.1 Chapter 8 Hashing Concept of Hashing Static Hashing Dynamic Hashing In CS, a

CS1622 - University of Pittsburghpeople.cs.pitt.edu/~mock/cs1622/lectures/lecture15.pdf · Lexical analysis ... Parsing Detects inputs with ill-formed parse trees ... • using hashing,

Chart Parsing and Probabilistic Parsing

5 Hashing Hashtabellen Hashing with Chaining Hashing with ... · Hashing Ubersicht¨ 5 Hashing Hashtabellen Hashing with Chaining Universelles Hashing Hashing with Linear Probing

Lecture 11 oct 6 Goals: hashing hash functions chaining closed hashing application of hashing

Department of Computer Science and Technology | - Clark ...Stephen Clark Practical Linguistically Motivated Parsing JHU, June 2009 ccg Grammar18 ccg Lexical Categories Atomic categories:

An incremental algorithm for transition-based CCG parsing

File Organizations Jan. 2008Yangjun Chen ACS-39021 Outline: Hashing (5.9, 5.10, 3 rd. ed.; 13.8, 4 th ed.) external hashing static hashing & dynamic hashing

Lecture XI HASHINGyap/wiki/pm/uploads/Algo/l11_BASE.pdf · basic hashing framework, including universal hashing, perfect hashing, extendible hashing, and cuckoo hashing. Hash is one

New Static hashing schemes - GitHub Pagesyljh21328.github.io/blog/pdf/EDHashing.pdf · 2014. 7. 13. · Dynamic Hashing - 2 • Dynamic hashing schemes - Extendible hashing - Dynamic

€¦ · Web viewNHS Vale Royal CCG NHS West Lancashire CCG NHS Wigan Borough CCG NHS Fylde & Wyre CCG NHS Airedale, Wharfedale and Craven CCG NHS Barnsley CCG NHS Bassetlaw CCG

Pre-Viva Talk: Parsing Jazz: Harmonic Analysis of Music ...jazzparser.granroth-wilding.co.uk/attachments/PreVivaTalk/slides.pdf · Introduction Harmonic Analysis Harmonic CCG Statistical

CCG Parsing - cs.utexas.edu

Chart Parsing and Probabilistic Parsing - SourceForgenltk.sourceforge.net/doc/en/advanced-parsing.pdfChart Parsing and Probabilistic Parsing 9.1 Introduction ... Furthermore, it is

On the Complexity of CCG ParsingKuhlmann, Satta, and Jonsson On the Complexity of CCG Parsing to the empty string. Such “empty categories” are ruled out by one of the fundamental

Chapter 1thanhtung/downloads/dbms/Chapter_1.pdf · hashing, extendible hashing, and linear hashing. Both dynamic and extendible hashing use the binary representation of the hash value

Wide-coverage efficient statistical parsing with CCG and log-linear

Hash-Funktionen Hashing mit Verkettung Offenes Hashing ...roefer/pi2-04/06.pdf · Universität Bremen Hashing Thomas Röfer Hash-Funktionen Hashing mit Verkettung Offenes Hashing

NLP4ADS - NASA Astrophysics Data Systemads.harvard.edu/adsug/2018/06-3_ADS_2025.pdfAutomatic speech recognition CCG supertagging Chunking Common sense Constituency parsing Coreference

1 B+-tree and Hash Indexes B+-trees Bulk loading Static Hashing Extendible Hashing Linear Hashing

Parsing: Top-Down vs. Bottom-Up Parsing Algorithms Partial Parsing

Summer School on Hashing’14 Locality Sensitive Hashing

hashing & indexing

Wide-Coverage CCG Parsing with Quantifier Scope

Frontier Pruning for Shift-Reduce ccg Parsing and testing occur on CCGbank, a corpus of 40,000 annotated sentences (Hockenmaier and Steedman, 2007)

10/17/2002CSE 202 - Hashing CSE 202 - Algorithms Hashing Universal Hash Functions Extendible Hashing