stanford pos tagger 17 th february 2011. system requirement java 1.5+ –

12
Stanford POS tagger 17 th February 2011

Upload: emmeline-pierce

Post on 21-Jan-2016

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Stanford POS tagger 17 th February 2011. System requirement Java 1.5+ –

Stanford POS tagger

17th February 2011

Page 2: Stanford POS tagger 17 th February 2011. System requirement Java 1.5+ –

System requirement

• Java 1.5+– http://www.java.com/en/download/index.jsp

Page 3: Stanford POS tagger 17 th February 2011. System requirement Java 1.5+ –

Part-of-speech name abbreviations

The Penn Treebank English POS tag set1. CC Coordinating conjunction 25. TO to2. CD Cardinal number 26. UH Interjection3. DT Determiner 27. VB Verb,base form 4. EX Existential there 28. VBD Verb, past tense 5. FW Foreignword 29. VBG Verb,gerund/present

6. INPreposition/subordinating participle conjunction 30. VBN Verb, past participle

7. JJ Adjective 31. VBP Verb, non-3rd ps. sing. present 8. JJR Adjective,comparative 32. VBZ Verb,3rd ps. sing. present 9. JJS Adjective, superlative 33. WDT wh-determiner 10. LS Listitem marker 34. WP wh-pronoun 11. MD Modal 35. WP$ Possessive wh-pronoun 12. NN Noun, singular or mass 36. WRB wh-adverb13. NNS Noun, plural 37 #. Pound sign14. NNP Proper noun, singular 38. $ Dollar sign15. NNPS Proper noun, plural 39. . Sentence-finalpunctuation 16. PDT Predeterminer 4 0 . , Comma 17. POS Possessive ending 41. : Colon, semi-colon18. PRP Personal pronoun 42. ( Left bracket character 19. PP$ Possessive pronoun )43 Right bracket character 20. RB Adverb )44 Straight double quote 21. RBR Adverb, comparative 45. ' Leftopen single quote 22. RBS Adverb, superlative 46. " Leftopen double quote 23. RP Particle 47. ' Right close single quote 24. SYM Symbol (mathematical or scientific) 48. " Right closedouble quote

Page 4: Stanford POS tagger 17 th February 2011. System requirement Java 1.5+ –

Download

• http://nlp.stanford.edu/software/stanford-postagger-2010-05-26.tgz

Page 5: Stanford POS tagger 17 th February 2011. System requirement Java 1.5+ –

GUI

Page 6: Stanford POS tagger 17 th February 2011. System requirement Java 1.5+ –

GUI

Page 7: Stanford POS tagger 17 th February 2011. System requirement Java 1.5+ –

Command

1. generate a default properties file.2. Tag file.

Page 8: Stanford POS tagger 17 th February 2011. System requirement Java 1.5+ –

generate a default properties filecommand

• java -classpath stanford-postagger.jar edu.stanford.nlp.tagger.maxent.MaxentTagger -genprops > myPropsFile.prop

Page 9: Stanford POS tagger 17 th February 2011. System requirement Java 1.5+ –

Tag file command

• java -mx300m -classpath stanford-postagger.jar edu.stanford.nlp.tagger.maxent.MaxentTagger -model models/bidirectional-distsim-wsj-0-18.tagger -textFile sample-input.txt > sample-tagged.txt

Page 10: Stanford POS tagger 17 th February 2011. System requirement Java 1.5+ –

Command

Page 11: Stanford POS tagger 17 th February 2011. System requirement Java 1.5+ –

Result

• sample-input.txt The School of Liberal Arts was originally the Department of Language and Social Studies under the Faculty of Industrial Education.

•sample-tagged.txtThe_DT School_NN of_IN Liberal_JJ Arts_NNS was_VBD originally_RB the_DT Department_NNP of_IN Language_NNP and_CC Social_NNP Studies_NNP under_IN the_DT Faculty_NNP of_IN Industrial_NNP Education_NNP ._.

Page 12: Stanford POS tagger 17 th February 2011. System requirement Java 1.5+ –

Q&A