natural language processing guangyan song. what is nlp natural language processing (nlp) is a field...

12
Natural Language Processing Guangyan Song

Upload: donald-hood

Post on 31-Dec-2015

227 views

Category:

Documents


0 download

TRANSCRIPT

Natural Language Processing

Guangyan Song

What is NLP

Natural Language processing (NLP) is a field of computer science and linguistics concerned with the interactions between computers and human (natural) languages.

GoalNatural Language UnderstandingNatural Language Generation

Example Applications

Automatic summarization Machine Translation Information Retrieval Question Answering system Foreign language written aid

Problems

Natural Languages are very complexMany words have various meaning The number of relevant dependencies is

much too large and those dependencies are too complex

Major Approaches

Rule based NLPHandcrafted linguistic rulesVery labour-intensive and difficult to scale up

Example based NLPSearch for similar examples from training

data

Statistical based NLPLearn from training data and generate natural

language

Machine Translation

Microsoft Bing TranslatorEarly used Rule based technology

MorphologyLexicalSyntactic

Machine Translation Now using Statistical based approach

Information Retrieval

Stop-Words Removal Stemming

Information Retrieval

Language Model RetrievalSimilar as Statistical based Machine

translation approach

NLP technologies are not widely used in web search

Foreign Language Writing aid

Microsoft Grammar checker English Second Language (ESL)

AssistantExample based approach

Information extraction

Email2DB Get stock information from emails and stored in the

database AddressDoctor

Analyze unstructured or partly structured addresses and divide them into individual elements

Recognize countries (by Name, ISO codes, major cities, etc.)

Format addresses according to the postal rules of all licensed countries

Standardize address elements (i.e. avenue -> ave, street -> st or vice versa)

Mainly rule based approach