slide#1 slide#2 why language important is? …200.652/prosem/files/halberda,languageprosem.pdf ·...

22
1 slide#1 Language: Words and Syntax Cognitive Proseminar By: Len Taing slide#2 Why language important is? “When we study human language, we are approaching what some might call the ‘human essence’, the distinctive qualities of mind that are, so far as we know, unique to man.” -Noam Chomsky slide#3 Advantages of words: – large number of concepts (> 60,000) – rapid computation recognize spoken word: 1/5 second retrieve word for production: 1/4 second Disadvantages of words: – finite number of predetermined concepts – everyone must have memorized them The Lexicon slide#4 Advantages of syntax: - content neutral rules allow us to combine any noun with any verb “Colorless green ideas sleep furiously” - rules allow new ideas to be expressed - the number of possible combinations grows exponentially with length of sentence Disadvantages of syntax: – rules require symbols, and perhaps the mind is not a symbol manipulating machine (behaviorism & connectionism) The Syntax

Upload: phamtuyen

Post on 01-Aug-2018

214 views

Category:

Documents


0 download

TRANSCRIPT

1

slide#1

Language:Words and Syntax

Cognitive ProseminarBy: Len Taing

slide#2

Why language important is?• “When we study human

language, we areapproaching what somemight call the ‘humanessence’, the distinctivequalities of mind that are,so far as we know, uniqueto man.” -Noam Chomsky

slide#3

• Advantages of words:– large number of concepts (> 60,000)– rapid computation

• recognize spoken word: 1/5 second• retrieve word for production: 1/4 second

• Disadvantages of words:– finite number of predetermined concepts– everyone must have memorized them

The Lexicon

slide#4

• Advantages of syntax:- content neutral rules allow us to combineany noun with any verb

“Colorless green ideas sleep furiously”- rules allow new ideas to be expressed- the number of possible combinations growsexponentially with length of sentence

• Disadvantages of syntax:– rules require symbols, and perhaps the

mind is not a symbol manipulating machine(behaviorism & connectionism)

The Syntax

2

slide#5

The Lexicon• Children rapidly learn new words (yesterday)• Is this ability uniquely human?

No (e.g. Ai the chimp and other animals)• Are words symbols for both child and animal?

Behaviorists: No, all words are justassociations (Skinner etc)

Cognitivist: Yes for child, maybe foranimal (Chomsky, Pinker, Markman, Bloom etc)

slide#6

Design of Human Language

• Words for common entities– (mechanism = memory)

• Grammatical rules for novel combinationsof entities– (mechanism = on-line computation)

slide#7

The Syntax

Combinatorial grammar (Humboldt: “theinfinite use of finite media”)

• Two types of rules of grammar:--morphological rules, build new words

from parts--syntactic rules, build sentences from

words

slide#8

Syntax and morphology• Syntax/Grammar

– Sentences are thehighest level oflinguistic meaning

– Words combine in toform sentences

• Consider:• “Toad ate the cookie”• * “Cookie toad ate the”

– Sense ofgrammaticality andorder

• Morphology– Morphemes are the

lowest level oflanguage that conveymeaning

– Morphemes aremultiple phonemesthat combine to formwords

• Consider:• “re-gift-er”• *“gift-er-re”

3

slide#9

Where does the linguistic ordercome from?

elephant• Behaviorists (Skinner):

Learned through Associations

• Cognitivists (Chomsky):Innate syntax rules

slide#10

Language: The associativist story• Skinner, behaviorism, and

language: (stimulus response)– No innate knowledge– Children learn language b/c

adults/environment teach it– Parents provide corrective

feedback

• Grammar:– Word chain devices

• Morphology:– *morpheme chain device

• Language as statisticalprobability

slide#11

Associativist GrammarWord1 Word2 Word3 Word4Tall dogs sleep continuouslyFast wounds walk viciouslyGreedy farmers yell openlyPretty foam kicks genuinely

• P(Talldogs) = 0.95• P(sleep viciously) = 0.001• P(“Grammar is a complicated word chain device”)= 0.15• Chain the transitional probabilities to get a

sentence

slide#12

Associativist MorphologyMorph1 Morph2 Morph3Darwin ian ismKick er edTalk istEr kick

• P(Darwin + ian + ism) = 0.89• P(Kick + er) = 0.98• *P(er + Kick) = 0.0001

4

slide#13

Problem with word-chain devices(1)

• Separability of semanticsand syntax

• Chomsky: “Review ofverbal behavior” (1959)– S = “Colorless green ideas

sleep furiously”– P(S) = 0– Grammaticality: YES!

slide#14

Problem with word-chaindevices(2)

• Stimulus does not predict response: Validutterances.

-What a beautiful juggular.-Remember that one timeIn transelvannia?-I forgot today was daylightssavings time.

slide#15

Problems with word-chaindevices(3)

• Poverty of the stimulus• Consider all of the English sentences under 20

words.– e.g. “You have a finite set of rules that help you

generate and validate an infinite number ofgrammatical sentences.” (19 words)

• There are 10^30 such sentences. Even at a rateof memorizing 100 sentences a minute, after 1billion trillion years, you’ll only learn 1% of it!– But, kids acquire language within 5 years! They must

not be using associations

slide#16

Poverty of the Stimulus• How fast do children learn the rules?

≈ 3 years

5

slide#17

Poverty of the Stimulus• How fast do children learn the rules?

≈ 3 yearsThe Plural Rule

slide#18

Poverty of the Stimulus• How fast do children learn the rules?

≈ 3 yearsThe Plural Rule (children succeed at 2.5 years)

This is a wug.

Now there are two of them. There are two...Jean Berko (1959)

slide#19

Poverty of the Stimulus• How fast do children learn the rules?

≈ 3 yearsThe Past-tense Rule (children succeed at 3 years)

Pinker (1980)

• Every day we like to flibble.• Yesterday, we ….?

slide#20

6

slide#21

Typical dialog• Child: My teacher holded the baby rabbits and we patted

them.• Parent: Did you say your teacher held the baby rabbits• Child: Yes• Parent: What did you say she did• Child: She holded the baby rabbits and we patted them.• Parent: Did you say she held them tightly• Child: No, she holded them loosely.

slide#22

Poverty of the Stimulus• How fast do children learn the rules?

The 2-Word StageWord Order (2 year olds)

Child says: “Dog big” (the dog is big)“Big Dog” (that is a big dog)“He big, Johnny big,” (he is big)BUT NEVER SAYS:“*Big he,” (*that is a big he*)

Paul Bloom (1990)

slide#23

Poverty of the Stimulus• Do parents provide corrective feedback?

No

• An Actual Parent-Child Dialog (sad parenting)• Child: Nobody don’t like me.• Parent: No, it’s nobody likes me.• Child: Nobody don’t like me.• 2 more iterations• Parent: Say, nobody likes me.• Child: Oh, Nobody don’t likes me.• Parents rarely give corrective feedback and when

they do it doesn’t work

slide#24

Lexicon and Syntax arelocalized in the human brain

7

slide#25

A simple theory:• Irregular forms are words:

bring sound: “brĭŋ” meaning:

brought sound: “brŏt” meaning: past tense

• Regular forms can be generated by a rule (combinatorial operation):

walk

V past tense

-ed

slide#26

-20

-15

-10

-5

0

5

10

15

20

1 2 3 4

Pro

po

rti

on

of

Prim

ing

Normal

Patient D.E.

Patient J.G.

Patient T.S.

RegularPast Tense

IrregularPast Tense

SemanticRelations

Double Dissociation Of Regulars and IrregularsMarslen-Wilson et al

slide#27

An Agrammatic Aphasic Patientslide#28Alternative

Artificial neural networks (connectionism)• There are no rules; just analogies from memory• Pattern associator memory: links sounds to sounds

instead of words to words

8

slide#29

Speech SegmentationHow do we find the Words?

Maybe there are spaces between them.

Where are the silences between words?

-J. Saffran

Statistical- what sounds usually go together

“pretty baby” NOT “pre tyba by”

slide#30Familiarization Preference Procedure

-Familiarize infants to an auditory stimulus(i.e. play it over and over until baby gets sick of it)

-Test if infants have learned anything about the stimulus(i.e. let the baby control what she hears)

90° 90°

Red ligh t Red ligh t

Yellow light

Video Camera Lens

Chair on which

parent and child sit

slide#31

Speech SegmentationStatistical cues:

from Saffran, Aslin, and Newport (1996)

-Familiarize 8-mnth-olds to a proto-language for 2 mintibudopabikudaropigolatupabikutibudogolatudaropidaropitibudopabikugolatu

-Testtibudopabiku vs pigolatudaro

7.5 sec 8.5 sec

Probability = .8 Probability = .2

slide#32

Language• Vast expressive power• The sources of this power:

cow sound: “kow” meaning:

Lexicon:Memorized words

(Saussure: the “arbitrarysign”)

Syntax:Combinatorial grammar(Humboldt: “the infinite

use of finite media”)

“Justin teaches class”

NP VP

Noun Verb Direct Object“Justin teaches class”

9

slide#33

Languages of New Zealand (as well as Latin)

-The boy kicked the ball.

Grammatical Structure

Subj. Verb Dir. Obj.

English

-The ball the boy kicked.-The boy the ball kicked.-Kicked the boy the ball.

-*Kicked the ball the boy.

slide#34

Tall dogs sleep continuouslyFast wounds talk viciouslyGreedy farmers yell openlyPretty foam kick genuinely

Rule based grammar

Adj. Noun Adv.Verb

• Grammar is built up from syntactic rules:– S NP VP S(NP(tall dogs) VP(sleep openly))– NP N NP(N(dogs))– NP adj. NP NP(adj(tall) NP(N(dogs)))– VP V adv. VP(V(sleep) adv(openly))

Learning grammar = Learning the orderly relations amongclasses of words.

slide#35Grammatical Structure

Use familiarization preference procedure

vs

Relations among classes of words

-Familiarize 8-mnth-olds to a proto-language for 2 min

from Marcus, Vijayan, Bandi Rao, and Vishton (1999)

AAB

-Test: new words

AABSame

strucure

ABBNovel

structure

6 sec 8 sec

slide#36

Paul Bloom; Word Order

Child says: “Dog big”Could mean “That dog is big”“Big Dog” (NP Adj and Adj N)Could mean “That is a big dog”“He big, Johnny big,” (NP N)Could mean “He is big, Johnny is big”BUT NEVER SAYS:“*Big he, *Big Johnny.” (*Adj NP)“*Big is he, Big is Johnny”

At the very beginning of the child making 2-word utterances

10

slide#37

Privileges of occurrenceChild says: “Dog big”“Big Dog” (NP Adj and Adj N)“He big, Johnny big,” (NP N)BUT NEVER SAYS:“*Big he, *Big Johnny.” (*Adj NP)

slide#38

Two-word Utterance Stage

• “Mommy sock”• “Billy up” “Sock foot” “Me happy.”Does the child distinguish N and NP at this

stage? Are “Mommy, Billy, me”syntactically distinguished from “sock?”“Sock” could be short for “the sock.”Could be a NP or a N. “Mommy, Billy, me”can only be N. (In adult grammar).

Privileges of occurrence

Child says: “Dog big”

“Big Dog” (NP Adj and Adj N)

“He big, Johnny big,” (NP N)

BUT NEVER SAYS:

“*Big he, *Big Johnny.” (*Adj NP)

slide#39

Do Animals have a syntaxNoBut some people are still looking

slide#40

Animal Lexicons

11

slide#41

Other Evidence For Innate Grammar:discussed in Pinker

•Specific Language Impairment:Genetic link

•Creolization in sign language andspoken language

•Differential loss of grammar followingstroke

slide#42

Language: the structure of meaningSyntax

Morphology

slide#43Two sets of rules:

Morphology Syntax

“Justin teaches class”

NP VP

Noun Verb Direct Object“Justin teaches class”

slide#44

What is a rule?

• Input-output function.• Operates over abstract variables. (E.g.,

N(noun), Nstem, Ninflection.)• N --> Nstem Ninflection“A noun can consist of a noun stem followed

by a noun inflection”

12

slide#45

Rule based morphology• Form Semantics Rule

-er (N) “someone who” V + erkicker, xeroxer

-ism (N) “ideal or belief” N + ismdarwinism, kickerism*

-ness (N) “state of being” Adj+nesblackness, ugliness

-’s (Det.) “of N” N + ‘sLen’s, kicker’s, ugliness’s

slide#46

Readings roadmapLanguage

SyntaxPinker: “How Language

Works”

MorphologyPinker: “Words, Words, Words”

Past Tense Debates:Pinker, Ullman: Rule-based view

McClellan, Patterson: Associativistor Connectionist view

Word LearningSaffrin, Aslin, Newport:“Statistical learning by8-month-old infants”

Marcus: “Rule learning byseven-month-old

Infants”

slide#47

Further prima facie evidence forrules

• Overregularization in childhood.• Both past tense and plurals:• Period of production of correct irregular

forms. (E.g., “we swam, we brought it)• Followed by period of overregularization

(E.g., “we swimmed, we bringed it.”

Marcus, G., Pinker, S., Ullman,M., Hollander, M., Rosen, T. J., &Xu, F. (1992) Monographs of the

Society for Research in ChildDevelopment 57(4).

Regular & Irregular Inflection

• Regular verbs: walked, jogged, patted, kissed• Predictable: verb + -ed• Open-ended:

– faxed– snarfed, moshed, spammed, dissed

• Productive in children:– wugged– bringed

Marcus, G., Pinker, S., Ullman, M., Hollander, M., Rosen, T. J., & Xu, F. (1992) Monographs of theSociety for Research in Child Development 57(4).

13

slide#49

• Irregular verbs: brought, hit, went, sang,slept, made, rang, flew

• Unpredictable:– sink-sank– cling-clung– think-thought– blink-blinked

• Closed class: 165 irregulars; no recent newones

slide#50

A Developmental Story

• Children overregularize when they learnthe rule!

• Problem--why do they stopoverregularizing?

• Why don’t adults apply the rule to “bring-bringed.”

slide#51

Blocking

• Retrieved irregular inflection blocksapplication of the rule.

• Do children have to learn blocking?

NO NEGATIVE EVIDENCE. CHILDRENARE NOT CORRECTED FORGRAMMATICAL ERRORS

slide#52

Corrections of child language

• Where is that big piece of paper I gaveyou yesterday?

• Child: Remember I writed on it.• Oh that’s right. Don’t you have any paper

down here buddy?

14

slide#53

A Simple Explanation forChildren’s Errors

• Children’s memory retrieval is less reliablethan adults’.

• If the child fails to retrieve an irregular, andthe child has the rule, the child applies therule to the irregular.

Marcus, G., Pinker, S., Ullman,M., Hollander, M., Rosen, T. J. &Xu, F. (1992) Monographs of the

Society for Research in ChildDevelopment, 57.

Evidence:

• Overregularizations are rare (< 10 %overall)

• Memory is weak:– The less often a parent uses an irregular, the

more often the child errs on it (19/19 children,p < .001)

• Rule has been acquired:– Errors with irregulars don’t appear right away,

don’t appear until children are reliablyproducing regulars

Marcus, G., Pinker, S., Ullman, M., Hollander, M., Rosen, T. J., & Xu, F. (1992) Monographs of theSociety for Research in Child Development (57)4.

slide#55

Conclusions

• Blocking is innate.-- If irregular is retrieved, it’s used.--No negative evidence

slide#56

A Complication:Patterns in the Irregular Verbs

• Irregular verbs display patterns:– keep-kept, sleep-slept, feel-felt, dream-dreamt– wear-wore, bear-bore, tear-tore, swear-swore– string-strung, swing-swung, sting-stung, fling-flung

• Patterns are occasionally generalized:– Children: brang, bote, wope– History: quit, caught, snuck– Dialects: clumb, swole– Sophomores: spling-splung

Xu, F. & Pinker, S. (1995) Journal of Child Language, 22, 531-556.Prasada, S. & Pinker, S. (1993) Language and Cognitive Processes, 8, 1-56.

15

slide#57

Alternative

• Both “dog” and “dogs” are memorizedwords. “Dogs” marked for plural.

• Both “walk” and “walked” are memorizedwords. “Walked marked for past test.

“Child,” “children” work this way.“Bring” and “brought” work this way.

slide#58

Alternative: Rule following onlyapparent

• Swimmed by analogy to “rimmed,”• Walked by analogy to “talked”,…. Sheer frequency of regulars leads them

to have more analogies.Explains occasional overgeneralization of

irregulars.

slide#59

Arguments for Pinker’s wordsand rules theory

• 1) Linguistic phenomena in derivationalmorphology

• 2) Evidence for innate knowledge of levelordering.

slide#60

Peter Gordon’s Level OrderingStudy

The x-eater. A monster.

This is a child. These are -----.The monster likes to eat -------.He’s a ….. “children-eater,”Or “child-eater.

This is a rat. These are ----The monster likes to eat ------He’s a “rat-eater,” not *“rats-eater.”

16

slide#61

Poverty of Stimulus

• In input, there were 0 instances of adultsusing irregular plurals in compounds.

• Conclusion: distinction between regularand irregular inflection and level orderingrules innate.

slide#62

A third problem:

• Pattern associators generalize apattern only when it is frequent in input(word types); people can generalize aregular pattern even when it is rare:– Studies of regularization in children– Studies of regularization across languages

slide#63

Crosslinguistic Studies

• Regular verbs are the majority in English.• Perhaps the regular pattern is generalized

most freely simply because it is mostfrequent.

• Needed:– A language where a regular (default) rule

applies to a minority of forms.

Marcus, G., Brinkmann, U.,Clahsen, H., Wiese, R., & Pinker,S. (1995) Cognitive Psychology

29, 189-256.

The Exception that Proves theRule

English –s (99.6%) German –s (7%)• weird Ns (ploamphs) •weird Ns (Plaupfs)

• names (the Julia Childs) •names (die Thomas Manns)

Marcus, G., Brinkmann, U., Clahsen, H., Wiese, R. & Pinker, S. (1995) Cognitive Psychology, 29, 189-256.

17

Marcus, G., Brinkmann, U.,Clahsen, H., Wiese, R., & Pinker,S. (1995) Cognitive Psychology

29, 189-256.

The Exception that Proves theRule

English –s (99.6%) German –s (7%)• weird Ns (ploamphs) •weird Ns (Plaupfs)

• names (the Julia Childs) •names (die Thomas Manns)

• eponyms (Batmans) •eponyms (Fausts)

• regularization errors (mans)

•regularization errors (Manns)

Marcus, G., Brinkmann, U., Clahsen, H., Wiese, R. & Pinker, S. (1995) Cognitive Psychology, 29, 189-256.

slide#66

Words and Rules in the Brain

• Neural substrate of word memory shouldbe tied do irregulars

• Neural substrate of grammaticalcomputation should be tied to regulars

slide#67

• A. Anomia: Impaired word-finding;relatively unimpaired grammar– Irregulars (63%) harder than regulars

(85%)–Regularization errors (25%)–Could do wug-test (80%)

• replicated by Tyler et al., 2001, on HSEpatients w/ temporal lobe damage

Ullman, M., Corkin, S., Coppola, M., Hickok, G., Growdon, J., Koroshetz, W. , & Pinker, S. (1997)Jounral of Cognitive Neuroscience, 9, 289-299.

slide#68

• Agrammatism: Impaired grammar; lessimpaired word-finding– Irregulars (69%) easier than regulars

(20%)–Few regularization errors (0%)–Could not do wug-test

–A DOUBLE DISSOCIATION

Ullman, M., Corkin, S., Coppola, M., Hickok, G., Growdon, J., Koroshetz, W. , & Pinker, S. (1997)Jounral of Cognitive Neuroscience, 9, 289-299.

18

slide#69

B. Alzheimer’s Disease: Impaired memory;relatively unimpaired grammar.

• Irregulars harder than regulars (60% v.89%)

• Regularization errors (27%)

• Can do wug-test (84%)

Ullman, M., Corkin, S., Coppola, M., Hickok, G., Growdon, J., Koroshetz, W. , & Pinker, S. (1997)Jounral of Cognitive Neuroscience, 9, 289-299.

slide#70

Parkinsons’ Disease: Agrammatic symptoms;less impaired word-finding.

• Irregulars easier than regulars (80% v.88%)

• Few or no regularization errors (0%)

• Difficulty with wug-test (65%)

Ullman, M., Corkin, S., Coppola, M., Hickok, G., Growdon, J., Koroshetz, W. , & Pinker, S. (1997)Jounral of Cognitive Neuroscience, 9, 289-299.

slide#71

Conclusion:

• Expressive power of language comesfrom an interaction between the mentallexicon (memory) and the mentalgrammar (computation)

• Irregular and regular inflection contrastthese processes, holding meaningconstant

slide#72

• Regular inflection is applied whenever memoryfails:– difficult-to-analogize words (ploamphed, frilged)– words with inaccessible roots (low-lifes, flied out)– words poorly recalled by children (breaked, holded)– words poorly recalled by patients with disorders of

word retrieval (anomia, Alzheimer’s)

19

slide#73

Word learning problem

• Parsing individual words from continuousspeech– Misconception: adults make the boundaries

clear for the child– People don’t speak like Frankenstein

• Foreign language speech (demo)

slide#74

Word parsing summary• Statistical boundary

– Infant use statisticalpatterns betweenphonemes todetermine wordboundaries:

– P(“ca” + “t”) = 0.75 (noa word boundary)

– P(“ph” + “z”) = 0.001(a word boundary)

• Purely statistical, noneed for rules

• Rule based boundary– Infants can learn rule

based patterns inwords

– e.g. AAB phonemepattern “dodoba”

– *important: can detectviolations of that rule

• Rules and statisticallearning are two toolsin infants’ toolbox

slide#75

Saffran, Aslin, Newport (1996)

• Speech stream (demo)– Continuous– Computer generated (no intonation)

• Result: infants use statistical patterns todetermine word boundaries

• Aside: Hauser has conducted a similarstudy with tamarin monkeys– Result: tamarin monkeys also are statistical

learners

slide#76

Marcus (1999)

• Statistical learning is not the only toolchildren can use in language acquisition

• There is also a rule based system thatthey use to learn.– E.g. AAB rule (demo)

• Read the paper to see how he establishesthis.

20

slide#77

Readings roadmapLanguage

SyntaxPinker: “How Language

Works”

MorphologyPinker: “Words, Words, Words”

Past Tense Debates:Pinker, Ullman: Rule-based view

McClellan, Patterson: Associativistor Connectionist view

Word LearningSaffrin, Aslin, Newport:“Statistical learning by8-month-old infants”

Marcus: “Rule learning byseven-month-old

Infants”

Domain general learning ability and syntax

slide#78

Readings roadmapLanguage

SyntaxPinker: “How Language

Works”

MorphologyPinker: “Words, Words, Words”

Past Tense Debates:Pinker, Ullman: Rule-based view

McClellan, Patterson: Associativistor Connectionist view

Word LearningSaffrin, Aslin, Newport:“Statistical learning by8-month-old infants”

Marcus: “Rule learning byseven-month-old

Infants”

slide#79

English past tense debate: Theproblem

• “He likes to… “Yesterday he…”gorpdancetringgofo

• English has 180 irregular verbs-– Small enough to memorize?– Amenable to rules?

slide#80

English past tense debates: Theproblem(2)

• “He likes to…” “Yesterday he…”gorp gorpeddance dancedtring tringed/tranggo wentfo foed

• Regulars: (rule-governed)– Add –ed: e.g. dance danced

• Quasi-regulars: (rule-governed irregulars)– Singsang, ringrang, sleepslept, keepkept

• Whacky irregulars:– Go went. Note: no generalization for “fo”

Systematic

Arbitrary

21

slide#81

Past tense: a rule based story• Pinker, Ullman: WR theory

– Whacky irregulars: gowent (memory)– Two components:

• rule based system(default): V + ed• lexicon (i.e. associative memory).• When there is a trace in lexicon, it pre-empts the rule-based

default• When memory fails mistake in speech: gogoed

– Middle-road: has a component that explainssystematicity and arbitrariness of past tense forms

• Explains the quasi-regular memory with a connectionist story– It’s a story of how short-term memory works

slide#82

Past tense: an associative learningtheory

• Rummelhart, McClelland– Connectionist model– Past tense domain– *Phoneme to phoneme associations

• Nothing else, i.e. no rules, just statistics

• State of machine (connection weights,input, and connections) determines theoutput

slide#83

Connectionism and neural nets: anintroduction

• Neural networks:– Nodes (computing centers/neuronal bodies)– Connections (links between nodes/axon-dendritic connections)– Nodes sum up all of the input from the connections, and based

on an internal function will either fire or not– Connections can be weighted according to strength

• Positive: excitatory• Negative: inhibitory

– Back-propagation algorithm: learning of connection weights thathelp minimize error in the experience set

• More powerful than word chain devices b/c ofgeneralizability– i.e. connection weights + back-prop learning might lead to

“expected” output for novel input

slide#84

Connectionism: AND-net

• Demonstration of a neural network thatcomputes the AND function

• Logical AND: 0 = false, 1 = trueA B Result0 0 01 0 00 1 01 1 1

22

slide#85

Connectionism and neural nets: anintroduction(2)

rin sin ing ng#

#sa sin ing ng#

#dr #ra #ri #sai

#dr #ra #ri ang

dri ing ink#si

#si dri ink rin

Input layer representing base form (input to computation: phoneme)

Output layer representing past tense form (result of computation:phoneme)

Connection (layer)

slide#86

Connectionism: its just ComputerScience

• These models get bigger and more complicated– To point, where trust statistical invariants

• Can be made to perfectly match the training set• But the test is on novel input• Model has the ability to learn, adjust connection

weights– Back-prop is just on such learning algorithm– Hebbian

• Models can also have fixed connection weights

slide#87

Take home message

• You will note that the tension between the view thatlanguage is rule based, and that language comes fromassociative learning is rooted in the fact that language isboth systematic and arbitrary

• Connectionist models are statistical learners• Saffran, Marcus:

– Statistical and rule based learning a domain general capacitythat, in this case, was co-opted for language acquisition?

• Pinker, Ullman vs. McClelland, Paterson– Is the English past tense best explained by rule-based or

associative system?– Is there a middle ground?

slide#88

Language and lobster dancing arefun! Questions?

Hi little lady, do youjitterbug?