Human simulations of vocabulary learning

Human simulations of vocabulary learning Gillette, Gleitman, Gleitman, Lederer Présentation Interface Syntaxe-Psycholinguistique Y-Lan BOUREAU

Outline • Some background • The problem to be solved • Facts : nouns’ acquisition precedes verbs’ acquisition • Existing theory • Gillette et al.’s hypothesis • Simulation experiment • Learning from observation • Learning from linguistic hints • Discussion

The problem of language learning • Children learn language from scratch • Traditional hypothesis : • Children hear adults speak • They spot that « cat » is uttered most frequently when there is a cat around • They infer that « cat » means ‘cat’ • But babies’ vocabulary does not reflect input frequencies • much more nouns than verbs in babies vocabulary

Nouns are learnt earlier • A conceptual hypothesis : • Verbs are conceptually more difficult • So cannot be learnt until babies display adequate conceptual knowledge • Alternative hypothesis : information requirements • Verbs require some syntax to be already acquired (e.g. : I know that Mommy is coming)

Pairing word to world • Three sources of information : • Nonlinguistic evidence (e.g. Mommy says « cat » when the cat is there) • Linguistic evidence : • Co-occurrence of semantically related words in sentences (e.g. food names usually appear with verbs like « eat ») • Syntactic structures in which words occur (e.g. a verb with one subject and two complements is likely to be of the « give » kind)

Hypothesis • Hypothesis : the baby • (1) acquires a small stock of nouns by word-to-world pairing • (2) uses that stock of nouns as a scaffold for constructing representations of the linguistic input that will support a more efficient learning procedure • Support : correlation of changes in vocabulary size with appearance of multiword speech

A simulation experiment • Principle : • Adult learners • (no conceptual issues any more) • Trying to guess : most frequently used nouns or verbs • Observational clues : video clips • Linguistic clues : co-occurring words, syntactic frame

First experiment • Only videoclips • Adults trying to guess 24 nouns and 24 verbs

Results, Part I : Nouns win • Nouns are guessed with much better results than verbs :

Imageability rules • Provided clues are exclusively visual • Nouns of the set (e.g. elephant, plane, bag) are a lot more « imageable » than verbs (e.g. think, know, wait)

Results, part II Nouns Nouns Verbs Verbs

Conclusion of experiment I • The one relevant factor seems to be imageability • Not that surprising : from a video, you learn imageable things ; a thing that is not imageable would be hard to picture !!

Linguistic clues vs. Observational clues • All nouns removed. • 6 conditions : • 1 : videoclips (but with a bip for the verb) • 2 : alphabetical lists of nouns • 3 : 1+2 (videoclips + alphabetical lists) • 4 : syntactic frames with all nonsense words • 5 : sentences with only the verb as nonsense • 6 : 1+5 (videoclips + sentences)

Linguistic clues vs. Observational clues

Linguistic clues vs. Observational clues • 6 conditions : • 1 : videoclips (but with a bip for the verb) • 2 : alphabetical lists of nouns • 3 : 1+2 (videoclips + alphabetical lists) • 4 : syntactic frames with all nonsense words • 5 : sentences with only the verb as nonsense • 6 : 1+5 (videoclips + sentences)

Results Nouns reintroduced No nouns provided !! No more visual information !! Visuals reintroduced

Linguistic clues vs. Observational clues • Remarkably : leap between 3 and 4, whereas the reverse could have been expected ! • Interestingly, those verbs that were best learnt in the observational learnings show a decrease between 3 and 4

Discussion • Verbs : complementary distributions (12 never learnt with visual clues = 12 best learnt with linguistic clues) • This distribution corresponds to the « imageability » criterion : • Quite logically, you can learn visually only what is visually representable • Verbs that use higher level linguistic representations have to wait until those can be constructed

Discussion : general scheme • First, imageable words are learnt on a word-to-world pairing basis • Those imageable words are mostly nouns • That would explain why nouns make up most of young infants’ vocabulary • Second, this first set of words allows learning of new words on a sentence-to-world pairing • Thus conceptual words can be learnt as well

Some reservations • The argument structure is the same across languages (logical requirements), but : • Adults already know the words, so they could try to guess the verbs by exhaustive search with all the information given (e.g. : the best performance is for « look », and it is probably due to the use of « look » with « at » )

The End

Human simulations of vocabulary learning

Human simulations of vocabulary learning

Presentation Transcript

Learning vocabulary from lists

Simulations as Learning Tools

Study and learning vocabulary

Vocabulary teaching and learning

HUMAN RESOURCES VOCABULARY

HUMAN LEARNING AND LEARNING

SIMULATIONS, EXPERIENTIAL LEARNING, RELATIONAL AUTONOMY

Learning Vocabulary

The Ways of Learning Vocabulary

LEARNING ACADEMIC VOCABULARY

Vocabulary Learning

Learning Vocabulary

Simulations and Discovery Based Learning

Teaching/learning Vocabulary

Vocabulary Learning Strategies

Human Learning

LEARNING GOAL 5 VOCABULARY

Learning in Simulations

vocabulary learning

Learning Spanish / French Vocabulary

Human Body Vocabulary

Learning a Visual Vocabulary