Professional Documents
Culture Documents
2023 - Who S Afraid of ChatGPT - Chesi UNISI
2023 - Who S Afraid of ChatGPT - Chesi UNISI
Cristiano Chesi
Director of NeTS
Neurolinguistics, Computational Linguistics, and Theoretical Syntax
The success of OpenAI
una
Io ieri
Sentence generation: a «simple» game
una
ora dal visto
presto
una bella sei
cosa
una mai trono
una
hai corre
quando
una legge
Sentence generation: a «simple» game
C. Chesi 12
Well, this is not true:
the poverty of
stimulus argument!
Noam
Chomsky
(1959)
The Poverty of Stimulus argument
Possible rules
Move the third word in front of the sentence
Move the first auxiliary in front of the sentence
Swap the matrix auxiliary with the matrix subject
Sanity check:
For more data: 1. Prompt: Ask me if the boy who was holding the flower is smiling
Warstadt & Bowman (2022) What artificial
neural networks can tell us about human ChatGPT: Was the boy who was holding the flower smiling?
language acquisition. In Algebraic Structures
in Natural Language, 17–60. CRC Press. 2. Prompt: Ask me if the man who could come to the restaurant did not come
ChatGPT: Did the man who was supposed to come to the restaurant fail to show up?
3. Prompt: Ask me if the granpa who is at home can see the moon
ChatGPT: Can the grandpa who is at home see the moon?
4. Prompt: Ask me if the granpa who can see the moon is at home
ChatGPT: Is the grandpa who is able to see the moon currently at home?
«Colorless green
ideas sleep
Semantics
furiously»
is
important!
Jackendoff
(1976)
Grammaticality
vs. meaning
ChatGPT May 3 Version
Grammaticality
vs. meaning
(sanity check)
ChatGPT May 3 Version
Grammaticality
vs. meaning
(sanity check)
ChatGPT May 3 Version
Usage-
Genius! based
Noam
Chomsky The dog runs away models!
(1957)
The dog bites the cat
that runs away
Right!
The dog bites the cat
that chases the mouse
that runs away Tomasello
(2005)
Skinner
C. Chesi (1950)
Tail recursion
ChatGPT May 3 Version
Tail recursion
ChatGPT May 3 Version
Center
embedding
(sanity check)
ChatGPT May 3 Version
A 10-y.o. child vs. ChatGPT
The training data
A 10 y.o. child
ChatGPT
Bayesian approximation:
P(wn|w0 ... wn-1) ≈ P(wn|wn-1)
Take few billions tokens corpus
Then count!
The probability of a word (w) given his “history” (h),
Language namely the sequence of preceding words, is
Models (LM) expressed as:
How a LM is created
P(w|h) e.g. P(door|please, close the)
Count(please, close the door)
Count(please, close the)
According to Google:
1.400.000 / 7.170.000 = 0,19
x…V hN yV
xkV
Simple Simple Recurrent Neural Networks
(recurrent) This is a bad … day
Artificial day
Neural
Networks ot
(Elman 1990)
ot
ht ht
contextt
it
it
bad
Standard RNN
Long Short
Term Memory o0 o1 on
(LSTM) h0 h1 h1
networks
(Hochreiter & Schmidhuber 1997)
i0 i1 in
?
LSTM:
o0 o1 on
+ + +
x x x x x x x x x
h0 h1 hn
c0 c1 cn
i0 i1 in
The age of ransformers:
“Attention is all you need” Vaswani et al. (2017)
Seq to Seq Machine Translation example rossa
(99%)
the red ball
word word word
embedding embedding embedding
e e e
h1 h2 h3
d h4 d h5 d
Transformers
Transformers Transformer
Transformer Transformer
Transformer
Transformers 48x Transformer
Transformer 96x Transformer
Transformer
12 x Transformer
Transformer
Transformer
Transformer Transformer
Transformer
Transformer Transformer
GPT-3 GPT-4
175.000.000.000 100.000.000.000.000
How big was GPT-3
…
How does it «cost» GPT-3
(forced choice)
A1. D-N
Il giorno. *I giorno.
the.SG.M day.SG.M the.PL.M day.SG.M
A1. D-N 47
A3. Subj-Verb
A4. Attraction
A5. Past-Participle
Minimal pairs in
COnVERSA B2. Auxiliary selection
B. Thematic roles Il gatto ha giocato. *Il gatto è giocato.
the cat has played the cat is played
B1. Argumental structure
Minimal pairs in
COnVERSA C2. Reflexives
Il ragazzo scivola. *Il ragazzo si scivola.
C. Pronouns The boy slips. The boy himself slips.
C3. Clitics
C3. Clitics
La nonna disegna un albero e lo colora.
*La nonna disegna un albero e gli colora.
The granma draws a tree and it/to_it paints.
Il nonno vede la bambina e le compra un gelato.
* Il nonno vede la bambina e la compra un gelato.
The granpa sees the child and her/to_her buys a ice cream.
D1. Questions on modifiers/adjuncts
Dove dorme il ragazzo?
Where does the child sleep?
In camera. *Di notte.
In the beedroom At night
Minimal pairs in
COnVERSA
D2. Questions on arguments
Chi mangia?
Who eats?
D. Questions La mamma. *La pasta.
Mom Pasta
D1. Questions on modifiers/adjuncts Cosa mangia?
What (does (s)he) eat?
D2. Questions on arguments La pasta. *La mamma.
Pasta Mom
Detailed Report
Hearing Child
53
Comparing reports
Report
ChatGPT
Control Values
Working Memory < 6 y.o.
(L2 generally perform better, DC
worse)
Featural sensitivity < 6 y.o.
(both L2 and DC generally perform
better)
Lexicon = 6 y.o.
(L2 generally perform better, DC
worse)
Comparing reports
Acceptability, complexity and coherence
Chesi, Vespignani, Zamparelli (SLI 2023)
Preliminary results:
Thanks
On Grammar Adequacy
a) Algorithmic Independence
IS and RS use different algorithms to explore the problem space generated from G;
b) Algorithmic Equivalence
IS and RS use the same algorithm A, but RS is affected by complexity (e.g. longer derivations are
harder to be computed)
ii. If ARS(G) and AIS(G) diverge, G is adequate if only if there is ARS such that ARS(G)
generates nothing but any possible sentence generable by RS.