Professional Documents
Culture Documents
2016 Rashi
2016 Rashi
2016 Rashi
Commerce
Original Research Paper Linguistic
An Understanding of Statistical Study of Language
Dynamics
Yash Mohan
Student, SGTB Khalsa College, University of Delhi
Sharma
Sushil Kumar
Assistant Professor, SGTB Khalsa College, University of Delhi
Singh
ABSTRACT Language is an integral part of human thought process and has been instrumental in the development of human
race. But often two or more languages have competed for speakers for various reasons. It is also true that many of the
languages of the world are on the verge of extinction with no speakers. In this work we discuss the effect of language
diffusion and the ensuing snowball effect initiated by economic integration just after the political independence in one of the Indian states.
various issues arising out of the fitted model. Finally, Thus, it actually comes out that the population
we give our conclusions in last section. dynamics of word diffusion is non-linear and can be
written as
THE DIFFUSION MODEL
𝑑𝑑
𝑥𝑥 = 𝑅𝑅𝑖𝑖 𝑥𝑥𝑖𝑖 (1 − 𝑥𝑥𝑖𝑖 ) − 𝛽𝛽𝑖𝑖 𝑥𝑥𝑖𝑖
The first model to consider would be the 𝑑𝑑𝑑𝑑 𝑖𝑖
𝑅𝑅𝑖𝑖 𝑥𝑥𝑖𝑖
lexical diffusion in populations. It accounts for the = (𝑅𝑅𝑖𝑖 − 𝛽𝛽𝑖𝑖 )𝑥𝑥𝑖𝑖 {1 − }
(𝑅𝑅𝑖𝑖 − 𝛽𝛽𝑖𝑖 )
spread of words of a given language as a consequence
of learning processes [6, 7, 8, 10]. In this model it is Let's denote 𝑥𝑥𝑖𝑖∗ = 1 −
𝛽𝛽𝑖𝑖
then
𝑅𝑅𝑖𝑖
assumed that each word of a given language is
incorporated independently [9, 10]. 𝑑𝑑 𝑥𝑥𝑖𝑖
𝑥𝑥𝑖𝑖 = (𝑅𝑅𝑖𝑖 − 𝛽𝛽𝑖𝑖 )𝑥𝑥𝑖𝑖 (1 − ∗ )
𝑑𝑑𝑑𝑑 𝑥𝑥𝑖𝑖
Let 𝑥𝑥𝑖𝑖 be the fraction of the population
knowing the word 𝑤𝑤𝑖𝑖 . It is evident that the rest of the The equation itself gives a telltale indication that there
population fraction 𝑦𝑦𝑖𝑖 = 1 − 𝑥𝑥𝑖𝑖 are yet to learn the are two equilibrium population fractions (for which the
word. Now two events dominate the population rate is null). The first is 𝑥𝑥𝑖𝑖 = 0 indicative of death of
dynamics of such a word. First is how fast the the word. The other is 𝑥𝑥𝑖𝑖 = 𝑥𝑥𝑖𝑖∗ indicative of the
𝛽𝛽
population fraction 𝑦𝑦𝑖𝑖 unaware of the word learns the survival of the word [7]. Since 𝑥𝑥𝑖𝑖∗ = 1 − 𝑖𝑖, it is also
𝑅𝑅𝑖𝑖
word
notable that the a fraction of population learn the word
𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑛𝑛𝑛𝑛 𝑟𝑟𝑟𝑟𝑟𝑟𝑟𝑟 = 𝛼𝛼𝑖𝑖 𝑦𝑦𝑖𝑖 = 𝛼𝛼𝑖𝑖 (1 − 𝑥𝑥𝑖𝑖 ) if the learning rate 𝑅𝑅𝑖𝑖 is higher. It is also apparent that,
for a word to be maintained in the population lexicon,
where αi is the learning coefficient. Second is how fast the following inequality must be fulfilled 𝑅𝑅𝑖𝑖 > 𝛽𝛽𝑖𝑖 [7].
the population fraction 𝑥𝑥𝑖𝑖 unlearns the word (gives up The time evolution of word propagation would then be
its use). determined by the most general solution to above rate
equation
𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒 𝑟𝑟𝑟𝑟𝑟𝑟𝑟𝑟 = 𝛽𝛽𝑖𝑖 𝑥𝑥𝑖𝑖
𝑥𝑥(0)𝑒𝑒 (𝑅𝑅𝑖𝑖−𝛽𝛽𝑖𝑖)𝑡𝑡
where 𝛽𝛽𝑖𝑖 is the extinction coefficient. The net 𝑥𝑥𝑖𝑖 = 𝑥𝑥 ∗
{𝑥𝑥 ∗ + 𝑥𝑥(0)[𝑒𝑒 (𝑅𝑅𝑖𝑖−𝛽𝛽𝑖𝑖)𝑡𝑡 − 1]}
diffusion of the word through the population would
then be
THE DATA FITTING
𝑑𝑑
𝑥𝑥 = 𝛼𝛼𝑖𝑖 (1 − 𝑥𝑥𝑖𝑖 ) − 𝛽𝛽𝑖𝑖 𝑥𝑥𝑖𝑖
𝑑𝑑𝑑𝑑 𝑖𝑖
We adopt the diffusion model as reflective of
However, as learning is a process dependent on the the shift of a whole collection of words and sentences.
interaction between individuals who already know the This way we apply a modified diffusion model for
word with those who don't, the learning coefficient language shift
involves a linear dependence on population fraction 𝑥𝑥𝑖𝑖
∗
∗
𝑥𝑥(0)𝑒𝑒 (𝑅𝑅−1)(𝑡𝑡−𝑇𝑇 )
𝑥𝑥𝑖𝑖 = 𝑒𝑒 + 𝑥𝑥 ∗
𝛼𝛼𝑖𝑖 = 𝑅𝑅𝑖𝑖 𝑥𝑥𝑖𝑖 {𝑥𝑥 + 𝑥𝑥(0)[𝑒𝑒 (𝑅𝑅−1)(𝑡𝑡−𝑇𝑇∗) − 1]}
TABLE – 1 DISCUSSION
HINDI SPEAKING FRACTION FOR THE
The word diffusion model curve has been studied
STATE OF RAJASTHAN
and has been shown that the word usage by fraction of
Census Year Population Fraction
population increases exponentially at low population
𝑇𝑇 𝑥𝑥
values, describing a scenario where words rapidly
1911 0.1145
propagate, followed by a slow down as the number of
1921 0.1694
potential learners’ decay. The accelerated, exponential
1931 0.1574
growth has been dubbed as the snowball effect [6] and
1941 -
such curves have been fitted to available data [7]. In
1951 0.1981
this case the economic integration just after
1961 0.0466
independence of India signifies the transition time
1971 0.9128
describing a scenario where language rapidly
1981 0.8766
propagates in the state of Rajasthan. The pattern of
1991 0.8956
Hindi language propagation is somewhat similar to the
2001 0.9097 snowball effect. Such studies hold a lot of potential in
TABLE – 2 a country as large as India having a vast treasure of
THE FITTING PARAMETERS languages. We strongly believe that a single model
2
(The adjusted-𝑅𝑅 value of the fit is 0.9934.) may not be enough to describe the language
𝑅𝑅 𝑇𝑇 ∗ propagation in all states of India. Therefore, there exist
a lot of scope for furthering such analysis for different
4.19 1965.75
languages and different parts of India.
pattern of spread of Hindi language in the state of [11] Govt. of India (1961), Census of India-1961, Volume-I India, Part II C(ii), Language Table,
Statement 1.
Rajasthan during the last 100 year. Before the [12] Govt. of India (1961), Census of India-1961, Volume-I India, Part II C(ii), Language Table,
Statement 1.
transition time few other major languages such as [13] Govt. of India (1981), Census of India-1981, Series-I India, Part IV B(i), Table C-7.
[14] Govt. of India (1991), Census of India-1991, Series-I India, Paper 1 of 1997, Language,
Dhundhari, Marwari and Mewari were very well India & States, Table C-7.
[15] Govt. of India (2001), Census of India-2001, Series-I India, Series-I India, Language, In-
competing with Hindi [11-15] but after the transition dia, States & UT, Table C-16.
ACKNOWLEGEMENTS
REFERENCES:
[1] Krauss M. (1992), “The world’s languages in crisis.”, Language 68, 4–10.
[2] Nowak M.A., Komarova N.L. & Niyogi P. (2002), “Computational and evolutionary as-
pects of language.”, Nature 417, 611–617.
[3] Hawkins J.A. & Gell-Mann M. (1992), “The evolution of human languages”, Addi-
son-Wesley, Reading, Massachusetts.
[4] Niyogi P. & Berwick R.C. (1997), “Evolutionary consequences of language learning”, Ling.
Phil. 20, 697–719.
[5] Wang W.S.-Y. (1969), “Competing change as a cause of residue.”, Language 45, 9–25.
[6] Wang W.S.-Y. & Minett J.W. (2005), “The invasion of language: emergence, change and
death.”, Trends Ecol. Evol. 20, 263–269.
[7] Ricard V.S., Bernat C.M. & Jordi F. (2010), “Diversity, competition, extinction: the eco-
physics of language change”, J. R. Soc. Interface 7, 1647–1664.
[8] Wang, W. S.-Y., Ke, J. & Minett, J. W. (2004), “Computational studies of language evolu-