Skip to main content
Intended for healthcare professionals
Restricted access
Research article
First published online June 23, 2021

The speakers of minority languages are more multilingual

Abstract

Aims and objectives/purpose/research questions:

The paper tests the hypothesis that the larger the population of language speakers, the smaller the number of second languages mastered by these speakers.

Design/methodology/approach:

We match the size of the population of 29 Dagestanian languages and the number of second languages spoken by the speakers of these languages from 54 villages, and run a Poisson mixed effects regression model that predicts the average number of second languages spoken by speakers from first-language communities of different size.

Data and analysis:

Data for this study comes from two sources. The information on the population of Dagestanian languages is based on the digitalized census of 1926. The information on the number of second languages in which the residents of Dagestan are proficient is taken from the database on multilingualism in Dagestan (4032 people).

Findings/conclusions:

The study supports the hypothesis that the size of language population is negatively correlated with the multilingualism of the language community.

Originality:

The paper is the first to test the correlation between the size of language population and the level of multilingualism of its speakers using statistical methods and a large body of empirical data.

Significance and implications:

Population size is a factor that could have influenced patterns of language evolution. The population is interrelated with other factors, one of which is long-standing multilingualism. The methodological lesson of this research is that there is a difference in the level of multilingualism within a range of populations where the largest was about 120,000 people.

Limitations:

The data is limited to one multilingual region. The revealed correlation probably does not hold for areas where language communities do not interact with their neighbors and even speakers of minority languages can be monolingual, or for the territories where many people migrated and the area where a language is spoken was discontinuous.

Get full access to this article

View all access and purchase options for this article.

References

Arnold J. B. (2019). ggthemes: Extra themes, scales and geoms for ‘ggplot2’. R package version 4.2.0. https://CRAN.R-project.org/package=ggthemes
Baldini R. (2015). Revisiting the effect of population size on cumulative cultural evolution. Journal of Cognition and Culture, 15, 320–336. https://doi.org/10.1163/15685373-12342153
Bates D., Maechler M., Bolker B., Walker S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 1–48. https://doi.org/10.18637/jss.v067.i01
Bentz C., Winter B. (2013). Languages with more second language learners tend to lose the nominal case. Quantifying Language Dynamics, 3, 1–27. https://doi.org/10.1163/22105832-13030105
Bowern C. (2010). Correlates of language change in hunter-gatherer and other ‘small’ languages. Language and Linguistics Compass, 4, 665–679. https://doi.org/10.1111/j.1749-818X.2010.00220.x
Bromham L., Hua X., Fitzpatrick T. G., Greenhill S. J. (2015). Rate of language evolution is affected by population size. Proceedings of the National Academy of Sciences, 112, 2097–2102. https://doi.org/10.1073/pnas.1419704112
Chambers J. K. (2002). Patterns of variation including change. In Chambers J. K., Trudgill P., Schilling N. (Eds.), The handbook of language variation and change (pp. 349–372). Blackwell Publishing.
Chirikba V. A. (2008). The problem of the Caucasian Sprachbund. In Muysken P. (Ed.), From linguistic areas to areal linguistics (pp. 25–94). John Benjamins Publishing.
Clarke E., Sherrill-Mix S. (2017). ggbeeswarm: Categorical scatter (violin point) plots. R package version 0.6.0. https://CRAN.R-project.org/package=ggbeeswarm
Collard M., Ruttle A., Buchanan B., O’Brien M. J. (2013). Population size and cultural evolution in nonindustrial food-producing societies. PLoS One, 8(9), e72628. https://doi.org/10.1371/journal.pone.0072628
Comrie B. (2008). Linguistic diversity in the Caucasus. Annual Review of Anthropology, 37, 131–143.
Daniel M., Koshevoy A., Schurov I., Dobrushina N. (2021). Can recall data be trusted? Field Methods, 34(2), 2022.
Dobrushina N. (2013). How to study multilingualism of the past: Investigating traditional contact situations in Daghestan. Journal of Sociolinguistics, 17(3), 269–411.
Dobrushina N., Kozhukhar A., Moroz G. (2019). Gendered multilingualism in highland Daghestan: Story of a loss. Journal of Multilingual and Multicultural Development, 40(2), 115–132.
Dobrushina N., Kultepina O. (2021). The rise of a lingua franca: The case of Russian in Dagestan. International Journal of Bilingualism, 25(1), 338–358.
Dobrushina N., Staferova D., Belokon A. (Eds.). (2017). Atlas of multilingualism in dagestan online. Linguistic Convergence Laboratory, HSE. https://multidagestan.com
Donohue M., Nichols J. (2011). Does phoneme inventory size correlate with population size? Linguistic Typology, 15(2), 161–170.
Evans N. (2017). Did language evolve in multilingual settings? Biology & Philosophy, 32(6), 905–933. https://doi.org/10.1007/s10539%E2%80%93018%E2%80%939609%E2%80%933
Greenhill S. J. (2014). Demographic correlates of language diversity. In Bowern C., Evans B. (Eds.), Handbook of historical linguistics (pp. 555–578). Routledge.
Greenhill S. J., Hua X., Welsh C. F., Schneemann H., Bromham L. (2018). Population size and the rate of language evolution: a test across Indo-European, Austronesian, and Bantu languages. Frontiers in Psychology, 9, 576.
Gries S. T. (2013). Statistics for linguistics with R: A practical introduction (Vol. 208). Walter de Gruyter.
Hay J., Bauer L. (2007). Phoneme inventory size and population size. Language, 83, 388–400. https://doi.org/10.1353/lan.2007.0071
Karafet T. M., Bulayeva K. B., Nichols J., Bulayev O. A., Gurgenova F., Omarova J., Yepiskoposyan L., Savina O. V., Rodrigue B. H., Hammer M. F. (2016). Coevolution of genes and languages and high levels of population structure among the highland populations of Daghestan. Journal of Human Genetics, 61(3), 181–191.
Khachaturyan M., Konoshenko M. (2021). Assessing (a)symmetry in multilingualism: The case of Mano and Kpelle in Guinea. International Journal of Bilingualism, 25(4), 979–998.
Khanina O. (2021). Language practices and ideologies at multilingual reaches of Lower Yenisei (Siberia). International Journal of Bilingualism, 25(4), 1059–1080.
Koplenig A. (2019). Language structure is influenced by the number of speakers but seemingly not by the proportion of non-native speakers. Royal Society Open Science, 6, 181274. https://doi.org/10.1098/rsos.181274
Koryakov Y. (2002). Atlas of the Caucasian languages. Institute of Linguistics RAS.
Koryakov Y., Sumbatova N. (2017). BRÈ, tom 8, Moskva: Naučnoe izd-vo ‘Bolʹšaja rossijskaja ènciklopedija.
Koryakov Y., Staferova D., Belokon A., Dobrushina N. (2019). Population of Dagestan from data of different census years. Linguistic Convergence Laboratory, HSE University. https://multidagestan.com/census
Kuznetsova A., Brockhoff P. B., Christensen R. H. B. (2017). lmerTest package: Tests in linear mixed effects models. Journal of Statistical Software, 82(13), 1–26. https://doi.org/10.18637/jss.v082.i13
Li Y. (2021). The interaction of age, L2, types of code alternation and multilingualism in the Zauzou community. International Journal of Bilingualism, 25(4), 1040–1058.
Lüdecke D., Makowski D., Waggoner P. (2019). Performance: Assessment of regression models performance. R package version 0.3.0. https://CRAN.R-project.org/package=performance
Lupyan G., Dale R. (2010). Language structure is partly determined by social structure. PLoS One, 5(1), e8559.
Moran S., McCloy D., Wright R. (2012). Revisiting population size vs. phoneme inventory size. Language, 88, 877–893. https://doi.org/10.1353/lan.2012.0087
Nettle D. (2012). Social scale and structural complexity in human languages. Philosophical transactions of the Royal Society of London. Series B, 367, 1829–1836.
Pupynina M., Aralova N. (2021). Lower Kolyma multilingualism: Historical setting and sociolinguistic trends. International Journal of Bilingualism, 25(4), 1081–1101.
R Core Team. (2019). R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://www.R-project.org/
Trudgill P. (2011). Sociolinguistic typology: Social determinants of linguistic complexity. Oxford University Press.
Van den Berg H. (2005). The East Caucasian language family. Lingua, 115(1), 147–190.
Walworth M., Dewar A., Ennever T., Takau L., Rodriguez I. (2021). Multilingualism in Vanuatu: Four case studies. International Journal of Bilingualism, 25(4), 1120–1141.
Wichmann S., Rama T., Holman E. W. (2011). Phonological diversity, word length, and population sizes across languages: The ASJP evidence. Linguistic Typology, 15, 177–197. https://doi.org/10.1515/lity.20
Wickham H. (2016). ggplot2: Elegant graphics for data analysis. Springer-Verlag.
Wixman R. (1980). Language aspects of ethnic patterns and processes in the north Caucasus. University of Chicago.

Biographies

Nina Dobrushina is the head of the Linguistic Convergence Laboratory and a professor at the School of Linguistics at the Higher School of Economics (HSE University) in Moscow. She works on languages of Dagestan, multilingualism, typology of mood and modality, and does extensive fieldwork on the languages of Dagestan.
George Moroz is a junior research fellow in the Linguistic Convergence Laboratory and senior lecturer at the School of Linguistics at the Higher School of Economics (HSE University) in Moscow. He works on phonetics and quantitative approaches to linguistic data. He does extensive fieldwork on East and West Caucasian languages, developed several linguistic databases and R packages, including lingtypology and phonfieldwork.