Advertisement

A Family of Languages

English is part of the large Indo-European language family, which includes Celtic, Germanic, Italic, Balto-Slavic, and Indo-Iranian languages. The origin of this family is hotly debated: one hypothesis places the origin north of the Caspian Sea in the Pontic steppes, from where it was disseminated by Kurgan semi-nomadic pastoralists; a second suggests that Anatolia, in modern-day Turkey, is the source, and the language radiated with the spread of agriculture. Bouckaert et al. (p. 957) used phylogenetic methods and modeling to assess the geographical spread of the Indo-European language group. The findings support the suggestion that the origin of the language family was indeed Anatolia 7 to 10 thousand years ago—contemporaneous with the spread of agriculture.

Abstract

There are two competing hypotheses for the origin of the Indo-European language family. The conventional view places the homeland in the Pontic steppes about 6000 years ago. An alternative hypothesis claims that the languages spread from Anatolia with the expansion of farming 8000 to 9500 years ago. We used Bayesian phylogeographic approaches, together with basic vocabulary data from 103 ancient and contemporary Indo-European languages, to explicitly model the expansion of the family and test these hypotheses. We found decisive support for an Anatolian origin over a steppe origin. Both the inferred timing and root location of the Indo-European language trees fit with an agricultural expansion from Anatolia beginning 8000 to 9500 years ago. These results highlight the critical role that phylogeographic inference can play in resolving debates about human prehistory.

Get full access to this article

View all available purchase options and get full access to this article.

Supplementary Material

Summary

Materials and Methods
Figs. S1 to S12
Tables S1 to S5
References (3162)
Movie S1
BEAST input file
NEXUS tree file

Resources

File (1219669indoeuropean_2mcctrees_annotated.zip)
File (1219669indoeuropean_beast.zip)
File (1219669s1.mov)
File (957.mp3)
File (bouckaert.sm.pdf)
File (bouckaert.sm.revised.pdf)

References and Notes

1
Gray R. D., Atkinson Q. D., Language-tree divergence times support the Anatolian theory of Indo-European origin. Nature 426, 435 (2003).
2
Gray R. D., Drummond A. J., Greenhill S. J., Language phylogenies reveal expansion pulses and pauses in Pacific settlement. Science 323, 479 (2009).
3
Kitchen A., Ehret C., Assefa S., Mulligan C. J., Bayesian phylogenetic analysis of Semitic languages identifies an Early Bronze Age origin of Semitic in the Near East. Proc. R. Soc. B 276, 2703 (2009).
4
Diamond J., Bellwood P., Farmers and their languages: The first expansions. Science 300, 597 (2003).
5
J. P. Mallory, D. Q. Adams, The Oxford Introduction to Proto Indo European and the Proto Indo European World (Oxford Univ. Press, New York, 2006).
6
M. Gimbutas, in Indo-European and Indo-Europeans, G. Cardona, H. M. Hoenigswald, A. Senn, Eds. (Univ. of Pennsylvania Press, Philadelphia, 1970), pp. 155–197.
7
Gimbutas M., J. Indo-Europ. Stud. 5, 277 (1977).
8
J. Koivulehto, in Early Contacts between Uralic and Indo-European: Linguistic and Archaeological Considerations, C. Carpelan, A. Parpola, P. Koskikallio, Eds. (Suomalais-Ugrilaisen Seuran Toimituksia, Helsinki, 2001), pp. 235–263.
9
D. W. Anthony, The Horse, the Wheel, and Language: How Bronze-Age Riders from the Eurasian Steppes Shaped the Modern World (Princeton Univ. Press, Princeton, NJ, 2007).
10
P. Heggarty, in Phylogenetic Methods and the Prehistory of Languages, P. Forster, C. Renfrew, Eds. (McDonald Institute for Archaeological Research, Cambridge, 2006), pp. 183–194.
11
C. Renfrew, Archaeology and Language: The Puzzle of Indo-European Origins (Cape, London, 1987).
12
Balter M., Early date for the birth of Indo-European languages. Science 302, 1490 (2003).
13
Lemey P., Rambaut A., Drummond A. J., Suchard M. A., Bayesian phylogeography finds its roots. PLOS Comput. Biol. 5, e1000520 (2009).
14
Lemey P., Rambaut A., Welch J. J., Suchard M. A., Phylogeography takes a relaxed random walk in continuous space and time. Mol. Biol. Evol. 27, 1877 (2010).
15
See supplementary materials on Science Online.
16
Ringe D., Warnow T., Taylor A., Indo-European and computational cladistics. Trans. Philol. Soc. 100, 59 (2002).
17
Dyen I., Kruskal J. B., Black P., An Indoeuropean classification: A lexicostatistical experiment. Trans. Am. Philos. Soc. 82, 1 (1992).
18
G. K. Nicholls, R. D. Gray, in Phylogenetic Methods and the Prehistory of Languages, P. Forster, C. Renfrew, Eds. (McDonald Institute for Archaeological Research, Cambridge, 2006), pp. 161–172.
19
Alekseyenko A. V., Lee C. J., Suchard M. A., Wagner and Dollo: A stochastic duet by composing two parsimonious solos. Syst. Biol. 57, 772 (2008).
20
Drummond A. J., Suchard M. A., Xie D., Rambaut A., Mol. Biol. Evol. 29, 1969 (2012).
21
Suchard M. A., Weiss R. E., Sinsheimer J. S., Bayesian selection of continuous-time Markov chain evolutionary models. Mol. Biol. Evol. 18, 1001 (2001).
22
Gkiasta M., Russell T., Shennan S., Steele J., Antiquity 77, 45 (2003).
23
Chikhi L., Update to Chikhi et al.’s “Clinal variation in the nuclear DNA of Europeans” (1998): Genetic data and storytelling—from archaeogenetics to astrologenetics? Hum. Biol. 81, 639 (2009).
24
Haak W., et al., Ancient DNA from European early neolithic farmers reveals their near eastern affinities. PLoS Biol. 8, e1000536 (2010).
25
Lacan M., et al., Ancient DNA reveals male diffusion through the Neolithic Mediterranean route. Proc. Natl. Acad. Sci. U.S.A. 108, 9788 (2011).
26
von Cramon-Taubadel N., Pinhasi R., Craniometric data support a mosaic model of demic and cultural Neolithic diffusion to outlying regions of Europe. Proc. R. Soc. B 278, 2874 (2011).
27
I. Glover, C. Higham, in The Origins and Spread of Agriculture and Pastoralism in Eurasia, D. Harris, Ed. (Blackwell, Cambridge, 1996), pp. 413–441.
28
Holden C. J., Bantu language trees reflect the spread of farming across sub-Saharan Africa: A maximum-parsimony analysis. Proc. R. Soc. B 269, 793 (2002).
29
Currie T. E., Greenhill S. J., Gray R. D., Hasegawa T., Mace R., Rise and fall of political complexity in island South-East Asia and the Pacific. Nature 467, 801 (2010).
30
Kass R. E., Raftery A. E., Bayes factors. J. Am. Stat. Assoc. 90, 773 (1995).
31
P. M. Lewis, Ethnologue: Languages of the World (SIL International, Dallas, ed. 16, 2009).
32
Q. D. Atkinson, R. D. Gray, in Phylogenetic Methods and the Prehistory of Languages (McDonald Institute for Archaeological Research, Cambridge, 2006), pp. 91–109.
33
S. J. Greenhill, R. D. Gray, in Austronesian Historical Linguistics and Culture History: A Festschrift for Robert Blust (Pacific Linguistics, Canberra, 2009).
34
Felsenstein J., The number of evolutionary trees. Syst. Zool. 27, 27 (1978).
35
Huelsenbeck J. P., Ronquist F., Nielsen R., Bollback J. P., Bayesian inference of phylogeny and its impact on evolutionary biology. Science 294, 2310 (2001).
36
Atkinson Q. D., Nicholls G. K., Welch D., Gray R. D., From words to dates: Water into wine, mathemagic or phylogenetic inference? Trans. Philol. Soc. 103, 193 (2005).
37
Greenhill S. J., Currie T. E., Gray R. D., Does horizontal transmission invalidate cultural phylogenies? Proc. Biol. Sci. 276, 2299 (2009).
38
Felsenstein J., Evolutionary trees from DNA sequences: A maximum likelihood approach. J. Mol. Evol. 17, 368 (1981).
39
Tuffley C., Steel M. A., Modeling the covarion hypothesis of nucleotide substitution. Math. Biosci. 147, 63 (1998).
40
Penny D., McComish B. J., Charleston M. A., Hendy M. D., Mathematical elegance with biochemical realism: The covarion model of molecular evolution. J. Mol. Evol. 53, 711 (2001).
41
Nicholls G., Gray R., Dated ancestral trees from binary trait data and their application to the diversification of languages. J. R. Stat. Soc., B 70, 545 (2008).
42
Hasegawa M., Kishino H., Yano T., Dating of the human-ape splitting by a molecular clock of mitochondrial DNA. J. Mol. Evol. 22, 160 (1985).
43
Gray R. R., et al., Testing spatiotemporal hypothesis of bacterial evolution using methicillin-resistant Staphylococcus aureus ST239 genome-wide data within a Bayesian framework. Mol. Biol. Evol. 28, 1593 (2011).
44
Rambaut A., Estimating the rate of molecular evolution: Incorporating non-contemporaneous sequences into maximum likelihood phylogenies. Bioinformatics 16, 395 (2000).
45
Shapiro B., et al., A Bayesian phylogenetic method to estimate unknown sequence ages. Mol. Biol. Evol. 28, 879 (2011).
46
Drummond A. J., Ho S. Y. W., Phillips M. J., Rambaut A., Relaxed phylogenetics and dating with confidence. PLoS Biol. 4, e88 (2006).
47
Drummond A. J., Rambaut A., Shapiro B., Pybus O. G., Bayesian coalescent inference of past population dynamics from molecular sequences. Mol. Biol. Evol. 22, 1185 (2005).
48
Suchard M. A., Kitchen C. M. R., Sinsheimer J. S., Weiss R. E., Hierarchical phylogenetic models for analyzing multipartite sequence data. Syst. Biol. 52, 649 (2003).
49
Redelings B. D., Suchard M. A., Joint Bayesian estimation of alignment and phylogeny. Syst. Biol. 54, 401 (2005).
50
Nelson-Sathi S., et al., Networks uncover hidden lexical borrowing in Indo-European language evolution. Proc. R. Soc. Ser. B 278, 1794 (2011).
51
Tadmor U., Haspelmath M., Taylor B., Borrowability and the notion of basic vocabulary. Diachronica 27, 226 (2010).
52
Atkinson Q. D., Meade A., Venditti C., Greenhill S. J., Pagel M., Languages evolve in punctuational bursts. Science 319, 588 (2008).
53
Pagel M., Atkinson Q. D., Meade A., Frequency of word-use predicts rates of lexical evolution throughout Indo-European history. Nature 449, 717 (2007).
54
Greenhill S. J., Atkinson Q. D., Meade A., Gray R. D., The shape and tempo of language evolution. Proc. R. Soc. Ser. B 277, 2443 (2010).
55
B. W. Fortson, Indo-European Language and Culture: An Introduction (Wiley-Blackwell, Malden, MA, 2010).
56
Bartumeus F., da Luz M. G. E., Viswanathan G. M., Catalan J., Animal search strategies: A quantitative random-walk analysis. Ecology 86, 3078 (2005).
57
P. Turchin, Quantitative Analysis of Movement: Measuring and Modeling Population Redistribution in Plants and Animals (Sinauer, Sunderland, MA, 1998).
58
Wu H.-i., Li B.-L., Springer T. A., Neill W. H., Modelling animal movement as a persistent random walk in two dimensions: Expected magnitude of net displacement. Ecol. Modell. 132, 115 (2000).
59
Gelfand A., Hills S., Racine-Poon A., Smith A., Illustration of Bayesian inference in normal data models using Gibbs sampling. J. Am. Stat. Assoc. 85, 972 (1990).
60
Pagel M., Meade A., Barker D., Bayesian estimation of ancestral character states on phylogenies. Syst. Biol. 53, 673 (2004).
61
Snyder W. V., Algorithm 531: Contour plotting [J6]. ACM Trans. Math. Softw. 4, 290 (1978).
62
B. W. Silverman, Density Estimation (Chapman and Hall, London, 1986).

(0)eLetters

eLetters is a forum for ongoing peer review. eLetters are not edited, proofread, or indexed, but they are screened. eLetters should provide substantive and scholarly commentary on the article. Embedded figures cannot be submitted, and we discourage the use of figures within eLetters in general. If a figure is essential, please include a link to the figure within the text of the eLetter. Please read our Terms of Service before submitting an eLetter.

Log In to Submit a Response

No eLetters have been published for this article yet.

Information & Authors

Information

Published In

Science
Volume 337 | Issue 6097
24 August 2012

Submission history

Received: 26 January 2012
Accepted: 1 June 2012
Published in print: 24 August 2012

Permissions

Request permissions for this article.

Acknowledgments

We thank the New Zealand Phylogenetics Meeting and the National Evolutionary Synthesis Center (NESCent) NSF grant EF-0423641, for fostering collaboration on this project. Supported by the Marsden Fund (R.B., R.D.G., S.J.G., and A.J.D.), Rutherford Discovery Fellowships (Q.D.A., A.J.D.), administered by the Royal Society of New Zealand, and by NIH grants R01 GM086887 and R01 HG006139 (M.A.S.). The research leading to these results has received funding from the European Union Seventh Framework Programme (FP7/2007-2013) under grant agreement 278433-PREDEMICS and European Research Council (ERC) grant agreement 260864.

Authors

Affiliations

Remco Bouckaert
Department of Computer Science, University of Auckland, Auckland 1142, New Zealand.
Philippe Lemey
Department of Microbiology and Immunology, Rega Institute, KU Leuven, 3000 Leuven, Belgium.
Michael Dunn
Max Planck Institute for Psycholinguistics, Post Office Box 310, 6500 AH Nijmegen, Netherlands.
Donders Institute for Brain, Cognition and Behaviour, Radboud University Nijmegen, Kapittelweg 29, 6525 EN Nijmegen, Netherlands.
Simon J. Greenhill
Department of Psychology, University of Auckland, Auckland 1142, New Zealand.
School of Culture, History & Language and College of Asia & the Pacific, Australian National University, 0200 Canberra, ACT, Australia.
Alexander V. Alekseyenko
Center for Health Informatics and Bioinformatics, New York University School of Medicine, New York, NY 10016, USA.
Alexei J. Drummond
Department of Computer Science, University of Auckland, Auckland 1142, New Zealand.
Allan Wilson Centre for Molecular Ecology and Evolution, University of Auckland, Auckland 1142, New Zealand.
Russell D. Gray
Department of Psychology, University of Auckland, Auckland 1142, New Zealand.
Department of Philosophy, Research School of the Social Sciences, Australian National University, 0200 Canberra, ACT, Australia.
Marc A. Suchard
Department of Biomathematics, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA.
Department of Biostatistics, School of Public Health, University of California, Los Angeles, CA 90095, USA.
Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA.
Quentin D. Atkinson* [email protected]
Department of Psychology, University of Auckland, Auckland 1142, New Zealand.
Institute of Cognitive and Evolutionary Anthropology, University of Oxford, Oxford OX2 6PN, UK.

Notes

*
To whom correspondence should be addressed. E-mail: [email protected]

Metrics & Citations

Metrics

Article Usage

Altmetrics

Citations

Cite as

Export citation

Select the format you want to export the citation of this publication.

Cited by

  1. Luigi Luca Cavalli Sforza and the history of human languages: A linguist’s point of view, Human Population Genetics and Genomics, (1-15), (2023).https://doi.org/10.47248/hpgg2303010002
    Crossref
  2. Language trees with sampled ancestors support a hybrid model for the origin of Indo-European languages, Science, 381, 6656, (2023)./doi/10.1126/science.abg0818
    Abstract
  3. Spatial evolution of human cultures inferred through Bayesian phylogenetic analysis, Journal of The Royal Society Interface, 20, 198, (2023).https://doi.org/10.1098/rsif.2022.0543
    Crossref
  4. Life of p: A consonant older than speech, BioEssays, 45, 4, (2023).https://doi.org/10.1002/bies.202200246
    Crossref
  5. The role of Hazarewal populations in the peopling of South Asia: A dental morphology investigation, American Journal of Biological Anthropology, 180, 4, (673-702), (2023).https://doi.org/10.1002/ajpa.24682
    Crossref
  6. Multidimensional Phylogenetic Metrics Identify Class I Aminoacyl-tRNA Synthetase Evolutionary Mosaicity and Inter-Modular Coupling, International Journal of Molecular Sciences, 23, 3, (1520), (2022).https://doi.org/10.3390/ijms23031520
    Crossref
  7. Historical Linguistics of Sign Languages: Progress and Problems, Frontiers in Psychology, 13, (2022).https://doi.org/10.3389/fpsyg.2022.818753
    Crossref
  8. Structural Variability Shows Power-Law Based Organization of Vowel Systems, Frontiers in Psychology, 13, (2022).https://doi.org/10.3389/fpsyg.2022.801908
    Crossref
  9. Effects of Linguistic Distance on Cognitive Skills, Health, and Social Outcomes in Canadian Immigrants, Frontiers in Political Science, 4, (2022).https://doi.org/10.3389/fpos.2022.874195
    Crossref
  10. A quantitative global test of the complexity trade-off hypothesis: the case of nominal and verbal grammatical marking, Linguistics Vanguard, 0, 0, (2022).https://doi.org/10.1515/lingvan-2021-0011
    Crossref
  11. See more
Loading...

View Options

Check Access

Log in to view the full text

AAAS ID LOGIN

AAAS login provides access to Science for AAAS Members, and access to other journals in the Science family to users who have purchased individual subscriptions.

Log in via OpenAthens.
Log in via Shibboleth.

More options

Register for free to read this article

As a service to the community, this article is available for free. Login or register for free to read this article.

Purchase this issue in print

Buy a single issue of Science for just $15 USD.

View options

PDF format

Download this article as a PDF file

Download PDF

Full Text

FULL TEXT

Media

Figures

Multimedia

Tables

Share

Share

Share article link

Share on social media