Repeatability for Gaussian and non-Gaussian data: a practical guide for biologists

Shinichi Nakagawa,

Corresponding Author

Shinichi Nakagawa

Department of Zoology, University of Otago, 340 Great King Street, Dunedin, 9054, New Zealand (E-mail: [email protected] )

Tel: +44-114-222-0113; Fax: +44-114-222-0002; E-mail: [email protected]Search for more papers by this author

Holger Schielzeth,

Holger Schielzeth

Department of Behavioural Ecology and Evolutionary Genetics, Max Planck Institute for Ornithology, Eberhard-Gwinner-Str. 5, D-82319 Seewiesen, Germany

Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Norbyvägen 18D, SE-752 36, Uppsala, Sweden (E-mail: [email protected] )

Search for more papers by this author

Shinichi Nakagawa,

Corresponding Author

Shinichi Nakagawa

Department of Zoology, University of Otago, 340 Great King Street, Dunedin, 9054, New Zealand (E-mail: [email protected] )

Tel: +44-114-222-0113; Fax: +44-114-222-0002; E-mail: [email protected]Search for more papers by this author

Holger Schielzeth,

Holger Schielzeth

Department of Behavioural Ecology and Evolutionary Genetics, Max Planck Institute for Ornithology, Eberhard-Gwinner-Str. 5, D-82319 Seewiesen, Germany

Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Norbyvägen 18D, SE-752 36, Uppsala, Sweden (E-mail: [email protected] )

Search for more papers by this author

First published: 21 June 2010

https://doi.org/10.1111/j.1469-185X.2010.00141.x

Citations: 1,217

Read the full text

About

PDF

Tools

Share a link

Email
Facebook
Twitter
LinkedIn
Reddit
Wechat

Abstract

Repeatability (more precisely the common measure of repeatability, the intra-class correlation coefficient, ICC) is an important index for quantifying the accuracy of measurements and the constancy of phenotypes. It is the proportion of phenotypic variation that can be attributed to between-subject (or between-group) variation. As a consequence, the non-repeatable fraction of phenotypic variation is the sum of measurement error and phenotypic flexibility. There are several ways to estimate repeatability for Gaussian data, but there are no formal agreements on how repeatability should be calculated for non-Gaussian data (e.g. binary, proportion and count data). In addition to point estimates, appropriate uncertainty estimates (standard errors and confidence intervals) and statistical significance for repeatability estimates are required regardless of the types of data. We review the methods for calculating repeatability and the associated statistics for Gaussian and non-Gaussian data. For Gaussian data, we present three common approaches for estimating repeatability: correlation-based, analysis of variance (ANOVA)-based and linear mixed-effects model (LMM)-based methods, while for non-Gaussian data, we focus on generalised linear mixed-effects models (GLMM) that allow the estimation of repeatability on the original and on the underlying latent scale. We also address a number of methods for calculating standard errors, confidence intervals and statistical significance; the most accurate and recommended methods are parametric bootstrapping, randomisation tests and Bayesian approaches. We advocate the use of LMM- and GLMM-based approaches mainly because of the ease with which confounding variables can be controlled for. Furthermore, we compare two types of repeatability (ordinary repeatability and extrapolated repeatability) in relation to narrow-sense heritability. This review serves as a collection of guidelines and recommendations for biologists to calculate repeatability and heritability from both Gaussian and non-Gaussian data.

XI. REFERENCES

Bakker, T. C. M. (1999). The study of intersexual selection using quantitative genetics. Behaviour 136, 1237–1266.
10.1163/156853999501748
Web of Science®Google Scholar
Becker, W. A. (1992). A manual of quantitative genetics , 5th edition. Academic Enterprises, Pullman, WA.

Google Scholar
Bell, A. M., Hankison, S. J. & Laskowski, K. L. (2009). The repeatability of behaviour: a meta-analysis. Animal Behaviour 77, 771–783.
10.1016/j.anbehav.2008.12.022
PubMedWeb of Science®Google Scholar
Berteaux, D., Thomas, D. W., Bergeron, J. M. & Lapierre, H. (1996). Repeatability of daily field metabolic rate in female meadow voles (Microtus pennsylvanicus). Functional Ecology 10, 751–759.
10.2307/2390510
Web of Science®Google Scholar
Biro, P. A., Beckmann, C. & Stamps, J. A. (2010). Small within-day increases in temparature affects boldness and alters personality in coral reef fish. Proceedings of the Royal Society B-Biological Sciences 277, 71–77.

Web of Science®Google Scholar
Boake, C. R. B. (1989). Repeatability: Its role in evolutionary studies of mating behavior. Evolutionary Ecology 3, 173–182.
10.1007/BF02270919
Web of Science®Google Scholar
Bolker, B. M., Brooks, M. E., Clark, C. J., Geange, S. W., Poulsen, J. R., Stevens, M. H. H. & White, J.-S. S. (2009). Generalized linear mixed models: a practical guide for ecology and evolution. Trends in Ecology & Evolution 24, 127–135.
10.1016/j.tree.2008.10.008
PubMedWeb of Science®Google Scholar
Bolund, E., Schielzeth, H. & Forstmeier, W. (2007). Intrasexual competition in zebra finches, the role of beak colour and body size. Animal Behaviour 74, 715–724.
10.1016/j.anbehav.2006.10.032
Web of Science®Google Scholar
Bolund, E., Schielzeth, H. & Forstmeier, W. (2009). Compensatory investment in zebra finches: females lay larger eggs when paired to sexually unattractive males. Proceedings of the Royal Society B-Biological Sciences 276, 707–715.
10.1098/rspb.2008.1251
PubMedWeb of Science®Google Scholar
Brommer, J. E., Rattiste, K. & Wilson, A. J. (2008). Exploring plasticity in the wild: laying date-temperature reaction norms in the common gull Larus canus. Proceedings of the Royal Society B-Biological Sciences 275, 687–693.
10.1098/rspb.2007.0951
PubMedWeb of Science®Google Scholar
Browne, W. J., Subramanian, S. V., Jones, K. & Goldstein, H. (2005). Variance partitioning in multilevel logistic models that exhibit overdispersion. Journal of the Royal Statistical Society Series A-Statistics in Society 168, 599–613.
10.1111/j.1467-985X.2004.00365.x
Web of Science®Google Scholar
Carrasco, J. L. (2009).A generalized concordance correlation coefficient based on the variance components generalized linear mixed models with application to overdispersed count data. Biometrics, in press, DOI: 10.1111/j.1541-0420

Google Scholar
Carrasco, J. L. & Jover, L. (2003). Estimating the generalized concordance correlation coefficient through variance components. Biometrics 59, 849–858.
10.1111/j.0006-341X.2003.00099.x
PubMedWeb of Science®Google Scholar
Carrasco, J. L. & Jover, L. (2005). Concordance correlation coefficient applied to discrete data. Statistics in Medicine 24, 4021–4034.
10.1002/sim.2397
PubMedWeb of Science®Google Scholar
Carrasco, J. L., Jover, L., King, T. S. & Chinchilli, V. M. (2007). Comparison of concordance correlation coefficient estimating approaches with skewed data. Journal of Biopharmaceutical Statistics 17, 673–684.
10.1080/10543400701329463
PubMedWeb of Science®Google Scholar
Carrasco, J. L., King, T. S. & Chinchilli, V. M. (2009). The concordance correlation coefficient for repeated measures estimated by variance components. Journal of Biopharmaceutical Statistics 19, 90–105.
10.1080/10543400802527890
PubMedWeb of Science®Google Scholar
Clark, J. S. (2005). Why environmental scientists are becoming Bayesians. Ecology Letters 8, 2–14.
10.1111/j.1461-0248.2004.00702.x
Web of Science®Google Scholar
DeWitt, T. J. & Scheiner, S. M. (2004). Phenotypic plasticity: functional and conceptual approaches. Oxford University Press, Oxford.

Google Scholar
Dingemanse, N. J., Both, C., Drent, P. J., Van Oers, K. & Van Noordwijk, A. J. (2002). Repeatability and heritability of exploratory behaviour in great tits from the wild. Animal Behaviour 64, 929–938.
10.1006/anbe.2002.2006
Web of Science®Google Scholar
Dingemanse, N. J., Kazem, A. J. N., Reale, D. & Wright, J. (2009). Behavioural reaction norms: animal personality meets individual plasticity. Trends in Ecology & Evolution 25, 82–89.

Web of Science®Google Scholar
Dohm, M. R. (2002). Repeatability estimates do not always set an upper limit to heritability. Functional Ecology 16, 273–280.
10.1046/j.1365-2435.2002.00621.x
Web of Science®Google Scholar
Donner, A. (1986). A review of inference procedures for the intraclass correlation coefficient in the one-way random effects model. International Statistical Review 54, 67–82.
10.2307/1403259
Web of Science®Google Scholar
Falconer, D. S. & Mackay, T. F. C. (1996). Introduction to quantitative genetics, 4th edition. Prentice Hall, Harlow, U.K.
10.1046/j.1365-2656.2000.00401.x
PubMedGoogle Scholar
Faraway, J. J. (2006). Extending the linear model with R. Chapman & Hall/CRC, Boca Raton, FL.

Google Scholar
Fleiss, J. L. & Cuzick, J. (1979). The reliablility of dichotomous judgments: unequal numbers of judges per subject. Applied Psychological Measurement 3, 537–542.
10.1177/014662167900300410
Google Scholar
Forstmeier, W. & Birkhead, T. R. (2004). Repeatability of mate choice in the zebra finch: consistency within and between females. Animal Behaviour 68, 1017–1028.
10.1016/j.anbehav.2004.02.007
Web of Science®Google Scholar
Garamszegi, L. Z., Calhim, S., Dochtermann, N., Hegyi, G., Hurd, P. L., Jèrgensen, C., Kutsukake, N., Lajeunesse, M. J., Pollard, K. A., Schielzeth, H., Symonds, M. R. E. & Nakagawa, S. (2009). Changing philosophies and tools for statistical inferences in behavioral ecology. Behavioral Ecology 20, 1363–1375.
10.1093/beheco/arp137
Web of Science®Google Scholar
Garamszegi, L. Z., Hegyi, G., Heylen, D., Ninni, P., De Lope, F., Eens, M. & Mèller, A. P. (2006). The design of complex sexual traits in male barn swallows: associations between signal attributes. Journal of Evolutionary Biology 19, 2052–2066.
10.1111/j.1420-9101.2006.01135.x
CASPubMedWeb of Science®Google Scholar
Gelman, A. & Hill, J. (2007). Data analysis using regression and multilevel/hierarchical models. Cambridge University Press, Cambridge, U.K.

Google Scholar
Gill, J. (2007). Bayesian methods: a social and behavioral sciences approach. CRC, Boca Raton, FL.

Google Scholar
Goldstein, H., Browne, W. & Rasbash, J. (2002). Partitioning variation in multilevel models. Understanding Statistics 1, 223–231.
10.1207/S15328031US0104_02
Google Scholar
Hadfield, J. D. (2010). MCMC methods for multi-response Generalised Linear Mixed Models: the MCMCglmm R package. Journal of Statistical Software 33, 1–22.

Google Scholar
Hadfield, J. D. & Nakagawa, S. (2010). General quantitative genetic methods for comparative biology: phylogenies, taxonomies and multi-trait models for continuous and categorical characters. Journal of Evolutionary Biology 23, 494–508.
10.1111/j.1420-9101.2009.01915.x
CASPubMedWeb of Science®Google Scholar
Isberg, S. R., Thomson, P. C., Nicholas, F. W., Barker, S. G. & Moran, C. (2005). Quantitative analysis of production traits in saltwater crocodiles (Crocodylus porosus): I. reproduction traits. Journal of Animal Breeding and Genetics 122, 361–369.
10.1111/j.1439-0388.2005.00548.x
CASPubMedWeb of Science®Google Scholar
Lee, Y., Nelder, J. A. & Pawitan, Y. (2006). Generalized linear models with random effects: unified analysis via H-likelihood. Chapman & Hall/CRC, Boca Raton, FL.

Google Scholar
Lessells, C. M. & Boag, P. T. (1987). Unrepeatable repeatabilities: a common mistake. Auk 104, 116–121.
10.2307/4087240
Web of Science®Google Scholar
Littell, R. C., Milliken, G. A., Stroup, W. W., Wolfinger, R. D. & Schabenberger, O. (2006). SAS ^® for Mixed Models. SAS Institue Inc., Cary, NC.

Google Scholar
Lynch, M. & Walsh, B. (1998). Genetics and analysis of quantitative traits. Sinauer, Sunderland, MA.

Google Scholar
Manly, B. R. J. (2006). Randomization, Bootstrap and Monte carlo Methods in Biology, 3rd edition. Chapman & Hall/CRC ,Boca Raton, FL.
10.1111/j.1365-2745.2005.01082.x
Google Scholar
McCarthy, M. A. (2007). Bayesian methods for ecology. Cambridge University Press, Cambridge.

Google Scholar
McCulloch, C. E. & Searle, S. R. (2002). Generalized, linear and mixed models. Wiley, Chichester.

Google Scholar
McGraw, K. O. & Wong, S. P. (1996). Forming inferences about some intraclass correlation coefficients. Psychological Methods 1, 30–46.
10.1037/1082-989X.1.1.30
Web of Science®Google Scholar
Merilä, J. & Sheldon, B. (2000). Avian quantitative genetics. Current Ornithology 9, 179–255.

Google Scholar
Nakagawa, S. & Cuthill, I. C. (2007). Effect size, confidence interval and statistical significance: a practical guide for biologists. Biological Reviews 82, 591–605.
10.1111/j.1469-185X.2007.00027.x
CASPubMedWeb of Science®Google Scholar
Nakagawa, S., Gillespie, D. O. S., Hatchwell, B. J. & Burke, T. (2007a). Predictable males and unpredictable females: sex difference in repeatability of parental care in a wild bird population. Journal of Evolutionary Biology 20, 1674–1681.
10.1111/j.1420-9101.2007.01403.x
CASPubMedWeb of Science®Google Scholar
Nakagawa, S., Ockendon, N., Gillespie, D. O. S., Hatchwell, B. J. & Burke, T. (2007b). Does the badge of status influence parental care and investment in house sparrows? An experimental test. Oecologia 153, 749–760.
10.1007/s00442-007-0765-4
PubMedWeb of Science®Google Scholar
Namboodiri, K. K., Green, P. P., Kaplan, E. B., Morrison, J. A., Chase, G. A., Elston, R. C., Owen, A. R. G., Rifkind, B. M., Glueck, C. J. & Tyroler, H. A. (1984). The collaborative lipid research clinics program family study: IV. Familial associations of plasma-lipids and lipoproteins. American Journal of Epidemiology 119, 975–996.
10.1093/oxfordjournals.aje.a113818
CASPubMedWeb of Science®Google Scholar
Nelder, J. A. (1954). The interpretation of negative components of variance. Biometrika 41, 544–548.
10.1093/biomet/41.3-4.544
Web of Science®Google Scholar
Nussey, D. H., Wilson, A. J. & Brommer, J. E. (2007). The evolutionary ecology of individual phenotypic plasticity in wild populations. Journal of Evolutionary Biology 20, 831–844.
10.1111/j.1420-9101.2007.01300.x
CASPubMedWeb of Science®Google Scholar
O’Hara, R. B. (2009). How to make models add up - a primer on GLMMs. Annales Zoologici Fennici 46, 124–137.
10.5735/086.046.0205
Web of Science®Google Scholar
Pigliucci, M. (2001). Phenotypic plasticity: beyond nature and nurture. The Johns Hopkins University Press, Baltimore, Maryland.

Google Scholar
R Development CoreTeam. (2009). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria.

Google Scholar
Réale, D., Reader, S. M., Sol, D., McDougall, P. T. & Dingemanse, N. J. (2007). Integrating animal temperament within ecology and evolution. Biological Reviews 82, 291–318.
10.1111/j.1469-185X.2007.00010.x
PubMedWeb of Science®Google Scholar
Richards, S. A. (2008). Dealing with overdispersed count data in applied ecology. Journal of Applied Ecology 45, 218–227.
10.1111/j.1365-2664.2007.01377.x
Web of Science®Google Scholar
Schielzeth, H. (2010). Simple means to improve the interpretability of regression coefficients. Methods in Ecology and Evolution 1, 103–113.
10.1111/j.2041-210X.2010.00012.x
Web of Science®Google Scholar
Schielzeth, H. & Bolund, E. (2010). Patterns of conspecific brood parasitism in zebra finches. Animal Behaviour. 79, 1329–1337.
10.1016/j.anbehav.2010.03.006
Web of Science®Google Scholar
Schielzeth, H., Bolund, E. & Forstmeier, W. (2010). Heritability of and early-environmental effects on variation in mating preferences. Evolution 64, 998–1006.
10.1111/j.1558-5646.2009.00890.x
PubMedWeb of Science®Google Scholar
Schielzeth, H. & Forstmeier, W. (2009). Conclusions beyond support: overconfident estimates in mixed models. Behavioral Ecology 20, 416–420.
10.1093/beheco/arn145
PubMedWeb of Science®Google Scholar
Shrout, P. E. & Fleiss, J. L. (1979). Intraclass correlations: Uses in assessing rater reliability. Psychological Bulletin 86, 420–428.
10.1037/0033-2909.86.2.420
CASPubMedWeb of Science®Google Scholar
Sih, A., Bell, A. & Johnson, J. C. (2004a). Behavioral syndromes: an ecological and evolutionary overview. Trends in Ecology & Evolution 19, 372–378.
10.1016/j.tree.2004.04.009
PubMedWeb of Science®Google Scholar
Sih, A., Bell, A. M., Johnson, J. C. & Ziemba, R. E. (2004b). Behavioral syndromes: an integrative overview. Quarterly Review of Biology 79, 241–277.
10.1086/422893
PubMedWeb of Science®Google Scholar
Snijders, T. A. B. & Bosker, R. (1999). Multilevel analysis: an introduction to basic and advanced multilevel modeling. Sage Publications, London, U.K.

Google Scholar
Sokal, R. R. & Rohlf, F. J. (1995). Biometry: The principles and practice of statistics in biological research, 3rd edition. W.H. Freeman and Company, New York.
10.1073/pnas.94.2.549
CASWeb of Science®Google Scholar
Solomon, P. J. & Taylor, J. M. G. (1999). Orthogonality and transformations in variance components models. Biometrika 86, 289–300.
10.1093/biomet/86.2.289
Web of Science®Google Scholar
Stamps, J. & Groothuis, T. G. G. (2010). The development of animal personality: relevance, concepts and perspetives. Biological Reviews 85, 301–325.
10.1111/j.1469-185X.2009.00103.x
PubMedWeb of Science®Google Scholar
Stamps, J. A. (2007). Growth-mortality tradeoffs and ‘personality traits' in animals. Ecology Letters 10, 355–363.
10.1111/j.1461-0248.2007.01034.x
PubMedWeb of Science®Google Scholar
Van de Pol, M. V. & Wright, J.(2009). A simple method for distinguishing within- versus between-subject effects using mixed models. Animal Behaviour 77, 753–758..
10.1016/j.anbehav.2008.11.006
Web of Science®Google Scholar
Venables, W. N. & Ripley, B. D. (2002). Modern applied statistics with S, 4th edition. Springer, New York.

Google Scholar
Venzon, D. J. & Moolgavkar, S. H. (1988). A Method for Computing Profile-Likelihood-Based Confidence-Intervals. Applied Statistics-Journal of the Royal Statistical Society Series C 37, 87–94.
10.2307/2347496
Web of Science®Google Scholar
Verbeke, G. & Molenberghs, G. (2001). Linear Mixed Models for Longitudinal Data. Springer, New York.

Google Scholar
Visscher, P. M., Hill, W. G. & Wray, N. R. (2008). Heritability in the genomics era - concepts and misconceptions. Nature Reviews Genetics 9, 255–266.
10.1038/nrg2322
CASPubMedWeb of Science®Google Scholar
Whittingham, L. A., Dunn, P. O. & Stapleton, M. K. (2006). Repeatability of extra-pair mating in tree swallows. Molecular Ecology 15, 841–849.
10.1111/j.1365-294X.2006.02808.x
PubMedWeb of Science®Google Scholar
Wilson, A. J. (2008). Why h² does not always equal V_A/V_P? Journal of Evolutionary Biology 21, 647–650.
10.1111/j.1420-9101.2008.01500.x
CASPubMedWeb of Science®Google Scholar
Wilson, A. J., Pemberton, J. M., Pilkington, J. G., Clutton-Brock, T. H., Coltman, D. W. & Kruuk, L. E. B. (2007). Quantitative genetics of growth and cryptic evolution of body size in an island population. Evolutionary Ecology 21, 337–356.
10.1007/s10682-006-9106-z
Web of Science®Google Scholar
Zou, G. Y. & Donner, A. (2004). Confidence interval estimation of the intraclass correlation coefficient for binary outcome data. Biometrics 60, 807–811.
10.1111/j.0006-341X.2004.00232.x
PubMedWeb of Science®Google Scholar

Citing Literature

Volume85, Issue4

November 2010

Pages 935-956

Repeatability for Gaussian and non-Gaussian data: a practical guide for biologists

Abstract

XI. REFERENCES

Citing Literature

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Repeatability for Gaussian and non-Gaussian data: a practical guide for biologists

Abstract

XI. REFERENCES

Citing Literature

References

Related

Information