On the inference of spatial structure from population genetics data

Gilles Guillot

doi:10.1093/bioinformatics/btp267

On the inference of spatial structure from population genetics data
Artikel i vetenskaplig tidskrift, 2009

Motivation: In a series of recent papers, Tess, a computer program based on the concept of hidden Markov random field, has been proposed to infer the number and locations of panmictic population units from the genotypes and spatial locations of these individuals. The method seems to be of broad appeal as it is conceptually much simpler than other competing methods and it has been reported by its authors to be fast and accurate. However, this methodology is not grounded in a formal statistical inference method and seems to rely to a large extent on arbitrary choices regarding the parameters used. The present article is an investigation of the accuracy of this method and an attempt to assess whether recent results reported on the basis of this method are genuine features of the genetic process or artefacts of the method. Method: I analyse simulated data consisting of populations at Hardy-Weinberg and linkage equilibrium and also data simulated under a scenario of isolation-by-distance at mutation-migration-drift equilibrium. Arabidopsis thaliana data previously analysed with this method are also reconsidered. Results: Using the Tess program under the no-admixture model to analyse data consisting of several genuine HWLE populations with individuals of pure ancestries leads to highly inaccurate results; Using the Tess program under the admixture model to analyse data consisting of a continuous isolation-by-distance population leads to the inference of spurious HWLE populations whose number and features depend on the parameters used. Results previously reported about the A. thaliana using Tess seem to a large extent to be artefacts of the statistical methodology used. The findings go beyond population clustering models and can be an help to design more efficient algorithms based on graphs.

software

reversible jump

unknown number

bayesian-analysis

distance

components

normalizing constants

model

differentiation

computer-program

Författare

Gilles Guillot

Chalmers, Matematiska vetenskaper

Göteborgs universitet

Forskning Andra publikationer

Bioinformatics

1367-4803 (ISSN) 13674811 (eISSN)

Vol. 25 14 1796-1801

Ämneskategorier (SSIF 2011)

Beräkningsmatematik

DOI

10.1093/bioinformatics/btp267

Publikationsdata kopplat till DOI

Mer information

Skapat

2017-10-07

On the inference of spatial structure from population genetics data Artikel i vetenskaplig tidskrift, 2009