Correcting for ascertainment bias in the inference of population structure
Artikel i vetenskaplig tidskrift, 2009

Background: The ascertainment process of molecular markers amounts to disregard loci carrying alleles with low frequencies. This can result in strong biases in inferences under population genetics models if not properly taken into account by the inference algorithm. Attempting to model this censoring process in view of making inference of population structure (i.e. identifying clusters of individuals) brings up challenging numerical difficulties. Method: These difficulties are related to the presence of intractable normalizing constants in Metropolis-Hastings acceptance ratios. This can be solved via an Markov chain Monte Carlo (MCMC) algorithm known as single variable exchange algorithm (SVEA). Result: We show how this general solution can be implemented for a class of clustering models of broad interest in population genetics that includes the models underlying the computer programs STRUCTURE, GENELAND and GESTE. We also implement the method proposed for a simple example and show that it allows us to reduce the bias substantially.

markers

loci

correlated allele frequencies

polymorphism

differentiation

model

Författare

Gilles Guillot

Chalmers, Matematiska vetenskaper

Göteborgs universitet

M. Foll

Swiss Institute of Bioinformatics

Universitat Bern

Bioinformatics

1367-4803 (ISSN) 1367-4811 (eISSN)

Vol. 25 552-554

Ämneskategorier

Beräkningsmatematik

DOI

10.1093/bioinformatics/btn665