Estimating Effective Population Size or Mutation Rate with Microsatellites

Hongyan Xu, Yun Xin Fu

Research output: Contribution to journalArticle

45 Citations (Scopus)

Abstract

Microsatellites are short tandem repeats that are widely dispersed among eukaryotic genomes. Many of them are highly polymorphic; they have been used widely in genetic studies. Statistical properties of all measures of genetic variation at microsatellites critically depend upon the composite parameter θ = 4Nμ, where N is the effective population size and μ is mutation rate per locus per generation. Since mutation leads to expansion or contraction of a repeat number in a stepwise fashion, the stepwise mutation model has been widely used to study the dynamics of these loci. We developed an estimator of θ, θF, on the basis of sample homozygosity under the single-step stepwise mutation model. The estimator is unbiased and is much more efficient than the variance-based estimator under the single-step stepwise mutation model. It also has smaller bias and mean square error (MSE) than the variance-based estimator when the mutation follows the multistep generalized stepwise mutation model. Compared with the maximum-likelihood estimator θL by NIELSEN (1997), θF has less bias and smaller MSE in general. θL has a slight advantage when θ is small, but in such a situation the bias in θL may be more of a concern.

Original languageEnglish (US)
Pages (from-to)555-563
Number of pages9
JournalGenetics
Volume166
Issue number1
DOIs
StatePublished - Jan 1 2004
Externally publishedYes

Fingerprint

Mutation Rate
Population Density
Microsatellite Repeats
Mutation
Genome

ASJC Scopus subject areas

  • Genetics

Cite this

Estimating Effective Population Size or Mutation Rate with Microsatellites. / Xu, Hongyan; Fu, Yun Xin.

In: Genetics, Vol. 166, No. 1, 01.01.2004, p. 555-563.

Research output: Contribution to journalArticle

Xu, Hongyan ; Fu, Yun Xin. / Estimating Effective Population Size or Mutation Rate with Microsatellites. In: Genetics. 2004 ; Vol. 166, No. 1. pp. 555-563.
@article{a73bf5ad41fb4ce183aa0f87c9ae3144,
title = "Estimating Effective Population Size or Mutation Rate with Microsatellites",
abstract = "Microsatellites are short tandem repeats that are widely dispersed among eukaryotic genomes. Many of them are highly polymorphic; they have been used widely in genetic studies. Statistical properties of all measures of genetic variation at microsatellites critically depend upon the composite parameter θ = 4Nμ, where N is the effective population size and μ is mutation rate per locus per generation. Since mutation leads to expansion or contraction of a repeat number in a stepwise fashion, the stepwise mutation model has been widely used to study the dynamics of these loci. We developed an estimator of θ, θF, on the basis of sample homozygosity under the single-step stepwise mutation model. The estimator is unbiased and is much more efficient than the variance-based estimator under the single-step stepwise mutation model. It also has smaller bias and mean square error (MSE) than the variance-based estimator when the mutation follows the multistep generalized stepwise mutation model. Compared with the maximum-likelihood estimator θL by NIELSEN (1997), θF has less bias and smaller MSE in general. θL has a slight advantage when θ is small, but in such a situation the bias in θL may be more of a concern.",
author = "Hongyan Xu and Fu, {Yun Xin}",
year = "2004",
month = "1",
day = "1",
doi = "10.1534/genetics.166.1.555",
language = "English (US)",
volume = "166",
pages = "555--563",
journal = "Genetics",
issn = "0016-6731",
publisher = "Genetics Society of America",
number = "1",

}

TY - JOUR

T1 - Estimating Effective Population Size or Mutation Rate with Microsatellites

AU - Xu, Hongyan

AU - Fu, Yun Xin

PY - 2004/1/1

Y1 - 2004/1/1

N2 - Microsatellites are short tandem repeats that are widely dispersed among eukaryotic genomes. Many of them are highly polymorphic; they have been used widely in genetic studies. Statistical properties of all measures of genetic variation at microsatellites critically depend upon the composite parameter θ = 4Nμ, where N is the effective population size and μ is mutation rate per locus per generation. Since mutation leads to expansion or contraction of a repeat number in a stepwise fashion, the stepwise mutation model has been widely used to study the dynamics of these loci. We developed an estimator of θ, θF, on the basis of sample homozygosity under the single-step stepwise mutation model. The estimator is unbiased and is much more efficient than the variance-based estimator under the single-step stepwise mutation model. It also has smaller bias and mean square error (MSE) than the variance-based estimator when the mutation follows the multistep generalized stepwise mutation model. Compared with the maximum-likelihood estimator θL by NIELSEN (1997), θF has less bias and smaller MSE in general. θL has a slight advantage when θ is small, but in such a situation the bias in θL may be more of a concern.

AB - Microsatellites are short tandem repeats that are widely dispersed among eukaryotic genomes. Many of them are highly polymorphic; they have been used widely in genetic studies. Statistical properties of all measures of genetic variation at microsatellites critically depend upon the composite parameter θ = 4Nμ, where N is the effective population size and μ is mutation rate per locus per generation. Since mutation leads to expansion or contraction of a repeat number in a stepwise fashion, the stepwise mutation model has been widely used to study the dynamics of these loci. We developed an estimator of θ, θF, on the basis of sample homozygosity under the single-step stepwise mutation model. The estimator is unbiased and is much more efficient than the variance-based estimator under the single-step stepwise mutation model. It also has smaller bias and mean square error (MSE) than the variance-based estimator when the mutation follows the multistep generalized stepwise mutation model. Compared with the maximum-likelihood estimator θL by NIELSEN (1997), θF has less bias and smaller MSE in general. θL has a slight advantage when θ is small, but in such a situation the bias in θL may be more of a concern.

UR - http://www.scopus.com/inward/record.url?scp=1642337411&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=1642337411&partnerID=8YFLogxK

U2 - 10.1534/genetics.166.1.555

DO - 10.1534/genetics.166.1.555

M3 - Article

VL - 166

SP - 555

EP - 563

JO - Genetics

JF - Genetics

SN - 0016-6731

IS - 1

ER -