Allele frequency

Allele frequency is the proportion of all copies of a gene that is made up of a particular gene variant (allele). In other words, it is the number of copies of a particular allele divided by the number of copies of all alleles at the genetic place (locus) in a population. It can be expressed for example as a percentage. In population genetics, allele frequencies are used to depict the amount of genetic diversity at the individual, population, and species level. It is also the relative proportion of all alleles of a gene that are of a designed type.

Given the following:

a particular locus on a chromosome and the gene occupying that locus
a population of N individuals carrying n loci in each of their somatic cells (e.g. two loci in the cells of diploid species, which contain two sets of chromosomes)
different alleles of the gene exist
one allele exists in a copies

then the allele frequency is the fraction or percentage of all the occurrences of that locus that is occupied by a given allele and the frequency of one of the alleles is a/(n*N).

For example, if the frequency of an allele is 20% in a given population, then among population members, one in five chromosomes will carry that allele. Four out of five will be occupied by other variant(s) of the gene. Note that for diploid genes the fraction of individuals that carry this allele may be nearly two in five. If the allele distributes randomly, then the binomial theorem will apply: 32% of the population will be heterozygous for the allele (i.e. carry one copy of that allele and one copy of another in each somatic cell) and 4% will be homozygous (carrying two copies of the allele). Together, this means that 36% of diploid individuals would be expected to carry an allele that has a frequency of 20%. However, alleles distribute randomly only under certain assumptions, including the absence of selection. When these conditions apply, a population is said to be in Hardy'Weinberg equilibrium.

The frequencies of all the alleles of a given gene often are graphed together as an allele frequency distribution histogram, or allele frequency spectrum. Population genetics studies the different "forces" that might lead to changes in the distribution and frequencies of alleles'in other words, to evolution. Besides selection, these forces include genetic drift, mutation and migration.

1 Calculation of allele frequencies from genotype frequencies
2 An example population
3 The effect of mutation
4 See also
5 External links

[edit] Calculation of allele frequencies from genotype frequencies

If $f (A A)$ , $f (A a)$ , and $f (a a)$ are the frequencies of the three genotypes at a locus with two alleles, then the frequency p of the A-allele and the frequency q of the a-allele are obtained by counting alleles. Because each homozygote AA consists only of A-alleles, and because half of the alleles of each heterozygote Aa are A-alleles, the total frequency p of A-alleles in the population is calculated as

$p=f(\mathbf{AA})+ \frac{1}{2}f(\mathbf{Aa})= \mbox{frequency of A}$

Similarly, the frequency q of the a allele is given by

$q=f(\mathbf{aa})+ \frac{1}{2}f(\mathbf{Aa})= \mbox{frequency of a}$

It would be expected that p and q sum to 1, since they are the frequencies of the only two alleles present. Indeed they do:

$p+q=f(\mathbf{AA})+f(\mathbf{aa})+f(\mathbf{Aa})=1$

and from this we get:

q = 1 ' p

and

p = 1 ' q

If there are more than two different allelic forms, the frequency for each allele is simply the frequency of its homozygote plus half the sum of the frequencies for all the heterozygotes in which it appears. Allele frequency can always be calculated from genotype frequency, whereas the reverse requires that the Hardy'Weinberg conditions of random mating apply. This is partly due to the three genotype frequencies and the two allele frequencies. It is easier to reduce from three to two.

[edit] An example population

Consider a population of ten individuals and a given locus with two possible alleles, A and a. Suppose that the genotypes of the individuals are as follows:

AA, Aa, AA, aa, Aa, AA, AA, Aa, Aa, and AA

Then the allele frequencies of allele A and allele a are:

$p=prob_A=\frac{2+1+2+0+1+2+2+1+1+2}{20}=0.7$

$q=prob_a=\frac{0+1+0+2+1+0+0+1+1+0}{20}=0.3$

so if an individual is chosen at random there is a 70% chance it will carry the A allele, and a 30% chance it will have the a allele.

[edit] The effect of mutation

Let ú be the mutation rate from allele A to some other allele a (the probability that a copy of gene A will become a during the DNA replication preceding meiosis). If $p t$ is the frequency of the A allele in generation t, then $q t = 1 ' p t$ is the frequency of the a allele in generation t, and if there are no other causes of gene frequency change (no natural selection, for example), then the change in allele frequency in one generation is

$\Delta p=p_t-p_{t-1}=\left(p_{t-1}-\acute{u}p_{t-1}\right)-p_{t-1}=-\acute{u}p_{t-1}$

where $p t' 1$ is the frequency of the preceding generation. This tells us that the frequency of A decreases (and the frequency of a increases) by an amount that is proportional to the mutation rate ú and to the proportion p of all the genes that are still available to mutate. Thus $î� p$ gets smaller as the frequency of p itself decreases, because there are fewer and fewer A alleles to mutate into a alleles. We can make an approximation that, after n generations of mutation,

$p_n=p_0e^{-n\acute{u}}$

[edit] See also

Single-nucleotide polymorphism

[edit] External links

Cheung, KH; Osier MV, Kidd JR, Pakstis AJ, Miller PL, Kidd KK (2000). "ALFRED: an allele frequency database for diverse populations and DNA polymorphisms". Nucleic Acids Research 28 (1): 361'3. doi:10.1093/nar/28.1.361. PMID 10592274.

Middleton, D; Menchaca L, Rood H, Komerofsky R (2002). "New allele frequency database: http://www.allelefrequencies.net". Tissue Antigens 61 (5): 403'7. doi:10.1034/j.1399-0039.2003.00062.x. PMID 12753660.

v ' d ' e Topics in molecular evolution

Natural selection	Balancing selection – Directional selection – Disruptive selection – Negative selection – Stabilizing selection – Selective sweep

Models	Models of DNA evolution – Models of nucleotide substitution – Allele frequency – Ka/Ks ratio – Tajima's D

Molecular processes	Gene conversion – Gene duplication – Silent mutation – Synonymous substitution

Related Articles & Resources

Sources Subject Index - Experts, Sources, Spokespersons

Sources Select Resources Articles

This article is based on one or more articles in Wikipedia, with modifications and additional content by SOURCES editors. This article is covered by a Creative Commons Attribution-Sharealike 3.0 License (CC-BY-SA) and the GNU Free Documentation License (GFDL). The remainder of the content of this website, except where otherwise indicated, is copyright SOURCES and may not be reproduced without written permission. (For information use the Contact form.)

SOURCES.COM is an online portal and directory for journalists, news media, researchers and anyone seeking experts, spokespersons, and reliable information resources. Use SOURCES.COM to find experts, media contacts, news releases, background information, scientists, officials, speakers, newsmakers, spokespeople, talk show guests, story ideas, research studies, databases, universities, associations and NGOs, businesses, government spokespeople. Indexing and search applications by Ulli Diemer and Chris DeFreitas.

For information about being included in SOURCES as a expert or spokesperson see the FAQ . For partnerships, content and applications, and domain name opportunities contact us.

Sources home page