Pop Ecol Lect 36

Lecture notes for ZOO 4400/5400 Population Ecology

Lecture 36 (24-Apr-06)

Return to Main Index page Go back to notes for Lecture 35, 21-Apr

Go to Excel spreadsheet for calculating gene frequencies and local inbreeding coefficient from genotypic counts

Population Genetics (continued)

Last time I introduced the topic of hierarchical F-statistics. We build those up by calculating three kinds of heterozygosities, H_I, H_S, and H_T. Let's look at the general formulae for these heterozygosities, and how they contribute to the calculation of hierarchical F-statistics, and then I will work through an example.

We begin with H_I, the observed heterozygosity in individuals, calculated as a weighted average across the subpopulations.

Eqn 36.1

where the subscript s refers to the s^th of n subpopulations. That is, first we multiply each subpopulation's observed heterozygosity by its population size. Then we sum those weighted heterozygosities. Finally, we divide by the sum of all the subpopulation sizes. See an example of a specific case example calculation in the F_ST example page.

Next we calculate H_S as the global weighted average of the expected heterozygosities across all the subpopulations:

Eqn 36.2

The formula differs from that of Eqn 36.1 only because we are now using H_exp (calculated from each subpopulation's gene frequencies by Eqn 37.1) instead of H_obs.

Finally we use the global mean gene frequencies to calculate H_T, the global expected heterozygosity. This will not give us the same answer as the weighted average of the separate subpopulation values for expected heterozygosity. The formula is:

Eqn 36.3

The only difference between this formula and that of Eqn 37.1 is that here we specify the global mean (p_i-bar) for the gene frequencies over all the subpopulations, rather than the subpopulation-by-subpopulation values.

With H_I, H_S, and H_T in hand we are ready to calculate our hierarchical F-statistics. First, F_IS:

Eqn 36.4

You will often see this written in the mathematically equivalent form:

Eqn 36.5

This first "global" F-statistic is the ratio of the difference between the global-average expected and observed heterozygosities in subpopulations (H_S - H_I) to the global-average expected heterozygosity (H_S). It gives us a view of the average inbreeding over the entire set of subpopulations (that is, it very closely resembles the local F or F_s of Step 5 in the F_ST example page.

Next we calculate the F-statistic that tells us the most about the degree of genetic difference among the subpopulations -- F_ST. It is calculated as

Eqn 36.6

Here we assess the difference between the expected heterozygosities in the subpopulations and the expected heterozygosity based on the global gene frequencies.

Let's consider two extreme examples that will illustrate how F_ST can vary between zero and one. Consider a system with three alleles where we have three subpopulations.

Case 1: Maximal F_ST. If each subpopulation is fixed for a different allele, then H_S will be zero (if we have only one allele, we don't expect any heterozygotes). In that case, Eqn 33.6 simplifies to H_T / H_T = 1.

Case 2: Minimal F_ST. If the gene frequencies are the same in each of the subpopulations, they will all have the same H_S, which will be the same as H_T. In that case, the numerator of Eqn 33.6 goes to zero and F_ST is zero. Why can't F_ST be negative? You cannot arrange a set of populations to have H_S > H_T.

The final (and least often used) global F-statistic is F_IT, given by the formula:

Eqn 36.7

F_IT is relatively little used for two reasons. First, it is often quite similar to F_IS, thereby providing little new information. If and when it does differ from F_IS it may even be somewhat misleading. It is possible to construct scenarios in which F_IT produces an "overall" picture that differs from the picture in any particular subpopulation. The reasonable context for an individual (observed heterozygosity) is against its own subpopulation. Juxtaposing individual against total population is less intuitively meaningful.

Go to worked F_ST calculation web page.

§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§§

Return to top of page