Professional Documents
Culture Documents
GenoCline User Manual v1.3
GenoCline User Manual v1.3
GenoCline v1.3
May, 2020
Jose A. Peña, Miguel A. Alfonso‐Sánchez, Luis Gómez‐Pérez
Dpto. Genética, Antropología Física y Fisiología Animal
Facultad de Ciencia y Tecnología
Universidad del País Vasco (UPV/EHU)
Aptdo. 644, 48080 Bilbao
SPAIN
E‐mail: genoclinesoftware@gmail.com
http://www.didac.ehu.es/genocline
http://genocline.sourceforge.net
Table of contents
Features ............................................................................................................................... 3
Download ............................................................................................................................ 3
Requirements ...................................................................................................................... 3
Menu options ...................................................................................................................... 4
Frequencies ............................................................................................................... 4
Data input ....................................................................................................... 4
Read a xls file .................................................................................................. 5
Genetic clines ‐ linear ..................................................................................... 6
Genetic cline for an allele ‐ linear ................................................................... 7
Genetic clines – angular transformation ........................................................ 7
Genetic cline for an allele ‐ angular transformation ...................................... 8
Genetic clines ‐ sigmoid .................................................................................. 8
Genetic cline for an allele ‐ sigmoid ............................................................... 8
GW data .................................................................................................................... 9
Read Eigensoft files ......................................................................................... 9
Read Eigenvectors and Coordinates ............................................................... 9
Genetic clines ‐ linear ..................................................................................... 9
Genetic cline for an eigenvector ‐ linear ........................................................ 9
Genetic clines – angular transformation ...................................................... 10
Genetic cline for an eigenvector ‐ angular transformation .......................... 10
Genetic clines ‐ sigmoid ................................................................................ 10
Genetic cline for an eigenvector ‐ sigmoid ................................................... 10
GeoGraphics ............................................................................................................ 11
Spatial autocorrelation ................................................................................. 11
Isolation by distance ..................................................................................... 12
Isolation by distance ‐ kinship ...................................................................... 13
Isolation by distance ‐ Fst ............................................................................. 14
Isolation by distance – Fst log transformation ............................................. 14
Heterozygosity in a cline ............................................................................... 15
Centroid method .......................................................................................... 16
Distances ................................................................................................................. 17
MDS ‐ Reynolds Fst ....................................................................................... 17
Sammon’s projection .................................................................................... 18
Geographical ................................................................................................. 18
Fst ................................................................................................................. 18
Export to Arlequin / Phylip / Past / Poptree ........................................................... 18
Output ..................................................................................................................... 19
Save results ................................................................................................... 19
Graphical representation of results .................................................................................. 19
Errors ................................................................................................................................. 19
Bibliography ...................................................................................................................... 20
Fig. 1. Example of the arrangement of Sheet 1.
Fig. 2. Example of the arrangement of Sheet 2.
Read a xls file
The Read a xls file option allows for the selection of the xls file containing the raw data. Once
the data have been read, the program creates a Results folder. This folder will be labeled using
the date and hour when the database was eventually opened (yyyymmddhhmmss). At the
same time, two windows appear on the screen to check if all the information is accurate: one
window with the allele frequency data and another with the geographical coordinates (Fig. 3).
If an error is detected, it should be corrected in the xls file.
Fig. 3. Screenshot after Read data option.
Fig. 4. Absolute frequency of detected genetic clines, grouped in 10‐degrees circumference arcs
Fig. 5. Regression line with 95% confidence limits (dotted lines), coefficient of determination (r2), Pearson’s r
correlation coefficient with statistical significance level and orientation of an allele frequency cline obtained with
the GenoCline software. Data from Moreira et al (2015)
Genetic clines – angular transformation
The same as the previous option, the program computes the Pearson's r correlation coefficient
between the frequency of each allele and the populations' coordinates regarding an axis that
rotates 360 degrees in consecutive iterations of one degree each.
In this option, singularity lies in the possibility of carrying out the angular transformation arcsin
√p (where p is the allele frequency) to separate the variance of the estimate from the
where,
xi, xj are the frequencies of the allele x for the populations i and j, respectively,
is the mean frequency,
wij is the spatial weight between populations i and j, and
d is the distance.
Moran's index values can be represented against the geographic distance in a graph
known as autocorrelogram. According to the "isolation‐by‐distance" model, it should be
expected a decreasing trend of the Moran's index as geographic distance increases. With
GenoCline we can also obtain the statistical significance of the autocorrelation values.
Logically, the statistical robustness of the autocorrelogram is greater when most of the
Moran's index values are statistically significant.
Based on the geographic coordinates of the populations, this program calculates the
distances between all population pairs and distributes them among seven (7) distance
categories or classes. Subsequently, GenoCline performs spatial autocorrelation analysis and
calculates the Moran's index for each distance class. Variations in the Moran's index owing to
the distance among populations are shown, along with the corresponding 95% confidence
limits (Fig. 6). The resulting image is saved as Autocorrelation.eps.
where rij refers to conditional kinship,
pik represents the frequency of the allele k in the population i and,
is the mean allele frequency of allele k.
Another method to examine the isolation by distance is based on the analysis of the
relationship between the quotient
1
Fig. 7. Intercept (a) and slope (b) estimates in Malécot’s isolation‐by‐distance model (Malécot, 1973).
Fig. 8. Intercept (a) and slope (b) estimates for the relationship between the quotient Fst/(1‐Fst) and the
geographic distance.
Isolation by distance ‐ Fst log transformation
The output of the analysis of the relationship between FST and geographic distance is saved
as IsolationByDistanceFl.eps. Such an analysis is performed when the model requires a
logarithmic transformation of the distance to linearize the function..
Fig. 10. Plot of heterozygosity (difference between expected heterozigosity‐predicted heterozigosity) versus
rotated coordinates in a group of populations.
Fig. 11. The cline of reference for Fig. 10.
where pik is the frequency of allele k in population i and,
is the mean frequency of allele k.
In the graph drawn by the centroid method, those subpopulations plotting above the
line (which represents the expected heterozygosity values) are assumed to have experienced
a higher impact of gene flow. On the contrary, plotting below the line indicates a deficit of
gene flow (see Fig. 9). An example of this issue can be consulted in Gómez‐Pérez et al. (2011a).
With GenoCline you can run analyses based on the centroid method. The program yields
population values for both the heterozygosity and the genetic variance, as well as a line
segment with the expected mean values of heterozygosity. The resulting image of this analysis
is saved as Centroid.eps.
Fig. 9. Plot of heterozygosity versus distance from genetic centroid in a group of 17 populations. Data from
Gómez‐Pérez et al. (2011a).
Fig. 12. An MDS analysis.
Fig. 13. The Sammon projection.
Geographical
GenoCline generates a geographic distance matrix that can be used as input file in other
statistical packages.
Fst
Likewise, the program also allows exporting of the genetic distance matrix obtained (FST
genetic distance, Reynolds et al., 1983).
Export to Arlequin / Phylip / Past / Poptree
Exporting data to other statistical programs (Arlequin, Phylip, Past, or Poptree) to carry out
complementary analyses is very easy with GenoCline since no change or specific modification
of data format is required.
Fig. 14. An error message.