Professional Documents
Culture Documents
2016 - Analysis of Y-Chromosome STRs in Chile Confirms An Extensive Introgression of European Male Lineages in Urban Populations
2016 - Analysis of Y-Chromosome STRs in Chile Confirms An Extensive Introgression of European Male Lineages in Urban Populations
Short communication
A R T I C L E I N F O A B S T R A C T
Article history: We analyzed the Y chromosome haplotypes (Yfiler) of 978 non-related Chilean males grouped in five
Received 12 May 2015 sampling regions (Iquique, Santiago de Chile, Concepción, Temuco and Punta Arenas) covering main geo-
Received in revised form 5 December 2015 political regions. Overall, 803 different haplotypes and 688 singletons were observed. Molecular diversity
Accepted 9 December 2015
was moderately lower than in other neighboring countries (e.g. Argentina); and AMOVA analysis on Y-
Available online 11 December 2015
STR haplotypes showed that among variation within Chile accounted for only 0.25% of the total variation.
Punta Arenas, in the southern cone, showed the lowest haplotype diversity, and discrimination capacity,
Keywords:
and also the highest matching probability of the five Chilean samples, probably reflecting its more
Y-filer
Y-STR
marked geographic isolation compared to the other regions. Multidimensional scaling (MDS) analysis
Chile based on RST genetic distances suggested a close proximity of Chilean Y-chromosome profiles to European
Forensic genetics ones. Consistently, haplogroups inferred from Y-STR profiles revealed that the Native American
YHRD component constituted only 8% of all the haplotypes, and this component ranged from 5% in the Centre of
the country to 9–10% in the South and 13% in the North, which is in good agreement with the distribution
of Native American communities in these regions. AMOVA computed on inferred haplogroups confirmed
the very low among variation observed in Chilean populations. The present project provides the first
Chilean dataset to the international Y-chromosome STR Haplotype Reference Database (YHRD) and it is
also the first reference database for Y-chromosome forensic casework of the country.
ã 2015 Elsevier Ireland Ltd. All rights reserved.
http://dx.doi.org/10.1016/j.fsigen.2015.12.005
1872-4973/ ã 2015 Elsevier Ireland Ltd. All rights reserved.
U. Toscanini et al. / Forensic Science International: Genetics 21 (2016) 76–80 77
Table 1
Molecular diversity indices on Y-chromosome profiles from Chile and other reference populations. Values in brackets refer to standard errors. Diversity values from Argentina
were included in this table for reference; the data was obtained from Toscanini et al. [10].
N K UH HD M DC MP
Population MHT Yfiler MHT Yfiler MHT Yfiler MHT Yfiler MHT Yfiler MHT Yfiler
Chile 978 54% 82% 39% 70% 0.9939 (0.0008) 0.9994 (0.0001) 4.89 (2.39) 10.16 (4.64) 0.540 0.821 0.007126 0.0015933
Iquique 196 73% 95% 63% 91% 0.9901 (0.0029) 0.9995 (0.0006) 4.75 (2.33) 9.98 (4.58) 0.735 0.954 0.0149417 0.0056227
Santiago 196 75% 96% 65% 93% 0.9931 (0.0020) 0.9996 (0.0006) 4.93 (2.41) 10.23 (4.70) 0.750 0.964 0.0119221 0.0054665
Concepción 198 74% 92% 62% 84% 0.9942 (0.0015) 0.9991 (0.0006) 4.96 (2.42) 10.21 (4.68) 0.742 0.919 0.0108152 0.0059178
Temuco 194 80% 97% 68% 94% 0.9963 (0.0012) 0.9997 (0.0006) 4.98 (2.43) 10.32 (4.73) 0.799 0.969 0.0088213 0.0054735
Punta Arenas 194 65% 81% 50% 70% 0.9904 (0.0023) 0.9967 (0.0011) 4.84 (2.37) 10.04 (4.61) 0.649 0.809 0.0147199 0.0083962
Argentina 621 73% 95% 60% 90% 0.9972 (0.0006) 0.9998 (0.0001) 5.10 (2.48) 10.51 (4.80) 0.729 0.947 0.0044420 0.0018022
Argentina-North 180 84% 94% 69% 88% 0.9980 (0.0009) 0.9993 (0.0007) 5.35 (2.59) 10.86 (4.96) 0.839 0.939 0.0075926 0.0062963
Argentina-Centre 311 79% 96% 69% 92% 0.9971 (0.0008) 0.9997 (0.0003) 4.99 (2.43) 10.43 (4.77) 0.794 0.958 0.0061517 0.0035256
Argentina-South 130 86% 98% 78% 97% 0.9951 (0.0026) 0.9998 (0.0010) 4.94 (2.42) 10.13 (4.66) 0.862 0.985 0.0125444 0.0079290
Europe 2168 58% 94% 45% 89% 0.9956 (0.0005) 0.9999 (0.0000) 4.98 (2.43) 10.36 (4.73) 0.579 0.938 0.0048789 0.0005408
South America 1108 69% 94% 56% 89% 0.9974 (0.0004) 0.9999 (0.0001) 5.21 (2.52) 10.62 (4.84) 0.693 0.938 0.0034716 0.0010475
MHT: minimal haplotype; K = number of different haplotypes; UH = singletons; HD = haplotype diversity; M = mean number of pairwise differences; DC = discrimination
capacity; MP = match probability.
preliminary results were announced in the Forensic Genetics among the different samples. Issues regarding DYS385ab marker
13 Proceedings of the 23rd International ISFG Congress [12], were considered as in Purps et al. [9] for these analyses.
and the data were neither released to public resources nor to the RST genetic distances were also computed between sample sets
YHRD. from Chile (present study), Argentina [10], Europe, and Africa [9].
The Toba from Argentina [11] was also added to the analysis in order
2.3. Statistical analyses to attract the Native American component of the Chileans and other
reference populations; the Toba was selected because it has only a
Molecular diversity indices (Haplotype Diversity [HD], number minor proportion of non Native American Y-chromosome ancestry
of different [K] and unique haplotypes [UH]) and parameters of [11] compare to other Native American populations in South
forensic interest (Discrimination Capacity [DC] and Matching America. A Kruskal’s Non-metric Multidimensional Scaling analysis
Probability [MP]), were estimated as described in Purps et al. [9]. (MDS) analysis based on the RST distances was built using the
The Arlequin software v3.5.1.2 [13] was used for most of the isomds function as implemented in MASS package (https://stat.
analyses. All the indices were calculated for both, the full Yfiler ethz.ch/R-manual/R-devel/library/MASS/html/isoMDS.html).
haplotypes and their corresponding 9-Y-STR MHT.
Computation of Mean Number of Pairwise Differences, Analysis 2.4. Haplogroup inference
of Molecular Variance (AMOVA) and estimation of RST genetic
distances (as implemented in Arlequin software v3.5.1.2) were The full Yfiler haplotypes excluding DYS385ab were used to
performed in order to assess the genetic structure and variation allocate them to their most likely haplogroup using the Athey’s
Fig. 2. MDS plot on RST distances. The inset is a zoom-in of the European pole of the main MDS plot. The European pole is represented by various Spanish populations; while
the African and the Native American poles are represented by people from Ibadan (Nigeria and Zimbabwe) and the Toba (Argentina), respectively. See Section 2 for more
information.
U. Toscanini et al. / Forensic Science International: Genetics 21 (2016) 76–80 79
Table 2 related to Europeans and urban Argentineans (see also Ref. [10]),
Haplogroup frequencies inferred from Y-STR in the different regions sampled in
then suggesting a high European Y-chromosome ancestry in Chile.
Chile.
The MDS analysis did not suggest the presence of a sub-Saharan
Chile Iquique Santiago Concepción Temuco Punta Arenas African component in Chileans, as it was clearly observed in other
E1b1a 0.002 0.005 0.005 South American populations i.e. [5,15,16].
E1b1b 0.101 0.071 0.082 0.152 0.119 0.082
G2a 0.044 0.036 0.071 0.051 0.026 0.036
3.3. Haplogroup frequency patterns
H 0.002 0.005 0.005
I1 0.020 0.005 0.005 0.030 0.041 0.021
I2a (xI2a1) 0.011 0.010 0.010 0.010 0.026 Haplogroups were inferred from Y-STR profiles (Table 2). These
I2a1 0.021 0.015 0.020 0.025 0.031 0.015 analyses showed that the Native American lineages (i.e. P + Q
I2b (xI2b1) 0.003 0.010 0.005 haplogroups) fairly represent an 8.3% of the genetic profiles of the
I2b1 0.011 0.005 0.020 0.015 0.005 0.010
global Chilean database; 93% of this indigenous component
J1 0.041 0.046 0.046 0.030 0.036 0.046
J2a1 x J2a1- 0.015 0.015 0.020 0.030 0.005 0.005 belonged to haplogroup Q (the remaining was accounted by
bh haplogroup P). The Native American component was lower in the
J2a1b 0.042 0.036 0.056 0.051 0.036 0.031 Centre of the country (5% in Santiago and Concepción), reached 9%
J2a1h 0.008 0.010 0.005 0.015 0.010
in the South (Temuco: 10%) and in the southernmost region of Chile
J2b 0.022 0.031 0.036 0.005 0.010 0.031
L 0.007 0.005 0.010 0.010 0.010
(Punta Arenas; 9%), and a maximum peak in the North (Iquique:
N 0.001 0.005 13%).
Q 0.083 0.128 0.061 0.051 0.098 0.077 The rest of the haplogroups in Chile (91.7%) were most likely of
R1a 0.016 0.010 0.010 0.015 0.015 0.031 Eurasian origin (Table 2). Their frequencies were relatively
R1b 0.525 0.556 0.515 0.485 0.510 0.557
homogeneous across the country. The most prevalent haplogroup
T 0.022 0.005 0.020 0.040 0.031 0.015
was R1b, which accounted for 51% in Chile; its frequency varied
slightly from 49% in Concepción to 54% in Punta Arenas. The second
most frequent haplogroup was E (10% on average), and it showed a
Haplogroup Predictor tool (http://www.hprg.com/hapest5/ more variable distribution: it reached 7.1% in Iquique and reached a
hapest5a/hapest5.htm?order=num). Petrej9 cíková et al. [14] maximum in Concepción (15.7%).
showed that this tool predicts correctly the haplogroup in
98.19% of the cases using only 12 Y-STRs. In addition, we inferred 4. Discussion
the haplogroup status using the Urasin’s Predictor software v.1.5.0
(http://predictor.ydna.ru). Both haplogroup predictors yielded As previously proven in other populations, Yfiler haplotypes
virtually the same frequency results for the root haplogroups showed a significantly higher molecular diversity and discrimina-
(the phylogenetic resolution used in the present study). tion power than MHT from the same individuals in Chile,
strengthening the convenience of using this Y-STR set in forensic
3. Results genetic instead of the MHT panel.
Various analyses were carried out using Argentina as a
3.1. Molecular diversity reference population given its similar demography to Chile,
historically and contemporarily. Effectively, the data showed some
The full Yfiler haplotypes for the 978 individuals sampled are parallelism in term of molecular diversity and haplogroup
reported in Supplementary Table S1. Neither null alleles nor composition between these two countries. In general, Chile has
duplications were observed. Intermediate alleles were observed in slightly lower values of molecular diversity (reflected on summary
50 individuals, of which 47 were found in DYS458. statistical indices). This is somehow expected if we take into
Diversity and forensic indices are shown in Table 1. HD was account that the impact of European immigrants arriving to South
always higher than 0.999 except for Punta Arenas (0.9967), America was much higher in the countries facing the Atlantic
indicating a decrease in the variation at the southernmost region of shores of the continent than in those countries exposed exclusively
the country. Compared to other populations, the global HD for to the Pacific Ocean (like Chile). Thus, Chile was traditionally a
Chile appeared to be slightly lower than that observed in provinces remote place for migrants because it was far from Europe and quite
from Argentina, Europe (Spain and Italy) and pooled populations inaccessible historically; this fact was already recorded in the
from South America; in part, this is mainly due to the low value census of 1907 indicating that the highest percentage of citizens
observed in Punta Arenas. A similar situation was observed for the born in Europe versus the total population of Chile was 2.2% (INE).
proportion of different haplotypes and of unique haplotypes when Although the arrival of Europeans into Chilean territories was
comparing the whole Chilean sample with those populations probably lower than in Argentina in historical times, the results of
mentioned above (Table 1). the present study indicate a massive introgression of European Y-
Finally, AMOVA, when carried out on the five Chilean regions chromosome haplotypes (about 92%) into the present-day Chilean
analyzed, shows that most of the variation (99.75%) occurred urban population. MDS analysis based on RST distances corrobo-
within populations. rated the presence of this component by allocating all the Chilean
sub-samples very closely related to Europeans.
3.2. Multidimensional scaling analysis The results from AMOVA indicated a very low proportion of
among population Y-chromosome variation in the country.
MDS analysis was performed based on RST distances in order to AMOVA however not always mirrors population differences that
visualize population sample relationships (Fig. 2). A set of could be relevant in forensic genetics [17–20]. Instead, phyloge-
neighboring Argentinean populations was also added for compari- netic analysis can sometimes show patterns of population sub-
son, including the Native American Toba as a reference population structure that could pass unnoticed when using FST-based
of Native American ancestry [11]. The MDS plot displays the three measures. In our study, however, analysis of inferred haplogroups
reference populations in the vertex of a quasi-equilateral triangle; does not reveal the existence of pronounce population stratifica-
each vertex indicating the European, the African, and the Native tion either. Thus, the European haplogroup frequencies observed in
American poles. Genetic distances showed Chileans to be closely Chile were relatively homogeneous all over the country, while the
80 U. Toscanini et al. / Forensic Science International: Genetics 21 (2016) 76–80
scenario was difference in Argentina, where different provinces [3] L. Cifuentes, R. Morales, D. Sepulveda, H. Jorquera, M. Acuna, DYS19 and
showed differential impact of Eurasian source populations. In DYS199 loci in a Chilean population of mixed ancestry, Am. J. Phys. Anthropol.
125 (2004) 85–89.
addition, the Native American component in Chile, although lower [4] A. Salas, J.C. Jaime, V. Álvarez-Iglesias, Á. Carracedo, Gender bias in the
than in Argentina (and much lower than other South American multiethnic genetic composition of central Argentina, J. Hum. Genet. 53 (2008)
populations [5,16]), showed only discrete variation in different 662–674.
[5] J.M. Cárdenas, T. Heinz, J. Pardo-Seco, V. Álvarez-Iglesias, P. Taboada-Echalar, P.
regions from Chile. The highest value of this Native component was Sánchez-Diz, Á. Carracedo, A. Salas, The multiethnic ancestry of Bolivians as
found in the North. Not surprisingly, this is the region that hosts revealed by the analysis of Y-chromosome markers, Forensic Sci. Int. Genet. 14
one of the highest Native American communities of the country (2014) 210–218.
[6] J. Söchtig, V. Álvarez-Iglesias, A. Mosquera-Miguel, M. Gelabert-Besada, A.
(i.e. Aymara). The southern region of the country showed also a Gómez-Carballa, A. Salas, Genomic insights on the ethno-history of the Maya
higher proportion of Native American Y-chromosomes than the and the ‘Ladinos’ from Guatemala, BMC Genomics 16 (2015) 131.
centre; and this is also in good agreement with the presence of the [7] M.C. Lardone, A. Marengo, A. Parada-Bustamante, L. Cifuentes, A. Piottante, M.
Ebensperger, R. Valdevenito, A. Castro, Greater prevalence of Y chromosome
most important Native American community at this latitude (i.e.
Q1a3a haplogroup in Y-microdeleted Chilean men: a case-control study, J.
Mapuches). Assist. Reprod. Genet. 30 (2013) 531–538.
In summary, the present study is one of the very few population [8] U. Toscanini, F. Moreno, J.A. Pantoja-Astudillo, E.A. Morales, P. Bustos, A. Salas,
genetic studies carried out in Chile; and the results provide new A reference frequency database of 15 autosomal STRs in Chile, Forensic Sci. Int.
Genet. 19 (2015) 35–36.
insights into the Y-chromosome genetic characteristics of the [9] J. Purps, S. Siegert, S. Willuweit, M. Nagy, C. Alves, R. Salazar, S.M. Angustia, L.H.
country. Last but not least, the data generated in the present Santos, K. Anslinger, B. Bayer, et al., A global analysis of Y-chromosomal
project represents the first Y-STR database that is available in the haplotype diversity for 23 STR loci, Forensic Sci. Int. Genet. 12C (2014) 12–23.
[10] U. Toscanini, C. Vullo, G. Berardi, C. Llull, A. Borosky, A. Gómez, J. Pardo-Seco, A.
literature for use in forensic casework carried out in Chile, and it is Salas, A comprehensive Y-STR portrait of Argentinean populations, Forensic
the first dataset from this country that is included in the YHRD. Sci. Int. Genet. (2015) . http://dx.doi.org/10.1016/j.fsigen.2015.09.002.
This study is also of special value in prosecuting human rights [11] U. Toscanini, L. Gusmão, G. Berardi, V. Gomes, A. Amorim, A. Salas, E. Raimondi,
Male lineages in South American Native groups: evidence of M19 traveling
violations occurring during the military government of the south, Am. J. Phys. Anthropol. 146 (2011) 188–196.
dictatorship suffered in Chile since 1973 until 1990. [12] R.K. Roby, S.D. González, N.R. Phillips, J.V. Planz, J.L. Thomas, J.A.P. Astudillo, J.
Ge, E.A. Morales, A.J. Eisenberg, R. Chakraborty, et al., Autosomal STR allele
frequencies and Y-STR and mtDNA haplotypes in Chilean sample populations,
Conflict of interest
Forensic Sci. Int. Genet. 2 (Suppl. 1) (2009) 532–533.
[13] L. Excoffier, G. Laval, S. Schneider, Arlequin ver. 3.0: an integrated software
None package for population genetics data analysis, Evol. Bioinf. Online 1 (2005)
47–50.
[14] E. Petrej9
cíková, J. Carnogurská, D. Hronská, J. Bernasovská, I. Boron ová, D.
Acknowledgements Gabriková, A. Bôžiková, S. Ma9 ceková, Y-SNP analysis versus Y-haplogroup
predictor in the Slovak population, Anthropol. Anz. 71 (2014) 275–285.
The authors declare no conflicts of interest. AS received funding [15] A. Salas, Á. Acosta, V. Álvarez-Iglesias, M. Cerezo, C. Phillips, M.V. Lareu, Á.
Carracedo, The mtDNA ancestry of admixed Colombian populations, Am. J.
from the “Ministerio de Ciencia e Innovación” (SAF2011-26983), Hum. Biol. 20 (2008) 584–591.
the Plan Galego IDT (EM 2012/045) and the grant from the Sistema [16] T. Heinz, V. Álvarez-Iglesias, J. Pardo-Seco, P. Taboada-Echalar, A. Gómez-
Universitario Gallego- Modalidad REDES (2012-PG226) from the Carballa, A. Torres-Balanza, O. Rocabado, Á. Carracedo, C. Vullo, A. Salas,
Ancestry analysis reveals a predominant Native American component with
Xunta de Galicia (A.S.). moderate European admixture in Bolivians, Forensic Sci. Int. Genet. 7 (2013)
537–542.
Appendix A. Supplementary data [17] A. Gómez-Carballa, F. Moreno, V. Álvarez-Iglesias, F. Martinón-Torres, M.
García-Magariños, J.A. Pantoja-Astudillo, E. Aguirre-Morales, P. Bustos, A.
Salas, Revealing latitudinal patterns of mitochondrial DNA diversity in
Supplementary data associated with this article can be Chileans, Forensic Sci. Int. Genet. 20 (2015) 81–88.
found, in the online version, at http://dx.doi.org/10.1016/j. [18] U. Toscanini, A. Salas, M. García-Magariños, L. Gusmão, E. Raimondi,
Population stratification in Argentina strongly influences likelihood ratio
fsigen.2015.12.005. estimates in paternity testing as revealed by a simulation-based approach, Int.
J. Legal Med. 124 (2010) 63–69.
References [19] U. Toscanini, A. Salas, Á. Carracedo, G. Berardi, Á. Amorim, L. Gusmão, E.
Raimondi, A simulation-based approach to evaluate population stratification
in Argentina, Forensic Sci. Int. Genet. 1 (1) (2008) 662–663.
[1] L. Thayer-Ojeda, Elementos étnicos que han intervenido en la población de
[20] U. Toscanini, L. Gusmão, G. Berardi, Á. Amorim, Á. Carracedo, A. Salas, E.
Chile, Imprenta La Ilustración, Santiago de Chile, 1919.
Raimondi, Testing for genetic structure in different urban Argentinian
[2] L. Cifuentes, C. Valenzuela, R. Cruz-Coke, L. Armanet, C. Lyng, Z. Harb,
populations, Forensic Sci. Int. 165 (2007) 35–40.
Caracterización genética de la población hospitalaria de Santiago, Rev. Med.
Chile 116 (1988) 28–33.