Thursday, November 25, 2010
The Sardinian samples from the HGDP are always, as far as I know, classified as entirely of West Eurasian origin via clustering algorithms like ADMIXTURE and STRUCTURE. In other words, these Sardinians completely fit into clusters that peak north of the Sahara and west of Central Asia. So it would appear that gene flow to Sardinia from neighboring Africa has been minimal, or even non-existent.
But that's not what I found when I took a closer look at their genomes, as well as those of over 250 other Europeans with apparently no extra-European ancestry, as shown by my own ADMIXTURE analyses. I picked one of my favorite "local admixture" programs for the job, called RHHcounter (see here for more details), setting the rare genotype detection level at 0.01%.
Quite a few of the individuals showed tiny clusters of 3-4 genotypes that were only common outside of Europe, usually in Africa, East Asia or the Americas. These were often too small to investigate further. However, I spotted two segments that were large and clear enough to warrant more detailed analyses. Surprisingly, these belonged to two of the HGDP Sardinians - HGDP00672 and HGDP00673. Below are their Chromosome Mosaics, courtesy of RHHmapper, along with MDS plots based on all the SNPs from the aforementioned segments (marked by arrows). The MDS plots include samples from Europe, North Africa and Sub-Saharan Africa.
As per above, the MDS plots were produced using all the genotypes contained within the relevant segments (over 300 and 2000 SNPs respectively), and not just those that were detected by RHHcouter in the analysis. Obviously, what this shows is that only a fraction of the extra-European genotypes were flagged, while the rest nearby remained undetected at this threshold.
I can't see any explanation for these results other than relatively recent gene flow from Sub-Saharan Africa to Sardinia. What this means, of course, is that there must be a reason why model-based algorithms can't pick up such admixtures in certain samples. As suggested by the authors of RHHcounter, perhaps the segments are too small and/or contain too few SNPs to have an impact on overall ancestry estimation? However, I also suspect that because Sardinia is something of a Southern European genetic isolate, the Sardinians are too easily classified as Europeans by ADMIXTURE, STRUCTURE etc., which might mask at least some of their minority admixtures.
Tuesday, November 9, 2010
These results from an LBK burial site in Germany (5,500-4,900 BC) look surprisingly Near Eastern. So the question is, where were the ancestors of modern Central Europeans at this time?
Interestingly, we do not find the most common Y chromosome hgs in modern Europe (e.g., R1b, R1a, I, and E1b1), which parallels the low frequency of the very common modern European mtDNA hg H (now at 20%–50% across Western Eurasia) in the Neolithic samples. Also, while both Neolithic Y chromosome hgs G2a3 and F* are rather rare in modern-day Europe, they have slightly higher frequencies in populations of the Near East, and the highest frequency of hg G2a is seen in the Caucasus today . The few published ancient Y chromosome results from Central Europe come from late Neolithic sites and were exclusively hg R1a . While speculative, we suggest this supports the idea that R1a may have spread with late Neolithic cultures from the east .
Haak W, Balanovsky O, Sanchez JJ, Koshel S, Zaporozhchenko V, et al. (2010) Ancient DNA from European Early Neolithic farmers reveals their Near Eastern affinities. PLoS Biol 8(11): e1000536. doi:10.1371/journal.pbio.1000536