Magnus Ducatus Lituaniae Project: Behind the Curtains: MDLP World 22 showcase

Saturday, September 15, 2012

Behind the Curtains: MDLP World 22 showcase

Preliminary remarks

As you all may know, the MDLP blog hasn't been updated since February 2012.

Half of year ago i promised myself that i would stop writing new posts on the MDLP blog before i'll finally get my scientific report on blog written. Since I had to prioritize the completion of a scientific paper over the routine of blog posting,I was unable to continue updating the blog on a regular basis due to a lack of time, and had to make a change in how I conducted my research. So i decided to abstain from posting on the MDLP blog for a couple of month, being focused on more important matters. Despite of all limitations, i kept working secretly on the MDLP project, collecting necessary data and performing different 'genomic' experiments in order to achieve my final goal (publishing of paper). The results of secret experiments with new genomic samples and tools eventually leaked to the curious public, spawning immense interest in my project. After releasing a new version of my own modification of DIYDodecad calculator on Gedmatch.com, i was literally flooded by emails from Gedmatch.com users asking me questions they wanted me to answer.

I understood the strategical mistake of releasing poorly documented data/analysis on Internet and felt obliged to explain details. Obviously, i will start new series of the blog posts by covering the project feature the people most interested in, i.e the MDLP World22 calculator.

The population dataset of MDLP World22 calculator.

The reference population dataset of the calculator was assembled in PLINK by intersecting and thinning the samples from different data sources: HapMap 3 (the filtered dataset CEU,YRI,JPT,CHB), 1000genomes, Rasmussen et al. (2010), HGDP (Stanford) (all populations), Metspalu et al. (2011),Yunusbayev et al. (2011), Chaubey et al. (2010) etc. Furthermore i handpicked random 10 individuals from each European country panel in POPRES dataset, or the maximum number of individuals available otherwise, to select the POPRES European individuals to be included in our study. Finally, in order to evaluate the correlation between the modern and the ancient genetic diversity, i have also included ancient DNA genomic samples of Ötzi,(Keller et al.(2012)) Swedish Neolithic samples Gök4, Ajv52, Ajv70, Ire8, Ste7 (Skoglund et al. (2012)) and 2 La Braña individuals from the Mesolithic sites of the Iberian Peninsula (Sánchez-Quinto et al.(2012)). Then i added 90 samples of individuals-participants of our MDLP project. After merging the aforementioned datasets and thinning the SNP set with PLINK command to exclude SNPs with missing rates greater than 1% and minor alleles, i filtered out duplicates, the individuals with high pairwise IBD-sharing (estimated in Plink as as the average fraction of alleles shared between two individuals over all loci) and the individuals with kinship coefficient suggesting relatedness (kinship coefficients were estimated in KING software). Also i had to filter out individuals with more more tham 3 standard deviations from the population averages. Since kinship coefficient is robustly estimated by HWE (Hary-Weinberg expectations) among SNPs with the same underlying allele frequencies, SNPs showing strong deviation (p < 5.5 x10⁻⁸) from Hardy-Weinberg expectations were removed from the merged and filtered dataset. After that I filtered to keep the list of common SNPs present in Illumina/Affymetrix chips and performed linkage disequilibrium based pruning using a window size of 50, a step of 5 and r^2 threshold of 0.3.

This complex sequence of consequent operations with the initial reference and project datasets yielded a final dataset which included 80751 SNPs in 2516 individuals from 225 populations.

ADMIXTURE analysis

As always, the final dataset in PLINK linked format was further processed in ADMIXTURE software. Sketching the plan for the design of ADMIXTURE test, i had to face the difficult problem: as it has been shown in (Patterson et al.2006) the number of markers needed to resolve populations in ADMIXTURE analysis is inversely proportional to the genetic distance (Fst ) betweeen the populations. According to ADMIXTURE best practice, it is believed that 10,000 markers are suffice to perform GWAS correction for continentally separated populations (for example, African, Asian, and European populations FST > .05) while more like 100,000 markers are necessary when the populations are within a continent (Europe, for instance, FST < 0.01).

To increase the accuracy of ADMIXTURE results i decided to use a method proposed by Dienekes' for converting allele frequencies into 'synthetic individuals'(see also Zack's example). The idea is fairly simple: run an unsupervised ADMIXTURE analysis once to generate allele frequencies for your K ancestral components; then generate zombie populations using these allele frequencies; whenever you want to estimate admixture proportions in new samples run supervised ADMIXTURE analysis using the zombie populations. Like any genome blogger engaged in the task of evaluating admixtures in samples, i must grapple with obvious question of the reliability of this approach. Although i am aware of methodological controversies in using simulated individuals, i would rather concur with Dienekes who considered "synthetic individuals" the best abstract proxies for the ancient ancestral populations. But my purpose is served if i can use the approach used by Dienekes and Zack to obtain meaningful results. To begin with, i routinely ran unsupervised ADMIXTURE K=22 analysis (assuming 22 ancestral populations) which yielded the admixture proportions of individuals from these K populations, as well as the allele frequencies for all SNPs for each of 22 ancestral populations (below are conventional names for each of inferred components in order of appearance):

Pygmy
West-Asian
North-European-Mesolithic
Tibetan
Mesomerican
Arctic-Amerind
South-America_Amerind
Indian
North-Siberean
Atlantic_Mediterranean_Neolithic
Samoedic
Proto-Indo-Iranian
East-Siberean
North-East-European
South-African
North-Amerind
Sub-Saharian
East-South-Asian
Near_East
Melanesian
Paleo-Siberean
Austronesian

Therefore i took the allele frequencies which were computed earlier in unsupervised Admixture K=22 for the merged dataset, pooled them into PLINK and generated 10 "synthetic individuals per ancestral component) using PLINK command --simulate. When the simulation had been finished, i visualized the distance between simulated individuals using multi-dimensional scaling:

As a next step,i included simulated individuals you have as part of a new reference population (including 220 simulated individuals in 22 simulated populations).Then, I ran ADMIXTURE anew, this time in “supervised” mode for K = 22 (with simulated individuals being 'reference' individuals). The Admixture K=22 converged in 31 iterations (37773.1 sec) with final loglikelihood:-188032005.430318 (below are Fst divergences between estimated 'ancestral' populations):

The Fst distance/divergence matrix was used for inferring a most probable NJ-based topology of component distance tree (outgroup: South-African):

The individual 'supervised' ADMIXTURE results (in Excel spreadsheet) for the project participants have been uploaded to GoogleDocs (please note that the average results for reference populations is also available on special request).

MDLP World22 DIYcalculator

The output files of Admixture K=22 supervised run (average values of admixture coefficients in reference populations and FsT values) were used for designing a new version of the MDLP DIYcalculator, which is better known by its codename "World22" (online version is available in AdMix-Utilities section of Gedmatch under MDLP project). MDLP DIYcalculator itself is based on the code of Dodecad DIY calculator (c)ourtesy of Dienekes Pontikos and was developed as part of the Dodecad Ancestry Project. In its Gedmatch implementation MDLP 'World22' DIYcalculator is paired by MDLP 'World22' Oracle, also based on Dienekes' and Zack's code (Harappa/DodecadOracle). The 'Oracle' is designed to find in a single population mode your closest (closest in terms of similarity) population from MDLP ''Word22' admixture results. In a mixed mode, Oracle considers all pairs of populations, and for each one of them calculates the minimum Fst-weighted distance to the sample in consideration, and the admixture proportions that produce it.

Please notice: 'ancestral' populations (i.e 'simulated populations' from the previous step - see above) are labeled in Oracle results as (anc), while the 'real world' modern and ancient populations are marked as "derived".

If you have troubles with understanding/interpreting the results of Oracle and DIYcalculcator, please consult the corresponding topics on Dodecad and HarappaWorld blogs. It is not of avail to repeat in this blog everything they wrote in their own blogs.

What the heck are MDLP Word-22 components?

One of those questions that i usually keep getting in emails is what do the various reference populations and ancestral components for my World K=12 and World-22 analyses mean. I've already provided hints to the answer earlier, but - as old Chinese proverb says - one picture is worth ten thousand words. That's why i decided to display the admixture coefficients spatially on the globe surface. Following Francois Olivier, who proposed to use the graphical library of the statistical software R to display spatial interpolates of the admixture coefficients (Q matrix) in two dimensions (where spatial coordinates are recorded as longitude and latitude), i created 2 contour maps per component.

Pygmy (modal in Biaka and Mbuti population)

West-Asian (bimodal component with peaks in Caucasian populations and south-western part of Iran, equal to Dienekes' Caucasian/Gedrosia component)

North-European-Mesolithic (local component with peaks in European Mesolithic samples of La_Brana and modern North-European Saami population).

Tibetan (Indo-Burmese) component (Himalay, Tibet)

Mesomerican (major genetic component in Native Americans from Mesoamerica)

North-Amerind (the 'native' component in North American Natives)

South-Amerind (the 'native' component in South American Natives)

Atlantic-Mediterranean-Neolithic (the main genetic component in Western and South-Western Europe)

The rest contour maps for all components could be downloaded here.

34 comments:

pconroySeptember 15, 2012 at 8:24 PM
Vadim,

Fantastic work - I can't wait to see my family's results!!!

Are you only providing the World22 and Oracle on Gedmatch, or is there a link to download them also?

I have one account on Gedmatch - my own - but have 9 other accounts, many of them children. I would like to be able to download the files and run them directly in R, is that possible?
ReplyDelete
Replies
pconroySeptember 15, 2012 at 9:13 PM
I'm Irish - all 8 G-Grandparents are Irish, and here's what I get:

Admix Results (sorted):

# Population Percent
1 North-East-European 48.89
2 Atlantic_Mediterranean_Neolithic 33.55
3 West-Asian 7.29
4 North-European-Mesolithic 6.41
5 Indo-Iranian 2.74
6 Indian 0.55
7 South-America_Amerind 0.27
8 South-African 0.14
9 Indo-Tibetan 0.09
10 Pygmy 0.04
11 Near_East 0.02

Single Population Sharing:

# Population (source) Distance
1 CEU_V (derived) 3.39
2 German_V (derived) 3.67
3 Welsh (derived) 4.41
4 CEU (derived) 4.92
5 Austrian (derived) 5.16
6 Norwegian_V (derived) 5.23
7 British (derived) 5.92
8 Swedish (derived) 6.11
9 German-North (derived) 6.34
10 Orcadian (derived) 6.84
11 Hungarian (derived) 7.03
12 German (derived) 7.26
13 Slovenian (derived) 7.96
14 Swedish_V (derived) 8.61
15 German-South (derived) 8.61
16 Croatian (derived) 9.19
17 Bosnian (derived) 9.99
18 Serbian (derived) 10.43
19 Czech (derived) 10.76
20 Croatian_V (derived) 12.2

Mixed Mode Population Sharing:

# Primary Population (source) Secondary Population (source) Distance
1 61% German_V (derived) + 39% Norwegian_V (derived) @ 1.98
2 81% CEU (derived) + 19% Latvian_V (derived) @ 2.19
3 61.4% CEU (derived) + 38.6% German (derived) @ 2.38
4 55.9% British (derived) + 44.1% German (derived) @ 2.41
5 81.5% CEU (derived) + 18.5% Ukrainian-Center (derived) @ 2.43
6 81.1% CEU (derived) + 18.9% Ukrainian_V (derived) @ 2.47
7 77.7% British (derived) + 22.3% Latvian_V (derived) @ 2.5
8 84.6% CEU (derived) + 15.4% Mordovian (derived) @ 2.51
9 93.7% Welsh (derived) + 6.3% Avar (derived) @ 2.52
10 96% CEU_V (derived) + 4% Lak (derived) @ 2.58
11 96% CEU_V (derived) + 4% Tabassaran (derived) @ 2.6
12 82.3% CEU (derived) + 17.7% Mordovian_V (derived) @ 2.6
13 96.3% CEU_V (derived) + 3.7% Pashtun (derived) @ 2.6
14 84% CEU (derived) + 16% Ukrainian-East (derived) @ 2.6
15 93.6% Welsh (derived) + 6.4% Tabassaran (derived) @ 2.6
16 96.3% CEU_V (derived) + 3.7% Ossetian (derived) @ 2.61
17 97.8% CEU_V (derived) + 2.2% West-Asian (ancestral) @ 2.63
18 68.1% Norwegian_V (derived) + 31.9% Bosnian (derived) @ 2.63
19 96.2% CEU_V (derived) + 3.8% Avar (derived) @ 2.63
20 81.4% British (derived) + 18.6% Mordovian (derived) @ 2.63

Question: Do you have Irish, Scottish or French Reference populations?
ReplyDelete
Replies
pconroySeptember 15, 2012 at 9:50 PM
One other thing, it would be interesting to see on what Chromosome segments I am:

North-European-Mesolithic 6.41

Also what are the rank order of North-European-Mesolithic in terms of various populations? I guess based on your map, Saami would be #1, and maybe Finns, Estonians and Latvians would be #2, and then maybe Scandinavians in general as #3 ??
ReplyDelete
Replies
Project "Magnus Ducatus Lituaniae"September 19, 2012 at 2:11 PM
@pconroy

Thank you for your comments.

As of today, i am going to release the off-line Oracle version for the wide audience.
So you'll be able to use it for your other accounts.

As far as it concerns your question regarding Irish, Scottish or French Reference populations - no i don't have any Irish, Scottish references in my dataset. On other hand,i do have a huge set of French populations.

North-European-Mesolithic - is a local "component" centered in ancient Mesolithic La Brana samples, Saami, Baltic and Finnic populations. That's why it has a double label "North-European-Mesolithic"
ReplyDelete
Replies
pconroySeptember 27, 2012 at 1:34 PM
Is there a URL that I can download the MDLP World-22 DIY Oracle from?

Thanks
ReplyDelete
Replies
Father NatureJanuary 23, 2013 at 1:51 AM
how come yemen jews are only 2.4% SSA according to your spreadsheet??i expected them to be much more SSA (they have the darkest skin tone among the jews in israel)

ReplyDelete
Replies
SakiusaMarch 2, 2013 at 11:09 PM
What is CEU_V?
ReplyDelete
Replies
ChrisRJune 21, 2013 at 1:11 PM
As Gedmatch has suspended the uploading of new raw data until August I'm also searching for the calculator files of World-22. Thank you in advance for sharing them.
ReplyDelete
Replies
UnknownApril 17, 2014 at 2:32 PM
What does this mean??? Please help!!!

# Population Percent
1 North-East-European 23.65
2 Atlantic_Mediterranean_Neolithic 22.23
3 Mesoamerican 17.48
4 North-Amerind 12.22
5 West-Asian 7.41
6 South-America_Amerind 5.58
7 Near_East 4.38
8 Sub-Saharian 2.89
9 Indo-Iranian 1.45
10 Samoedic 0.79
11 North-Siberean 0.62
12 Austronesian 0.34
13 Pygmy 0.3
14 Indian 0.23
15 East-Siberean 0.18
16 North-European-Mesolithic 0.18
17 Melanesian 0.09

Single Population Sharing:

# Population (source) Distance
1 Miwok (derived) 7.1
2 Mexican (derived) 12.72
3 Serrano (derived) 21.27
4 Puerto-Rican (derived) 21.91
5 Cochimi (derived) 22.73
6 Colville (derived) 24.25
7 Costanoan (derived) 24.3
8 Tsimsian (derived) 24.57
9 Romania (derived) 28
10 Ashkenazim_V (derived) 28.47
11 Gagauz (derived) 28.84
12 Bulgarian (derived) 28.87
13 Macedonian (derived) 29.57
14 Greek_South (derived) 29.7
15 Swiss (derived) 29.95
16 Greek_North (derived) 30.01
17 Montenegrin (derived) 30.09
18 Tatar_Crim (derived) 30.22
19 Aleut (derived) 30.48
20 Italian_North (derived) 30.6

Mixed Mode Population Sharing:

# Primary Population (source) Secondary Population (source) Distance
1 86% Miwok (derived) + 14% Latvian_V (derived) @ 3.49
2 86.7% Miwok (derived) + 13.3% Mordovian_V (derived) @ 3.77
3 87.6% Miwok (derived) + 12.4% Mordovian (derived) @ 3.88
4 85.1% Miwok (derived) + 14.9% Tatar_Kryashen (derived) @ 3.97
5 88.4% Miwok (derived) + 11.6% Russian_South (derived) @ 3.97
6 85.9% Miwok (derived) + 14.1% Tartar_Mishar (derived) @ 3.97
7 88.3% Miwok (derived) + 11.7% Ukrainian (derived) @ 3.98
8 87.4% Miwok (derived) + 12.6% Ukrainian_V (derived) @ 3.99
9 88.3% Miwok (derived) + 11.7% Ukrainian-East (derived) @ 4
10 87.7% Miwok (derived) + 12.3% Ukrainian-Center (derived) @ 4.04
11 89.9% Miwok (derived) + 10.1% Belarusian (derived) @ 4.04
12 88.3% Miwok (derived) + 11.7% Russian_cossack (derived) @ 4.05
13 87.3% Miwok (derived) + 12.7% Ukrainian-West (derived) @ 4.07
14 89.5% Miwok (derived) + 10.5% Russian (derived) @ 4.1
15 88.3% Miwok (derived) + 11.7% Russian_V (derived) @ 4.11
16 90.7% Miwok (derived) + 9.3% Lithuanian (derived) @ 4.13
17 92.7% Miwok (derived) + 7.3% North-East-European (ancestral) @ 4.13
18 88.4% Miwok (derived) + 11.6% Moldavian (derived) @ 4.14
19 89% Miwok (derived) + 11% Russian_Center (derived) @ 4.14
20 84.5% Miwok (derived) + 15.5% Tatar (derived) @ 4.16
ReplyDelete
Replies
rinkgMay 12, 2014 at 9:00 AM
Hi, I just ran the Oracle results from your tool and this is quite interesting: my whole family is French from Poitou but from the Ftdna results my dad and mum's origins are a little different and here is what I get when I ran the 3 raw data from my family with MDLP World 22:

My dad:

Admix Results (sorted):

# Population Percent
1 Atlantic_Mediterranean_Neolithic 43.11
2 North-East-European 40.26
3 Near_East 10.23
4 West-Asian 2.82
5 North-European-Mesolithic 1.55
6 Mesoamerican 0.78
7 East-South-Asian 0.44
8 Indo-Iranian 0.36
9 Pygmy 0.23
10 Austronesian 0.1
11 Paleo-Siberian 0.07
12 Sub-Saharian 0.04

and the 1st five to make it shorter:

Single Population Sharing:

# Population (source) Distance
1 French (derived) 7.85
2 Swiss (derived) 7.87
3 Provancestralal (derived) 8.53
4 German-South (derived) 9.28
5 Italian_North (derived) 9.46

Mixed Mode Population Sharing:

# Primary Population (source) Secondary Population (source) Distance
1 52.7% Sardinian (derived) + 47.3% Russian_Center (derived) @ 1.4
2 50.5% Sardinian (derived) + 49.5% Ukrainian-East (derived) @ 1.43
3 50.4% Sardinian (derived) + 49.6% Russian_cossack (derived) @ 1.46
4 50.5% Sardinian (derived) + 49.5% Ukrainian (derived) @ 1.62
5 51.1% Sardinian (derived) + 48.9% Russian_South (derived) @ 1.63

My mum:

Admix Results (sorted):

# Population Percent
1 North-East-European 41.25
2 Atlantic_Mediterranean_Neolithic 39.35
3 Near_East 7.85
4 West-Asian 7.47
5 North-European-Mesolithic 2.38
6 Arctic-Amerind 0.67
7 Paleo-Siberian 0.59
8 Sub-Saharian 0.22
9 Indo-Iranian 0.14
10 Melanesian 0.03
11 South-America_Amerind 0.03
12 Samoedic 0.02

Single Population Sharing:

# Population (source) Distance
1 German-South (derived) 4.48
2 Swiss (derived) 5.16
3 French (derived) 6.6
4 Montenegrin (derived) 6.72
5 Provancestralal (derived) 7.44

Mixed Mode Population Sharing:

# Primary Population (source) Secondary Population (source) Distance
1 50.2% Spaniard (derived) + 49.8% Bosnian (derived) @ 1.71
2 66.6% Spaniard (derived) + 33.4% Latvian_V (derived) @ 1.73
3 74.4% Hungarian (derived) + 25.6% Sardinian (derived) @ 1.8
4 51.3% Spaniard (derived) + 48.7% Croatian (derived) @ 1.85
5 50.3% Iberian (derived) + 49.7% Croatian (derived) @ 1.95

Looking at those results I thought that between those 2 my 1st population might be Swiss then.

Me:

Admix Results (sorted):

# Population Percent
1 Atlantic_Mediterranean_Neolithic 40.75
2 North-East-European 39.59
3 Near_East 10.89
4 West-Asian 4.75
5 North-European-Mesolithic 1.8
6 South-America_Amerind 0.67
7 Arctic-Amerind 0.51
8 Austronesian 0.29
9 Indo-Iranian 0.29
10 Paleo-Siberian 0.2
11 Sub-Saharian 0.14
12 Pygmy 0.12

Single Population Sharing:

# Population (source) Distance
1 Swiss (derived) 6.37
2 Provancestralal (derived) 7.63
3 Italian_North (derived) 7.93
4 French (derived) 8.04
5 German-South (derived) 8.33

Mixed Mode Population Sharing:

# Primary Population (source) Secondary Population (source) Distance
1 62.8% Croatian_V (derived) + 37.2% Sardinian (derived) @ 1.72
2 54.3% Ukrainian_V (derived) + 45.7% Sardinian (derived) @ 2.14
3 68.1% Slovenian (derived) + 31.9% Otzi (derived) @ 2.22
4 66% Iberian (derived) + 34% Croatian_V (derived) @ 2.27
5 86.5% Portugese (derived) + 13.5% North-East-European (ancestral) @ 2.39

So, my estimation was correct I got Swiss as 1st population, it's exactly on the other side of the country from where I was born. From what I understand my dad is more Southern than my mum who seems to be more Eastern from our local Poitou.

I tried to look at the definition of "derived" on the blog but could not find it, I understand it as an admixture of populations who settled long ago in a specific place, an example for me would be in Finald: Saami could be ancestral and other Finnish would be derived.. Am I right?

Thanks,
Karine
ReplyDelete
Replies
AnonymousAugust 29, 2016 at 5:14 PM
My results are as follows. Can anyone tell me what they mean?

Population
Pygmy -
West-Asian 8.88
North-European-Mesolithic 5.00
Indo-Tibetan -
Mesoamerican -
Arctic-Amerind -
South-America_Amerind -
Indian -
North-Siberean -
Atlantic_Mediterranean_Neolithic 33.22
Samoedic 0.84
Indo-Iranian 0.51
East-Siberean -
North-East-European 46.82
South-African -
North-Amerind 0.99
Sub-Saharian -
East-South-Asian -
Near_East 3.69
Melanesian -
Paleo-Siberian -
Austronesian -

ReplyDelete
Replies
georgSeptember 12, 2017 at 4:17 AM
Does the calculator detect those with Sephardim SNPs? I don't see Sephards listed in the table.
BTW, "Samoed" is spelt wrong. The letter Y is missing. The correct spelling is SAMOYED.
ReplyDelete
Replies
SharJanuary 8, 2018 at 5:37 PM
Does anyone here know who can explain it to us laypeople?I still don't understand how "Indo-Tibetan" and and other low percentages fit into the picture. On the commercial sites, I show up as 97% European, but on MDLP World 22 here is how it rolls out:
Population:
Pygmy -
West-Asian 20.82
North-European-Mesolithic 0.08
Indo-Tibetan 0.07
Mesoamerican -
Arctic-Amerind 1.10
South-America_Amerind 0.24
Indian 0.48
North-Siberean -
Atlantic_Mediterranean_Neolithic 33.84
Samoedic -
Indo-Iranian 0.22
East-Siberean 0.23
North-East-European 18.22
South-African -
North-Amerind -
Sub-Saharian 2.08
East-South-Asian -
Near_East 21.85
Melanesian 0.12
Paleo-Siberian -
Austronesian 0.64

Web site and contents ©Copyright 2011-2018 by GEDmatch, Inc.
Genealogy and DNA data remains the property of the submitter.
Each Admixture Proportions 'calculator' model remains the property of its developer.

68790 SNPs used in this evaluation
ReplyDelete
Replies
UnknownMarch 3, 2018 at 2:21 AM
This comment has been removed by the author.
ReplyDelete
Replies
TristanMarch 3, 2018 at 6:29 AM
Hello Project "Magnus Ducatus Lituaniae"
I need answers for this reading Exactly what am I? The reading cme up on me for this pertaining to me:

MDLP World-22 Oracle results:
Kit T022519

Admix Results (sorted):

# Population Percent
1 Sub-Saharian 82.35
2 Near_East 3.8
3 Pygmy 2.76
4 North-East-European 2.14
5 North-Amerind 1.62
6 South-America_Amerind 1.61
7 Atlantic_Mediterranean_Neolithic 1.33
8 South-African 1.12
9 Mesoamerican 1.03
10 Indo-Iranian 0.75
11 Indian 0.6
12 Indo-Tibetan 0.4
13 Samoedic 0.35
14 North-European-Mesolithic 0.13

Single Population Sharing:

# Population (source) Distance
1 Lemba (derived) 8.46
2 Bantu (derived) 11.55
3 Mandenka (derived) 19.59
4 Sub-Saharian (ancestral) 19.71
5 Yoruba (derived) 19.71
6 Biaka_Pygmies (derived) 35.6
7 Ethiopian (derived) 58.16
8 Jew-Ethiopia (derived) 61.83
9 Jew_Ethiopia (derived) 62.86
10 Lumbee (derived) 68.15
11 Moroccan (derived) 71.1
12 Mozabite (derived) 77.3
13 Yemen (derived) 77.46
14 Australian (derived) 78.96
15 Egyptian (derived) 82.56
16 Puerto-Rican (derived) 84.74
17 Jordanian (derived) 88.06
18 Ste7 (derived) 88.53
19 Miwok (derived) 90.17
20 Mexican (derived) 91.05
ReplyDelete
Replies
UnknownMarch 5, 2018 at 12:57 AM
Why are there 2 contour maps per component? One of them is for World K=12 and the other one is for World-22?
ReplyDelete
Replies
jita'ame salumaMay 14, 2018 at 6:24 AM
Population
Pygmy 0.21
West-Asian 21.92
North-European-Mesolithic 2.87
Indo-Tibetan 0.68
Mesoamerican -
Arctic-Amerind -
South-America_Amerind 0.09
Indian -
North-Siberean -
Atlantic_Mediterranean_Neolithic 30.53
Samoedic 0.54
Indo-Iranian 1.49
East-Siberean 0.74
North-East-European 25.49
South-African 0.08
North-Amerind 0.15
Sub-Saharian 1.29
East-South-Asian 0.48
Near_East 12.02
Melanesian 0.45
Paleo-Siberian 0.96
Austronesian -

so i went to oracle and my north african is Ashkenazim_V_, what the heck is that , obviously jewish , well i knew i had jewish , i knew i had afican and native american , i descend from a woman that was half native half african , she came to uk with my ancestor when slavery was over in usa, he had been deported for debt and returned with her and two of their children in 1870, he returned from Canada not usa , however i know from my dna her relatives are all found in usa , not canada , the black and the natives that is , but however relatives of theirs that are closer found in canada , children where left there , adult and married i guess , figuring girls, though some are rockley on male side so, adult son stayed ??????????, they have native and african in about the percentage as me , not all some have way higher native percent , maybe he married a native woman ???????????? , it says i have no mexican or brazilian on this but i had my dna looked at by professor macdonald he says i have all four native types , as in i have inuit , maya , brazilian and he says pima too, the mayan being mexican , pima found in AZ and mexico , he also said my percentage when you add up the other nations that make up native makes me 4 per cent , that my mother had it in both her mothers and fathers lines hence it stayed so high in mine , your still not good at native dna he says , too few agree to being tested because it could be used against them , proving them not as native as they say/thought they are , makes sense , so much they lost, what does a percentage really say ? my grand sons refuse to cut their hair and their percent is not as high as mine, i have not permitted my hair cut from 8, before then i was too young to fight it, percentage can not dictate how native you are inside only genetically , dna has taught me a lot , p.s i am a brit , so if you know what Ashkenazim_V_ is please could you reply, jita'ame suluma thats a native name gifted to me by an elder , it means wisdom eagle , basically a wise eagle , my real name is layla ozdemir , you can find me on fb
ReplyDelete
Replies
TedarionMay 19, 2018 at 4:31 PM
This comment has been removed by the author.
ReplyDelete
Replies
UnknownJanuary 17, 2019 at 9:05 AM
I downloaded my Ancestry DNA to Genesis. Using my new kit # I ran MDLP-World22 Admixture proportions. There seems to be a problem on chromosomes 14 through 22. It is showing either 0 or 1 SNPs eval. Could this be from a problem with the download of my data? Or is this a known problem? Note: It looks fine using my old kit # in GEDmatch.
ReplyDelete
Replies
Tina A CavittJanuary 29, 2019 at 12:15 AM
I am assuming central European Viking because that's what me and mom are lol
ReplyDelete
Replies
Tina A CavittJanuary 29, 2019 at 12:16 AM
ceu_v derived
ReplyDelete
Replies
Tina A CavittJanuary 29, 2019 at 12:18 AM
Using 1 population approximation:
1 German_V_derived @ 3.560286
2 Austrian_derived @ 4.214431
3 CEU_V_derived @ 4.268415
4 CEU_derived @ 5.010840
5 Welsh_derived @ 5.027669
6 British_derived @ 5.594800
7 German-South_derived @ 6.258089
8 Orcadian_derived @ 7.256489
9 Hungarian_derived @ 7.305158
10 Norwegian_V_derived @ 8.251949
11 Serbian_derived @ 8.660540
12 Slovenian_derived @ 8.909108
13 German-North_derived @ 8.939845
14 Croatian_derived @ 9.111379
15 Swedish_derived @ 9.469718
16 Bosnian_derived @ 9.705576
17 German_derived @ 9.729921
18 Montenegrin_derived @ 10.941747
19 French_derived @ 12.315708
20 Macedonian_derived @ 12.539729

Using 2 populations approximation:
1 50% CEU_V_derived +50% German-South_derived @ 1.538202

Using 3 populations approximation:
1 50% British_derived +25% Kosovar_derived +25% Russian_Center_derived @ 0.921730

Using 4 populations approximation:
+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
1 British_derived + CEU_V_derived + Croatian_derived + German-South_derived @ 0.754127
2 Bosnian_derived + British_derived + French_derived + German-North_derived @ 0.776180
3 British_derived + French_derived + German-South_derived + Ukrainian-Center_derived @ 0.828698
4 CEU_derived + Croatian_derived + German-South_derived + Welsh_derived @ 0.837887
5 British_derived + French_derived + German-South_derived + Ukrainian_V_derived @ 0.848064
6 French_derived + German-North_derived + German-South_derived + Slovenian_derived @ 0.854000
7 British_derived + German-South_derived + German_V_derived + Hungarian_derived @ 0.863129
8 Bosnian_derived + CEU_derived + French_derived + German-North_derived @ 0.863738
9 Bosnian_derived + CEU_V_derived + German-South_derived + Orcadian_derived @ 0.864598
10 German-South_derived + German_V_derived + Hungarian_derived + Orcadian_derived @ 0.886464
11 British_derived + CEU_V_derived + CEU_V_derived + Montenegrin_derived @ 0.890582
12 German-North_derived + German-South_derived + German-South_derived + German_V_derived @ 0.893314
13 British_derived + Croatian_derived + French_derived + German-North_derived @ 0.904219
14 Croatian_derived + German-South_derived + German_V_derived + Orcadian_derived @ 0.915155
15 CEU_derived + Croatian_derived + French_derived + German-North_derived @ 0.918807
16 British_derived + British_derived + Kosovar_derived + Russian_Center_derived @ 0.921730
17 British_derived + French_derived + Kosovar_derived + Latvian_derived @ 0.923793
18 British_derived + German_V_derived + Serbian_derived + Welsh_derived @ 0.927894
19 CEU_V_derived + French_derived + German_V_derived + Hungarian_derived @ 0.929176
20 British_derived + CEU_V_derived + German-South_derived + Hungarian_derived @ 0.932342
that's moms results

ReplyDelete
Replies
Tina A CavittJanuary 29, 2019 at 12:25 AM
Using 1 population approximation:
1 German-South_derived @ 5.952879
2 Serbian_derived @ 8.182379
3 CEU_derived @ 8.675700
4 Montenegrin_derived @ 8.753657
5 German_V_derived @ 9.280185
6 British_derived @ 9.318875
7 Austrian_derived @ 9.701466
8 Swiss_derived @ 9.940567
9 Macedonian_derived @ 10.018495
10 CEU_V_derived @ 10.354508
11 Welsh_derived @ 10.608590
12 French_derived @ 10.916842
13 Orcadian_derived @ 10.995100
14 Hungarian_derived @ 11.246842
15 Norwegian_V_derived @ 11.735426
16 Bulgarian_derived @ 11.856838
17 Bosnian_derived @ 11.924855
18 Romania_derived @ 12.483692
19 Croatian_derived @ 12.634500
20 Provancestralal_derived @ 13.254197

Using 2 populations approximation:
1 50% Norwegian_V_derived +50% Romania_derived @ 4.307440

Using 3 populations approximation:
1 50% German_V_derived +25% German_V_derived +25% Puerto-Rican_derived @ 3.337241

Using 4 populations approximation:
+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
1 British_derived + German_V_derived + Lumbee_derived + Montenegrin_derived @ 3.267666
2 German_V_derived + German_V_derived + German_V_derived + Puerto-Rican_derived @ 3.337241
3 Kosovar_derived + Lumbee_derived + Orcadian_derived + Ukrainian-Center_derived @ 3.338995
4 German-North_derived + German-North_derived + Kosovar_derived + Lumbee_derived @ 3.345200
5 CEU_derived + German_V_derived + Lumbee_derived + Montenegrin_derived @ 3.356480
6 British_derived + Kosovar_derived + Lumbee_derived + Ukrainian-Center_derived @ 3.366493
7 Bosnian_derived + CEU_derived + German_V_derived + Puerto-Rican_derived @ 3.370591
8 Bosnian_derived + British_derived + German-South_derived + Lumbee_derived @ 3.375951
9 Bosnian_derived + British_derived + German_V_derived + Puerto-Rican_derived @ 3.376987
10 British_derived + Kosovar_derived + Lumbee_derived + Polish_V_derived @ 3.394152
11 British_derived + CEU_V_derived + Lumbee_derived + Montenegrin_derived @ 3.404645
12 Belarusian_V_derived + British_derived + Kosovar_derived + Lumbee_derived @ 3.407532
13 British_derived + German_V_derived + Lumbee_derived + Macedonian_derived @ 3.407672
14 German_V_derived + Lumbee_derived + Montenegrin_derived + Orcadian_derived @ 3.415161
15 Czech_derived + Kosovar_derived + Lumbee_derived + Swedish_derived @ 3.427819
16 Bosnian_derived + French_derived + German_V_derived + Lumbee_derived @ 3.431776
17 Kosovar_derived + Lumbee_derived + Orcadian_derived + Ukrainian_V_derived @ 3.439188
18 German_derived + German-South_derived + German_V_derived + Puerto-Rican_derived @ 3.444206
19 British_derived + CEU_V_derived + Lumbee_derived + Macedonian_derived @ 3.451045
20 CEU_derived + Kosovar_derived + Lumbee_derived + Ukrainian-Center_derived @ 3.452269 mine I knew I had Viking I am assuming german_V is german Viking and of course my Melungeon dna :P lumbee indian :P
ReplyDelete
Replies
GoldieZPMay 1, 2019 at 5:27 PM
This comment has been removed by the author.
ReplyDelete
Replies
GoldieZPMay 1, 2019 at 5:28 PM
How am I to understand the population spreadsheet for MDLP World-22. I cut and paste only the Jewish portion are these 22 chromosomes for each population?

Population Pygmy West-Asian North-European-Mesolithic Indo-Tibetan Mesoamerican Arctic-Amerind South-America_Amerind Indian North-Siberean Atlantic_Mediterranean_Neolithic Samoedic Indo-Iranian East-Siberean North-East-European South-African North-Amerind Sub-Saharian East-South-Asian Near_East Melanesian Paleo-Siberian Austronesian

Jew_Algeria 0.00 22.90 0.50 0.20 0.00 0.10 0.10 0.20 0.00 37.10 0.20 0.20 0.00 5.30 0.20 0.00 3.40 0.00 28.90 0.30 0.00 0.40
Jew_Azerbaijan 0.00 47.75 0.10 0.20 0.10 0.00 0.30 1.50 0.50 21.42 0.00 0.70 0.00 0.40 0.10 0.00 0.00 0.00 26.73 0.00 0.10 0.10
Jew_Ethiopia 0.90 2.70 0.00 1.00 0.00 0.00 0.00 0.00 0.00 0.20 0.00 0.00 0.00 0.00 1.60 0.00 40.56 0.00 52.45 0.20 0.10 0.30
Jew-Ethiopia 0.70 1.10 0.10 0.60 0.00 0.10 0.00 0.30 0.00 1.20 0.50 0.00 0.00 0.00 1.00 0.00 41.50 0.00 52.00 0.50 0.00 0.40
Jew_Francestrale 0.20 21.68 0.30 0.30 0.00 0.00 0.10 0.10 0.10 38.16 0.20 0.00 0.00 10.19 0.20 0.10 0.60 0.10 26.77 0.50 0.40 0.00
Jew_Georgia 0.20 42.46 0.60 0.00 0.10 0.20 0.00 0.50 0.00 22.68 0.00 0.50 0.00 0.80 0.10 0.30 0.30 0.00 30.87 0.30 0.10 0.00
Jew_India 0.00 30.91 1.50 0.70 0.30 0.30 0.20 45.16 0.10 5.48 0.90 3.49 0.60 0.70 0.20 0.70 0.10 1.50 6.58 0.20 0.20 0.20
Jew-Iran 0.10 46.80 0.30 0.10 0.00 0.00 0.00 1.50 0.10 18.10 0.00 0.30 0.00 0.30 0.00 0.00 0.00 0.20 31.60 0.20 0.30 0.10
Jew_Iraqi 0.10 45.70 0.00 0.90 0.00 0.00 0.00 1.80 0.00 21.20 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 30.20 0.00 0.00 0.10
Jew-Iraqi 0.10 45.20 0.30 0.10 0.00 0.10 0.20 0.50 0.00 20.30 0.00 0.50 0.10 0.40 0.00 0.00 0.00 0.00 31.60 0.30 0.00 0.30
Jew_Italia 0.10 22.46 0.50 0.30 0.10 0.10 0.10 0.10 0.10 37.23 0.30 0.40 0.00 9.38 0.20 0.00 0.80 0.00 27.05 0.40 0.10 0.30
Jew_Kurd 0.00 46.45 0.60 0.20 0.10 0.30 0.10 2.50 0.20 17.18 0.30 1.00 0.20 1.40 0.00 0.00 0.00 0.10 28.87 0.20 0.10 0.20
Jew_Libya 0.70 22.32 0.70 0.40 0.00 0.00 0.00 0.00 0.00 34.13 0.00 0.00 0.00 4.20 0.50 0.00 3.40 0.00 33.23 0.00 0.40 0.00
Jew_Morocco 0.10 21.88 0.30 0.00 0.00 0.00 0.10 0.30 0.10 35.06 0.10 0.10 0.00 8.09 0.30 0.20 3.40 0.00 29.77 0.00 0.00 0.20
Jew_Romania 0.60 21.38 0.30 0.00 0.10 0.20 0.20 0.00 0.00 33.27 0.00 0.40 1.40 20.88 0.10 0.20 0.00 0.30 20.08 0.60 0.00 0.00
Jew_Syria 0.30 31.70 0.00 0.00 0.10 0.10 0.10 0.10 0.00 30.90 0.00 0.80 0.00 4.70 0.00 0.00 0.90 0.00 29.60 0.50 0.00 0.20
Jew_Tat 0.00 48.80 0.50 0.40 0.00 0.00 0.10 1.30 0.30 19.20 0.40 0.40 0.00 1.40 0.00 0.10 0.00 0.00 26.60 0.10 0.10 0.30
Jew_Tunisia 0.10 23.40 0.00 0.20 0.20 0.10 0.30 0.00 0.30 34.50 0.10 0.00 0.00 6.00 0.40 0.00 2.50 0.00 31.70 0.10 0.00 0.10
Jew-Uzbekistan 0.00 46.70 0.00 0.20 0.20 0.00 0.30 2.70 0.00 17.90 0.70 0.80 0.50 3.00 0.00 0.00 0.00 0.00 26.40 0.60 0.00 0.00
Jew_Yemen 0.30 22.42 0.00 0.10 0.00 0.00 0.00 0.10 0.00 13.41 0.00 0.00 0.00 0.00 0.70 0.00 1.70 0.00 60.96 0.10 0.00 0.20
ReplyDelete
Replies
UnknownSeptember 7, 2019 at 11:00 AM
Where can I find the information on what final populations were used as references for the results?
ReplyDelete
Replies
MMSeptember 16, 2019 at 10:51 PM
Hello, I was wondering when you are going to update your blog and also, I am new to reading the painting on Eurogenes K13 and MDLP World-22. I have Native American Blood and I have been trying (without success) to correctly read the paintings on the above to GEDmatch tools.. Could you please help, I am fairly new to this.
ReplyDelete
Replies
UnknownDecember 1, 2021 at 12:14 AM
I'm interested - what "old" ancestor's have you used in Rus region? And, if i understand right - you took 10 respondents from Ukraine, 10 from Russia and 10 from Belarus and than analysed their genom?
ReplyDelete
Replies
tzintzuntzan irichekwaNovember 13, 2023 at 8:20 AM
Does anyone know what Mexican_CV refers to in the population section?
ReplyDelete
Replies
EbeneezerJuly 13, 2024 at 9:08 AM
I'm unable to access GEDMatch right now. Is it possible for me to find out the exact ethnic groups that plot closest to South Amerind and North Amerind on the MDLP World 22 maps? Thank you.
ReplyDelete
Replies

Add comment

Magnus Ducatus Lituaniae Project

My Blog List

Saturday, September 15, 2012

Behind the Curtains: MDLP World 22 showcase

34 comments:

Blog Archive

Followers