Saturday, September 15, 2012

Behind the Curtains: MDLP World 22 showcase


Preliminary remarks

As you all may know, the MDLP  blog hasn't been updated since February 2012.
Half of year ago i promised myself that i would stop writing new posts on the MDLP blog before i'll finally get my scientific report on blog written.  Since I had to prioritize the completion of a scientific paper over the routine of blog posting,I was unable to continue updating the blog on a regular basis due to a lack of time, and had to make a change in how I conducted my research. So i decided to abstain from posting on the MDLP blog for a couple of month, being focused on more important matters. Despite of all limitations, i kept  working secretly on the MDLP project, collecting necessary data and performing different 'genomic' experiments in order to achieve my final goal (publishing of paper).  The results of secret experiments with new genomic samples and tools eventually leaked to the curious public, spawning immense interest in my project. After releasing a new version of my own modification of DIYDodecad calculator on Gedmatch.com, i was literally flooded by emails from Gedmatch.com users asking me  questions they wanted me to answer.

I understood the strategical mistake of releasing poorly documented data/analysis on Internet  and felt obliged  to explain details. Obviously, i will start new series of  the blog posts by covering the project feature the people most interested in, i.e the MDLP World22 calculator.
 
The population dataset of MDLP World22 calculator.

The reference population dataset of the calculator was assembled in PLINK by intersecting and thinning the samples from different data sources: HapMap 3 (the filtered dataset CEU,YRI,JPT,CHB), 1000genomes, Rasmussen et al. (2010)HGDP (Stanford) (all populations)Metspalu et al. (2011),Yunusbayev et al. (2011), Chaubey et al. (2010) etc. Furthermore i handpicked random 10 individuals from each European country panel in POPRES dataset, or the maximum number of individuals available otherwise, to select the POPRES European individuals to be included in our study. Finally, in order to evaluate the correlation between the modern and the ancient genetic diversity, i have also included ancient DNA genomic samples of Ötzi,(Keller et al.(2012)) Swedish Neolithic samples Gök4, Ajv52, Ajv70, Ire8, Ste7 (Skoglund et al. (2012)) and 2 La Braña individuals from the Mesolithic sites of the Iberian Peninsula (Sánchez-Quinto et al.(2012)). Then i added 90 samples of individuals-participants of our MDLP project.  After merging the aforementioned datasets and thinning the SNP set with PLINK command to exclude SNPs with missing rates greater than 1% and minor alleles, i filtered out duplicates, the individuals with high pairwise IBD-sharing  (estimated in Plink as as the average fraction of alleles shared between two individuals over all loci) and the individuals with kinship coefficient suggesting relatedness (kinship coefficients were estimated in KING software). Also i had to filter out individuals with more more tham 3 standard deviations from the population averages. Since kinship coefficient is robustly estimated by HWE (Hary-Weinberg expectations) among SNPs with the same underlying allele frequencies, SNPs showing strong deviation (p < 5.5 x10−8) from Hardy-Weinberg expectations were removed from the merged and filtered dataset. After that I filtered to keep the list of common SNPs present in Illumina/Affymetrix chips and performed  linkage disequilibrium based pruning using a window size of 50, a step of 5 and r^2 threshold of 0.3.

This complex sequence of consequent operations with the initial reference and project datasets yielded a final dataset which included 80751 SNPs  in 2516 individuals from 225 populations.


ADMIXTURE analysis


 As always, the final dataset in PLINK linked format was further processed in ADMIXTURE software. Sketching the plan for the design of ADMIXTURE test,  i had to face the difficult problem: as it has been shown in (Patterson et al.2006) the number of markers needed to resolve populations in ADMIXTURE analysis is inversely proportional to the genetic distance (Fst ) betweeen the populations. According to ADMIXTURE best practice, it is believed that 10,000 markers  are suffice to perform GWAS correction for continentally separated populations (for example, African, Asian, and European populations FST > .05) while more like 100,000 markers are necessary when the populations are within a continent (Europe, for instance, FST < 0.01).
To increase the accuracy of ADMIXTURE results i decided to use a method proposed by  Dienekes' for converting allele frequencies into 'synthetic individuals'(see also Zack's example). The idea is fairly simple: run an unsupervised ADMIXTURE analysis once to generate allele frequencies for your K ancestral components; then generate zombie populations using these allele frequencies; whenever you want to estimate admixture proportions in new samples run supervised ADMIXTURE analysis using the zombie populations.  Like any genome blogger engaged in the task of evaluating admixtures in samples, i must grapple with obvious question of the reliability of this approach.  Although i am aware of  methodological controversies in using simulated individuals, i would rather concur with Dienekes who considered "synthetic individuals" the best abstract proxies for the ancient ancestral populations. But my purpose is served if i can use the approach used by Dienekes and Zack to obtain meaningful results. To begin with, i routinely ran unsupervised ADMIXTURE  K=22 analysis (assuming 22 ancestral populations) which yielded the admixture proportions of individuals from these K populations, as well as the allele frequencies for all SNPs for each of 22 ancestral populations (below are conventional names for each of inferred components in order of appearance):

Pygmy
West-Asian
North-European-Mesolithic
Tibetan
Mesomerican
Arctic-Amerind
South-America_Amerind
Indian
North-Siberean
Atlantic_Mediterranean_Neolithic
Samoedic
Proto-Indo-Iranian
East-Siberean
North-East-European
South-African
North-Amerind
Sub-Saharian
East-South-Asian
Near_East
Melanesian
Paleo-Siberean
Austronesian
Therefore i took the allele frequencies which were computed earlier in unsupervised Admixture K=22 for the merged dataset, pooled them into PLINK and generated 10 "synthetic individuals per ancestral component) using PLINK command --simulate.  When the simulation had been  finished, i visualized the distance between simulated individuals using multi-dimensional scaling:



 

As a next step,i included simulated individuals you have as part of a new reference population (including 220 simulated individuals in 22 simulated populations).Then, I ran ADMIXTURE anew, this time in “supervised” mode for K = 22 (with simulated individuals being 'reference' individuals). The Admixture K=22 converged in 31 iterations (37773.1 sec) with final loglikelihood:-188032005.430318 (below are Fst divergences between estimated 'ancestral' populations):


The Fst distance/divergence matrix was used for inferring a most probable NJ-based topology of component distance tree (outgroup: South-African):



 The individual 'supervised' ADMIXTURE  results (in Excel spreadsheet) for the project participants have been uploaded to GoogleDocs (please note that the average results for reference populations is also available on special request).

MDLP World22 DIYcalculator

The output files of Admixture K=22 supervised run (average values of admixture coefficients in reference populations and FsT values)  were used for designing a new version of  the MDLP DIYcalculator, which is better known by its codename "World22" (online version is available in AdMix-Utilities section of Gedmatch under MDLP project). MDLP DIYcalculator itself is based on the code of Dodecad DIY calculator (c)ourtesy of Dienekes Pontikos and was developed as part of the Dodecad Ancestry Project. In its Gedmatch implementation MDLP 'World22' DIYcalculator is paired by MDLP 'World22' Oracle, also based on Dienekes' and Zack's code (Harappa/DodecadOracle). The 'Oracle' is designed to find in a single population mode your closest (closest in terms of similarity) population from MDLP ''Word22' admixture results. In a mixed mode, Oracle considers all pairs of populations, and for each one of them calculates the minimum Fst-weighted distance to the sample in consideration, and the admixture proportions that produce it.

Please notice: 'ancestral' populations (i.e 'simulated populations' from the previous step - see above) are labeled in Oracle results as (anc), while the 'real world' modern and ancient populations are marked as "derived".

If you have troubles with understanding/interpreting the results of Oracle and DIYcalculcator, please consult the corresponding topics on Dodecad and HarappaWorld blogs. It is not of avail to repeat in this blog everything they wrote in their own blogs.


What the heck are MDLP Word-22 components?

 One of those questions that i usually keep getting in emails is what do the various reference populations and ancestral components for my World K=12 and World-22 analyses mean. I've already provided hints to the answer  earlier, but - as old Chinese proverb says - one picture is worth ten thousand words. That's why i decided to display the admixture coefficients spatially on the globe surface. Following Francois Olivier, who proposed to use the graphical library of the statistical software R to display  spatial interpolates of the admixture coefficients (Q matrix) in two dimensions (where spatial coordinates are recorded as longitude and latitude), i created  2 contour maps per component.

Pygmy (modal in Biaka and Mbuti population)



West-Asian (bimodal component with peaks in Caucasian populations and south-western part of Iran, equal to Dienekes' Caucasian/Gedrosia component) 



 North-European-Mesolithic (local component with peaks in European Mesolithic samples of La_Brana and  modern North-European Saami population).


 Tibetan (Indo-Burmese) component (Himalay, Tibet)


Mesomerican (major genetic component in Native Americans from Mesoamerica)



North-Amerind (the 'native' component in North American Natives)




South-Amerind (the 'native' component in South American Natives)






  Atlantic-Mediterranean-Neolithic (the main genetic component  in Western and South-Western Europe)



  The rest contour maps for all components could be downloaded here.






   

34 comments:

  1. Vadim,

    Fantastic work - I can't wait to see my family's results!!!

    Are you only providing the World22 and Oracle on Gedmatch, or is there a link to download them also?

    I have one account on Gedmatch - my own - but have 9 other accounts, many of them children. I would like to be able to download the files and run them directly in R, is that possible?

    ReplyDelete
  2. I'm Irish - all 8 G-Grandparents are Irish, and here's what I get:


    Admix Results (sorted):

    # Population Percent
    1 North-East-European 48.89
    2 Atlantic_Mediterranean_Neolithic 33.55
    3 West-Asian 7.29
    4 North-European-Mesolithic 6.41
    5 Indo-Iranian 2.74
    6 Indian 0.55
    7 South-America_Amerind 0.27
    8 South-African 0.14
    9 Indo-Tibetan 0.09
    10 Pygmy 0.04
    11 Near_East 0.02


    Single Population Sharing:

    # Population (source) Distance
    1 CEU_V (derived) 3.39
    2 German_V (derived) 3.67
    3 Welsh (derived) 4.41
    4 CEU (derived) 4.92
    5 Austrian (derived) 5.16
    6 Norwegian_V (derived) 5.23
    7 British (derived) 5.92
    8 Swedish (derived) 6.11
    9 German-North (derived) 6.34
    10 Orcadian (derived) 6.84
    11 Hungarian (derived) 7.03
    12 German (derived) 7.26
    13 Slovenian (derived) 7.96
    14 Swedish_V (derived) 8.61
    15 German-South (derived) 8.61
    16 Croatian (derived) 9.19
    17 Bosnian (derived) 9.99
    18 Serbian (derived) 10.43
    19 Czech (derived) 10.76
    20 Croatian_V (derived) 12.2


    Mixed Mode Population Sharing:


    # Primary Population (source) Secondary Population (source) Distance
    1 61% German_V (derived) + 39% Norwegian_V (derived) @ 1.98
    2 81% CEU (derived) + 19% Latvian_V (derived) @ 2.19
    3 61.4% CEU (derived) + 38.6% German (derived) @ 2.38
    4 55.9% British (derived) + 44.1% German (derived) @ 2.41
    5 81.5% CEU (derived) + 18.5% Ukrainian-Center (derived) @ 2.43
    6 81.1% CEU (derived) + 18.9% Ukrainian_V (derived) @ 2.47
    7 77.7% British (derived) + 22.3% Latvian_V (derived) @ 2.5
    8 84.6% CEU (derived) + 15.4% Mordovian (derived) @ 2.51
    9 93.7% Welsh (derived) + 6.3% Avar (derived) @ 2.52
    10 96% CEU_V (derived) + 4% Lak (derived) @ 2.58
    11 96% CEU_V (derived) + 4% Tabassaran (derived) @ 2.6
    12 82.3% CEU (derived) + 17.7% Mordovian_V (derived) @ 2.6
    13 96.3% CEU_V (derived) + 3.7% Pashtun (derived) @ 2.6
    14 84% CEU (derived) + 16% Ukrainian-East (derived) @ 2.6
    15 93.6% Welsh (derived) + 6.4% Tabassaran (derived) @ 2.6
    16 96.3% CEU_V (derived) + 3.7% Ossetian (derived) @ 2.61
    17 97.8% CEU_V (derived) + 2.2% West-Asian (ancestral) @ 2.63
    18 68.1% Norwegian_V (derived) + 31.9% Bosnian (derived) @ 2.63
    19 96.2% CEU_V (derived) + 3.8% Avar (derived) @ 2.63
    20 81.4% British (derived) + 18.6% Mordovian (derived) @ 2.63

    Question: Do you have Irish, Scottish or French Reference populations?

    ReplyDelete
    Replies
    1. Your post is older, but how do you find the Mixed Mode Population sharing? I have quite a mix and would like to see a better breakdown :) Thanks

      Delete
  3. One other thing, it would be interesting to see on what Chromosome segments I am:

    North-European-Mesolithic 6.41

    Also what are the rank order of North-European-Mesolithic in terms of various populations? I guess based on your map, Saami would be #1, and maybe Finns, Estonians and Latvians would be #2, and then maybe Scandinavians in general as #3 ??

    ReplyDelete
  4. @pconroy

    Thank you for your comments.

    As of today, i am going to release the off-line Oracle version for the wide audience.
    So you'll be able to use it for your other accounts.

    As far as it concerns your question regarding Irish, Scottish or French Reference populations - no i don't have any Irish, Scottish references in my dataset. On other hand,i do have a huge set of French populations.

    North-European-Mesolithic - is a local "component" centered in ancient Mesolithic La Brana samples, Saami, Baltic and Finnic populations. That's why it has a double label "North-European-Mesolithic"

    ReplyDelete
  5. Is there a URL that I can download the MDLP World-22 DIY Oracle from?

    Thanks

    ReplyDelete
    Replies
    1. Yes, i'll let you know when i'll publish the MDLP World-22 DIY Oracle online

      Delete
  6. how come yemen jews are only 2.4% SSA according to your spreadsheet??i expected them to be much more SSA (they have the darkest skin tone among the jews in israel)


    ReplyDelete
  7. As Gedmatch has suspended the uploading of new raw data until August I'm also searching for the calculator files of World-22. Thank you in advance for sharing them.

    ReplyDelete
  8. What does this mean??? Please help!!!


    # Population Percent
    1 North-East-European 23.65
    2 Atlantic_Mediterranean_Neolithic 22.23
    3 Mesoamerican 17.48
    4 North-Amerind 12.22
    5 West-Asian 7.41
    6 South-America_Amerind 5.58
    7 Near_East 4.38
    8 Sub-Saharian 2.89
    9 Indo-Iranian 1.45
    10 Samoedic 0.79
    11 North-Siberean 0.62
    12 Austronesian 0.34
    13 Pygmy 0.3
    14 Indian 0.23
    15 East-Siberean 0.18
    16 North-European-Mesolithic 0.18
    17 Melanesian 0.09

    Single Population Sharing:

    # Population (source) Distance
    1 Miwok (derived) 7.1
    2 Mexican (derived) 12.72
    3 Serrano (derived) 21.27
    4 Puerto-Rican (derived) 21.91
    5 Cochimi (derived) 22.73
    6 Colville (derived) 24.25
    7 Costanoan (derived) 24.3
    8 Tsimsian (derived) 24.57
    9 Romania (derived) 28
    10 Ashkenazim_V (derived) 28.47
    11 Gagauz (derived) 28.84
    12 Bulgarian (derived) 28.87
    13 Macedonian (derived) 29.57
    14 Greek_South (derived) 29.7
    15 Swiss (derived) 29.95
    16 Greek_North (derived) 30.01
    17 Montenegrin (derived) 30.09
    18 Tatar_Crim (derived) 30.22
    19 Aleut (derived) 30.48
    20 Italian_North (derived) 30.6

    Mixed Mode Population Sharing:

    # Primary Population (source) Secondary Population (source) Distance
    1 86% Miwok (derived) + 14% Latvian_V (derived) @ 3.49
    2 86.7% Miwok (derived) + 13.3% Mordovian_V (derived) @ 3.77
    3 87.6% Miwok (derived) + 12.4% Mordovian (derived) @ 3.88
    4 85.1% Miwok (derived) + 14.9% Tatar_Kryashen (derived) @ 3.97
    5 88.4% Miwok (derived) + 11.6% Russian_South (derived) @ 3.97
    6 85.9% Miwok (derived) + 14.1% Tartar_Mishar (derived) @ 3.97
    7 88.3% Miwok (derived) + 11.7% Ukrainian (derived) @ 3.98
    8 87.4% Miwok (derived) + 12.6% Ukrainian_V (derived) @ 3.99
    9 88.3% Miwok (derived) + 11.7% Ukrainian-East (derived) @ 4
    10 87.7% Miwok (derived) + 12.3% Ukrainian-Center (derived) @ 4.04
    11 89.9% Miwok (derived) + 10.1% Belarusian (derived) @ 4.04
    12 88.3% Miwok (derived) + 11.7% Russian_cossack (derived) @ 4.05
    13 87.3% Miwok (derived) + 12.7% Ukrainian-West (derived) @ 4.07
    14 89.5% Miwok (derived) + 10.5% Russian (derived) @ 4.1
    15 88.3% Miwok (derived) + 11.7% Russian_V (derived) @ 4.11
    16 90.7% Miwok (derived) + 9.3% Lithuanian (derived) @ 4.13
    17 92.7% Miwok (derived) + 7.3% North-East-European (ancestral) @ 4.13
    18 88.4% Miwok (derived) + 11.6% Moldavian (derived) @ 4.14
    19 89% Miwok (derived) + 11% Russian_Center (derived) @ 4.14
    20 84.5% Miwok (derived) + 15.5% Tatar (derived) @ 4.16

    ReplyDelete
  9. Hi, I just ran the Oracle results from your tool and this is quite interesting: my whole family is French from Poitou but from the Ftdna results my dad and mum's origins are a little different and here is what I get when I ran the 3 raw data from my family with MDLP World 22:

    My dad:

    Admix Results (sorted):

    # Population Percent
    1 Atlantic_Mediterranean_Neolithic 43.11
    2 North-East-European 40.26
    3 Near_East 10.23
    4 West-Asian 2.82
    5 North-European-Mesolithic 1.55
    6 Mesoamerican 0.78
    7 East-South-Asian 0.44
    8 Indo-Iranian 0.36
    9 Pygmy 0.23
    10 Austronesian 0.1
    11 Paleo-Siberian 0.07
    12 Sub-Saharian 0.04

    and the 1st five to make it shorter:

    Single Population Sharing:

    # Population (source) Distance
    1 French (derived) 7.85
    2 Swiss (derived) 7.87
    3 Provancestralal (derived) 8.53
    4 German-South (derived) 9.28
    5 Italian_North (derived) 9.46

    Mixed Mode Population Sharing:

    # Primary Population (source) Secondary Population (source) Distance
    1 52.7% Sardinian (derived) + 47.3% Russian_Center (derived) @ 1.4
    2 50.5% Sardinian (derived) + 49.5% Ukrainian-East (derived) @ 1.43
    3 50.4% Sardinian (derived) + 49.6% Russian_cossack (derived) @ 1.46
    4 50.5% Sardinian (derived) + 49.5% Ukrainian (derived) @ 1.62
    5 51.1% Sardinian (derived) + 48.9% Russian_South (derived) @ 1.63

    My mum:

    Admix Results (sorted):

    # Population Percent
    1 North-East-European 41.25
    2 Atlantic_Mediterranean_Neolithic 39.35
    3 Near_East 7.85
    4 West-Asian 7.47
    5 North-European-Mesolithic 2.38
    6 Arctic-Amerind 0.67
    7 Paleo-Siberian 0.59
    8 Sub-Saharian 0.22
    9 Indo-Iranian 0.14
    10 Melanesian 0.03
    11 South-America_Amerind 0.03
    12 Samoedic 0.02

    Single Population Sharing:

    # Population (source) Distance
    1 German-South (derived) 4.48
    2 Swiss (derived) 5.16
    3 French (derived) 6.6
    4 Montenegrin (derived) 6.72
    5 Provancestralal (derived) 7.44

    Mixed Mode Population Sharing:

    # Primary Population (source) Secondary Population (source) Distance
    1 50.2% Spaniard (derived) + 49.8% Bosnian (derived) @ 1.71
    2 66.6% Spaniard (derived) + 33.4% Latvian_V (derived) @ 1.73
    3 74.4% Hungarian (derived) + 25.6% Sardinian (derived) @ 1.8
    4 51.3% Spaniard (derived) + 48.7% Croatian (derived) @ 1.85
    5 50.3% Iberian (derived) + 49.7% Croatian (derived) @ 1.95

    Looking at those results I thought that between those 2 my 1st population might be Swiss then.

    Me:

    Admix Results (sorted):

    # Population Percent
    1 Atlantic_Mediterranean_Neolithic 40.75
    2 North-East-European 39.59
    3 Near_East 10.89
    4 West-Asian 4.75
    5 North-European-Mesolithic 1.8
    6 South-America_Amerind 0.67
    7 Arctic-Amerind 0.51
    8 Austronesian 0.29
    9 Indo-Iranian 0.29
    10 Paleo-Siberian 0.2
    11 Sub-Saharian 0.14
    12 Pygmy 0.12

    Single Population Sharing:

    # Population (source) Distance
    1 Swiss (derived) 6.37
    2 Provancestralal (derived) 7.63
    3 Italian_North (derived) 7.93
    4 French (derived) 8.04
    5 German-South (derived) 8.33

    Mixed Mode Population Sharing:

    # Primary Population (source) Secondary Population (source) Distance
    1 62.8% Croatian_V (derived) + 37.2% Sardinian (derived) @ 1.72
    2 54.3% Ukrainian_V (derived) + 45.7% Sardinian (derived) @ 2.14
    3 68.1% Slovenian (derived) + 31.9% Otzi (derived) @ 2.22
    4 66% Iberian (derived) + 34% Croatian_V (derived) @ 2.27
    5 86.5% Portugese (derived) + 13.5% North-East-European (ancestral) @ 2.39

    So, my estimation was correct I got Swiss as 1st population, it's exactly on the other side of the country from where I was born. From what I understand my dad is more Southern than my mum who seems to be more Eastern from our local Poitou.

    I tried to look at the definition of "derived" on the blog but could not find it, I understand it as an admixture of populations who settled long ago in a specific place, an example for me would be in Finald: Saami could be ancestral and other Finnish would be derived.. Am I right?

    Thanks,
    Karine

    ReplyDelete
  10. My results are as follows. Can anyone tell me what they mean?


    Population
    Pygmy -
    West-Asian 8.88
    North-European-Mesolithic 5.00
    Indo-Tibetan -
    Mesoamerican -
    Arctic-Amerind -
    South-America_Amerind -
    Indian -
    North-Siberean -
    Atlantic_Mediterranean_Neolithic 33.22
    Samoedic 0.84
    Indo-Iranian 0.51
    East-Siberean -
    North-East-European 46.82
    South-African -
    North-Amerind 0.99
    Sub-Saharian -
    East-South-Asian -
    Near_East 3.69
    Melanesian -
    Paleo-Siberian -
    Austronesian -








    ReplyDelete
    Replies
    1. 1. "One Drop Rule": If you have one SNP Samoyed, then you are Samoyed.
      2. Your genome is 0.84% Samoyed
      3. Therefore, you're Samoyed.

      Delete
  11. Does the calculator detect those with Sephardim SNPs? I don't see Sephards listed in the table.
    BTW, "Samoed" is spelt wrong. The letter Y is missing. The correct spelling is SAMOYED.

    ReplyDelete
    Replies
    1. I can say say you can export your raw data to my heritage and my family tree DNA, both found Sephardic/MaghrebI Jewish DNA (because Sephardic doesn't really mean spanish as from where the population came from (search maghreb Jews or north African jews for more info), there might be a difference if they split it further but it isn't shown.but when I exported the data into gedmatch it got matched differently, and matched reference populations of Tunisian Jews, Moroccan Jews, etc. I'm not sure how specific it can be since Jews cluster with other Jews, except for Ethiopian & Indian which are kind of outliers, they match but they aren't as close. Tl;dr: try myheritage, ftldna, etc too.

      Delete
  12. Does anyone here know who can explain it to us laypeople?I still don't understand how "Indo-Tibetan" and and other low percentages fit into the picture. On the commercial sites, I show up as 97% European, but on MDLP World 22 here is how it rolls out:
    Population:
    Pygmy -
    West-Asian 20.82
    North-European-Mesolithic 0.08
    Indo-Tibetan 0.07
    Mesoamerican -
    Arctic-Amerind 1.10
    South-America_Amerind 0.24
    Indian 0.48
    North-Siberean -
    Atlantic_Mediterranean_Neolithic 33.84
    Samoedic -
    Indo-Iranian 0.22
    East-Siberean 0.23
    North-East-European 18.22
    South-African -
    North-Amerind -
    Sub-Saharian 2.08
    East-South-Asian -
    Near_East 21.85
    Melanesian 0.12
    Paleo-Siberian -
    Austronesian 0.64











    Web site and contents ©Copyright 2011-2018 by GEDmatch, Inc.
    Genealogy and DNA data remains the property of the submitter.
    Each Admixture Proportions 'calculator' model remains the property of its developer.

    68790 SNPs used in this evaluation

    ReplyDelete
  13. This comment has been removed by the author.

    ReplyDelete
  14. Hello Project "Magnus Ducatus Lituaniae"
    I need answers for this reading Exactly what am I? The reading cme up on me for this pertaining to me:


    MDLP World-22 Oracle results:
    Kit T022519

    Admix Results (sorted):

    # Population Percent
    1 Sub-Saharian 82.35
    2 Near_East 3.8
    3 Pygmy 2.76
    4 North-East-European 2.14
    5 North-Amerind 1.62
    6 South-America_Amerind 1.61
    7 Atlantic_Mediterranean_Neolithic 1.33
    8 South-African 1.12
    9 Mesoamerican 1.03
    10 Indo-Iranian 0.75
    11 Indian 0.6
    12 Indo-Tibetan 0.4
    13 Samoedic 0.35
    14 North-European-Mesolithic 0.13

    Single Population Sharing:

    # Population (source) Distance
    1 Lemba (derived) 8.46
    2 Bantu (derived) 11.55
    3 Mandenka (derived) 19.59
    4 Sub-Saharian (ancestral) 19.71
    5 Yoruba (derived) 19.71
    6 Biaka_Pygmies (derived) 35.6
    7 Ethiopian (derived) 58.16
    8 Jew-Ethiopia (derived) 61.83
    9 Jew_Ethiopia (derived) 62.86
    10 Lumbee (derived) 68.15
    11 Moroccan (derived) 71.1
    12 Mozabite (derived) 77.3
    13 Yemen (derived) 77.46
    14 Australian (derived) 78.96
    15 Egyptian (derived) 82.56
    16 Puerto-Rican (derived) 84.74
    17 Jordanian (derived) 88.06
    18 Ste7 (derived) 88.53
    19 Miwok (derived) 90.17
    20 Mexican (derived) 91.05

    ReplyDelete
  15. Why are there 2 contour maps per component? One of them is for World K=12 and the other one is for World-22?

    ReplyDelete
  16. Population
    Pygmy 0.21
    West-Asian 21.92
    North-European-Mesolithic 2.87
    Indo-Tibetan 0.68
    Mesoamerican -
    Arctic-Amerind -
    South-America_Amerind 0.09
    Indian -
    North-Siberean -
    Atlantic_Mediterranean_Neolithic 30.53
    Samoedic 0.54
    Indo-Iranian 1.49
    East-Siberean 0.74
    North-East-European 25.49
    South-African 0.08
    North-Amerind 0.15
    Sub-Saharian 1.29
    East-South-Asian 0.48
    Near_East 12.02
    Melanesian 0.45
    Paleo-Siberian 0.96
    Austronesian -

    so i went to oracle and my north african is Ashkenazim_V_, what the heck is that , obviously jewish , well i knew i had jewish , i knew i had afican and native american , i descend from a woman that was half native half african , she came to uk with my ancestor when slavery was over in usa, he had been deported for debt and returned with her and two of their children in 1870, he returned from Canada not usa , however i know from my dna her relatives are all found in usa , not canada , the black and the natives that is , but however relatives of theirs that are closer found in canada , children where left there , adult and married i guess , figuring girls, though some are rockley on male side so, adult son stayed ??????????, they have native and african in about the percentage as me , not all some have way higher native percent , maybe he married a native woman ???????????? , it says i have no mexican or brazilian on this but i had my dna looked at by professor macdonald he says i have all four native types , as in i have inuit , maya , brazilian and he says pima too, the mayan being mexican , pima found in AZ and mexico , he also said my percentage when you add up the other nations that make up native makes me 4 per cent , that my mother had it in both her mothers and fathers lines hence it stayed so high in mine , your still not good at native dna he says , too few agree to being tested because it could be used against them , proving them not as native as they say/thought they are , makes sense , so much they lost, what does a percentage really say ? my grand sons refuse to cut their hair and their percent is not as high as mine, i have not permitted my hair cut from 8, before then i was too young to fight it, percentage can not dictate how native you are inside only genetically , dna has taught me a lot , p.s i am a brit , so if you know what Ashkenazim_V_ is please could you reply, jita'ame suluma thats a native name gifted to me by an elder , it means wisdom eagle , basically a wise eagle , my real name is layla ozdemir , you can find me on fb

    ReplyDelete
  17. This comment has been removed by the author.

    ReplyDelete
  18. I downloaded my Ancestry DNA to Genesis. Using my new kit # I ran MDLP-World22 Admixture proportions. There seems to be a problem on chromosomes 14 through 22. It is showing either 0 or 1 SNPs eval. Could this be from a problem with the download of my data? Or is this a known problem? Note: It looks fine using my old kit # in GEDmatch.

    ReplyDelete
  19. I am assuming central European Viking because that's what me and mom are lol

    ReplyDelete
  20. Using 1 population approximation:
    1 German_V_derived @ 3.560286
    2 Austrian_derived @ 4.214431
    3 CEU_V_derived @ 4.268415
    4 CEU_derived @ 5.010840
    5 Welsh_derived @ 5.027669
    6 British_derived @ 5.594800
    7 German-South_derived @ 6.258089
    8 Orcadian_derived @ 7.256489
    9 Hungarian_derived @ 7.305158
    10 Norwegian_V_derived @ 8.251949
    11 Serbian_derived @ 8.660540
    12 Slovenian_derived @ 8.909108
    13 German-North_derived @ 8.939845
    14 Croatian_derived @ 9.111379
    15 Swedish_derived @ 9.469718
    16 Bosnian_derived @ 9.705576
    17 German_derived @ 9.729921
    18 Montenegrin_derived @ 10.941747
    19 French_derived @ 12.315708
    20 Macedonian_derived @ 12.539729

    Using 2 populations approximation:
    1 50% CEU_V_derived +50% German-South_derived @ 1.538202


    Using 3 populations approximation:
    1 50% British_derived +25% Kosovar_derived +25% Russian_Center_derived @ 0.921730


    Using 4 populations approximation:
    +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
    1 British_derived + CEU_V_derived + Croatian_derived + German-South_derived @ 0.754127
    2 Bosnian_derived + British_derived + French_derived + German-North_derived @ 0.776180
    3 British_derived + French_derived + German-South_derived + Ukrainian-Center_derived @ 0.828698
    4 CEU_derived + Croatian_derived + German-South_derived + Welsh_derived @ 0.837887
    5 British_derived + French_derived + German-South_derived + Ukrainian_V_derived @ 0.848064
    6 French_derived + German-North_derived + German-South_derived + Slovenian_derived @ 0.854000
    7 British_derived + German-South_derived + German_V_derived + Hungarian_derived @ 0.863129
    8 Bosnian_derived + CEU_derived + French_derived + German-North_derived @ 0.863738
    9 Bosnian_derived + CEU_V_derived + German-South_derived + Orcadian_derived @ 0.864598
    10 German-South_derived + German_V_derived + Hungarian_derived + Orcadian_derived @ 0.886464
    11 British_derived + CEU_V_derived + CEU_V_derived + Montenegrin_derived @ 0.890582
    12 German-North_derived + German-South_derived + German-South_derived + German_V_derived @ 0.893314
    13 British_derived + Croatian_derived + French_derived + German-North_derived @ 0.904219
    14 Croatian_derived + German-South_derived + German_V_derived + Orcadian_derived @ 0.915155
    15 CEU_derived + Croatian_derived + French_derived + German-North_derived @ 0.918807
    16 British_derived + British_derived + Kosovar_derived + Russian_Center_derived @ 0.921730
    17 British_derived + French_derived + Kosovar_derived + Latvian_derived @ 0.923793
    18 British_derived + German_V_derived + Serbian_derived + Welsh_derived @ 0.927894
    19 CEU_V_derived + French_derived + German_V_derived + Hungarian_derived @ 0.929176
    20 British_derived + CEU_V_derived + German-South_derived + Hungarian_derived @ 0.932342
    that's moms results

    ReplyDelete
  21. Using 1 population approximation:
    1 German-South_derived @ 5.952879
    2 Serbian_derived @ 8.182379
    3 CEU_derived @ 8.675700
    4 Montenegrin_derived @ 8.753657
    5 German_V_derived @ 9.280185
    6 British_derived @ 9.318875
    7 Austrian_derived @ 9.701466
    8 Swiss_derived @ 9.940567
    9 Macedonian_derived @ 10.018495
    10 CEU_V_derived @ 10.354508
    11 Welsh_derived @ 10.608590
    12 French_derived @ 10.916842
    13 Orcadian_derived @ 10.995100
    14 Hungarian_derived @ 11.246842
    15 Norwegian_V_derived @ 11.735426
    16 Bulgarian_derived @ 11.856838
    17 Bosnian_derived @ 11.924855
    18 Romania_derived @ 12.483692
    19 Croatian_derived @ 12.634500
    20 Provancestralal_derived @ 13.254197

    Using 2 populations approximation:
    1 50% Norwegian_V_derived +50% Romania_derived @ 4.307440


    Using 3 populations approximation:
    1 50% German_V_derived +25% German_V_derived +25% Puerto-Rican_derived @ 3.337241


    Using 4 populations approximation:
    +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
    1 British_derived + German_V_derived + Lumbee_derived + Montenegrin_derived @ 3.267666
    2 German_V_derived + German_V_derived + German_V_derived + Puerto-Rican_derived @ 3.337241
    3 Kosovar_derived + Lumbee_derived + Orcadian_derived + Ukrainian-Center_derived @ 3.338995
    4 German-North_derived + German-North_derived + Kosovar_derived + Lumbee_derived @ 3.345200
    5 CEU_derived + German_V_derived + Lumbee_derived + Montenegrin_derived @ 3.356480
    6 British_derived + Kosovar_derived + Lumbee_derived + Ukrainian-Center_derived @ 3.366493
    7 Bosnian_derived + CEU_derived + German_V_derived + Puerto-Rican_derived @ 3.370591
    8 Bosnian_derived + British_derived + German-South_derived + Lumbee_derived @ 3.375951
    9 Bosnian_derived + British_derived + German_V_derived + Puerto-Rican_derived @ 3.376987
    10 British_derived + Kosovar_derived + Lumbee_derived + Polish_V_derived @ 3.394152
    11 British_derived + CEU_V_derived + Lumbee_derived + Montenegrin_derived @ 3.404645
    12 Belarusian_V_derived + British_derived + Kosovar_derived + Lumbee_derived @ 3.407532
    13 British_derived + German_V_derived + Lumbee_derived + Macedonian_derived @ 3.407672
    14 German_V_derived + Lumbee_derived + Montenegrin_derived + Orcadian_derived @ 3.415161
    15 Czech_derived + Kosovar_derived + Lumbee_derived + Swedish_derived @ 3.427819
    16 Bosnian_derived + French_derived + German_V_derived + Lumbee_derived @ 3.431776
    17 Kosovar_derived + Lumbee_derived + Orcadian_derived + Ukrainian_V_derived @ 3.439188
    18 German_derived + German-South_derived + German_V_derived + Puerto-Rican_derived @ 3.444206
    19 British_derived + CEU_V_derived + Lumbee_derived + Macedonian_derived @ 3.451045
    20 CEU_derived + Kosovar_derived + Lumbee_derived + Ukrainian-Center_derived @ 3.452269 mine I knew I had Viking I am assuming german_V is german Viking and of course my Melungeon dna :P lumbee indian :P

    ReplyDelete
  22. This comment has been removed by the author.

    ReplyDelete
  23. How am I to understand the population spreadsheet for MDLP World-22. I cut and paste only the Jewish portion are these 22 chromosomes for each population?

    Population Pygmy West-Asian North-European-Mesolithic Indo-Tibetan Mesoamerican Arctic-Amerind South-America_Amerind Indian North-Siberean Atlantic_Mediterranean_Neolithic Samoedic Indo-Iranian East-Siberean North-East-European South-African North-Amerind Sub-Saharian East-South-Asian Near_East Melanesian Paleo-Siberian Austronesian

    Jew_Algeria 0.00 22.90 0.50 0.20 0.00 0.10 0.10 0.20 0.00 37.10 0.20 0.20 0.00 5.30 0.20 0.00 3.40 0.00 28.90 0.30 0.00 0.40
    Jew_Azerbaijan 0.00 47.75 0.10 0.20 0.10 0.00 0.30 1.50 0.50 21.42 0.00 0.70 0.00 0.40 0.10 0.00 0.00 0.00 26.73 0.00 0.10 0.10
    Jew_Ethiopia 0.90 2.70 0.00 1.00 0.00 0.00 0.00 0.00 0.00 0.20 0.00 0.00 0.00 0.00 1.60 0.00 40.56 0.00 52.45 0.20 0.10 0.30
    Jew-Ethiopia 0.70 1.10 0.10 0.60 0.00 0.10 0.00 0.30 0.00 1.20 0.50 0.00 0.00 0.00 1.00 0.00 41.50 0.00 52.00 0.50 0.00 0.40
    Jew_Francestrale 0.20 21.68 0.30 0.30 0.00 0.00 0.10 0.10 0.10 38.16 0.20 0.00 0.00 10.19 0.20 0.10 0.60 0.10 26.77 0.50 0.40 0.00
    Jew_Georgia 0.20 42.46 0.60 0.00 0.10 0.20 0.00 0.50 0.00 22.68 0.00 0.50 0.00 0.80 0.10 0.30 0.30 0.00 30.87 0.30 0.10 0.00
    Jew_India 0.00 30.91 1.50 0.70 0.30 0.30 0.20 45.16 0.10 5.48 0.90 3.49 0.60 0.70 0.20 0.70 0.10 1.50 6.58 0.20 0.20 0.20
    Jew-Iran 0.10 46.80 0.30 0.10 0.00 0.00 0.00 1.50 0.10 18.10 0.00 0.30 0.00 0.30 0.00 0.00 0.00 0.20 31.60 0.20 0.30 0.10
    Jew_Iraqi 0.10 45.70 0.00 0.90 0.00 0.00 0.00 1.80 0.00 21.20 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 30.20 0.00 0.00 0.10
    Jew-Iraqi 0.10 45.20 0.30 0.10 0.00 0.10 0.20 0.50 0.00 20.30 0.00 0.50 0.10 0.40 0.00 0.00 0.00 0.00 31.60 0.30 0.00 0.30
    Jew_Italia 0.10 22.46 0.50 0.30 0.10 0.10 0.10 0.10 0.10 37.23 0.30 0.40 0.00 9.38 0.20 0.00 0.80 0.00 27.05 0.40 0.10 0.30
    Jew_Kurd 0.00 46.45 0.60 0.20 0.10 0.30 0.10 2.50 0.20 17.18 0.30 1.00 0.20 1.40 0.00 0.00 0.00 0.10 28.87 0.20 0.10 0.20
    Jew_Libya 0.70 22.32 0.70 0.40 0.00 0.00 0.00 0.00 0.00 34.13 0.00 0.00 0.00 4.20 0.50 0.00 3.40 0.00 33.23 0.00 0.40 0.00
    Jew_Morocco 0.10 21.88 0.30 0.00 0.00 0.00 0.10 0.30 0.10 35.06 0.10 0.10 0.00 8.09 0.30 0.20 3.40 0.00 29.77 0.00 0.00 0.20
    Jew_Romania 0.60 21.38 0.30 0.00 0.10 0.20 0.20 0.00 0.00 33.27 0.00 0.40 1.40 20.88 0.10 0.20 0.00 0.30 20.08 0.60 0.00 0.00
    Jew_Syria 0.30 31.70 0.00 0.00 0.10 0.10 0.10 0.10 0.00 30.90 0.00 0.80 0.00 4.70 0.00 0.00 0.90 0.00 29.60 0.50 0.00 0.20
    Jew_Tat 0.00 48.80 0.50 0.40 0.00 0.00 0.10 1.30 0.30 19.20 0.40 0.40 0.00 1.40 0.00 0.10 0.00 0.00 26.60 0.10 0.10 0.30
    Jew_Tunisia 0.10 23.40 0.00 0.20 0.20 0.10 0.30 0.00 0.30 34.50 0.10 0.00 0.00 6.00 0.40 0.00 2.50 0.00 31.70 0.10 0.00 0.10
    Jew-Uzbekistan 0.00 46.70 0.00 0.20 0.20 0.00 0.30 2.70 0.00 17.90 0.70 0.80 0.50 3.00 0.00 0.00 0.00 0.00 26.40 0.60 0.00 0.00
    Jew_Yemen 0.30 22.42 0.00 0.10 0.00 0.00 0.00 0.10 0.00 13.41 0.00 0.00 0.00 0.00 0.70 0.00 1.70 0.00 60.96 0.10 0.00 0.20

    ReplyDelete
  24. Where can I find the information on what final populations were used as references for the results?

    ReplyDelete
  25. Hello, I was wondering when you are going to update your blog and also, I am new to reading the painting on Eurogenes K13 and MDLP World-22. I have Native American Blood and I have been trying (without success) to correctly read the paintings on the above to GEDmatch tools.. Could you please help, I am fairly new to this.

    ReplyDelete
  26. I'm interested - what "old" ancestor's have you used in Rus region? And, if i understand right - you took 10 respondents from Ukraine, 10 from Russia and 10 from Belarus and than analysed their genom?

    ReplyDelete
  27. Does anyone know what Mexican_CV refers to in the population section?

    ReplyDelete
  28. I'm unable to access GEDMatch right now. Is it possible for me to find out the exact ethnic groups that plot closest to South Amerind and North Amerind on the MDLP World 22 maps? Thank you.

    ReplyDelete