My Blog List

Saturday, January 21, 2012

Comparing fineSTRUCTURE clustering to Mclust clusterings

The first weeks of 2012 years marked a milestone in  BGA blogging paradigm , introducing substantial shift of methodologies.
First of all, Eurogenes blog has applied recently published Chromopainter tools to its intra-North Euro cluster analysis (with more than 400 samples and 270K SNPs, in linkage mode, and 200K burn-ins and iterations).

Dodecad Project  has also improved its  Cluster Galore method  to be  used with linked haplotype data (this refined method is , however, designed, to work not wiyj fineSTRUCTURE, but with different software fastIBD).

Although both methods are different in technical performance and design, they still strive for the same goal of inferring the population structure. This type of structure inference consists of two parts: deriving a matrix of relationships between sampled individuals and clustering these relationships.  As was noted by Dan Lawson , given a distance-like matrix such as the number of SNPs IBS or the ChromoPainter coancestry matrix, it is possible to apply a wide variety of clustering algorithms. 

Still fineSTRUCTURE has two advantages. Firstly, because fineSTRUCTURE performs MCMC it is less likely to get stuck in local optima than computationally cheaper methods that climb gradients. Secondly, and most importantly, (when used correctly) fineSTRUCTURE is well calibrated as it has no unknown and hard to estimate tuning parameters. 


Experiment
In order to evaluate the performance of different clustering methods on a real-world dataset, i used sampled individuals from my project. To make this test experiment even harder, i've applied Chromopainter's algorithm (*linkage mode) to a homogenous uniform subset of very similar Baltic Populations: Lithuanians, Belorussians, Ukrainians (90  samples plus a couple of reference Belorussian, Lithuanian and Ukrainian samples with 90K SNPs).  The same dataset was used in Shellfish to carry out a  principal component analysis of genome-wide SNP data, and in PLINK MDS-calculations. Obtained PCA and MDS data was subsequently analyzed using the general-purpose clustering software Mclust.


I ran Chromopainter's algorithm separately on each of 22 chromosomes, the output files were merged into one single file. Then i applied fineSTRUCTURE's MCMC algorithm to infer the population structure, PCA components and cluster trees. Below are plots showing individual co-ancestry matrix, individual and population agglomerate plots and PCA plots.








It appears that fineSTRUCTURE inferred 5 clusters (starting with smallest): (1) Ashkenazi, (2) individuals of West-European origin with (moderate to minor) East-European "admixture" (3) individuals from East-Europe and (4) Lithuanians, (5) the biggest cluster including Poles, Belorusians, Russians and Ukrainians.
For our project's particular purposes, it is important to note that such a clear split between Belorussians and Lithuanians is introduced for the first time. Although the innuendo of this division could be inferred from our earlier experiments, the statistical signal of separation was weak  enough to be ignored, because (as it is shown on the plot below) some regions of the plots are highly uncertain.






The cluster assignments (fineStructure) for project's individuals can be seen here.
 I have also published corresponding cluster assignments by Mclust (PLINK+MClust and Shellfish+MClust).  Direct agreement between these two latter solutions: 0 of 4 pairs, iterations for permutation matching: 24, cases in matched pairs: 34.58 %

     2  3  1  4
  1  0  2  3  0
  2  2  8  1  3
  3 13 11 16 18
  4  4  4  9 13















 

19 comments:

  1. you are truly a good webmaster. The website loading speed is amazing. It sort of feels that you are doing any distinctive trick. Furthermore, The contents are masterwork. you have done a magnificent job in this subject! Visit telephone answering service for best Telephone Service.

    ReplyDelete
  2. This is my first time i visit here and I found so many interesting stuff in your blog especially it's discussion.Music courses online sydney Thank you so much for sharing this great blog. Keep it

    ReplyDelete
  3. I like this article because i got all the information that i want. So thank you for posting keep it.

    Chennai Escorts
    Chennai Escort agency
    Chennai Escort Services
    Chennai Escort
    Independent Chennai Escort

    ReplyDelete
  4. Such a great way of representing such a nice post, thanks...
    Escorts in Chennai

    ReplyDelete
  5. This article gives the light in which we can observe the reality. This is very nice one and gives in depth information. Thanks for this nice article.
    https://goweedonline.com

    ReplyDelete
  6. Thanks for sharing your valuable information with us, I ll keep in my mind It will help me in my coming future, Regarding doing this type of work.

    Visit our site for getting all Satta King, Satta King 2020, Disawar Satta King, Satta King Chart, Gali Satta King Results.

    ReplyDelete
  7. Thanks allowed me to comment in here, verry informative post, i'm so interesting
    to say thanks, the information

    ReplyDelete
  8. Lose yourself in the arms of Anupreet Kaur from Mumbai Escorts. Life is more than adventurous and happening for me as I am this Funy diva about whom only men can dream about day in and day out. Many men are so oblivious to my existence as they are so busy at work and afterward doing some mechanical Fun with their spouses when experts on booties are waiting for them. Yes, there indeed requires lots of skills and expertise in giving lots of pleasure as if every other woman knew about it, there wouldn’t be this long list of an increasing number of my clients.

    Independent Mumbai Escorts
    Female Mumbai Escorts
    Escorts In Mumbai
    Mumbai Escorts Service

    ReplyDelete
  9. Thanks for sharing such a nice information through this blog. I hope it will help everyone. I appreciate your work, please keep sharing. 토토

    ReplyDelete
  10. An outstanding share! I have just forwarded this onto a co-worker who was doing a little homework on this. And he actually bought me dinner simply because I discovered it for him… lol. So let me reword this…. Thanks for the meal!! But yeah, thanks for spending some time to talk about this topic here on your web site.

    성인야설
    휴게텔
    마사지
    건마탑
    온라인카지노

    ReplyDelete
  11. Hi there, I enjoy reading all of your article. 슬롯머신

    ReplyDelete
  12. It’s really a great and helpful piece of info. I’m glad that you shared this helpful info with us. 토토사이트 , 바카라사이트 , 룰렛 , 토토사이트

    ReplyDelete