06-27-2016, 01:23 PM
So, I want to extract the genotype data of the 8 Romanian samples used in Lazaridis' et al study on "The genetic structure of the world's first farmers". The purpose is to upload them on GEDmatch and compare with my other samples.

The dataset is publically available here (https://genetics.med.harvard.edu/reich/Reich_Lab/Datasets.html). After downloading it, there are three file extensions (.geno, .ind, .snp). I suppose that I need to play with geno?

What would be the best practice of extracting what I need from there, as it is all lumped together into one big file, without damaging the samples?

