Results 1 to 4 of 4

Thread: Converting .VCF file to 23 and me format using Python3

  1. #1
    Registered Users
    Posts
    972
    Sex
    Location
    Bay Area
    Nationality
    American
    mtDNA (M)
    J1c2p
    Y-DNA (P)
    E-BY14160

    England Italy Sweden Netherlands

    Converting .VCF file to 23 and me format using Python3

    I did a Nebula Genomics test a while ago and am trying to make some use out of the files they provide.

    Does anyone know how to cvert a .VCF file to 23 and me format using Python3? I am at my wits end trying to glean any value from this (worthless) test. I can successfully use DNAKitStudio and convert but when I upload to Gedmatch, I get the following error:

    Notice: Undefined index: u in /var/www/genesis-v1.0.0/html/v_upload2log.phpnf on line 23

    Notice: Undefined index: name in /var/www/genesis-v1.0.0/html/v_upload2log.phpnf on line 24

    Notice: Undefined index: name2 in /var/www/genesis-v1.0.0/html/v_upload2log.phpnf on line 25

    Notice: Undefined index: email in /var/www/genesis-v1.0.0/html/v_upload2log.phpnf on line 26

    Notice: Undefined index: auth in /var/www/genesis-v1.0.0/html/v_upload2log.phpnf on line 27
    You must agree that you are authorized to upload this file.


    I used an older version of DNA Studio Kit and managed to upload the file, but in the end, I see this:
    File does not contain the correct number of SNPs. The SNP count must be between 50000 and 10000000 SNPs.
    Your file had 27550498 SNPs.

    Any idea's?

    The Nebula test is worthless. I have read that the low coverage is borderline useless. I dont really care about that, I am more interested in geneology and ancestry, and the ethnicity estimate isnt awful but its exactly what we got for free with Gencove before they stopped accepting uploads.

    I'd like to get some value out of this, but Nebula seems to want to make that hard to impossible to do so.

    Here is their page on this:
    https://blog.nebula.org/how-to-start...-genomic-data/
    distance%=4.6465"
    Barcin_N,47.2
    Yamnaya_Samara,41.4
    WHG,10.6
    Ethiopia_4500BP,0.8


    E-V13 => E-PH1246 => E-BY14160
    Antonio Reale born circa 1710, CiminÓ Italy

  2. #2
    Junior Member
    Posts
    6
    Sex
    Location
    Burgos

    Castile and Leˇn
    Try DNA Kit Studio from simple tools.

  3. #3
    Moderator
    Posts
    5,844
    Sex
    Location
    Normandy
    Ethnicity
    northwesterner
    mtDNA (M)
    H5a1
    Y-DNA (P)
    R-BY3604-Z275

    Normandie Netherlands Friesland Finland Orkney
    More than 27 million SNPs for less than $200, it's interesting. I hope they soon sell their services out of the USA. That said if you are interested in ancestry you will likely never use these 27 million. The only modern detailed reference sample that I know which use so many SNPs (even more if I remember well) is Pagani, essentially useful for east-Eurasian ancestry. 1000genomes use around 40 million but its European regionalisation is very sketchy. Afaik all the others, and in particular the Harvardian samples use far fewer SNPs. About the conversion vcf-->txt, I'm absolutely ignorant of DNAKitstudio. I always used vcftools and PLINK. Would you mind posting a hardcopy of your ancestral report? I'm curious about what they do with 27million SNPs.
    En North alom, de North venom
    En North fum naiz, en North manom

    (Roman de Rou, Wace, 1160-1170)

  4. #4
    Registered Users
    Posts
    972
    Sex
    Location
    Bay Area
    Nationality
    American
    mtDNA (M)
    J1c2p
    Y-DNA (P)
    E-BY14160

    England Italy Sweden Netherlands
    Nebula.JPG

    I should clarify my statement. I think its a low pass sequencing, I have read multiple places that the data is almost worthless. So for medical, etc.. hard pass. The ancestry portion is not wrong overall but the %'s are a bit mixed up, mostly the Scandinavian being 10% too much.

    On paper I am:
    50% English (likely 6% or so French as well in there)
    25% Italian
    12.5% Swedish
    12.5% Dutch/German

    What I have notice dis the 50% english is rarely identified as actual English. Usually that % is lower and the Germanic is higher. Who knows what is correct.
    distance%=4.6465"
    Barcin_N,47.2
    Yamnaya_Samara,41.4
    WHG,10.6
    Ethiopia_4500BP,0.8


    E-V13 => E-PH1246 => E-BY14160
    Antonio Reale born circa 1710, CiminÓ Italy

Similar Threads

  1. New Raw DNA File Format
    By JimB in forum FTDNA
    Replies: 60
    Last Post: 08-21-2019, 07:55 AM
  2. Converting .vcf to .csv??
    By gyanwali in forum General
    Replies: 8
    Last Post: 05-20-2019, 08:01 AM
  3. PLINK extracting to 23&Me format - too large size of file
    By lukaszM in forum Inquiries Corner
    Replies: 18
    Last Post: 10-12-2017, 08:17 PM
  4. Converting .bam format to CSV (Zipped)?
    By gyanwali in forum General
    Replies: 15
    Last Post: 08-25-2017, 04:35 PM
  5. Replies: 8
    Last Post: 11-28-2016, 09:57 PM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •