PDA

View Full Version : R1b-P312xL21 Haplotypes Spreadsheet & SNP Tree



TigerMW
05-16-2013, 02:57 AM
I maintain the R1b-P312xL21_haplotypes spreadsheet of all the confirmed P312+ haplotypes that I can find from public projects. L21 is subclade of P312 but there are too many too include in this file so I save those separately. The P312xL21 spreadsheet is found under the Links section of the R1b-P312-Project Yahoo group under "Haplotype Data". The Links section is in alphabetic order. There is a second file for L21 as well. I do NOT use any robot type programs to copy information from projects. I do it manually and only from public information.

There is a "Readme" help tab on the far left at the bottom. You may have to hit the left arrow to get to it.

The "AllHts" tab has all of the haplotypes.

All Haplotypes are displayed, but only the first 67 STRs per each haplotype.

The first group of columns provides background information including the kit #, Most Distant Known Ancestor (MDKA) surname, SNP based haplogroup definition, STR signature based variety (cluster) and geographic origin.
The Hg (haplogroup) column shows a description of the phylogenetic tree branch that the haplotype is on, according to the relevant Downstream SNPs.

The next group of columns is the first set of STR columns. They are the STR values in FTDNA panel sequence order with each multi-copy element broken out.

The Relevant SNPs column is a list of directly relevant SNP test results. Only the lowest level (youngest) positive (derived) SNP is shown plus any negative (ancestral) tested SNPs one level younger than the. In additioned, unpositioned (on the tree) SNP results are also shown.

The second set of STR columns shows only the Genetic Distance (GD) from the Base Haplotype at the top of the screen per each STR. This STR columns are displayed in order from slowest to fastest mutation rate.

On the far right are three columns that control GD calculations for the distance across all 67 STRs for any haplotype that you select versus all of the other 67 STR length haplotypes in the worksheet. You select the Target haplotype by putting an "x" in the yellow cell next to it. You can only select one haplotype as the target. You can make the target haplotype the modal for a selected set of haplotypes by putting an "x" at the top of the spreadsheet next to the calculated Mode row.

The last column has the Ysearch IDs where we could find them. You can use Ysearch to directly to contact the owner of the haplotype.

A statistical summary section is at the bottom of the spreadsheet. The calculations include:
Allele distribution table per each STR, 1-67
Count
Mode per each STR
Diversity per each STR
Variance per each STR
Standard Deviation per each STR
Mean per each STR
Sum of the Variance for all 67 STRs

The blue titled rows are numbers totalled for the whole spreadsheet. The green titled rows are totals for just the selected haplotypes.

You can use the autofiltering function of the spreadsheet in conjunction with the the green titled totals. Select (filter) to just view the haplotypes you want and the green titled statistics will subtotal for just those selected. Selection is accomplished by using the to the Column heading drop down arrow "autofilter" functionality.


The "ExtHts" tab has all of the 111 STR haplotypes.

Extended 111 STRs (only) are displayed on this worksheet. The GD and statistical capabilities are similar to AllHts.


The "Clades" tab has the SNP/haplogroup tree pointer information and the STR signatures of the deep ancestral varieties assigned.

Clades has two sections. First, a list of SNPs with associated haplogroup labels. The haplogroup labels on the other worksheets are displayed based on the SNPs in the Downstream SNPs column and how they align with this first table within Clade. This table contains branching depth levels as well as SNPs that need to be translated to other synomous SNPs for consistency purposes

The second section of Clade is a list of STR based variety labels along with their actual off-modal STR values.


The "Rates" tab has misc. information.

This worksheet primarily just contains supplemental information, like mutation rates for STRs. There are a set of columns where the modal values for the largest R1b subclades is kept. WAMH, the Western Atlantic Modal Haplotype, is an obsolete set of modals based on a conglomeration of R1b clades. Effectively, the R1b-P312 modal supersedes it in represent the most common subclade in Western Europe.

TigerMW
07-03-2013, 11:21 PM
I updated the R1b-P312xL21_haplotypes spreadsheet of all the confirmed P312+ haplotypes that I can find from public projects. The L21 is subclade P312 is excluded because there are too many. I save L21 separately. The P312xL21 spreadsheet is found under the Links section of the R1b-P312-Project Yahoo group under "Haplotype Data".

I've checked the R1b-P312, U152, R1b and SRY2627 files in the last day for this update.

TigerMW
07-14-2013, 08:47 PM
I've just updated this spreadsheet.

TigerMW
08-03-2013, 11:08 PM
I just updated the R1b-P312xL21 Haplotypes spreadsheet out under the Links section again under "Haplotype Data"

This is based on my latest draft phylogenetic trees for P312, DF27 and U152.
http://tinyurl.com/R1b-P312-Tree
http://tinyurl.com/R1b-DF27-Tree
http://tinyurl.com/R1b-U152-Tree

I added a number of suspected P312+ haplotypes, including DF19 suspects. I tried to take care to classify P312** people into STR based varieties. I also added several L238 suspects, in some cases, not necessarily Norse. Keep in mind they are just suspects.

TigerMW
09-20-2013, 05:42 PM
I just updated the R1b-P312xL21 Haplotypes spreadsheet out under the Links section again under "Haplotype Data"

This is based on my latest draft phylogenetic trees for P312, DF27 and U152.
http://tinyurl.com/R1b-P312-Tree
http://tinyurl.com/R1b-DF27-Tree
http://tinyurl.com/R1b-U152-Tree

I updated and uploaded this spreadsheet this morning. We've got a new L238 person or two plus more progression on DF27 (and subclade) testing. We've also got the DF99 guy so P312** dwindles and will have to be redefined to include DF99- soon.

MitchellSince1893
02-23-2014, 10:49 PM
Last update I found for this spreadsheet was 12/7/2013. Are more recent versions available or is it no longer being updated?