¿ªÔÆÌåÓý

ctrl + shift + ? for shortcuts
© 2025 Groups.io

Re: The Importance of Colmar 239

 

Hello Iain

Your post was so interesting, that I took the liberty of posting it to the England GB EIJ project; which is always very active.

Already some some have asked about re-posting it elsewhere; to which I responded "I rely on Iain previously having said he was happy to have his words posted elsewhere". So it will be interesting to see the comments that flow.

There is one comment, suggesting that not taking waterways into account, is yet another bias.

Kind regards
John






Re: The Importance of Colmar 239

 

Thank you Iain for this insightful opinion.
Indeed, FTDNA does not seem to have completely succeeded in erasing prediction errors due to bias in its databases.

An illustrative example of these prediction biases, that you are not without knowing, with the branch R-S775 > R-L745 > R-FGC34909 (> R-S781 - downstream of R-P312>>R-L21), illustrates due to the House of Stuart.
The Stuart, as well as the FitzAlan, descend from the Breton knight Alan fitz Flaad (+ ~1120), seneschal of Dol-de-Bretagne. These 2 lineages were established in the United Kingdom in the 12th century. Their most distant known ancestor (MDKA) was Alain, dapifer sacrae ecclesiae Dolensis archiepiscopi Dolensis, alive in the 11th century.
Therefore, upstream of R-L745, Globetrekker should pass through Brittany (France), for at least a century or more (between R-S775 and R-L745, almost 2500 years have passed!), or this is not the case... Globetrekker indicates that R-S552 (a descendant of R-L21) would have crossed the Channel around 2 600 BCE, and the entire lineage from R-S552 to R-S781 would have remained in United Kingdom...

As a certain number of Bretons originally came from Great Britain, there is a significant possibility that this lineage actually came from Great Britain, then migrated to Brittany around the 6th century, before returning to settle on the other side of the Channel, from the 12th century.

Cheers,

Ewenn


Re: The Importance of Colmar 239

 

Thanks, Iain.? I have already sent feedback to FTDNA concerning the Globetrekker issue, because it is so unrealistic in it's placing anything before S3997 on the coast of Britain.? Personally, I think that even S3997 was in Britain as late as 200 BCE - but that's just a hunch.

Ed??


Re: The Importance of Colmar 239

Louise Walsh Throop
 

If I have the geology of the English Channel correctly, the Channel was formed ca 6000 years before present, so an ancestor of Colmar239 could have crossed the channel area before 6000 ybp or taken a boat across the channel after the 6000-year ago event, and then descendants move south in France into Gaul.......etc.

Louise

On Wednesday, November 15, 2023 at 12:00:48 PM PST, ejsteele56@... <ejsteele56@...> wrote:


Analysis of the DNA of ancient remains of a man believed to have been associated with the La Tene culture in Haut-Rhin, France, and known only as Colmar 239, reveal that he was descended from the common ancestor of haplogroup R-A10645.? Furthermore, according to the Discover Haplogroups Report, only 42 FTDNA testers currently share that common ancestor of A10645 with Colmar 239.


Understandably, this is a very small group of testers affected in comparison to many branches under Z156.? However, given that the La Tene culture is thought to have been culturally Celtic, and given that these remains were found in what was then Gaul, has there been any discussion as to what this may tell us of the location of A10645?? It certainly seems to counter any idea that the common ancestor of A10645 was in Britain in 1650 BCE.

Ed


Re: The Importance of Colmar 239

 

¿ªÔÆÌåÓý

Thank you Iain! ?This is reminiscent of several discussions and conversations in my neck of the aDNA woods over the last several years. ?We are fortunate to having these analyzed remains, but they have as many as or more limitations as modern day testers in regard to haplogroup or even subclade geographical origins.

That is difficult for many to fully embrace, but none the less is our reality.
Susan Hedeen?


On Nov 16, 2023, at 12:00 PM, Iain via groups.io <gubbins@...> wrote:

?

Hi Ed,


I don't know that there's been a lot of talk specifically about Colmar 239, but if you're basing your origin estimates off Globetrekker, then you're in for a bad time. Globetrekker effectively brings Family Tree DNA up-to-date with the efforts that Rob Spencer, Hunter Provyn and others have been doing for a few years now, but no-one really comes close to an accurate solution because the biases in the data still haven't been correctly accounted for, and the ancient DNA (in my opinion) hasn't been given the right weighting in the calculations.


The problem with assigning origins based on ancient DNA is the same as assigning origins based on modern testers: how much does one sample really tell you? The key here is contemporaneity. A sample like Ploti?t¨§ nad Labem 1, who lived within a few generations of the R-U106 MRCA, says a lot about where R-U106 formed. Colmar 239, living 1000 years or so after the R-A10645 MRCA, says much less about where R-A10645 formed. We also have to be careful with our nomenclature: an ancient DNA sample may not be tested for all the SNPs in a haplogroup, therefore may descend from an intermediate haplogroup that is now either untested or extinct; similarly the sample may be positive for downstream sub-clades, for either not positive for all SNPs, or not tested for any.


If we assume a front speed of a between one and a few km/year for human migrations (https://www.pnas.org/doi/10.1073/pnas.1920051117), we can presume that R-U106 was founded within a few hundred km of where Ploti?t¨§ nad Labem 1 was buried. This puts the origin of R-U106 in the vicinity of Bohemia, although doesn't allow us to definitively narrow it down to a specific country. If we do the same to Colmar 239, we can say that R-A10645 was founded somewhere within between one and a few thousand km of the La Tene culture, i.e., somewhere within Europe. That's not very constraining.


In both cases, we can say that at least some of R-U106 passed through the Bohemian part of the Corded Ware Culture, and that some of R-A10645 passed through the La Tene culture. In all likelihood, proportionally more of R-U106 probably passed through the Bohemian CWC because there had been much less time for R-U106 to diverge into different regions and cultures by that point. So, not only is Ploti?t¨§ nad Labem 1 more constraining geographically regarding the origin of R-U106 than Colmar 239 is of R-A10645, but it also says more specifically about what cultures our contemporary early R-U106 ancestors were doing at the time. By contrast, by 600 BC, R-A10645 could have been spread to many different cultures in many different places, and we only sample one.


To put some numbers on this, imagine a population that diffuses out at a constant rate over time. It might cover 1000 square km after 100 years, 4000 square km after 200 years, 9000 square km after 300 years, and 2.5 million square km (1/4 of Europe) after 5000 years, etc. Imagine a tester in a 5000-year-old haplogroup like R-U106 today. We can draw a circle around him of 2.5 million square km and say that there's a 68% chance that the origin of R-U106 lies within that circle. If we have ancient DNA that's from 1000 years after R-U106 split, then we can draw a circle of 100,000 square km around the burial, thus that ancient DNA is 25 times more precise, and thus "worth" 25 modern testers today. For Ploti?t¨§ nad Labem 1, we are dealing with a sample probably about 100 years or so after R-U106 split, thus the circle is 2500 times smaller, and that ancient DNA is "worth" 2500 modern testers in terms of defining origins.


Of course, populations move and spread and haven't expanded at a constant size, and all sorts of other confounding factors so, actually, each ancient DNA sample is "worth" much more than this, because it strips out all of these systematic trends. But the comparative "worth" of Ploti?t¨§ nad Labem 1 is still more than 100 times that of Colmar 239 by this argument, and Colmar 239 on its own probably doesn't say much more about the origins of R-A10645 than the existing 42 FTDNA testers already did.


The take-home point here, then, is that ancient DNA needs to assigned to a contemporary haplogroup (not a much more ancient one) before it says anything about our ancestors, and it is only samples really close to the TMRCA of the haplogroup that can be so incredibly defining in terms of origins.


Cheers,


Iain.


Re: The Importance of Colmar 239

 

¿ªÔÆÌåÓý

Hi Ed,


I don't know that there's been a lot of talk specifically about Colmar 239, but if you're basing your origin estimates off Globetrekker, then you're in for a bad time. Globetrekker effectively brings Family Tree DNA up-to-date with the efforts that Rob Spencer, Hunter Provyn and others have been doing for a few years now, but no-one really comes close to an accurate solution because the biases in the data still haven't been correctly accounted for, and the ancient DNA (in my opinion) hasn't been given the right weighting in the calculations.


The problem with assigning origins based on ancient DNA is the same as assigning origins based on modern testers: how much does one sample really tell you? The key here is contemporaneity. A sample like Ploti?t¨§ nad Labem 1, who lived within a few generations of the R-U106 MRCA, says a lot about where R-U106 formed. Colmar 239, living 1000 years or so after the R-A10645 MRCA, says much less about where R-A10645 formed. We also have to be careful with our nomenclature: an ancient DNA sample may not be tested for all the SNPs in a haplogroup, therefore may descend from an intermediate haplogroup that is now either untested or extinct; similarly the sample may be positive for downstream sub-clades, for either not positive for all SNPs, or not tested for any.


If we assume a front speed of a between one and a few km/year for human migrations (https://www.pnas.org/doi/10.1073/pnas.1920051117), we can presume that R-U106 was founded within a few hundred km of where Ploti?t¨§ nad Labem 1 was buried. This puts the origin of R-U106 in the vicinity of Bohemia, although doesn't allow us to definitively narrow it down to a specific country. If we do the same to Colmar 239, we can say that R-A10645 was founded somewhere within between one and a few thousand km of the La Tene culture, i.e., somewhere within Europe. That's not very constraining.


In both cases, we can say that at least some of R-U106 passed through the Bohemian part of the Corded Ware Culture, and that some of R-A10645 passed through the La Tene culture. In all likelihood, proportionally more of R-U106 probably passed through the Bohemian CWC because there had been much less time for R-U106 to diverge into different regions and cultures by that point. So, not only is Ploti?t¨§ nad Labem 1 more constraining geographically regarding the origin of R-U106 than Colmar 239 is of R-A10645, but it also says more specifically about what cultures our contemporary early R-U106 ancestors were doing at the time. By contrast, by 600 BC, R-A10645 could have been spread to many different cultures in many different places, and we only sample one.


To put some numbers on this, imagine a population that diffuses out at a constant rate over time. It might cover 1000 square km after 100 years, 4000 square km after 200 years, 9000 square km after 300 years, and 2.5 million square km (1/4 of Europe) after 5000 years, etc. Imagine a tester in a 5000-year-old haplogroup like R-U106 today. We can draw a circle around him of 2.5 million square km and say that there's a 68% chance that the origin of R-U106 lies within that circle. If we have ancient DNA that's from 1000 years after R-U106 split, then we can draw a circle of 100,000 square km around the burial, thus that ancient DNA is 25 times more precise, and thus "worth" 25 modern testers today. For Ploti?t¨§ nad Labem 1, we are dealing with a sample probably about 100 years or so after R-U106 split, thus the circle is 2500 times smaller, and that ancient DNA is "worth" 2500 modern testers in terms of defining origins.


Of course, populations move and spread and haven't expanded at a constant size, and all sorts of other confounding factors so, actually, each ancient DNA sample is "worth" much more than this, because it strips out all of these systematic trends. But the comparative "worth" of Ploti?t¨§ nad Labem 1 is still more than 100 times that of Colmar 239 by this argument, and Colmar 239 on its own probably doesn't say much more about the origins of R-A10645 than the existing 42 FTDNA testers already did.


The take-home point here, then, is that ancient DNA needs to assigned to a contemporary haplogroup (not a much more ancient one) before it says anything about our ancestors, and it is only samples really close to the TMRCA of the haplogroup that can be so incredibly defining in terms of origins.


Cheers,


Iain.


The Importance of Colmar 239

 

Analysis of the DNA of ancient remains of a man believed to have been associated with the La Tene culture in Haut-Rhin, France, and known only as Colmar 239, reveal that he was descended from the common ancestor of haplogroup R-A10645.? Furthermore, according to the Discover Haplogroups Report, only 42 FTDNA testers currently share that common ancestor of A10645 with Colmar 239.


Understandably, this is a very small group of testers affected in comparison to many branches under Z156.? However, given that the La Tene culture is thought to have been culturally Celtic, and given that these remains were found in what was then Gaul, has there been any discussion as to what this may tell us of the location of A10645?? It certainly seems to counter any idea that the common ancestor of A10645 was in Britain in 1650 BCE.

Ed


Re: Best-guess origins of the major R-U106 haplogroups: methods and results #origins

 

Hi Roy,

A new post on the FTDNA blog which deals precisely on this:


Cheers,?

Ewenn

Le lun. 13 nov. 2023 ¨¤ 15:36, Roy <apeiron@...> a ¨¦crit?:

Iain can now add Indo-European linguistics to his portfolio of expertise. However, one small point might cause confusion: North Germanic languages only reached the Black Sea through the trading and raiding of mainly Swedish Vikings, who were known as Varangians.? Members of these groups formed the state of Kievan Rus and ultimately became Slavic speakers.? Centuries before, however, Greuthingi-Goths had penetrated into the Crimean Peninsula. Contrary to what is stated in the Wikipedia article, Gothic is normally classified as an East Germanic language.

I expect that when Iain wrote Black Sea he intended Baltic Sea.

It is interesting to note that Crimean Gothic survived to the late 18th century in some areas .


-Roy


On 11/13/23 04:34, Iain via wrote:

Hi Randy,


There are plenty people who will give you a more authoritative answer on this, however I'll give it my best shot.


The Germanic language family and, by extension, the Germanic people, can be split into three branches: Northern, Western and Eastern. The Northern Germanic group roughly co-incides with the earliest Germanic people around the shores of the Black Sea and Scandinavia,


Re: Best-guess origins of the major R-U106 haplogroups: methods and results #origins

 
Edited

Thanks for the clarification, Iain. I thought you were referring to a split into three major haplogroups of the Germanic speaking people. Makes perfect sense now.

Thanks again for all your work on U106 and for sharing your findings with us!

Best Regards,
Randy


Re: Best-guess origins of the major R-U106 haplogroups: methods and results #origins

 

¿ªÔÆÌåÓý

Iain can now add Indo-European linguistics to his portfolio of expertise. However, one small point might cause confusion: North Germanic languages only reached the Black Sea through the trading and raiding of mainly Swedish Vikings, who were known as Varangians.? Members of these groups formed the state of Kievan Rus and ultimately became Slavic speakers.? Centuries before, however, Greuthingi-Goths had penetrated into the Crimean Peninsula. Contrary to what is stated in the Wikipedia article, Gothic is normally classified as an East Germanic language.

I expect that when Iain wrote Black Sea he intended Baltic Sea.

It is interesting to note that Crimean Gothic survived to the late 18th century in some areas .


-Roy


On 11/13/23 04:34, Iain via groups.io wrote:

Hi Randy,


There are plenty people who will give you a more authoritative answer on this, however I'll give it my best shot.


The Germanic language family and, by extension, the Germanic people, can be split into three branches: Northern, Western and Eastern. The Northern Germanic group roughly co-incides with the earliest Germanic people around the shores of the Black Sea and Scandinavia,


Re: Best-guess origins of the major R-U106 haplogroups: methods and results #origins

 

¿ªÔÆÌåÓý

Hi Randy,


There are plenty people who will give you a more authoritative answer on this, however I'll give it my best shot.


The Germanic language family and, by extension, the Germanic people, can be split into three branches: Northern, Western and Eastern. The Northern Germanic group roughly co-incides with the earliest Germanic people around the shores of the Black Sea and Scandinavia, namely the Norwegian, Swedish, Danish and descendant peoples in the Faeroes, Iceland and Greenland, and parts of Finland. The Western Germanic group comprises modern Germans, but also the countries west of them, including the Dutch, Flemish and English languages. The Eastern Germanic group comprises language families in eastern Europe, which are now extinct but survive in Gothic texts. The Western Germanic group is often sub-divided.


How far back we can place the origins of a distinct Germanic people is debateable. Most references I've seen typically put this definition around 700-500 BC, and the culture didn't fragment into these later branches until the Germanic peoples started expanding, both south-west into Celtic central Europe as the Germanic tribes the Romans knew, and eastwards into what ultimately became the Gothic migrations down towards the Black Sea and into Rome. These migrations and expansions were a gradual process across many centuries, and subsequent migrations (e.g. within Prussia) have erased many differences that there once were.


Part of what we can do with genetic genealogy is use the TMRCA estimates that we create to merge with what is known from an archaeological understanding of this period of history, and help chart the migrations of people that we see evidenced in the archaeology. As you may anticipate, this isn't a straight-forward process.


Cheers,


Iain.


Re: Best-guess origins of the major R-U106 haplogroups: methods and results #origins

 
Edited

"R-U106>Z381>L48>Z9>Z2>Z7>Z8>Z1>Z346>Z343>CTS5601

Likely MRCA data range: 1100-100 BC

Likely origin: Scandinavia, likely Sweden

Culture: proto- or early Germanic (if early), north Germanic if later

Narrative evidence: R-CTS5601 shows much stronger Scottish (and, to a lesser degree, Irish) themes than R-FGC11784, or much of the rest of R-Z8 as a whole. It remains strong in England, though only typically so compared to R-Z381 as a whole. In continental Europe, it is strong in the Netherlands, and moderately present in Germany and France. In Scandinavia, it is very strong in Sweden and Finland, and present in Norway. Sporadic returns are seen in eastern Germanic branches, but this is very much focussed towards north Germanic areas, with some western Germanic migrations happening later. Consequently, it appears that this predates the major split of the Germanic peoples into their three traditional branches, but has latterly become most associated with the northern Germanic groups."

So CTS5601 predates the split of the three traditional branches of the Germanic people. What are the three branches that you're referring to here, Iain?


Re: Testing statistics and country-level bias update

 

¿ªÔÆÌåÓý

Hi Mike,


For R-FGC910, see R-Z7 and R-FGC902 on this post:

/g/R1b-U106/message/5759


The period of history is very difficult to make predictions about. It's too far back in history for historical sources or modern distributions to accurately reflect origins without complex interpretation, and it's too far forward in history for the known origins of R-U106 to be instructive. Apart from some stand-out cases, we need better modelling in this period before we can be very confident in our results.


Cheers,


Iain.


Re: Testing statistics and country-level bias update

 

Hi Ian,

This data is really interesting. I run a small project for "Gleave" surname which is quite rare, mainly present in Lancashire and Cheshire. We are downstream of R-FGC910 and after spending a few years alone on???, a new big y match with the surname "Marchant" joined me on a new haplogroup,??R-FTA81892.

Are there any indications of the geographical origin of R-FGC910? An analysis of the different branches reveals diverse locations such as Sweden, England, Scotland, Germany. The Globetrekker tool has R-FCG 902 moving swiftly back to the continent following Z-2's arrival, but I would guess this is not accurate.

All the best,

Mike Gleave



On Saturday, 11 November 2023 at 16:10:18 CET, Iain via groups.io <gubbins@...> wrote:


Hi folks,


About November time I like to go through the haplotree, looking to see how much different branches and different countries have grown over the previous year. This helps us understand both what the country-level and haplogroup-level biases are, how they change over time, and where we might need to focus our efforts to help get the right people tested. The other reason for analysing these statistics is that it feeds back into the question of origins: where are people of particular haplogroups more or less likely to come from, hence where are specific haplogroups more likely to originate? But that's a question that's going to take me more time to answer than I have right now! Instead, see earlier messages like #5759. Thanks particularly to Ewenn who, late last year, gave me some code to speed up this process hugely.


Here, I'll be comparing two time periods: Nov 2022 - Nov 2023, and Nov 2019 to Nov 2023.


In general, this year continues the trend of previous years. The British Isles bias continues to get slightly worse, but this is more than counterbalanced by the extra information that new testers are bringing in.


Globally, the size of the FTDNA haplotree has increased by 6.6% in the last year, which is a slower rate than the average of the last four years (8.5%). The haplotree contains information from a variety of sources, so we can't infer information about how fast FTDNA's customer base is increasing from this information.


This increase has been disproportionately from people who cannot trace their origins back to Europe. The size of the European testing population has grown by only 6.1%. The British Isles bias now stands as follows: testers from the British Isles now make up 45% of European testers in the database, despite the British Isles making up only 8.6% of the comparative modern population, so we over-sample the British Isles by a factor of about 5.2 compared to the rest of Europe. It should be noted, however, that modern populations are not always indicative of historical populations. If we instead use population estimates from 1800, where the typical person's earliest-known ancestor lived, we find a British Isles bias of 5.9 instead. Some specific country or region-level modern/historical biases are:

England 2.3/1.6

Scotland 0.39/0.57

Wales 1.59/1.41

N.I. 0.71/1.6

Ireland 0.25/0.84

France 11.7/24.0

Germany 5.5/6.4

Netherlands 8.3/4.9

Poland 6.8/6.0

Czechia 6.5/11.5

Austria 9.5/14.1

Denmark 4.8/3.5

Scandinavia+Finland 1.45/1.16

European former USSR 15.0/16.1

Balkans+Turkey 20.9/13.9

Meditteranean 10.8/12.9


The further down we go in the tree, the faster the increase in testing becomes. This is because we get rid of the customers and studies that have only undertaken limited testing, and are increasingly left with only BigY testers. This is therefore a better estimator of the speed at which branches relevant to us are increasing.


The R-U106 testing population has increased by about 11.5% globally, by 10.9% in people with known European ancestry, and by 10.1% in the British Isles. These are very similar rates to those over the last four years, so the growth is fairly constant. These rates are faster than the growth in R-P312 (7.9% globally), but this may reflect the historical depth of testing rather than any inherent behaviour in R-U106! R-U106 testers make up 8.55% the haplotree at FTDNA. We've seen above-average increases in Northern Ireland and among the former Eastern Bloc countries, particularly the Czech Republic, but also Poland. We've seen below-average increases in Scotland, Belgium, Norway, Finland and Russia - the latter largely because FTDNA now offer sub-populations within Russia rather than because of geopolitics.


We can step down further to R-Z2265, which represents almost the whole of R-U106, but ignores the ~1240 testers that have only tested as far as R-U106 with single SNP or SNP-pack testers, i.e. mostly BigY testers. There are 19692 Z2265+ entries globally (an increase of 13.2%) and 8306 in Europe (an increase of 12.1%). This rate is slightly above the average for the last four years (12.4%/11.3%). Given the increase in database size, that shows that the rate of BigY testing is still proportionally increasing. At this rate, the database size doubles about every six years meaning that an average tester will have to wait about six years to receive a match closer to them and a new haplogroup designation (obviously not true for people who have purposefully tested close relatives).


Different haplogroups have grown faster than others. The reasons behind this haven't always been clear! For example, testing in R-Z18 and particularly R-L257 have been growing much faster this year (13.1%, 14.2%) than their average in the previous four years (10.2%, 9.6%). Testing in R-Z156 and particularly R-DF96 have slowed (from 13.9% to 10.8% and 12.6% to 10.3%). R-DF98 continues to out-perform other haplogroups in terms of deep testing (22.0% to 17.4%). R-L47 is below average (9.4% to 10.1%) but R-Z9 have increased testing (13.9% to 16.2%). R-FGC910 in particular has increased in size by a sizeable 20.7% on the year, and R-Z343 by 15.8%. Hopefully many of you will have seen these increases among your matches.


Cheers,


Iain.


Re: Projects for Sub-groups of U106

 

¿ªÔÆÌåÓý

Thank you sir!?

Robert McMillan

On Nov 11, 2023, at 9:52 AM, Tiger Mike <mwwdna@...> wrote:

?Robert McMillan, I have been checking and it appears I've sent it everyone I can from the R1b All Subclades project to the R-U106 project that is tested U106. There are a few people who only grant "minimum access" to admins.


Re: Projects for Sub-groups of U106

 

Robert McMillan, I have been checking and it appears I've sent it everyone I can from the R1b All Subclades project to the R-U106 project that is tested U106. There are a few people who only grant "minimum access" to admins.


Another request for case studies

 

¿ªÔÆÌåÓý

Hi again folks,


As many of you know, I'm in the process of updating the textbook I'm writing with the Strathclyde team.


We're now looking at the final chapters and would like a short case study describing where someone has had zero useful Y-STR matches, but found out a lot more information about their ancestry when they upgraded to BigY (or YElite, WGS or a similar sequencing test). This is to showcase the value of upgrading to BigY even if Y-STRs haven't been particularly useful to you.


If any of you have a case study that you'd like to put forward, please get in touch with my privately ASAP!


Best regards,


Iain.


Testing statistics and country-level bias update

 

¿ªÔÆÌåÓý

Hi folks,


About November time I like to go through the haplotree, looking to see how much different branches and different countries have grown over the previous year. This helps us understand both what the country-level and haplogroup-level biases are, how they change over time, and where we might need to focus our efforts to help get the right people tested. The other reason for analysing these statistics is that it feeds back into the question of origins: where are people of particular haplogroups more or less likely to come from, hence where are specific haplogroups more likely to originate? But that's a question that's going to take me more time to answer than I have right now! Instead, see earlier messages like #5759. Thanks particularly to Ewenn who, late last year, gave me some code to speed up this process hugely.


Here, I'll be comparing two time periods: Nov 2022 - Nov 2023, and Nov 2019 to Nov 2023.


In general, this year continues the trend of previous years. The British Isles bias continues to get slightly worse, but this is more than counterbalanced by the extra information that new testers are bringing in.


Globally, the size of the FTDNA haplotree has increased by 6.6% in the last year, which is a slower rate than the average of the last four years (8.5%). The haplotree contains information from a variety of sources, so we can't infer information about how fast FTDNA's customer base is increasing from this information.


This increase has been disproportionately from people who cannot trace their origins back to Europe. The size of the European testing population has grown by only 6.1%. The British Isles bias now stands as follows: testers from the British Isles now make up 45% of European testers in the database, despite the British Isles making up only 8.6% of the comparative modern population, so we over-sample the British Isles by a factor of about 5.2 compared to the rest of Europe. It should be noted, however, that modern populations are not always indicative of historical populations. If we instead use population estimates from 1800, where the typical person's earliest-known ancestor lived, we find a British Isles bias of 5.9 instead. Some specific country or region-level modern/historical biases are:

England 2.3/1.6

Scotland 0.39/0.57

Wales 1.59/1.41

N.I. 0.71/1.6

Ireland 0.25/0.84

France 11.7/24.0

Germany 5.5/6.4

Netherlands 8.3/4.9

Poland 6.8/6.0

Czechia 6.5/11.5

Austria 9.5/14.1

Denmark 4.8/3.5

Scandinavia+Finland 1.45/1.16

European former USSR 15.0/16.1

Balkans+Turkey 20.9/13.9

Meditteranean 10.8/12.9


The further down we go in the tree, the faster the increase in testing becomes. This is because we get rid of the customers and studies that have only undertaken limited testing, and are increasingly left with only BigY testers. This is therefore a better estimator of the speed at which branches relevant to us are increasing.


The R-U106 testing population has increased by about 11.5% globally, by 10.9% in people with known European ancestry, and by 10.1% in the British Isles. These are very similar rates to those over the last four years, so the growth is fairly constant. These rates are faster than the growth in R-P312 (7.9% globally), but this may reflect the historical depth of testing rather than any inherent behaviour in R-U106! R-U106 testers make up 8.55% the haplotree at FTDNA. We've seen above-average increases in Northern Ireland and among the former Eastern Bloc countries, particularly the Czech Republic, but also Poland. We've seen below-average increases in Scotland, Belgium, Norway, Finland and Russia - the latter largely because FTDNA now offer sub-populations within Russia rather than because of geopolitics.


We can step down further to R-Z2265, which represents almost the whole of R-U106, but ignores the ~1240 testers that have only tested as far as R-U106 with single SNP or SNP-pack testers, i.e. mostly BigY testers. There are 19692 Z2265+ entries globally (an increase of 13.2%) and 8306 in Europe (an increase of 12.1%). This rate is slightly above the average for the last four years (12.4%/11.3%). Given the increase in database size, that shows that the rate of BigY testing is still proportionally increasing. At this rate, the database size doubles about every six years meaning that an average tester will have to wait about six years to receive a match closer to them and a new haplogroup designation (obviously not true for people who have purposefully tested close relatives).


Different haplogroups have grown faster than others. The reasons behind this haven't always been clear! For example, testing in R-Z18 and particularly R-L257 have been growing much faster this year (13.1%, 14.2%) than their average in the previous four years (10.2%, 9.6%). Testing in R-Z156 and particularly R-DF96 have slowed (from 13.9% to 10.8% and 12.6% to 10.3%). R-DF98 continues to out-perform other haplogroups in terms of deep testing (22.0% to 17.4%). R-L47 is below average (9.4% to 10.1%) but R-Z9 have increased testing (13.9% to 16.2%). R-FGC910 in particular has increased in size by a sizeable 20.7% on the year, and R-Z343 by 15.8%. Hopefully many of you will have seen these increases among your matches.


Cheers,


Iain.


Re: FTDNA Administrators Conference 2023

 

¿ªÔÆÌåÓý

Hi Joe

?

There are already lots of Australians in the FTDNA database. A lot of people are testing first at AncestryDNA and then transferring to FTDNA. I certainly find in my Devon DNA Project that only a few of these people currently go on to order Y-DNA tests. My hope is that when people get their Family Finder haplogroup reports they will be inspired to take additional tests to join the Y-DNA database. I¡¯m hoping all our projects will get a big boost.

?

I think it¡¯s unlikely that FTDNA would want to accommodate the YSEQ testers. However, I would hope that many of these YSEQ testers would have taken an autosomal DNA test elsewhere so that they can transfer to FTDNA and get a haplogroup assignment. This would also allow them to join a project.

?

Illumina do allow companies to add custom Y-SNPs and mtDNA SNPs to the microarrays. 23andMe and LivingDNA already do this so that they can provide more details haplogroup assignments. It wasn¡¯t made clear at the conference if FTDNA would be doing the same or they would only be using the very limited number of Y-SNPs and mtDNA SNPs provided on the default microarray.

?

Debbie

?

From: [email protected] <[email protected]> On Behalf Of Joe Sprowl
Sent: Wednesday, November 8, 2023 9:10 PM
To: [email protected]
Subject: Re: [R1b-U106] FTDNA Administrators Conference 2023

?

Thank you Debbie and Charles for the update.

?

Sounds like importing YSEQ won't be an option in our lifetimes but at least we are getting more Aussies onboard! Now if we could only do something about France!!

?

- Joe

?

On Wed, Nov 8, 2023 at 3:58?PM Debbie <debbiekennett@...> wrote:

Charles

?

You provide a good summary of these discussions.

?

I think what FTDNA said was that everyone would eventually be forced to used two-factor authentication. The process will start with project admins. They are also requiring all admins to reset passwords, understandably so given that project admins can have access to hundreds and sometimes thousands of kits. FTDNA said they were still considering whether or not to implement a password reset for the entire database. They don¡¯t want anyone to lose access to valuable data. All the major companies are now starting to use two-factor authentication. It will benefit all of us in the long run and it¡¯s something we will have to get used to.

?

Like you, I wasn¡¯t sure what was going to happen with the haplogroup designations for Family Finder results. Are these only going to be displayed on the personal pages or is there going to be some way of displaying haplogroups without STRs on a public results page?

?

I have mixed feelings about the use of MyHeritage trees. One major limitation of MyHeritage is that you are restricted to a tree of just 250 people with a free account. I found one time that when my subscription lapsed I was locked out of my account because I had too many people in my tree. However, the FTDNA trees have always been very poor and it will be easier to maintain one small tree at MyHeritage rather than worrying about an additional tree at FTDNA. FTDNA did acknowledge that not everyone would be able to use the MyHeritage facility so trees from old accounts will be preserved in read-only mode.

?

They did also say that they were looking into offering a whole genome sequencing test though there is no timeline.

?

The Family Finder haplogroups are promised ¡°soon¡±.

?

Likewise the introduction of 2FA should be implemented very soon.

?

Also, FTDNA have finally updated what they call the ¡°onboarding¡± process so that the consent process for relative matching is separated from the consent process for law enforcement matching. This means that people now have to proactively opt in to law enforcement matching. Previously you were automatically opted in unless you dug down deep into your settings to opt out.

?

FTDNA are also being much more active on social media and they have published a lot of informative blog posts. If you¡¯re not already doing so it¡¯s well worth subscribing to their blog:

?

?

I was impressed by the size of the lab and the number of staff now employed by the company. I was originally concerned about the merger with MyDNA but it seems have resulted in a big spending spree and lots of new developments so I think it¡¯s worked out very well.

?

Best wishes

?

Debbie

?

From: [email protected] <[email protected]> On Behalf Of Charles via
Sent: Monday, November 6, 2023 11:18 PM
To: R1b-U106 IO Forum <[email protected]>
Subject: [R1b-U106] FTDNA Administrators Conference 2023

?

The FTDNA Admin Conferences were usually every year before the pandemic, but after 4 ? years, it was great to see old friends, now older, of course!??There is a lot that is new in terms of both people and lab equipment, including robotics.??It¡¯s interesting that robots are analyzing human DNA, but who is analyzing the robots¡¯ DNA?

?

Despite the long time gap between conferences, there weren¡¯t any dramatic announcements of new tests, as sometimes would happen during the pioneering years of the previous two decades.

?

I wrote down a few notes, but other U106 attendees should feel free to add or correct anything I¡¯m reporting here.??My notes are in the order of the presentations.

?

They are planning to increase collaboration or merging of the Customer Service and Group Projects (ie ¡°Groups¡±) support groups. From my standpoint as an Admin, this sounds a little ominous, since the Groups support has been very helpful to me over the years.??But trying to send a consistent message to admins and customers is a good idea.

?

With respect to Ancient Connections in Discover, they now have 6,100 of about 7,500 male ancient extractions in their system.??268 Big Y customers¡¯ closest match in the system is actually one of the ancient remains samples.??

?

Our own Dr Iain McDonald¡¯s name was cited in multiple presentations, and always very favorably.??We all know what a tremendous asset he is to U106, and as we also know, he was very involved in helping the Discover team get the ball rolling on the age analysis of the haplogroups.

?

For admins, there is (already I think) a place to in effect put up a Help Wanted sign on the Project Banner.

?

Undoubtedly the worst news from the weekend, perhaps at least partly due to the recent 23andMe incident, is that a ¡°forced password reset¡± is coming soon.??They will send out warnings, and the Admins access to the Admin system known as GAP, will go first.??But all accounts will be forced to reset the passwords.??If you manage accounts of other sample donors, living or deceased, and your own email address isn¡¯t on the account, you¡¯re going to most likely be out-of-luck ever being able to access it again. I don¡¯t understand the details, and other attendees or other computer geeks might be able to help with the discussion, and this pending nightmare should probably have its own thread here.??But my simple thought is that you should try to make sure ASAP that your own current email address is on every account that you manage, because they are going to email the password reset instructions to the email address on the account, and nobody else will be allowed to reset it.??You should also make sure your accounts have Beneficiary designations.??They mentioned two-factor authentication as a distinct possibility in what they¡¯re planning.??Another idea is that since the Fall Sale is on now, you might want to go ahead and order new tests for some of the kits you manage before the chaos begins.

?

Some good news is that Family Finder and Transfers will be given whatever Y-DNA or mtDNA designations can be obtained from these tests, similar to what you get from Ancestry or 23andMe.??It wasn¡¯t clear to me where this will be displayed???Maybe other attendees understood better, but it sounded like it would go where the current red or green haplogroup labels on the Y-haplotypes are now, EXCEPT OBVIOUSLY it won¡¯t over-write an actual Y-SNP test result, unless it¡¯s downstream from such a Y test.

?

They have an idea to add yet another feature to Discover, along with Ancient and Notable Connections, which would be Connections to results from Academic/Scientific Studies.

?

They are working hard to get a Mitotree up and running for mtDNA, which would be built from scratch from their mtDNA database and which will create new branches and update mtDNA haplogroup designations.??Beyond that, they will also have an mtDNA Discover tool.

?

I think people will take this next item as partly good and partly bad, but here goes.??They are going to replace their traditional genealogical family tree feature with the much better one from MyHeritage.

I think I understood there will be some free and hopefully not too difficult way to get your current tree into the MyHeritage system before they toss the current system overboard (for the second time).

?

They have some Wish List ideas, including moving to whatever future T2T Reference system replaces the current ¡°Build 38¡± reference, an mtDNA Block Tree, and WGS tests if they can get the cost down.

?

It was fun at the lab today watching the test kit vials being filled by robots instead of lab techs who can do ¡°more important things that robots can¡¯t¡±, and presumably less boring or monotonous.

?

There are now over 100,000 Big Y tests in the system.

?

With respect to the new owner from Down Under having a positive influence on getting more Aussies tested, they said actually Australia is one of the better tested countries.

?

They don¡¯t currently have plans to arrange Y-DNA Transfers for a fee from other Y labs, but it didn¡¯t sound like they were opposed to it (a non-answer answer).

?

Long-Read testing is still too expensive.

?

There were a number of long, entertaining presentations such as a racially-mixed (and mixed-up) adoption situation, whereas the news items I¡¯ve mentioned here were spit out too fast for me to do a decent job with my notes.??I probably can¡¯t answer your questions.??Fortunately, there were multiple other U106ers there, so I hope they won¡¯t be shy about adding, correcting, and discussing these items with y¡¯all.

?

?

?

?

?

?

Charles?


Re: FTDNA Administrators Conference 2023

 

Thank you Debbie and Charles for the update.

Sounds like importing YSEQ won't be an option in our lifetimes but at least we are getting more Aussies onboard! Now if we could only do something about France!!

- Joe


On Wed, Nov 8, 2023 at 3:58?PM Debbie <debbiekennett@...> wrote:

Charles

?

You provide a good summary of these discussions.

?

I think what FTDNA said was that everyone would eventually be forced to used two-factor authentication. The process will start with project admins. They are also requiring all admins to reset passwords, understandably so given that project admins can have access to hundreds and sometimes thousands of kits. FTDNA said they were still considering whether or not to implement a password reset for the entire database. They don¡¯t want anyone to lose access to valuable data. All the major companies are now starting to use two-factor authentication. It will benefit all of us in the long run and it¡¯s something we will have to get used to.

?

Like you, I wasn¡¯t sure what was going to happen with the haplogroup designations for Family Finder results. Are these only going to be displayed on the personal pages or is there going to be some way of displaying haplogroups without STRs on a public results page?

?

I have mixed feelings about the use of MyHeritage trees. One major limitation of MyHeritage is that you are restricted to a tree of just 250 people with a free account. I found one time that when my subscription lapsed I was locked out of my account because I had too many people in my tree. However, the FTDNA trees have always been very poor and it will be easier to maintain one small tree at MyHeritage rather than worrying about an additional tree at FTDNA. FTDNA did acknowledge that not everyone would be able to use the MyHeritage facility so trees from old accounts will be preserved in read-only mode.

?

They did also say that they were looking into offering a whole genome sequencing test though there is no timeline.

?

The Family Finder haplogroups are promised ¡°soon¡±.

?

Likewise the introduction of 2FA should be implemented very soon.

?

Also, FTDNA have finally updated what they call the ¡°onboarding¡± process so that the consent process for relative matching is separated from the consent process for law enforcement matching. This means that people now have to proactively opt in to law enforcement matching. Previously you were automatically opted in unless you dug down deep into your settings to opt out.

?

FTDNA are also being much more active on social media and they have published a lot of informative blog posts. If you¡¯re not already doing so it¡¯s well worth subscribing to their blog:

?

?

I was impressed by the size of the lab and the number of staff now employed by the company. I was originally concerned about the merger with MyDNA but it seems have resulted in a big spending spree and lots of new developments so I think it¡¯s worked out very well.

?

Best wishes

?

Debbie

?

From: [email protected] <[email protected]> On Behalf Of Charles via
Sent: Monday, November 6, 2023 11:18 PM
To: R1b-U106 IO Forum <[email protected]>
Subject: [R1b-U106] FTDNA Administrators Conference 2023

?

The FTDNA Admin Conferences were usually every year before the pandemic, but after 4 ? years, it was great to see old friends, now older, of course!??There is a lot that is new in terms of both people and lab equipment, including robotics.??It¡¯s interesting that robots are analyzing human DNA, but who is analyzing the robots¡¯ DNA?

?

Despite the long time gap between conferences, there weren¡¯t any dramatic announcements of new tests, as sometimes would happen during the pioneering years of the previous two decades.

?

I wrote down a few notes, but other U106 attendees should feel free to add or correct anything I¡¯m reporting here.??My notes are in the order of the presentations.

?

They are planning to increase collaboration or merging of the Customer Service and Group Projects (ie ¡°Groups¡±) support groups. From my standpoint as an Admin, this sounds a little ominous, since the Groups support has been very helpful to me over the years.??But trying to send a consistent message to admins and customers is a good idea.

?

With respect to Ancient Connections in Discover, they now have 6,100 of about 7,500 male ancient extractions in their system.??268 Big Y customers¡¯ closest match in the system is actually one of the ancient remains samples.??

?

Our own Dr Iain McDonald¡¯s name was cited in multiple presentations, and always very favorably.??We all know what a tremendous asset he is to U106, and as we also know, he was very involved in helping the Discover team get the ball rolling on the age analysis of the haplogroups.

?

For admins, there is (already I think) a place to in effect put up a Help Wanted sign on the Project Banner.

?

Undoubtedly the worst news from the weekend, perhaps at least partly due to the recent 23andMe incident, is that a ¡°forced password reset¡± is coming soon.??They will send out warnings, and the Admins access to the Admin system known as GAP, will go first.??But all accounts will be forced to reset the passwords.??If you manage accounts of other sample donors, living or deceased, and your own email address isn¡¯t on the account, you¡¯re going to most likely be out-of-luck ever being able to access it again. I don¡¯t understand the details, and other attendees or other computer geeks might be able to help with the discussion, and this pending nightmare should probably have its own thread here.??But my simple thought is that you should try to make sure ASAP that your own current email address is on every account that you manage, because they are going to email the password reset instructions to the email address on the account, and nobody else will be allowed to reset it.??You should also make sure your accounts have Beneficiary designations.??They mentioned two-factor authentication as a distinct possibility in what they¡¯re planning.??Another idea is that since the Fall Sale is on now, you might want to go ahead and order new tests for some of the kits you manage before the chaos begins.

?

Some good news is that Family Finder and Transfers will be given whatever Y-DNA or mtDNA designations can be obtained from these tests, similar to what you get from Ancestry or 23andMe.??It wasn¡¯t clear to me where this will be displayed???Maybe other attendees understood better, but it sounded like it would go where the current red or green haplogroup labels on the Y-haplotypes are now, EXCEPT OBVIOUSLY it won¡¯t over-write an actual Y-SNP test result, unless it¡¯s downstream from such a Y test.

?

They have an idea to add yet another feature to Discover, along with Ancient and Notable Connections, which would be Connections to results from Academic/Scientific Studies.

?

They are working hard to get a Mitotree up and running for mtDNA, which would be built from scratch from their mtDNA database and which will create new branches and update mtDNA haplogroup designations.??Beyond that, they will also have an mtDNA Discover tool.

?

I think people will take this next item as partly good and partly bad, but here goes.??They are going to replace their traditional genealogical family tree feature with the much better one from MyHeritage.

I think I understood there will be some free and hopefully not too difficult way to get your current tree into the MyHeritage system before they toss the current system overboard (for the second time).

?

They have some Wish List ideas, including moving to whatever future T2T Reference system replaces the current ¡°Build 38¡± reference, an mtDNA Block Tree, and WGS tests if they can get the cost down.

?

It was fun at the lab today watching the test kit vials being filled by robots instead of lab techs who can do ¡°more important things that robots can¡¯t¡±, and presumably less boring or monotonous.

?

There are now over 100,000 Big Y tests in the system.

?

With respect to the new owner from Down Under having a positive influence on getting more Aussies tested, they said actually Australia is one of the better tested countries.

?

They don¡¯t currently have plans to arrange Y-DNA Transfers for a fee from other Y labs, but it didn¡¯t sound like they were opposed to it (a non-answer answer).

?

Long-Read testing is still too expensive.

?

There were a number of long, entertaining presentations such as a racially-mixed (and mixed-up) adoption situation, whereas the news items I¡¯ve mentioned here were spit out too fast for me to do a decent job with my notes.??I probably can¡¯t answer your questions.??Fortunately, there were multiple other U106ers there, so I hope they won¡¯t be shy about adding, correcting, and discussing these items with y¡¯all.

?

?

?

?

?

?

Charles?