¿ªÔÆÌåÓý

ctrl + shift + ? for shortcuts
© 2025 Groups.io

Re: Should I upgrade to Y37- BIGY Y700 ?

 

Hi Robert,
I have found when working with autosomal, that it is much easier to sort matches that come from my mother (over 5000 matches) and father (over 7000 matches) rather than using my own dna (just over 700 matches). I also have both which give me my dad's MT line as well as my mother's.

I also discovered that my mother and one of my sisters matched my father-in-law and one of his daughters at the 4th to distant cousins. The connection would have been in middle Europe in the early 1800's (Germany or the Netherlands) although I can't quite find the common ancestor, but I do know the surname in common.

On Saturday, June 22, 2024 at 02:32:40 PM CDT, Robert McMillan via groups.io <tensawmac@...> wrote:


I agree, my Dante test realigned to 60.3Mbp (from 23.6)
My dad¡¯s Nebula test from 23.6 to 60.1M BP

I have seen some Nebula results at or around 45mbp, and I think these come from Nebula realigning just the y bam. I¡¯ve read where others have had better luck realigning the complete BAM and then extracting the Y, I don¡¯t know enough about this to make a comment one way or the other. Only that is what I did on the two tests mentioned above.?

Robert McMillan

On Jun 22, 2024, at 1:41?PM, 6458923@... wrote:

?On Sat, Jun 22, 2024 at 08:30 PM, vineviz wrote:
"WGS files from Nebula, when aligned to T2T almost never produce coverage of more than 45 mbp."This sentence is completely wrong. ?My own Nebula 30x WGS result R-Y278111* -- T2T .BAM Nebula Genomics 11X, 59.8 Mbp, 150 bp and several other results I know have readings over 60 million bases. Please don't say "Never" if you don't have enough information.?


Re: Should I upgrade to Y37- BIGY Y700 ?

 



On Sat, Jun 22, 2024 at 4:43?PM Wayne via <dna_wayne=[email protected]> wrote:
What Iain is missing about long read technology is that there will be additionat STRs reported.? The current short read NGS technology is actually worse than the original Sanger technology in terms of identifying STRs and reporting out on those with longer/arger motifs.? ?FTDNA took a step backwards in reporting STRs when them moved from Sanger to NGS based techniques.? ? Long read technologies will also provide better insight into sequence structural deletions and rearrangements which NGS may not correctly identify or report on.

Wayne

On Saturday, June 22, 2024 at 03:57:04 PM EDT, Iain via <gubbins=[email protected]> wrote:


There are a lot of potentially iffy comparisons going on in the last few posts on this thread, so I think it's important to step back and think about what these numbers mean. I'm aware that many of the people posting are aware of most of the following, but I think it's important for the benefit of the wider audience - I'll try to pitch this at a fairly intermediate level. We first need to define two terms:


Total number of loci = the number of base pairs that have any reads in a test


Callable loci = the number of base pairs that where a mutation can be securely identified or dismissed


The numbers are often very different from each other, and even between different estimates of the same number by different individuals/organisations, because the numbers depend strongly on the quality thresholds that are being used by individuals or companies. Two of the most important are the read quality and the mapping quality. The read quality says the security with which an allele can be called A, C, G or T. The mapping quality says the probability that that section of DNA has been accurately mapped back onto the reference sequence. It's the mapping quality that is more important for this discussion. James Kane keeps a set of benchmarks from each test type here, which are homogeneously reduced and therefore can be directly compared between tests and companies:

Nebula's 30x 150bp test currently benchmarks at around 15 million callable loci and 23 million total reads when mapped to GRCh38. When a T2T reference is used, this increases to about 16 million callable loci and normally about 45 million total reads, but can be between about 30 and 60 million. Note that many people will not have 60 million base pairs in their entire Y chromosome to begin with.


The Y chromosome contains around 23-24 million base pairs which can be termed the "readable Y". Typically, this is the limit for the total number of loci returned by BigY or WGS tests. If you have more than this total number of loci, then you must be mapping parts of the "non-readable Y", which may include the centromere, DYZ19 and Yq12 regions. These regions are considered non-readable because they contain many repeating sequences, which are all longer than the 100-150 base pair chunks that these tests are split up into. DYZ19 is just about readable in parts because the repeat length (125bp) is close to the read length. But the centromere and Yq12 regions contain many repeats of identical sequences that span many hundreds or even thousands of base pairs. There is no way you can take a read from a short-read test and accurately map it back onto the long repeats in the centromere and Yq12. Any reads from a 100bp or 150bp WGS test that are recovered beyond about 24 million base pairs are effectively useless, because they cannot be accurately mapped back to a place on the chromosome, even if the alignment software suggests they can - your map of the centromere and Yq12 regions may look very different from the T2T reference sequence due to large-scale mutations between haplogroups, which we simply don't yet know enough about, and the mapping software can easily conflate one SNP for another in such circumstances. For example, you might be able to state that there may(!) be a SNP somewhere in the DYZ3 region, but you won't be able to say whether it's a real SNP or just a bunching up of bad reads, or what the true location of the SNP is within the DYZ3 region, or whether it's phylogenically identical to the same SNP in a different test (since it could be on a different repeat).


This brings us to callable loci. If you can accurately map several reads back to a reference chromosome, and if all those reads pass certain quality thresholds, then you can call whether or not a SNP exists in that location. The very best that's achievable with current 100x or 150x base pair technology is about 23-24 million base pairs but, in practice, the limitation is normally closer to 14-16 million base pairs. This is the useful part of the test and the real number that matters (unless you plan on combining multiple tests together). You cannot expect to go any higher than this without increasing the read length.


So it's the callable loci that matter for almost every genetic genealogy application. We won't be able to make meaningful use of these extra tens of millions of non-callable loci without long-read technology like T2T and a much better understanding of the structural variation of the Y chromosome on large scales across different haplgroups.


When that becomes available commercially, it will be an incremental addition for most people. All most people can expect are a bunch of extra SNPs within many of their haplogroups. That will help revise TMRCAs and hugely reduce the uncertainties, but it's not going to be a game-changer like BigY was when it arrived. The people it's going to be most useful for are those with recent surname problems - people who really need to squeeze every SNP and structural variant out of a test to separate individual generations in a genetic family tree. If the mutation rate can be brought substantially below one mutation per generation (instead of the current 83 years/SNP), then we can start to say a relationship between two testers might be (e.g.) two to four generations beyond their earliest known ancestor, rather than the swathes of centuries that current TMRCA estimates provide. There's real application there, but a large part of it is limited to a subset of people who have already taken BigY or similar tests.


- Iain.


Re: Should I upgrade to Y37- BIGY Y700 ?

 

Hi Wayne - I was lumping STRs in with structural variants, since I think the applicability will be the same. Sure, we should be able to get the "original 111" without a separate test, but that's not a visible success to the end user unless it positively affects the pricing point. Otherwise the additional STRs are really only likely to create significant benefit from moderate improvements to the TMRCAs and splitting haplogroups at the generational level for the few who need that. That and, after nearly ten years, we still haven't got full benefit from the additional STRs beyond the first 111.

?

- Iain.


Re: Should I upgrade to Y37- BIGY Y700 ?

 

On Sat, Jun 22, 2024 at 09:57 PM, Iain wrote:
thank you for your reply, we are far away from creating a complete T2T or De Novo Genome Assembly. There is currently not a single company in the world offering a telomere to telomere (T2T). For creating a real T2T sequence like hs1/CP086569.2 you need to use 18 flow cells of nanopore sequencing to get 166x coverage plus many high quality short read genomes from a specially grown cell line using a special Pore-C protocol. Plus they further used a couple of PacBio HiFi WGS runs. This is out of the reach for most citizen scientists. YSEQ is trying first baby steps to make at least long read nanopore sequencing available. .How does the Nebula WGS 100x help for T2T alignment?


Re: Should I upgrade to Y37- BIGY Y700 ?

 

What Iain is missing about long read technology is that there will be additionat STRs reported.? The current short read NGS technology is actually worse than the original Sanger technology in terms of identifying STRs and reporting out on those with longer/arger motifs.? ?FTDNA took a step backwards in reporting STRs when them moved from Sanger to NGS based techniques.? ? Long read technologies will also provide better insight into sequence structural deletions and rearrangements which NGS may not correctly identify or report on.

Wayne

On Saturday, June 22, 2024 at 03:57:04 PM EDT, Iain via groups.io <gubbins@...> wrote:


There are a lot of potentially iffy comparisons going on in the last few posts on this thread, so I think it's important to step back and think about what these numbers mean. I'm aware that many of the people posting are aware of most of the following, but I think it's important for the benefit of the wider audience - I'll try to pitch this at a fairly intermediate level. We first need to define two terms:


Total number of loci = the number of base pairs that have any reads in a test


Callable loci = the number of base pairs that where a mutation can be securely identified or dismissed


The numbers are often very different from each other, and even between different estimates of the same number by different individuals/organisations, because the numbers depend strongly on the quality thresholds that are being used by individuals or companies. Two of the most important are the read quality and the mapping quality. The read quality says the security with which an allele can be called A, C, G or T. The mapping quality says the probability that that section of DNA has been accurately mapped back onto the reference sequence. It's the mapping quality that is more important for this discussion. James Kane keeps a set of benchmarks from each test type here, which are homogeneously reduced and therefore can be directly compared between tests and companies:
https://ydna-warehouse.org/benchmarks
Nebula's 30x 150bp test currently benchmarks at around 15 million callable loci and 23 million total reads when mapped to GRCh38. When a T2T reference is used, this increases to about 16 million callable loci and normally about 45 million total reads, but can be between about 30 and 60 million. Note that many people will not have 60 million base pairs in their entire Y chromosome to begin with.


The Y chromosome contains around 23-24 million base pairs which can be termed the "readable Y". Typically, this is the limit for the total number of loci returned by BigY or WGS tests. If you have more than this total number of loci, then you must be mapping parts of the "non-readable Y", which may include the centromere, DYZ19 and Yq12 regions. These regions are considered non-readable because they contain many repeating sequences, which are all longer than the 100-150 base pair chunks that these tests are split up into. DYZ19 is just about readable in parts because the repeat length (125bp) is close to the read length. But the centromere and Yq12 regions contain many repeats of identical sequences that span many hundreds or even thousands of base pairs. There is no way you can take a read from a short-read test and accurately map it back onto the long repeats in the centromere and Yq12. Any reads from a 100bp or 150bp WGS test that are recovered beyond about 24 million base pairs are effectively useless, because they cannot be accurately mapped back to a place on the chromosome, even if the alignment software suggests they can - your map of the centromere and Yq12 regions may look very different from the T2T reference sequence due to large-scale mutations between haplogroups, which we simply don't yet know enough about, and the mapping software can easily conflate one SNP for another in such circumstances. For example, you might be able to state that there may(!) be a SNP somewhere in the DYZ3 region, but you won't be able to say whether it's a real SNP or just a bunching up of bad reads, or what the true location of the SNP is within the DYZ3 region, or whether it's phylogenically identical to the same SNP in a different test (since it could be on a different repeat).


This brings us to callable loci. If you can accurately map several reads back to a reference chromosome, and if all those reads pass certain quality thresholds, then you can call whether or not a SNP exists in that location. The very best that's achievable with current 100x or 150x base pair technology is about 23-24 million base pairs but, in practice, the limitation is normally closer to 14-16 million base pairs. This is the useful part of the test and the real number that matters (unless you plan on combining multiple tests together). You cannot expect to go any higher than this without increasing the read length.


So it's the callable loci that matter for almost every genetic genealogy application. We won't be able to make meaningful use of these extra tens of millions of non-callable loci without long-read technology like T2T and a much better understanding of the structural variation of the Y chromosome on large scales across different haplgroups.


When that becomes available commercially, it will be an incremental addition for most people. All most people can expect are a bunch of extra SNPs within many of their haplogroups. That will help revise TMRCAs and hugely reduce the uncertainties, but it's not going to be a game-changer like BigY was when it arrived. The people it's going to be most useful for are those with recent surname problems - people who really need to squeeze every SNP and structural variant out of a test to separate individual generations in a genetic family tree. If the mutation rate can be brought substantially below one mutation per generation (instead of the current 83 years/SNP), then we can start to say a relationship between two testers might be (e.g.) two to four generations beyond their earliest known ancestor, rather than the swathes of centuries that current TMRCA estimates provide. There's real application there, but a large part of it is limited to a subset of people who have already taken BigY or similar tests.


- Iain.


Re: Should I upgrade to Y37- BIGY Y700 ?

 

I understand, thank you


Re: Should I upgrade to Y37- BIGY Y700 ?

 

There are a lot of potentially iffy comparisons going on in the last few posts on this thread, so I think it's important to step back and think about what these numbers mean. I'm aware that many of the people posting are aware of most of the following, but I think it's important for the benefit of the wider audience - I'll try to pitch this at a fairly intermediate level. We first need to define two terms:


Total number of loci = the number of base pairs that have any reads in a test


Callable loci = the number of base pairs that where a mutation can be securely identified or dismissed


The numbers are often very different from each other, and even between different estimates of the same number by different individuals/organisations, because the numbers depend strongly on the quality thresholds that are being used by individuals or companies. Two of the most important are the read quality and the mapping quality. The read quality says the security with which an allele can be called A, C, G or T. The mapping quality says the probability that that section of DNA has been accurately mapped back onto the reference sequence. It's the mapping quality that is more important for this discussion. James Kane keeps a set of benchmarks from each test type here, which are homogeneously reduced and therefore can be directly compared between tests and companies:
https://ydna-warehouse.org/benchmarks
Nebula's 30x 150bp test currently benchmarks at around 15 million callable loci and 23 million total reads when mapped to GRCh38. When a T2T reference is used, this increases to about 16 million callable loci and normally about 45 million total reads, but can be between about 30 and 60 million. Note that many people will not have 60 million base pairs in their entire Y chromosome to begin with.


The Y chromosome contains around 23-24 million base pairs which can be termed the "readable Y". Typically, this is the limit for the total number of loci returned by BigY or WGS tests. If you have more than this total number of loci, then you must be mapping parts of the "non-readable Y", which may include the centromere, DYZ19 and Yq12 regions. These regions are considered non-readable because they contain many repeating sequences, which are all longer than the 100-150 base pair chunks that these tests are split up into. DYZ19 is just about readable in parts because the repeat length (125bp) is close to the read length. But the centromere and Yq12 regions contain many repeats of identical sequences that span many hundreds or even thousands of base pairs. There is no way you can take a read from a short-read test and accurately map it back onto the long repeats in the centromere and Yq12. Any reads from a 100bp or 150bp WGS test that are recovered beyond about 24 million base pairs are effectively useless, because they cannot be accurately mapped back to a place on the chromosome, even if the alignment software suggests they can - your map of the centromere and Yq12 regions may look very different from the T2T reference sequence due to large-scale mutations between haplogroups, which we simply don't yet know enough about, and the mapping software can easily conflate one SNP for another in such circumstances. For example, you might be able to state that there may(!) be a SNP somewhere in the DYZ3 region, but you won't be able to say whether it's a real SNP or just a bunching up of bad reads, or what the true location of the SNP is within the DYZ3 region, or whether it's phylogenically identical to the same SNP in a different test (since it could be on a different repeat).


This brings us to callable loci. If you can accurately map several reads back to a reference chromosome, and if all those reads pass certain quality thresholds, then you can call whether or not a SNP exists in that location. The very best that's achievable with current 100x or 150x base pair technology is about 23-24 million base pairs but, in practice, the limitation is normally closer to 14-16 million base pairs. This is the useful part of the test and the real number that matters (unless you plan on combining multiple tests together). You cannot expect to go any higher than this without increasing the read length.


So it's the callable loci that matter for almost every genetic genealogy application. We won't be able to make meaningful use of these extra tens of millions of non-callable loci without long-read technology like T2T and a much better understanding of the structural variation of the Y chromosome on large scales across different haplgroups.


When that becomes available commercially, it will be an incremental addition for most people. All most people can expect are a bunch of extra SNPs within many of their haplogroups. That will help revise TMRCAs and hugely reduce the uncertainties, but it's not going to be a game-changer like BigY was when it arrived. The people it's going to be most useful for are those with recent surname problems - people who really need to squeeze every SNP and structural variant out of a test to separate individual generations in a genetic family tree. If the mutation rate can be brought substantially below one mutation per generation (instead of the current 83 years/SNP), then we can start to say a relationship between two testers might be (e.g.) two to four generations beyond their earliest known ancestor, rather than the swathes of centuries that current TMRCA estimates provide. There's real application there, but a large part of it is limited to a subset of people who have already taken BigY or similar tests.


- Iain.


Re: Should I upgrade to Y37- BIGY Y700 ?

 

¿ªÔÆÌåÓý

Good question. Probably not a good answer: ?I wanted my parents WGS to put on a drive. I don¡¯t know what the future holds and it is a record I thought I would be glad I have one day. If not me, my kids or grandkids.?

Of immediate benefit was my father¡¯s MTDNA information, and of course his autosomal. The cost was not terribly more than just doing a MtDNA test.?

YDNA was not the reason for the test. That is a result of playing around with WGS Extract.?

I do have my parent¡¯s ancestry kits, however I went ahead and did WGS on them both. I think autosomal is only limited by the computing abilities of the various platforms that host such results. If I am wrong, oh well.?

Robert McMillan

On Jun 22, 2024, at 2:38?PM, 6458923@... wrote:

?Thank you for your comment I have two questions
What were the benefits of testing your father?
What were the advantages of two T2T WGS?


Re: Should I upgrade to Y37- BIGY Y700 ?

 

Thank you for your comment I have two questions
What were the benefits of testing your father?
What were the advantages of two T2T WGS?


Re: Should I upgrade to Y37- BIGY Y700 ?

 

¿ªÔÆÌåÓý

I agree, my Dante test realigned to 60.3Mbp (from 23.6)
My dad¡¯s Nebula test from 23.6 to 60.1M BP

I have seen some Nebula results at or around 45mbp, and I think these come from Nebula realigning just the y bam. I¡¯ve read where others have had better luck realigning the complete BAM and then extracting the Y, I don¡¯t know enough about this to make a comment one way or the other. Only that is what I did on the two tests mentioned above.?

Robert McMillan

On Jun 22, 2024, at 1:41?PM, 6458923@... wrote:

?On Sat, Jun 22, 2024 at 08:30 PM, vineviz wrote:
"WGS files from Nebula, when aligned to T2T almost never produce coverage of more than 45 mbp."This sentence is completely wrong. ?My own Nebula 30x WGS result R-Y278111* -- T2T .BAM Nebula Genomics 11X, 59.8 Mbp, 150 bp and several other results I know have readings over 60 million bases. Please don't say "Never" if you don't have enough information.?


Re: Should I upgrade to Y37- BIGY Y700 ?

 

On Sat, Jun 22, 2024 at 08:30 PM, vineviz wrote:
"WGS files from Nebula, when aligned to T2T almost never produce coverage of more than 45 mbp."This sentence is completely wrong. ?My own Nebula 30x WGS result R-Y278111* -- T2T .BAM Nebula Genomics 11X, 59.8 Mbp, 150 bp and several other results I know have readings over 60 million bases. Please don't say "Never" if you don't have enough information.?


Re: Should I upgrade to Y37- BIGY Y700 ?

 

WGS files from Nebula, when aligned to T2T almost never produce coverage of more than 45 mbp. ?And FTDNA files average closer to 30 mbp than 25.

And that still doesn't mean that the difference (45 vs 30) is phylogenetically significant. ?I suspect most of those 15 "additional" mbp won't prove to be very useful.


On Sat, Jun 22, 2024 at 10:57 AM, <6458923@...> wrote:

WGSs, which offer short read, can currently call over 60 million bases. The BigY 700 is around 25 million, which is why I said it is insufficient. I think FTDNA will also offer a WGS test with short read? in a few years. I also agree with the view that long read will not become cheaper in the near future.


Re: Should I upgrade to Y37- BIGY Y700 ?

 

WGSs, which offer short read, can currently call over 60 million bases. The BigY 700 is around 25 million, which is why I said it is insufficient. I think FTDNA will also offer a WGS test with short read? in a few years. I also agree with the view that long read will not become cheaper in the near future.


Re: Should I upgrade to Y37- BIGY Y700 ?

 

I don't think it is reasonable to suggest that Big Y is inadequate.

?

For one thing, most of the 80 million bases are phylogenetically unhelpful. ?For this reason, full sequencing of the Y-chromosome - even if it were commercially available, which I don't think it currently is - provides only an incremental benefit over conventional sequencing with realignment to the novel reference.

I suspect affordable long-read sequencing is many years away. ?And when it IS available I think most of us will find the marginal benefit to be minimal.


Vince


On Sat, Jun 22, 2024 at 07:48 AM, <6458923@...> wrote:

The Big700 offers very inadequate coverage for the T2T reference, offering around 25 million bases. Human Ydna, on the other hand, can reach approximately 80 million bases, although this varies from person to person with recent studies.?


Re: Should I upgrade to Y37- BIGY Y700 ?

 

The Big700 offers very inadequate coverage for the T2T reference, offering around 25 million bases. Human Ydna, on the other hand, can reach approximately 80 million bases, although this varies from person to person with recent studies.?


Re: Should I upgrade to Y37- BIGY Y700 ?

 

FTDNA have said they will eventually re-align all Big Y 700 tests to so-called "T2T" reference genomes, and they've already done this for some kits.

Vince


Re: Should I upgrade to Y37- BIGY Y700 ?

 

I will upgrade my test to Y111 in the next discount period.


Re: Should I upgrade to Y37- BIGY Y700 ?

 

I applied a few days ago to set up a group project for R-Z49 at FTDNA.


Re: Should I upgrade to Y37- BIGY Y700 ?

 

Thank you very much for your advice. I will upgrade to the Y111 test at the next sale. If FTDNA announces a new improved test that offers a new T2T ALIGNMENT then I will do that. ??


Re: 2 matches in block tree but no big y matches

 

Thanks iain.? I don't think theirs any benefit and my current y111 testers shared surname financially can't afford the upgrade and I said don't both as I believe we would all be the same anyway.

For us it appears lack of genealogical records to confirm our connection which this will not resolve.

I am in the uk and they are all in the states.

Jason


On Wed, 19 Jun 2024, 12:48 Iain via , <gubbins=[email protected]> wrote:

Hi Jason,


Should you spend the $99 to get the BAM file? That depends on what use you're going to make of it.


First, let's understand the different classes of files. Put simply, FASTQ is the set of reads that come out of the DNA sequencer - strings of As, Gs, Cs and Ts (with information about the confidence in those reads) that represent the chopped up bits of DNA that have been read. The BAM format is what you get when you align those reads to a particular reference sequence - normally this is hg38, but it could be realigned to one of the new T2T sequences. The VCF format is what you get when you run through the BAM file to look for differences from that reference sequence - this is what you see in your FTDNA account and what you can download for free; it's very useful for people like me to check the details of variants and whether specific locations are covered.


Now let's consider what you can do with a BAM file that you can't do with a VCF file. Primarily, you can either read it yourself (which most people aren't going to do, because it's fairly technically involved), or you can pay more money to upload it to a third-party comparison site like YFull (€45) or Full Genomes Corp. ($50). You also then have a copy of your DNA results in case of a company-ending calamity at Family Tree DNA.


What do you gain by using these third-party sites? As Wayne points out, you can have your results compared to Y-DNA testers from other companies. This can be really useful for people who have tested outside FTDNA, but FTDNA makes up such a huge proportion of the Y-DNA testing market that you're unlikely to find a close match - however, there's nothing stopping you loading up their phylogenic tree and having a look to see if one exists in your haplogroup already. These sites also give you a "second opinion" on the SNP calls in your test results, including a realignment of your DNA to more up-to-date reference sequences like T2T.


Is that worth it for you, and worth the ~$150 it's going to cost you? It's worth remembering what your goals are. Many people are fixated on a Pokemon-like "got to get them all" attitude towards finding new SNPs. While finding more SNPs can be useful, they are only as useful as the information you can get out of them and the relationships they trace. That means these kind of third-party analyses aren't worth it for most testers, except those who fall into one of a few categories. Some of the most common are:
(1) You have an addiction to cool tools to analyse your DNA, in which case I'd like $100k for a research project. :)
(2) You have checked your haplogroup at YFull and found that you have a haplogroup member outside FTDNA that you really want to be compared to.
(3) You really care about specific STR results in your own test for some reason.


There are all sorts of intangible ramifications from using these third-party tools. The negatives include increasing the exposure of your DNA to malign groups; the positives include making your haplogroup more visible, encouraging competition among DNA-testing companies, etc. All of these are complex in their own right and these kind of BAM downloads / third-party uploads may not be the best way to address any of them. However, historically, when there's been a third-party service that Family Tree DNA have found useful, they've adopted that technology into their own system. We've seen this with Alex Williamson's tree interface (which became the Block Tree), YFull and my TMRCA estimates (which seeded the Discover platform), Dave Vance and Hunter Provyn's SNP Tracker and Phylogeography tools (which, despite the algorithmic limitations, have become co-opted into Globetrekker). Arguably the release of Y-DNA haplogroups from Family Finder tests comes out of tools like MorleyDNA's too. If you think these kind of third-party tools will be useful, then giving them support can eventually make them available to everyone.

?

In summary, the benefit of paying for this download really depends on you having a case for needing it. Speaking personally, I downloaded my BAM file because I have older test and it was free, but I can't say I've made much use of it. I got FGC to analyse my file and report and name my novel variants back when that mattered, but that has less relevance now. There's nothing currently that these third-party sites have been able to offer me that is both of value to me and that I can't get from using a VCF file. Your case may be different, but I'd encourage you to identify an end goal before spending more money.


Cheers,


Iain.