Was the 25TB raw data gathered from a single human genome?
What would be the size in bytes of a unique genomic fingerprint once raw data is all fully processed into high confidence base values? (including non-coding regions)
If we just look at coding regions and further compress by only looking at SNPs, how many bytes is that?
Considering that each base has ~2B of information... it would be super interesting to know how much space it takes to describe our uniqueness!
Describing your total unique genetic profile would obviously require a lot more space, and wouldn't be constant across individuals/ancestral backgrounds (e.g. there's more genetic diversity in people of African descent).
The default assumption is that it's not your article, unless you prepend "Show HN" (or there's something obvious like your username matching the domain name).
A couple of quick questions:
Was the 25TB raw data gathered from a single human genome?
What would be the size in bytes of a unique genomic fingerprint once raw data is all fully processed into high confidence base values? (including non-coding regions)
If we just look at coding regions and further compress by only looking at SNPs, how many bytes is that?
Considering that each base has ~2B of information... it would be super interesting to know how much space it takes to describe our uniqueness!