2 Comments
User's avatar
Mahesh Vashishtha's avatar

> But of course the reason we bother sequencing genomes is that they are not an exact match with the reference genome.

I always thought even getting the original reference genome was hard because reads could come from multiple points in the genome. Is that true? e.g.

genome:

AATAA

reads:

AAT

TAA

possible other genomes:

TAAT

AATAAT

although I guess this problem should be easier as you get more and/or longer reads.

Expand full comment
Gopal K. Vashishtha's avatar

I think the reference genome was first sequenced by Sanger sequencing to avoid this problem. But would need to confirm that

Expand full comment