The role of read alignment in bioinformatics
> But of course the reason we bother sequencing genomes is that they are not an exact match with the reference genome.
I always thought even getting the original reference genome was hard because reads could come from multiple points in the genome. Is that true? e.g.
genome:
AATAA
reads:
AAT
TAA
possible other genomes:
TAAT
AATAAT
although I guess this problem should be easier as you get more and/or longer reads.
I think the reference genome was first sequenced by Sanger sequencing to avoid this problem. But would need to confirm that
> But of course the reason we bother sequencing genomes is that they are not an exact match with the reference genome.
I always thought even getting the original reference genome was hard because reads could come from multiple points in the genome. Is that true? e.g.
genome:
AATAA
reads:
AAT
TAA
possible other genomes:
TAAT
AATAAT
although I guess this problem should be easier as you get more and/or longer reads.
I think the reference genome was first sequenced by Sanger sequencing to avoid this problem. But would need to confirm that