Abstract:
Although the genome sequences of most studied organisms, like human, E. coli, and others are already known, de novo genome sequencing remains popular as a majority of genomes remains unknown. Unfortunately, sequencing machines are able to read only short fragments of DNA. Therefore, one of the basic steps in reconstructing novel genomes lies in putting these pieces of DNA, called ‘reads’, together into complete genome sequences using a process known as genome assembly. Reads joining, however, requires efficient detection of their overlaps.