Can a new algorithm improve the accuracy of DNA sequencing? This paper presents LifeTrace, a novel basecalling algorithm designed to convert raw chromatogram data into nucleotide sequences, addressing challenges in electrophoresis sequencing. LifeTrace emphasizes local chromatogram information to better handle variable peak spacing. LifeTrace was tested against Phred, a widely used basecalling software, using MegaBACE capillary sequencing machines. LifeTrace significantly reduced substitution and insertion/deletion errors, generating more aligned bases in dye-primer and dye-terminator chromatograms. It introduces a new gap-quality score for deletion error detection. LifeTrace offers high-quality basecalls and reliable quality scores, proving effective for MegaBACE capillary sequencing. Its performance improvements translate to more accurate SNP detection and improved genome analysis. The new benchmark protocol offers better discernment in basecaller performance, furthering research into genomics and biotechnology.
Genome Research, known for its focus on genomic technologies, provides the perfect platform for this research. This paper’s detailed description of LifeTrace directly addresses improving basecalling accuracy, aligning with the journal's dedication to innovations in genome analysis and sequencing technology. The emphasis on benchmark testing and SNP detection enhances the study’s significance for genomic research.