Abstract
Nanopore sequencing generates noisy electrical signals that need to be converted into a standard string of DNA nucleotide bases using a computational step called basecalling. The performance of basecalling has critical implications for all later steps in genome analysis. Therefore, there is a need to reduce the computation and memory cost of basecalling while maintaining accuracy. We present RUBICON, a framework to develop efficient hardware-optimized basecallers. We demonstrate the effectiveness of RUBICON by developing RUBICALL, the first hardware-optimized mixed-precision basecaller that performs efficient basecalling, outperforming the state-of-the-art basecallers. We believe RUBICON offers a promising path to develop future hardware-optimized basecallers.
Original language | English |
---|---|
Article number | 49 |
Number of pages | 29 |
Journal | Genome Biology |
Volume | 25 |
Issue number | 1 |
DOIs | |
Publication status | Published - 16 Feb 2024 |
Keywords
- Basecalling
- Deep neural network
- Genomics sequencing
- Hardware acceleration
- Machine learning