Abstract
We propose a novel resistive edit distance-tolerant content addressable memory for computational genomics applications, particularly for detection and identification of pathogens of pandemic importance. Unlike state-of-the-art approximate search solutions that tolerate small number of replacements between the query pattern and the stored data, DIPER tolerates insertions and deletions, ubiquitous in genomics. DIPER achieves up to 1.7× higher <italic>F</italic>1 score for high-quality DNA reads and up to 6.2× higher <italic>F</italic>1 score for DNA reads with 15% error rate, compared to state-of-the-art DNA classification tool Kraken2. Simulated at 500MHz, DIPER provides 910× average speedup over Kraken2.
Original language | English |
---|---|
Pages (from-to) | 1-12 |
Number of pages | 12 |
Journal | IEEE Transactions on Computers |
DOIs | |
State | Accepted/In press - 2023 |
Bibliographical note
Publisher Copyright:Author
Keywords
- Bioinformatics
- DNA
- DNA detection and classification
- Genomics
- Hamming distances
- Pandemics
- Pathogens
- Sequential analysis
- approximate search
- content addressable memory
- memristors
- resistive memory