An unbiased comparison of immunoglobulin sequence aligners

Thomas Konstantinovsky, Ayelet Peres, Pazit Polak, Gur Yaari

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

Adaptive Immune Receptor Repertoire sequencing (AIRR-seq) is critical for our understanding of the adaptive immune system's dynamics in health and disease. Reliable analysis of AIRR-seq data depends on accurate rearranged immunoglobulin (Ig) sequence alignment. Various Ig sequence aligners exist, but there is no unified benchmarking standard representing the complexities of AIRR-seq data, obscuring objective comparisons of aligners across tasks. Here, we introduce GenAIRR, a modular simulation framework for generating Ig sequences alongside their ground truths. GenAIRR realistically simulates the intricacies of V(D)J recombination, somatic hypermutation, and an array of sequence corruptions. We comprehensively assessed prominent Ig sequence aligners across various metrics, unveiling unique performance characteristics for each aligner. The GenAIRR-produced datasets, combined with the proposed rigorous evaluation criteria, establish a solid basis for unbiased benchmarking of immunogenetics computational tools. It sets up the ground for further improving the crucial task of Ig sequence alignment, ultimately enhancing our understanding of adaptive immunity.

Original languageEnglish
Article numberbbae556
JournalBriefings in Bioinformatics
Volume25
Issue number6
DOIs
StatePublished - 23 Sep 2024

Bibliographical note

Publisher Copyright:
© 2024 The Author(s).

Keywords

  • AIRR-seq
  • V(D)J recombination
  • benchmarking
  • immunoglobulin
  • sequence alignment
  • somatic hypermutation

Fingerprint

Dive into the research topics of 'An unbiased comparison of immunoglobulin sequence aligners'. Together they form a unique fingerprint.

Cite this