IMSEQ-A fast and error aware approach to immunogenetic sequence analysis

Leon Kuchenbecker, Mikalai Nienen, Jochen Hecht, Avidan U. Neumann, Nina Babel, Knut Reinert, Peter N. Robinson

Research output: Contribution to journalArticlepeer-review

74 Scopus citations

Abstract

Motivation: Recombined T-and B-cell receptor repertoires are increasingly being studied using next generation sequencing (NGS) in order to interrogate the repertoire composition as well as changes in the distribution of receptor clones under different physiological and disease states. This type of analysis requires efficient and unambiguous clonotype assignment to a large number of NGS read sequences, including the identification of the incorporated V and J gene segments and the CDR3 sequence. Current tools have deficits with respect to performance, accuracy and documentation of their underlying algorithms and usage. Results: We present IMSEQ, a method to derive clonotype repertoires from NGS data with sophisticated routines for handling errors stemming from PCR and sequencing artefacts. The application can handle different kinds of input data originating from single-or paired-end sequencing in different configurations and is generic regarding the species and gene of interest. We have carefully evaluated our method with simulated and real world data and show that IMSEQ is superior to other tools with respect to its clonotyping as well as standalone error correction and runtime performance.

Original languageEnglish
Pages (from-to)2963-2971
Number of pages9
JournalBioinformatics
Volume31
Issue number18
DOIs
StatePublished - 15 Sep 2015

Bibliographical note

Publisher Copyright:
© 2015 The Author.

Fingerprint

Dive into the research topics of 'IMSEQ-A fast and error aware approach to immunogenetic sequence analysis'. Together they form a unique fingerprint.

Cite this