Modeling coverage gaps in haplotype frequencies via Bayesian inference to improve stem cell donor selection

Yoram Louzoun, Idan Alter, Loren Gragert, Mark Albrecht, Martin Maiers

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

Regardless of sampling depth, accurate genotype imputation is limited in regions of high polymorphism which often have a heavy-tailed haplotype frequency distribution. Many rare haplotypes are thus unobserved. Statistical methods to improve imputation by extending reference haplotype distributions using linkage disequilibrium patterns that relate allele and haplotype frequencies have not yet been explored. In the field of unrelated stem cell transplantation, imputation of highly polymorphic human leukocyte antigen (HLA) genes has an important application in identifying the best-matched stem cell donor when searching large registries totaling over 28,000,000 donors worldwide. Despite these large registry sizes, a significant proportion of searched patients present novel HLA haplotypes. Supporting this observation, HLA population genetic models have indicated that many extant HLA haplotypes remain unobserved. The absent haplotypes are a significant cause of error in haplotype matching. We have applied a Bayesian inference methodology for extending haplotype frequency distributions, using a model where new haplotypes are created by recombination of observed alleles. Applications of this joint probability model offer significant improvement in frequency distribution estimates over the best existing alternative methods, as we illustrate using five-locus HLA frequency data from the National Marrow Donor Program registry. Transplant matching algorithms and disease association studies involving phasing and imputation of rare variants may benefit from this statistical inference framework.

Original languageEnglish
Pages (from-to)279-292
Number of pages14
JournalImmunogenetics
Volume70
Issue number5
DOIs
StatePublished - 1 May 2018

Bibliographical note

Publisher Copyright:
© 2017, Springer-Verlag GmbH Germany.

Keywords

  • Bayesian inference
  • DNA typing
  • HLA
  • Imputation
  • Rare variants

Fingerprint

Dive into the research topics of 'Modeling coverage gaps in haplotype frequencies via Bayesian inference to improve stem cell donor selection'. Together they form a unique fingerprint.

Cite this