Language model-guided anticipation and discovery of mammalian metabolites

  • Hantao Qiang
  • , Fei Wang
  • , Wenyun Lu
  • , Xi Xing
  • , Hahn Kim
  • , Sandrine A.M. Mérette
  • , Lucas B. Ayres
  • , Eponine Oler
  • , Jenna E. AbuSalim
  • , Asael Roichman
  • , Michael Neinast
  • , Ricardo A. Cordova
  • , Won Dong Lee
  • , Ehud Herbst
  • , Vishu Gupta
  • , Samuel L. Neff
  • , Mickel Hiebert-Giesbrecht
  • , Adamo Young
  • , Vasuk Gautam
  • , Siyang Tian
  • Bo Wang, Hannes Röst, Jatinder Baidwan, Russell Greiner, Li Chen, Chad W. Johnston, Leonard J. Foster, Aaron M. Shapiro, David S. Wishart, Joshua D. Rabinowitz, Michael A. Skinnider

Research output: Contribution to journalArticlepeer-review

Abstract

Despite decades of study, large parts of the mammalian metabolome remain unexplored1. Mass spectrometry-based metabolomics routinely detects thousands of small molecule-associated peaks in human tissues and biofluids, but typically only a small fraction of these can be identified, and structure elucidation of novel metabolites remains challenging2, 3–4. Biochemical language models have transformed the interpretation of DNA, RNA and protein sequences, but have not yet had a comparable impact on understanding small molecule metabolism. Here we present an approach that leverages chemical language models5, 6–7 to anticipate the existence of previously uncharacterized metabolites. We introduce DeepMet, a chemical language model that learns from the structures of known metabolites to anticipate the existence of previously unrecognized metabolites. Integration of DeepMet with mass spectrometry-based metabolomics data facilitates metabolite discovery. We harness DeepMet to reveal several dozen structurally diverse mammalian metabolites. Our work demonstrates the potential for language models to advance the mapping of the mammalian metabolome.

Original languageEnglish
JournalNature
Early online date14 Jan 2026
DOIs
StateE-pub ahead of print - 14 Jan 2026

Bibliographical note

Publisher Copyright:
© The Author(s) 2026.

Fingerprint

Dive into the research topics of 'Language model-guided anticipation and discovery of mammalian metabolites'. Together they form a unique fingerprint.

Cite this