Abstract
This paper describes Asteroid, the PyTorch-based audio source separation toolkit for researchers. Inspired by the most successful neural source separation systems, it provides all neural building blocks required to build such a system. To improve reproducibility, Kaldi-style recipes on common audio source separation datasets are also provided. This paper describes the software architecture of Asteroid and its most important features. By showing experimental results obtained with Asteroid's recipes, we show that our implementations are at least on par with most results reported in reference papers. The toolkit is publicly available at github.com/mpariente/asteroid.
| Original language | English |
|---|---|
| Title of host publication | Interspeech 2020 |
| Publisher | International Speech Communication Association |
| Pages | 2637-2641 |
| Number of pages | 5 |
| ISBN (Print) | 9781713820697 |
| DOIs | |
| State | Published - 2020 |
| Externally published | Yes |
| Event | 21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020 - Shanghai, China Duration: 25 Oct 2020 → 29 Oct 2020 |
Publication series
| Name | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH |
|---|---|
| Volume | 2020-October |
| ISSN (Print) | 2308-457X |
| ISSN (Electronic) | 1990-9772 |
Conference
| Conference | 21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020 |
|---|---|
| Country/Territory | China |
| City | Shanghai |
| Period | 25/10/20 → 29/10/20 |
Bibliographical note
Publisher Copyright:© 2020 International Speech Communication Association. All rights reserved.
Keywords
- End-to-end
- Open-source software
- Source separation
- Speech enhancement