Publications

861 result(s)
Coarse-to-Fine Text-to-Music Latent Diffusion
Lanzendörfer L.A., Lu T., Perraudin N., Herremans D., Wattenhofer R., 2025, Proceedings of ICASSP, India
An exploration of controllability in symbolic music infilling
R. Guo, D. Herremans., 2025, IEEE Access, 13, 54873-54891, https://ieeexplore.ieee.org/document/10938538
PRESENT: Zero-Shot Text-to-Prosody Control
Lam P., Zhang H., Chen N.F, Sisman B., Herremans D., 2025, IEEE Signal Processing Letters, 32, 776-780, https://ieeexplore.ieee.org/document/10838710
Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval: a Survey
Le D-V-T, Bigo L., Keller M., Herremans D., 2025, ACM Computing Surveys, 57.7, 1-40, https://arxiv.org/abs/2402.17467
Text2midi: Generating Symbolic Music from Captions
Bhandari K., Roy A., Wang K., Puri G., Colton S., Herremans D., 2025, Proceedings of AAAI, Philadelphia, https://www.arxiv.org/abs/2412.16526
Coarse-to-Fine Text-to-Music Latent Diffusion
Lanzendörfer L.A., Lu T., Perraudin N., Herremans D., Wattenhofer R., 2024, Audio Imagination: NeurIPS 2024 Workshop, Vancouver
DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
Melechovsky J., Mehrish A., Sisman B., Herremans D., 2024, Audio Imagination: NeurIPS 2024 Workshop, Vancouver, https://arxiv.org/abs/2410.13342
Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training
Melechovsky J., Mehrish A., Sisman B., Herremans D., 2024, Proceedings of IEEE Tencon, Singapore, https://arxiv.org/abs/2406.01018
Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder
Melechovsky J., Mehrish A., Sisman B., Herremans D., 2024, Proceedings of IEEE Tencon, Singapore, https://arxiv.org/abs/2211.03316
DisfluencySpeech — Single-Speaker Conversational Speech Dataset with Paralanguage
Wang K., Herremans D., 2024, Proceedings of IEEE Tencon, Singapore, https://arxiv.org/abs/2406.08820