Publications

861 result(s)
Mustango: Toward Controllable Text-to-Music Generation
Melechovsky, J., Guo, Z., Ghosal, D., Majumder, N., Herremans, D., Poria, S., 2024, Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), Mexico City, Mexico, https://arxiv.org/abs/2311.08355
SNIPER Training: Variable Sparsity Rate Training For Text-To-Speech
Lam P., Zhang H., Chen N.F, Sisman B., Herremans D., 2024, Proceedings of IEEE Tencon, Singapore, https://arxiv.org/abs/2211.07283
Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model
Kang, J. and Poria, S. and Herremans, D., 2024, Expert Systems with Applications, 249, 123640, https://arxiv.org/abs/2311.00968
A Multimodal Model with Twitter Finbert Embeddings for Extreme Price Movement Prediction of Bitcoin
Zou Y., Herremans D., 2023, Expert Systems with Applications, 233, 120838, https://doi.org/10.1016/j.eswa.2023.120838
Constructing Time-Series Momentum Portfolios with Deep Multi-Task Learning
Ong, J., and Herremans, D, 2023, Expert Systems with Applications, 230 (2023), 120587, https://doi.org/10.1016/j.eswa.2023.120587
Learning accent representation with multi-level VAE towards controllable speech synthesis
J. Melechovsky, A. Mehrish, D. Herremans, B. Sisman, 2023, IEEE Spoken Language Technology (SLT) Workshop, Doha, Quatar
A Domain-Knowledge-Inspired Music Embedding Space and a Novel Attention Mechanism for Symbolic Music Modeling
N. Guo, J. Kong, D. Herremans, 2023, Proceedings of the 37th AAAI Conference on Artificial Intelligence, Washington DC
DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability
K. W., Cheuk, Sawata, R., Uesaka, T., Murata, N., Takahashi, N., Takahashi, S., .Herremans D., Mitsufuji, Y., 2023, Proceedings of ICASSP, Rhodes Island, Greece, https://arxiv.org/abs/2210.05148
MERP: A Music Dataset with Emotion Ratings and Raters’ Profile Information
Koh, E.Y.; Cheuk, K.W.; Heung, K.Y.; Agres, K.R.; Herremans, 2023, D. Sensors 2023, 23, 382
A Gaussian mixture classifier model to differentiate respiratory symptoms using phonated /ɑː/ sounds
Balamurali B T, H.I. Hee, C.M. Ying Lin, P. Priyadarshinee, C.J. Clarke, D. Herremans, J.M. Chen, 2022, The 18th Australasian International Conference on Speech Science and Technology (SST), Canberra, Australia, https://sst2022.com/a-gaussian-mixture-classifier-model-to-differentiate-respiratory-symptoms-using-phonated-a%cb%90-sounds/