Знайдено документів: 1
Інформація × Реєстраційний номер 2124U004700, Матеріали видань та локальних репозитаріїв Категорія Препринт Назва роботи The Impact of Remixing on Vocal Extraction in Low Data Regimes Автор Svystun Taras Дата публікації 01-01-2024 Постачальник інформації Український католицький університет Першоджерело https://hdl.handle.net/20.500.14570/4849 Видання Опис The field of music source separation aims to split a musical composition into 4 groups of instruments: vocals, drums, bass and others. Increasingly often, researchers use remixing and data augmentation to enlarge the amount of data and its diversity. The purpose of this thesis is to investigate what performance gains can be expected from remixing, as well as what optimal parameters are required for this. Therefore, this study uses the classical remixing method and tests a new method, which consists of mixing human speech with musical accompaniment. After finding the optimal hyperparameters, another data augmentations are used during the final training. The results of this methodology are the list of optimal parameters for remixing, all the steps necessary to reproduce the results, and the improved checkpoint for the SOTA MSS model - Band-split RNN, previously best publicly available. The findings indicate that remixing and data augmentation techniques are indeed powerful and necessary to improve vocal source separation. Додано в НРАТ 2025-05-09 Закрити
Матеріали
Препринт
Svystun Taras. The Impact of Remixing on Vocal Extraction in Low Data Regimes : публікація 2024-01-01; Український католицький університет, 2124U004700
Знайдено документів: 1

Оновлено: 2026-03-16