Sound examples
Contact: {merlijn.blaauw, jordi.bonada}@upf.edu
Presented at ICASSP 2020, May 4-8, 2020, Barcelona, Spain.
Feed-forward Transformer Seq2Seq model, with neural vocoder, effects and background music.
The model predicts timbre and phonetic timings, while F0 and note onsets (vowel onsets) are obtained from a reference recording.
The proprietary dataset used in these experiments was provided by Voctro Labs.