A comparison to the DDSP method is presented in the following audio samples.
We trained 4 model targets for timbre transfer:cello, saxophone, trumpet and violin.
The six source domains used for the MOS:clarinet, violin, female singer, male singer, trumpet and saxophone.
Our method shows better results both on target domain similarity and melody preservation.