Skip to the content.

SingleVC: Any2-to-one voice conversion

SingleVC performs A2O VC through a self-supervised task((Xi →X̂si → X̂i )). X̂si is a PSDR-processed speech with pitch-shifted s. The more details can be access here.

This page provides converted speech samples. The pretrained model is trained with p249(female, 22.5-minute) from VCTK corpus.

p249_samples

  1. We also need a small plastic snake and a big toy frog for the kids.
  2. The weaknesses are few.
  3. In fact, they have the opposite effect.
Utterance Source
p249_004
p249_155
p249_316

VCTK

  1. F1, Ask her to bring these things with her from the store.
  2. F2, She can scoop these things into three red bags, and we will go meet her Wednesday at the train station.
  3. M1, Please call Stella.
  4. M2, He should have asked for a second opinion.
Utterance Source Convert
F1_p310_002
F2_p240_005
M1_p374_001
M2_p245_062

LibriSpeech

  1. F1, The visit went off successfully, as was to have been expected.
  2. F2, “He’s Gilbert Blythe,” said Marilla contentedly.
  3. M1, All judgements do not require examination, that is, investigation into the grounds of their truth.
  4. M2, And always that same pretext is offered–it looks like the thing.
Utterance Source Convert
F1_225_131256_000006_000002
F2_188_135249_000012_000000
M1_296_129659_000004_000005
M2_272_130225_000010_000007

VCC2020

  1. F1, If not, it’s about time somebody did.
  2. F2, The figures are adjusted for seasonal variation.
  3. M1, The trip was a disaster.
  4. M2, Sometimes, it helps to take a step back.
Utterance Source Convert
F1_TEF1_E10061
F2_SEF2_E10066
M1_SEM1_E10033
M2_TEM2_E20042

LJSpeech

  1. F1, especially as no more time is occupied, or cost incurred, in casting, setting, or printing beautiful letters
  2. F2, fourteen sixty-nine, fourteen seventy.
Utterance Source Convert
F1_LJ001-0012
F2_LJ001-0045

AISHELL

  1. F1, 购房节奏暂时性放缓.
  2. M2, 目前房地产整体形势不是特别景气.
Utterance Source Convert
F1_BAC009S0234W0141
F2_BAC009S0091W0160

Wild

  1. M1, 我国发展仍然处于重要战略机遇期,但
  2. M2, food_and_medical_supplies.
Utterance Source Convert
F1_kh_42_2
F2_SEF2_E10066