Hasty Briefsbeta

Bilingual

Show HN: KVoiceWalk – Voice cloning for Kokoro TTS using random walk algorithms

a year ago
  • #voice-cloning
  • #audio-processing
  • #machine-learning
  • KVoiceWalk uses a random walk algorithm and hybrid scoring to clone target voices.
  • The project builds on Kokoro and Resemblyzer to evolve new voice tensors.
  • Target audio should be 20-30 seconds, 24000 Hz WAV, single speaker.
  • Process involves finding closest matches, random walk, and saving best voices.
  • Interpolation search improves starting population for random walk.
  • Scoring combines Resemblyzer similarity, self similarity, and feature extraction.
  • Harmonic mean in scoring allows balanced improvement across metrics.
  • Future improvements could include genetic algorithms or predictive models.
  • Multiple instances can run in parallel depending on hardware.