Show HN: KVoiceWalk – Voice cloning for Kokoro TTS using random walk algorithms

a year ago

KVoiceWalk uses a random walk algorithm and hybrid scoring to clone target voices.
The project builds on Kokoro and Resemblyzer to evolve new voice tensors.
Target audio should be 20-30 seconds, 24000 Hz WAV, single speaker.
Process involves finding closest matches, random walk, and saving best voices.
Interpolation search improves starting population for random walk.
Scoring combines Resemblyzer similarity, self similarity, and feature extraction.
Harmonic mean in scoring allows balanced improvement across metrics.
Future improvements could include genetic algorithms or predictive models.
Multiple instances can run in parallel depending on hardware.

Hasty Briefsbeta