Show HN: KVoiceWalk – Voice cloning for Kokoro TTS using random walk algorithms
a year ago
- #voice-cloning
- #audio-processing
- #machine-learning
- KVoiceWalk uses a random walk algorithm and hybrid scoring to clone target voices.
- The project builds on Kokoro and Resemblyzer to evolve new voice tensors.
- Target audio should be 20-30 seconds, 24000 Hz WAV, single speaker.
- Process involves finding closest matches, random walk, and saving best voices.
- Interpolation search improves starting population for random walk.
- Scoring combines Resemblyzer similarity, self similarity, and feature extraction.
- Harmonic mean in scoring allows balanced improvement across metrics.
- Future improvements could include genetic algorithms or predictive models.
- Multiple instances can run in parallel depending on hardware.