Speech recognition and translation: faster-whisper
TTS with voice cloning: XTTSv2
interface: SillyTavern
It can translate all major languages to English and vice versa.
how to mix it all together: github.com/Mozer/wav2lip_extension
Негізгі бет whisper + XTTSv2 = live audio translation with voice cloning (Перевод с сохранением голоса)
Пікірлер: 1