2025-10-28 Bayesian Speech synthesizers Can Learn from Multiple Teachers Ziyang Zhang et.al. 2510.24372 null 2025-10-28 emg2speech: synthesizing speech from electromyography using self-supervised ...
F5-TTS: Diffusion Transformer with ConvNeXt V2, faster trained and inference. E2 TTS: Flat-UNet Transformer, closest reproduction from paper. Install torch with your CUDA version, e.g. : pip install ...