Pocket TTS is an open-source text-to-speech model that runs on CPUs, clones voices from 5 seconds of audio, and keeps voice ...
A robot face developed by researchers can now lip sync speech and songs after training on YouTube videos, using machine ...
Pipit is a free Mac dictation app that works offline. It can be used to do more than just transcribe speech—it can launch apps, toggle settings, and even launch a web search or query an AI service.
ChatGPT Translate is a separate tool. It's not multimodal yet, but it does let you refine clarity, tone, and intent. Here's how.