Pocket TTS is an open-source text-to-speech model that runs on CPUs, clones voices from 5 seconds of audio, and keeps voice ...
A robot face developed by researchers can now lip sync speech and songs after training on YouTube videos, using machine ...
Pipit is a free Mac dictation app that works offline. It can be used to do more than just transcribe speech—it can launch apps, toggle settings, and even launch a web search or query an AI service.
ChatGPT Translate is a separate tool. It's not multimodal yet, but it does let you refine clarity, tone, and intent. Here's how.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results