Overview Leading voice AI frameworks power realistic, fast, and scalable conversational agents across enterprise, consumer, ...
VoiceRun, a platform for developing and scaling voice agents, has raised $5.5 million in a seed round led by Flybridge Capital.
CEO Scott Stephenson revealed the company achieved cash-flow positive status last year and didn't actually need the funding.
FileWizard lets you convert documents, extract text, transcribe audio and manage files on your own computer without uploading anything.
Pocket TTS is an open-source text-to-speech model that runs on CPUs, clones voices from 5 seconds of audio, and keeps voice ...
Overview: Master deep learning with these 10 essential books blending math, code, and real-world AI applications for lasting ...
Pipit is a free Mac dictation app that works offline. It can be used to do more than just transcribe speech—it can launch ...
NVIDIA doubles down on open speech AI with ultra-low-latency automatic speech recognition and multilingual text-to-speech models.
Speech recognition technology is becoming increasingly crucial to our daily lives, and iFLYTEK, based in Hefei, China, has been working on new ways of using this smart technology since the company was ...
LTX-2 is an open source AI video model with 14B video and 15B audio parameters, giving you synced clips and local control.
I've worked with AI for decades and have a master's degree in education. Here are the top free AI courses online that I recommend - and why.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results