Mistral AI launches Voxtral TTS, an open-weight enterprise voice model that runs on a smartphone and challenges ElevenLabs in the fast-growing voice AI market.
Microsoft's New AI Models Go Beyond Just Text ...
Every time a language model like GPT-4, Claude or Mistral generates a sentence, it does something deceptively simple: It picks one word at a time. This word-by-word approach is what gives ...
Mistral launches Voxtral TTS, extending its model family into speech generation and enabling end-to-end voice workflows.
GLM-5V-Turbo is Z.ai's first native multimodal agent foundation model, built for vision-based coding and agentic task ...
Mistral AI is expanding its Voxtral model family with its first text-to-speech model. The launch comes amid intensifying ...
Google on Friday added a new, experimental “embedding” model for text, Gemini Embedding, to its Gemini developer API. Embedding models translate text inputs like words and phrases into numerical ...
OpenAI's text-to-videos tool Sora generates high-quality videos up to one minute in length. (OpenAI) OpenAI on Thursday announced Sora, a brand new model that generates high-definition videos up to ...
Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more Multi-modal models that can process both ...
Researchers at Amazon have trained the largest ever text-to-speech model yet, which they claim exhibits “emergent” qualities improving its ability to speak even complex sentences naturally. The ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results