Mistral AI launches Voxtral TTS, an open-weight enterprise voice model that runs on a smartphone and challenges ElevenLabs in the fast-growing voice AI market.
Microsoft's New AI Models Go Beyond Just Text ...
Every time a language model like GPT-4, Claude or Mistral generates a sentence, it does something deceptively simple: It picks one word at a time. This word-by-word approach is what gives ...
Mistral launches Voxtral TTS, extending its model family into speech generation and enabling end-to-end voice workflows.
GLM-5V-Turbo is Z.ai's first native multimodal agent foundation model, built for vision-based coding and agentic task ...
Mistral AI is expanding its Voxtral model family with its first text-to-speech model. The launch comes amid intensifying ...
Google on Friday added a new, experimental “embedding” model for text, Gemini Embedding, to its Gemini developer API. Embedding models translate text inputs like words and phrases into numerical ...
OpenAI's text-to-videos tool Sora generates high-quality videos up to one minute in length. (OpenAI) OpenAI on Thursday announced Sora, a brand new model that generates high-definition videos up to ...
Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more Multi-modal models that can process both ...
Researchers at Amazon have trained the largest ever text-to-speech model yet, which they claim exhibits “emergent” qualities improving its ability to speak even complex sentences naturally. The ...