Multimodal Information Text

Mistral unveils Pixtral 12B, a multimodal AI model that can process both text and images

Mistral AI, a Paris-based artificial intelligence startup, today unveiled its latest advanced AI model capable of processing both images and text. The new model, called Pixtral 12B, employs about 12 ...

SiliconANGLE

Microsoft releases new Phi models optimized for multimodal processing, efficiency

Microsoft Corp. today expanded its Phi line of open-source language models with two new algorithms optimized for multimodal processing and hardware efficiency. The first addition is the text-only ...

VentureBeat

Meta Introduces Spirit LM open source model that combines text and speech inputs/outputs

Just in time for Halloween 2024, Meta has unveiled Meta Spirit LM, the company’s first open-source multimodal language model capable of seamlessly integrating text and speech inputs and outputs.

TechPP on MSN

From text to voice to vision – how to build multimodal AI apps today

Build reliable multimodal AI apps with text, voice, and vision using shared context, smart orchestration, routing, and ...

Scientific American

The Latest AI Chatbots Can Handle Text, Images and Sound. Here’s How

Slightly more than 10 months ago OpenAI’s ChatGPT was first released to the public. Its arrival ushered in an era of nonstop headlines about artificial intelligence and accelerated the development of ...

InfoWorld

Microsoft’s Phi-4-multimodal AI model handles speech, text, and video

Microsoft has introduced a new AI model that, it says, can process speech, vision, and text locally on-device using less compute capacity than previous models. Innovation in generative artificial ...

Apple AI research shows how MLLMs understand, generate, search for images

Apple's researchers continue to focus on multimodal LLMs, with studies exploring their use for image generation, ...

Gizmodo

Why ‘Multimodal AI’ Is the Hottest Thing in Tech Right Now

There's a new race in technology to make AI see and hear the world around you, and ultimately make sense of it for you. Reading time 3 minutes OpenAI and Google showcased their latest and greatest AI ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results