Multimodal

AI May 05, 2026

GLM-4.5: Zhipu AI's Next-Gen Multimodal Foundation Model

The evolution of foundation models in 2025-2026 has been defined by two trends: multimodality and efficiency. Models that could only process text …

AI May 03, 2026

Qwen2.5-Omni is Alibaba’s flagship open-source multimodal AI model, developed by the QwenLM team at Alibaba Cloud. As a single end-to-end …

AI May 03, 2026

Multimodal AI models that can simultaneously process vision, speech, and text represent the cutting edge of artificial intelligence. …

AI May 03, 2026

The concept of a digital avatar that can hold a natural conversation — seeing your face, hearing your voice, and responding with synchronized lip …

AI May 02, 2026

In the rapidly advancing field of vision-language models, a new heavyweight has emerged from an unexpected corner. Seed1.5-VL, developed by …