Computer Vision

AI May 03, 2026

VILA: NVIDIA's Open-Source Vision Language Model Family from NVlabs

Vision Language Models (VLMs) that can reason about both images and text have become one of the most active areas in AI research. VILA (Visual …

AI May 03, 2026

Vector graphics are everywhere – from icons and logos to illustrations and data visualizations. But generating complex SVGs …

AI May 02, 2026

StoryDiffusion is a research project from Nankai University and ByteDance that tackles one of the hardest problems in generative AI: maintaining …

AI May 02, 2026

LLaMA-VID (Large Language and Video Assistant) is an ECCV 2024 research project that tackles the fundamental bottleneck in video understanding …

AI May 02, 2026

Animate Anyone is a research project from Alibaba’s HumanAIGC group that turns a single photo into a fully animated video of a person …