Tags

DeepSeek R1

AI May 04, 2026

X-R1: Open-Source Reasoning Model Exploration

The revelation that language models could develop sophisticated reasoning capabilities through reinforcement learning – without human …

AI May 03, 2026

TinyZero: Reproducing DeepSeek R1-Zero's Reasoning with RL for Under $30

DeepSeek R1-Zero was widely regarded as a breakthrough when it was released in January 2025. The model demonstrated that pure reinforcement …