Audio

AI May 04, 2026

Faster-Whisper: 4x Faster Speech Recognition with CTranslate2

OpenAI’s Whisper model was a breakthrough in automatic speech recognition (ASR), demonstrating that large-scale weakly supervised training …

AI May 03, 2026

VoxCPM2 is a tokenizer-free text-to-speech (TTS) model developed by OpenBMB, an open-source AI research community affiliated with Tsinghua …

AI May 03, 2026

RVC (Retrieval-based Voice Conversion) WebUI is an open-source voice conversion framework developed by the RVC-Project team that has become the …

AI May 03, 2026

GPT-SoVITS is an open-source voice cloning and text-to-speech system developed by RVC-Boss that has taken the AI audio community by storm. The …

AI May 03, 2026

Text-to-speech technology has advanced dramatically in the past three years. Zero-shot voice cloning, where a system can synthesize speech in a …

AI May 02, 2026

Voice generation technology has seen remarkable progress, but most open-source text-to-speech (TTS) models still struggle with a fundamental …