Surya: Open-Source Multilingual OCR and Document Understanding
Optical Character Recognition is one of the oldest applications of computer vision, but traditional OCR engines have struggled to keep pace with …
Optical Character Recognition is one of the oldest applications of computer vision, but traditional OCR engines have struggled to keep pace with …
VoxCPM2 is a tokenizer-free text-to-speech (TTS) model developed by OpenBMB, an open-source AI research community affiliated with Tsinghua …