BCEmbedding: Bilingual Cross-Modal Embedding Models from NetEase

BCEmbedding is a bilingual cross-modal embedding model for semantic search, RAG, and cross-lingual retrieval with state-of-the-art performance.

Keeping this site alive takes effort — your support means everything.

無程式碼也能輕鬆打造專業LINE官方帳號！一鍵導入模板，讓AI助你行銷加分！

Editorial Team May 05, 2026 2 min read

Embedding models are the foundation of modern semantic search and retrieval-augmented generation (RAG) systems. BCEmbedding, developed by NetEase Youdao, stands out by delivering state-of-the-art performance specifically optimized for bilingual Chinese-English and cross-modal retrieval tasks.

The model excels at understanding semantic relationships across languages and modalities. Whether you are searching Chinese documents with English queries, retrieving images from text descriptions, or building a bilingual RAG pipeline, BCEmbedding provides embeddings that capture meaning across these boundaries.

Model Capabilities

Capability	Description	Performance
Bilingual text	Chinese-English cross-lingual retrieval	Top 3 on MTEB leaderboard
Cross-modal	Text-to-image and image-to-text retrieval	State-of-the-art
Dense retrieval	Single-vector representation	Competitive with BGE
Sparse retrieval	Hybrid with BM25 support	Enhanced recall
RAG optimization	Tuned for chunk-level retrieval	Excellent precision

Embedding Architecture

flowchart LR
    subgraph Input
        A[Chinese Text]
        B[English Text]
        C[Images]
    end
    subgraph BCEmbedding
        D[Bilingual Encoder]
        E[Vision Encoder]
        F[Cross-Modal Fusion]
    end
    subgraph Output
        G[Vector Embeddings]
        H[Similarity Scores]
    end
    A --> D
    B --> D
    C --> E
    D --> F
    E --> F
    F --> G
    G --> H

The architecture uses separate encoders for text and vision, with a cross-modal fusion layer that projects both modalities into a shared embedding space. This allows direct comparison between any combination of text and image inputs.

Performance Benchmarks

Benchmark	BCEmbedding	BGE-large	OpenAI ada-002
MTEB (English)	64.5	64.2	61.0
C-MTEB (Chinese)	67.8	66.5	N/A
Cross-lingual retrieval	72.3	68.1	42.5
Image-text retrieval	85.6	N/A	80.2

For more details, visit the BCEmbedding GitHub repository and check the MTEB leaderboard.

Frequently Asked Questions

Q: What embedding dimensions does BCEmbedding output? A: The text model outputs 768-dimensional vectors, matching the BGE-large architecture.

Q: Can I use BCEmbedding with LangChain or LlamaIndex? A: Yes, it integrates easily through HuggingFace embedding wrappers compatible with both frameworks.

Q: Is BCEmbedding free for commercial use? A: Yes, it is released under the Apache 2.0 license.

Q: Does it support languages beyond Chinese and English? A: It is optimized for Chinese-English. Other languages have reduced but functional performance.

Q: How large is the model? A: The text encoder is approximately 1.3GB (BGE-large based), and the vision encoder adds about 0.5GB.

BCEmbedding: Bilingual Cross-Modal Embedding Models from NetEase

Model Capabilities

Embedding Architecture

Performance Benchmarks

Frequently Asked Questions

LATEST POST

Workday, Anthropic, and LISC Join Forces to Launch AI Solopreneurship Accelerato

Sensor Tower Acquires AppMagic, Filling SMB Data Analytics Gap

Musk, Cook, and Fink Expected to Join Trump's Delegation to Beijing This Week

TAG

CATEGORIES