ConvMemory OPC Student BGE

This checkpoint is an OPC-style Chinese ConvMemory student reranker. It keeps the ConvMemory inference pattern while using a Chinese-specialized BGE encoder:

bge-base-zh embeddings -> ConvMemory window encoder -> CE-lite reranker head.

Usage

from convmemory import OPCConvMemory

model = OPCConvMemory.from_pretrained(
    "Purdy0228/ConvMemory-OPC-Student-BGE",
    device="cuda",  # or "cpu"
)

ranked = model.rerank(
    query="现在定价方案是什么?",
    memories=[
        {"id": "m1", "text": "定价方案最终是基础版 99 元,专业版 299 元。"},
        {"id": "m2", "text": "获客渠道第一批先做小红书。"},
        {"id": "m3", "text": "旧方案曾经考虑过 49 元,后来被否掉。"},
    ],
    top_k=3,
)

Architecture

  • Dense encoder: BAAI/bge-base-zh-v1.5
  • ConvMemory window encoder
  • CE-lite student reranker head
  • Offline teacher used for distillation only: BAAI/bge-reranker-v2-m3

The teacher is not used at inference time. Evaluation uses construction-gold memories from the synthetic OPC-style data generator, not teacher agreement.

Intended Use

Use this checkpoint for Chinese OPC-style memory retrieval when candidate pools are large enough that an online cross-encoder teacher is too slow.

For small candidate pools, a strong dense retriever such as BAAI/bge-base-zh-v1.5 may be sufficient. The phase-0 gate did not show a statistically robust improvement over BGE cosine retrieval, so this checkpoint should be treated as a ConvMemory student operating point rather than a universal replacement for the base retriever.

Released API Evaluation

Package path: OPCConvMemory.from_pretrained(...)

split questions R@1 R@5 R@10 Hit@1 Hit@10 MRR
phase-0 pass-only test 43 0.4516 0.9583 0.9913 0.7442 1.0000 0.8484

Warm-Memory Pool Scan

pool size teacher MRR student MRR teacher ms/query student ms/query
10 0.9302 0.9380 19.89 5.54
50 0.8702 0.8411 48.24 26.60
200 0.8302 0.8070 166.53 30.17
500 0.7760 0.7445 390.79 35.66
1000 0.6803 0.6599 779.67 126.18

Files

  • student.pt: ConvMemory window encoder and CE-lite scorer weights
  • config.json: model and architecture config
  • MANIFEST.json: provenance and checksums

Checksums

  • student.pt SHA256: e56a74d937446d05de7381307f9188cffb8794e07805ee9837838ed8503713eb
  • config.json SHA256: 24e838d0e86bd839f618e51890a4330d9d677bdda790d27bea53ed98614c9b77
  • training/evaluation data SHA256: c8f595ee8afd2aa2d1a2f6336ac172f297667e3ec3b69bd7f8e876b925a1319d
Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support