stanford-star/the-join
Viewer • Updated • 6.87k • 7.08k • 3
rt-j)
RT-J is a Relational Transformer foundation model for in-context / few-shot entity prediction over multi-table relational databases (no per-task gradient training). This repository hosts both task-head variants:
| Variant | Folder | Task | Metric | Selected checkpoint |
|---|---|---|---|---|
| Classifier | classification/ |
binary entity classification | AUROC ↑ (mean 0.7310 on 12 RelBench tasks) | SWA @ step 58,000 (best val AUROC) |
| Regressor | regression/ |
entity regression | MAE ↓ (mean 0.2677 on 9 RelBench tasks, Z-scored) | SWA @ step 18,000 (best val MAE) |
Both variants share the same architecture and training recipe:
~85.6M params · bfloat16 · 12 blocks, d_model 512, 8 heads, d_ff 2048 ·
text columns embedded with all-MiniLM-L12-v2 (d_text 384).
Each folder contains model.safetensors (weights) and config.json
(dims + text-embedding model + provenance).
from huggingface_hub import hf_hub_download
# shared architecture + variant map (also what the Hub counts as a download)
config = hf_hub_download("stanford-star/rt-j", "config.json")
# pick the variant: "classification" or "regression"
weights = hf_hub_download("stanford-star/rt-j", "classification/model.safetensors")
variant_config = hf_hub_download("stanford-star/rt-j", "classification/config.json")