TRIBE v2 Subcortical Head (Lahner)

This model provides subcortical fMRI prediction weights for the TRIBE v2 architecture.
It predicts BOLD activity in subcortical brain regions (e.g. hippocampus, amygdala, pallidum) from multimodal inputs (video, audio, text).

Model Details

Developed by: Logan Fernandez
Model type: Multimodal fMRI encoding model (regression)
Base model: TRIBE v2
License: CC BY-NC 4.0

Uses

Predict deep brain activity from naturalistic stimuli
Study stimulus → brain response relationships
Analyze emotional response to stimuli
Make the brain watch reels (See Github repo)

Limitations

Subject-specific behavior for Lahner-style subjects, tuned to short-form media
Correlation, not causation

Training

Dataset: Lahner2024Bold (BOLD Moments)
Features: video (V-JEPA), audio (Wac2Vert), text (Qwen3B)
Projection: subcortical mask

Evaluation

Metric: Pearson correlation
Result: r = 0.165 (test split)

This reflects the difficulty of predicting subcortical activity, which is noisier and lower resolution than cortical signals.

Repository

https://github.com/Enzyme0/homunculus

Contact

Logan Fernandez loganfe@outlook.com

Acknowledgements

This model builds on TRIBE-V2 and is intended as a downstream or derived component built from that work.

Citation

Please cite both this model and TRIBE-V2 when relevant:

@article{dAscoli2026TribeV2,
  title={A foundation model of vision, audition, and language for in-silico neuroscience},
  author={d'Ascoli, St{\'e}phane and Rapin, J{\'e}r{\'e}my and Benchetrit, Yohann and Brookes, Teon and Begany, Katelyn and Raugel, Jos{\'e}phine and Banville, Hubert and King, Jean-R{\'e}mi},
  year={2026}
}

Downloads last month: 134

Inference Providers NEW

Graph Machine Learning

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for loganf26/tribev2-subcortical

Base model

facebook/tribev2

Finetuned

(3)

this model

Evaluation results

Pearson r on Lahner2024Bold (BOLD Moments)
test set self-reported

0.165