In a Training Loop 🔄

1 13 49

Mou Chen

Mou11209203

emmanuelwithme

AI & ML interests

None yet

Recent Activity

upvoted a collection 18 days ago

DeepSeek-V4

liked a model 18 days ago

deepseek-ai/DeepSeek-V4-Pro

liked a dataset 2 months ago

a2aj/canadian-case-law

View all activity

Organizations

upvoted a collection 18 days ago

DeepSeek-V4

Collection

4 items • Updated 18 days ago • 627

upvoted an article 5 months ago

Article

mmBERT: ModernBERT goes Multilingual

mmarone, orionweller, will-fleshman, eugene-yang, dlawrie, vandurme

•

Sep 9, 2025

• 146

upvoted an article 6 months ago

Article

BioClinical ModernBERT: an example of continued pre-training of ModernBERT

thomas-sounack

•

Sep 10, 2025

• 6

upvoted a collection 6 months ago

Common Pile v0.1

Collection

All resources related to Common Pile v0.1, an 8TB dataset of public domain and openly licensed text • 4 items • Updated Jun 6, 2025 • 40

upvoted a paper 6 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 163

upvoted 2 papers 7 months ago

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20, 2024 • 183

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 514

upvoted 2 collections 12 months ago

Flan-T5 release

Collection

The Flan-T5 covers 4 checkpoints of different sizes each time. It also includes upgrades versions trained using Universal sampling • 7 items • Updated Mar 12 • 36

GTE models

Collection

General Text Embedding Models Released by Tongyi Lab of Alibaba Group • 20 items • Updated Mar 2 • 36

upvoted an article 12 months ago

Article

Fine-tune ModernBERT for text classification using synthetic data

davidberenstein1957

•

Dec 30, 2024

• 39

upvoted an article about 1 year ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

bwarner, NohTow, bclavie, orionweller, ohallstrom, staghado, alexisgallagher, rbiswasfc, fladhak, tomaarsen, ncoop57, griffin, jph00, johnowhitaker, iacolippo

•

Dec 19, 2024

• 740

upvoted a paper about 1 year ago

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Paper • 2404.07143 • Published Apr 10, 2024 • 111

upvoted an article about 1 year ago

Article

MTEB: Massive Text Embedding Benchmark

Muennighoff

•

Oct 19, 2022

• 93

Mou Chen

AI & ML interests

Recent Activity

Organizations

Mou11209203's activity

mmBERT: ModernBERT goes Multilingual

BioClinical ModernBERT: an example of continued pre-training of ModernBERT

Fine-tune ModernBERT for text classification using synthetic data

Finally, a Replacement for BERT: Introducing ModernBERT

MTEB: Massive Text Embedding Benchmark