-
ModernGBERT: German-only 1B Encoder Model Trained from Scratch
Paper • 2505.13136 • Published • 22 -
LSX-UniWue/ModernGBERT_1B
Feature Extraction • 1B • Updated • 1.48k • 10 -
LSX-UniWue/ModernGBERT_134M
Feature Extraction • 0.2B • Updated • 1.72k • • 5 -
LSX-UniWue/LLaMmlein-Dataset
Viewer • Updated • 838M • 513 • 4
AI & ML interests
German NLP and beyond
-
ModernGBERT: German-only 1B Encoder Model Trained from Scratch
Paper • 2505.13136 • Published • 22 -
LSX-UniWue/LLaMmlein2Vec_7B
Feature Extraction • 7B • Updated • 7 -
LSX-UniWue/LLaMmlein2Vec_1B
Feature Extraction • 1B • Updated • 3.8k -
LSX-UniWue/LLaMmlein2Vec_120M
Feature Extraction • 0.1B • Updated • 10.7k
https://www.informatik.uni-wuerzburg.de/datascience/projects/nlp/llammlein/
-
LLäMmlein: Compact and Competitive German-Only Language Models from Scratch
Paper • 2411.11171 • Published • 8 -
LSX-UniWue/LLaMmlein_7B
Text Generation • 7B • Updated • 7.05k • • 7 -
LSX-UniWue/LLaMmlein_1B
Text Generation • 1B • Updated • 2.06k • • 2 -
LSX-UniWue/LLaMmlein_120M
Text Generation • 0.1B • Updated • 2.88k
https://www.informatik.uni-wuerzburg.de/datascience/projects/nlp/llammlein/
-
ModernGBERT: German-only 1B Encoder Model Trained from Scratch
Paper • 2505.13136 • Published • 22 -
LSX-UniWue/ModernGBERT_1B
Feature Extraction • 1B • Updated • 1.48k • 10 -
LSX-UniWue/ModernGBERT_134M
Feature Extraction • 0.2B • Updated • 1.72k • • 5 -
LSX-UniWue/LLaMmlein-Dataset
Viewer • Updated • 838M • 513 • 4
https://www.informatik.uni-wuerzburg.de/datascience/projects/nlp/llammlein/
-
LLäMmlein: Compact and Competitive German-Only Language Models from Scratch
Paper • 2411.11171 • Published • 8 -
LSX-UniWue/LLaMmlein_7B
Text Generation • 7B • Updated • 7.05k • • 7 -
LSX-UniWue/LLaMmlein_1B
Text Generation • 1B • Updated • 2.06k • • 2 -
LSX-UniWue/LLaMmlein_120M
Text Generation • 0.1B • Updated • 2.88k
-
ModernGBERT: German-only 1B Encoder Model Trained from Scratch
Paper • 2505.13136 • Published • 22 -
LSX-UniWue/LLaMmlein2Vec_7B
Feature Extraction • 7B • Updated • 7 -
LSX-UniWue/LLaMmlein2Vec_1B
Feature Extraction • 1B • Updated • 3.8k -
LSX-UniWue/LLaMmlein2Vec_120M
Feature Extraction • 0.1B • Updated • 10.7k
https://www.informatik.uni-wuerzburg.de/datascience/projects/nlp/llammlein/