UniversalCEFR/peapl2_pt
Viewer • Updated • 481 • 52
This model classifies Portuguese texts according to CEFR (Common European Framework of Reference) proficiency levels.
| Level | Count | Percentage |
|---|---|---|
| A1 | 314 | 33.0% |
| A2 | 89 | 9.3% |
| B1 | 367 | 38.6% |
| B2 | 70 | 7.4% |
| C1 | 112 | 11.8% |
from transformers import pipeline
classifier = pipeline("text-classification", model="marcosremar2/cefr-classifier-pt-distilbert-balanced")
result = classifier("Eu gosto de estudar português.")
print(result) # [{'label': 'A1', 'score': 0.93}]
| Level | Precision | Recall | F1-Score |
|---|---|---|---|
| A1 | 0.80 | 0.63 | 0.71 |
| A2 | 0.44 | 0.61 | 0.51 |
| B1 | 0.67 | 0.58 | 0.62 |
| B2 | 0.32 | 0.50 | 0.39 |
| C1 | 0.47 | 0.64 | 0.54 |
| Macro Avg | 0.54 | 0.59 | 0.55 |
If you use this model, please cite:
@misc{cefr-pt-distilbert,
author = {Marcos Remar},
title = {CEFR Classifier for Portuguese},
year = {2025},
publisher = {HuggingFace Hub},
}