How to use SzegedAI/hubertusz-tiny-wiki with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModelForPreTraining tokenizer = AutoTokenizer.from_pretrained("SzegedAI/hubertusz-tiny-wiki") model = AutoModelForPreTraining.from_pretrained("SzegedAI/hubertusz-tiny-wiki")
This model was trained from scratch on the Wikipedia subset of Hungarian Webcorpus 2.0 with MLM and SOP tasks.
First phase:
Second phase: