neuralabs/german-synth-ocr
Viewer • Updated • 5k • 44 • 2
How to use neuralabs/deepseek_ocr_de with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("feature-extraction", model="neuralabs/deepseek_ocr_de", trust_remote_code=True) # Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("neuralabs/deepseek_ocr_de", trust_remote_code=True, dtype="auto")How to use neuralabs/deepseek_ocr_de with Unsloth Studio:
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for neuralabs/deepseek_ocr_de to start chatting
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for neuralabs/deepseek_ocr_de to start chatting
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for neuralabs/deepseek_ocr_de to start chatting
pip install unsloth
from unsloth import FastModel
model, tokenizer = FastModel.from_pretrained(
model_name="neuralabs/deepseek_ocr_de",
max_seq_length=2048,
)# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("neuralabs/deepseek_ocr_de", trust_remote_code=True, dtype="auto")
This model is a fine-tuned version of DeepSeek OCR on German text for Optical Character Recognition (OCR) tasks.
This model has been fine-tuned specifically for recognizing German text in images, including handling of German-specific characters (ä, ö, ü, ß) and common German compound words.
This model is designed for:
from transformers import TrOCRProcessor, VisionEncoderDecoderModel
from PIL import Image
import requests
# Load model and processor
processor = TrOCRProcessor.from_pretrained("YOUR_USERNAME/deepseek-ocr-german")
model = VisionEncoderDecoderModel.from_pretrained("YOUR_USERNAME/deepseek-ocr-german")
# Load image
url = "path_to_your_german_text_image.jpg"
image = Image.open(url).convert("RGB")
# Process
pixel_values = processor(image, return_tensors="pt").pixel_values
generated_ids = model.generate(pixel_values)
generated_text = processor.batch_decode(generated_ids, skip_special_tokens=True)[0]
print(generated_text)
from transformers import TrOCRProcessor, VisionEncoderDecoderModel
from PIL import Image
processor = TrOCRProcessor.from_pretrained("YOUR_USERNAME/deepseek-ocr-german")
model = VisionEncoderDecoderModel.from_pretrained("YOUR_USERNAME/deepseek-ocr-german")
# Multiple images
images = [Image.open(f"image_{i}.jpg").convert("RGB") for i in range(5)]
# Batch process
pixel_values = processor(images, return_tensors="pt", padding=True).pixel_values
generated_ids = model.generate(pixel_values)
generated_texts = processor.batch_decode(generated_ids, skip_special_tokens=True)
for text in generated_texts:
print(text)
import torch
from transformers import TrOCRProcessor, VisionEncoderDecoderModel
from PIL import Image
device = "cuda" if torch.cuda.is_available() else "cpu"
processor = TrOCRProcessor.from_pretrained("YOUR_USERNAME/deepseek-ocr-german")
model = VisionEncoderDecoderModel.from_pretrained("YOUR_USERNAME/deepseek-ocr-german").to(device)
image = Image.open("german_text.jpg").convert("RGB")
pixel_values = processor(image, return_tensors="pt").pixel_values.to(device)
generated_ids = model.generate(pixel_values)
text = processor.batch_decode(generated_ids, skip_special_tokens=True)[0]
print(text)
The model was fine-tuned on a synthetic German OCR dataset containing 200,000 images with:
Data Split:
# Example training configuration
from transformers import Seq2SeqTrainer, Seq2SeqTrainingArguments
training_args = Seq2SeqTrainingArguments(
output_dir="./deepseek-ocr-german",
per_device_train_batch_size=8,
per_device_eval_batch_size=8,
learning_rate=5e-5,
num_train_epochs=10,
logging_steps=100,
save_steps=1000,
eval_steps=1000,
evaluation_strategy="steps",
save_total_limit=2,
fp16=True,
predict_with_generate=True,
)
If you use this model, please cite:
@misc{deepseek-ocr-german,
author = {Santosh Pandit},
title = {DeepSeek OCR - German Fine-tuned},
year = {2025},
publisher = {HuggingFace},
howpublished = {\url{https://huggingface.co/YOUR_USERNAME/deepseek-ocr-german}},
}
For questions or feedback, please open an issue on the model repository or contact [hello@neuralabs.one].
Base model
deepseek-ai/DeepSeek-OCR
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("feature-extraction", model="neuralabs/deepseek_ocr_de", trust_remote_code=True)