This collection hosts a series of Vision Language Models (VLMs) fine-tuned for Optical Character Recognition (OCR) and Document Processing.
-
loay/Arabic-OCR-Qwen2.5-VL-7B-Vision
Image-to-Text • 8B • Updated • 126 • 3 -
loay/Arabic-OCR-DeepSeek-OCR-2
Image-to-Text • 3B • Updated • 46 -
loay/English-Document-OCR-Qwen3.5-2B
Image-Text-to-Text • 2B • Updated • 265 • 1 -
loay/English-Document-OCR-Qwen3.5-0.8B
Image-Text-to-Text • 0.8B • Updated • 210 • 4