PP-OCRv6 Collection From 1.5M to 34.5M Parameters, Surpassing Billion-Scale VLMs on OCR Tasks • 19 items • Updated 3 days ago • 71
PP-LCNet: A Lightweight CPU Convolutional Neural Network Paper • 2109.15099 • Published Sep 17, 2021 • 2
PaddleOCR-VL-1.6 Collection Expanding the Frontier of Document Parsing with Under-Optimized Region Refinement and Progressive Post-Training • 5 items • Updated 19 days ago • 10
view article Article PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend PaddlePaddle • about 1 month ago • 37
PaddleOCR-VL-1.5: Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing Paper • 2601.21957 • Published Jan 29 • 23
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model Paper • 2510.14528 • Published Oct 16, 2025 • 129
view article Article PP-OCRv5 on Hugging Face: A Specialized Approach to OCR baidu • Sep 10, 2025 • 111
PP-StructureV3 Collection PP-StructureV3 is a SOTA document parsing solution on OmniDocBench, supporting the conversion of PDFs and do cument images to Markdown and JSON. • 17 items • Updated Sep 15, 2025 • 18
PP-OCRv5 Collection PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese • 13 items • Updated May 12 • 57