Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing
AI & ML interests
Deep Learning Framework
Recent Activity
View all activity
Papers
GraphNet: A Large-Scale Computational Graph Dataset for Tensor Compiler Research
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
Organization Card
Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
-
PaddlePaddle/PaddleOCR-VL
Image-Text-to-Text • 1.0B • Updated • 7.79k • 1.58k -
PaddleOCR-VL Online Demo
📈238Extract text, tables, formulas, and charts from images
-
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
Paper • 2510.14528 • Published • 123 -
PaddlePaddle/PP-DocLayoutV2
Object Detection • Updated • 8.64k • 28
Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing
Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
-
PaddlePaddle/PaddleOCR-VL
Image-Text-to-Text • 1.0B • Updated • 7.79k • 1.58k -
PaddleOCR-VL Online Demo
📈238Extract text, tables, formulas, and charts from images
-
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
Paper • 2510.14528 • Published • 123 -
PaddlePaddle/PP-DocLayoutV2
Object Detection • Updated • 8.64k • 28
spaces 7
pinned
Running
Featured
238
PaddleOCR-VL Online Demo
📈
Extract text, tables, formulas, and charts from images
Running
78
PP-OCRv5 Online Demo
🌍
Universal-Scene Text Recognition Model with High-Accuracy
Running
34
PP-StructureV3 Online Demo
📊
Next-Gen High-Precision Doc Parsing Solution
Running
Featured
72
PaddleOCR-VL-1.5 Online Demo
😻
PaddleOCR-VL-1.5_Online_Demo
Running
9
Doc2Page - Document to Webpage Converter
🏄
Convert docs to webpages using PaddleOCR and ERNIE
models 97
PaddlePaddle/PP-Chart2Table_safetensors
Image-to-Text • Updated • 914
PaddlePaddle/PP-DocLayoutV2_safetensors
Object Detection • Updated • 1.31k • 2
PaddlePaddle/PP-DocLayoutV3_safetensors
Object Detection • Updated • 248k • 20
PaddlePaddle/RT-DETR-L_wireless_table_cell_det_safetensors
Image-to-Text • Updated • 31
PaddlePaddle/RT-DETR-L_wired_table_cell_det_safetensors
Image-to-Text • Updated • 55
PaddlePaddle/SLANeXt_wireless_safetensors
Image-to-Text • 91.2M • Updated • 50
PaddlePaddle/SLANeXt_wired_safetensors
Image-to-Text • 91.2M • Updated • 468
PaddlePaddle/UVDoc_safetensors
Image-to-Text • Updated • 333
PaddlePaddle/PP-DocBlockLayout_safetensors
Image-to-Text • Updated • 29
PaddlePaddle/PP-DocLayout_plus-L_safetensors
Image-to-Text • Updated • 34
datasets 6
PaddlePaddle/Real5-OmniDocBench
Viewer • Updated • 2.8k • 9.08k • 7
PaddlePaddle/GraphNet
Updated • 42 • 2
PaddlePaddle/PaddleOCR-VL_demo
Viewer • Updated • 23 • 18.8k • 1
PaddlePaddle/GSM8K_distilled_zh
Viewer • Updated • 8.79k • 128 • 1
PaddlePaddle/dureader_robust
Updated • 78 • 5
PaddlePaddle/duconv
Viewer • Updated • 36.9k • 100 • 2