Article
Niels Rogge
nielsr
AI & ML interests
Mainly interested in diving into complex Github repos and making AI easier and more accessible to everyone
Recent Activity
new activity 3 minutes ago
jhcodec/sw2v_60k:Add pipeline tag and link to paper published an article about 16 hours ago
On CLIs vs. MCP new activity 2 days ago
BestWishYsh/OpenS2V-Eval:Need support to update the datasets and verify the accountOrganizations
Articles 12
Article
88
Community Evals: Because we're done trusting black-box leaderboards over the community
Image-to-text models
Collection of image captioning models
-
Salesforce/blip-image-captioning-large
Image-to-Text • 0.5B • Updated • 1.29M • 1.46k -
microsoft/git-large-coco
Image-to-Text • 0.4B • Updated • 9.44k • 104 -
Salesforce/instructblip-vicuna-7b
Image-Text-to-Text • 8B • Updated • 12.4k • 99 -
Salesforce/blip2-flan-t5-xxl
Image-Text-to-Text • 12B • Updated • 5.65k • 93
SigLIP release
SigLIP improves upon CLIP with a sigmoid loss. Both English-only and multilingual checkpoints are released.
-
Sigmoid Loss for Language Image Pre-Training
Paper • 2303.15343 • Published • 11 -
google/siglip-base-patch16-224
Zero-Shot Image Classification • 0.2B • Updated • 1.28M • 79 -
google/siglip-base-patch16-256
Zero-Shot Image Classification • 0.2B • Updated • 160k • 6 -
google/siglip-base-patch16-384
Zero-Shot Image Classification • 0.2B • Updated • 48.6k • 11
Image-to-text models
Collection of image captioning models
-
Salesforce/blip-image-captioning-large
Image-to-Text • 0.5B • Updated • 1.29M • 1.46k -
microsoft/git-large-coco
Image-to-Text • 0.4B • Updated • 9.44k • 104 -
Salesforce/instructblip-vicuna-7b
Image-Text-to-Text • 8B • Updated • 12.4k • 99 -
Salesforce/blip2-flan-t5-xxl
Image-Text-to-Text • 12B • Updated • 5.65k • 93
SigLIP release
SigLIP improves upon CLIP with a sigmoid loss. Both English-only and multilingual checkpoints are released.
-
Sigmoid Loss for Language Image Pre-Training
Paper • 2303.15343 • Published • 11 -
google/siglip-base-patch16-224
Zero-Shot Image Classification • 0.2B • Updated • 1.28M • 79 -
google/siglip-base-patch16-256
Zero-Shot Image Classification • 0.2B • Updated • 160k • 6 -
google/siglip-base-patch16-384
Zero-Shot Image Classification • 0.2B • Updated • 48.6k • 11
spaces 26
pinned
Runtime error
ICLR2024 Papers
📊
Running on Zero
MCP
2
Videomt Transformers Demo
🐨
Segment videos with instance, semantic, or panoptic masks
No application file
Rag Demo
⚡
Runtime error
23
KOSMOS-2.5 Document AI Demo
📄
Upload an image to generate markdown, extract text, or ask questions
Running
Featured
159
Dpt Depth Estimation
⚡
Generate depth map from an image
Runtime error
182
Dit Document Layout Analysis
👀
Analyze document layout from images
models 255
nielsr/lw-detr-medium-tray-detection-hub-only-20260318
Object Detection • 28.2M • Updated • 194
nielsr/lw-detr-medium-tray-detection-hub-init
Object Detection • 28.2M • Updated • 365
nielsr/lw-detr-medium-tray-detection
Object Detection • 28.2M • Updated • 764
nielsr/rf-detr-parity-runs
Updated
nielsr/rtdetr-rfdetr-aligned-300ep-20260305-v1
Object Detection • 76.6M • Updated • 832
nielsr/lwdetr_dinov2_small_o365_checkpoint
32.2M • Updated • 25
nielsr/rtdetr-paper-control-300ep-20260305-v1
Object Detection • 76.6M • Updated • 605
nielsr/rf-detr-demo-style-vs-trainer-20260305
Updated
nielsr/lwdetr-small-tray-cart-rf100-lr-20260303-222437
14.2M • Updated • 386
nielsr/lwdetr-small-tray-cart-coco-lr-20260303-222437
14.2M • Updated • 399
datasets 123
nielsr/balloon-dataset
Viewer • Updated • 74 • 23
nielsr/tray-cart-detection
Viewer • Updated • 125 • 217
nielsr/arxiv-papers-citations
Viewer • Updated • 59k • 21
nielsr/arxiv-papers-input
Viewer • Updated • 59k • 9
nielsr/methods-thumbnails
Viewer • Updated • 11 • 1.41k
nielsr/paper-page-assets
Viewer • Updated • 41 • 1.17k • 1
nielsr/segmented-tables
Viewer • Updated • 352 • 9
nielsr/random-data
Viewer • Updated • 2 • 10
nielsr/funsd
Viewer • Updated • 199 • 2.4k • 17
nielsr/funsd-layoutlmv3
Viewer • Updated • 199 • 651 • 40