PII & De-Identification Collection Models for extracting PII entities and de-identifying clinical text, with support for HIPAA and GDPR compliance. • 33 items • Updated 3 days ago • 24
Arabic Speech Datasets Collection Best Datasets for Arabic Speech Tasks • 16 items • Updated 15 days ago • 15
view article Article Introducing the Synthetic Data Generator - Build Datasets with Natural Language +4 Dec 16, 2024 • 152
PaddleOCR-VL Collection Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model • 3 items • Updated Dec 16, 2025 • 24
DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated Aug 21, 2025 • 454
LLMDet Collection LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models • 5 items • Updated Jul 26, 2025 • 3
view article Article Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub Jun 27, 2025 • 30
view article Article ScreenSuite - The most comprehensive evaluation suite for GUI Agents! +1 Jun 6, 2025 • 55
view article Article Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H Jun 3, 2025 • 71
MedGemma Release Collection Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 9 items • Updated 2 days ago • 399