MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding Paper • 2507.12463 • Published Jul 16, 2025 • 26 • 1
4KAgent: Agentic Any Image to 4K Super-Resolution Paper • 2507.07105 • Published Jul 9, 2025 • 105 • 4
DINO-R1: Incentivizing Reasoning Capability in Vision Foundation Models Paper • 2505.24025 • Published May 29, 2025 • 27 • 4
Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing Paper • 2411.16832 • Published Nov 25, 2024 • 2 • 3