OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory Paper • 2512.07802 • Published 27 days ago • 43
RAVENEA: A Benchmark for Multimodal Retrieval-Augmented Visual Culture Understanding Paper • 2505.14462 • Published May 20, 2025 • 4
Do Vision and Language Models Share Concepts? A Vector Space Alignment Study Paper • 2302.06555 • Published Feb 13, 2023 • 9