arxiv:2604.08539
Wenbo Hu
gordonhu
AI & ML interests
None yet
Recent Activity
authored a paper about 5 hours ago
BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual
Questions authored a paper about 5 hours ago
Matryoshka Query Transformer for Large Vision-Language Models authored a paper about 5 hours ago
MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal
Models