Wenbo Hu's picture

Wenbo Hu

gordonhu

·

https://gordonhu608.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper about 5 hours ago

BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions

authored a paper about 5 hours ago

Matryoshka Query Transformer for Large Vision-Language Models

authored a paper about 5 hours ago

MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models

View all activity

Organizations

Papers 10

arxiv:2604.08539

arxiv:2512.10863

arxiv:2511.21688

arxiv:2510.08457

spaces 1

MQT LLaVA

models 1

gordonhu/MQT-LLaVA-7b

Image-Text-to-Text • 7B • Updated May 30, 2024 • 6 • 5

datasets 0

None public yet