CG-Bench

community

https://cg-bench.github.io/leaderboard/

AI & ML interests

None defined yet.

Recent Activity

lulidong authored a paper 3 days ago

Learning Visual Affordance from Audio

lulidong authored a paper 3 days ago

Towards Multimodal Lifelong Understanding: A Dataset and Agentic Baseline

lulidong updated a dataset 4 days ago

CG-Bench/MM-Lifelong

View all activity

Papers

Towards Multimodal Lifelong Understanding: A Dataset and Agentic Baseline

View all Papers

authored 2 papers 3 days ago

Learning Visual Affordance from Audio

Paper • 2512.02005 • Published Dec 1, 2025

Towards Multimodal Lifelong Understanding: A Dataset and Agentic Baseline

Paper • 2603.05484 • Published 4 days ago • 4

updated a dataset 4 days ago

CG-Bench/MM-Lifelong

Preview • Updated 4 days ago • 77

submitted a paper to Daily Papers 4 days ago

Towards Multimodal Lifelong Understanding: A Dataset and Agentic Baseline

Paper • 2603.05484 • Published 4 days ago • 4

published a dataset 28 days ago

CG-Bench/MM-Lifelong

Preview • Updated 4 days ago • 77

updated a dataset 8 months ago

CG-Bench/CG-AV-Counting

Viewer • Updated Jul 22, 2025 • 1.03k • 53 • 5

in CG-Bench/CG-AV-Counting 9 months ago

Data unzip error

#2 opened 9 months ago by

authored a paper 9 months ago

Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision

Paper • 2506.06253 • Published Jun 6, 2025 • 9

authored a paper 9 months ago

Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision

Paper • 2506.06253 • Published Jun 6, 2025 • 9

authored a paper 9 months ago

AV-Reasoner: Improving and Benchmarking Clue-Grounded Audio-Visual Counting for MLLMs

Paper • 2506.05328 • Published Jun 5, 2025 • 21

authored a paper 9 months ago

AV-Reasoner: Improving and Benchmarking Clue-Grounded Audio-Visual Counting for MLLMs

Paper • 2506.05328 • Published Jun 5, 2025 • 21

published a dataset 9 months ago

CG-Bench/CG-AV-Counting

Viewer • Updated Jul 22, 2025 • 1.03k • 53 • 5

authored a paper 11 months ago

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

Paper • 2504.15271 • Published Apr 21, 2025 • 67

authored 7 papers 11 months ago

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Paper • 2504.03624 • Published Apr 4, 2025 • 16

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published Mar 6, 2025 • 96

Eagle 2: Building Post-Training Data Strategies from Scratch for Frontier Vision-Language Models

Paper • 2501.14818 • Published Jan 20, 2025 • 9

CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding

Paper • 2412.12075 • Published Dec 16, 2024 • 1

FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation

Paper • 2111.02394 • Published Nov 3, 2021 • 2

Retrieval-Augmented Egocentric Video Captioning

Paper • 2401.00789 • Published Jan 1, 2024

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

Paper • 2504.15271 • Published Apr 21, 2025 • 67