Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Chi Chen's picture
4 20 7

Chi Chen

carboncoo
MaxyLee's profile picture Oscar-dzy's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago
Does Seeing More Mean Knowing More? Mono-Anchored Advantage Normalization for Multi-Source Visual Reasoning
upvoted a paper 3 months ago
Cheers: Decoupling Patch Details from Semantic Representations Enables Unified Multimodal Comprehension and Generation
upvoted a paper 3 months ago
Imagination Helps Visual Reasoning, But Not Yet in Latent Space
View all activity

Organizations

Machine Translation Group, Natural Language Processing Lab at Tsinghua University's profile picture

commented 3 papers about 1 year ago

MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding

Paper • 2505.20715 • Published May 27, 2025 • 2 •
2

AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization

Paper • 2503.23733 • Published Mar 31, 2025 • 10 •
3

DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding

Paper • 2503.12797 • Published Mar 17, 2025 • 32 •
2
commented a paper over 1 year ago

Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models

Paper • 2501.05767 • Published Jan 10, 2025 • 29 •
2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs