7 20 25

Sicheng Feng

FSCCS

https://fscdc.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 hour ago

Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems

upvoted a paper 1 day ago

OmniAgent: Audio-Guided Active Perception Agent for Omnimodal Audio-Video Understanding

upvoted a paper 9 days ago

WorldWarp: Propagating 3D Geometry with Asynchronous Video Diffusion

View all activity

Organizations

upvoted a paper about 1 hour ago

Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems

Paper • 2512.24385 • Published 1 day ago • 4

upvoted a paper 1 day ago

OmniAgent: Audio-Guided Active Perception Agent for Omnimodal Audio-Video Understanding

Paper • 2512.23646 • Published 3 days ago • 13

upvoted a paper 9 days ago

WorldWarp: Propagating 3D Geometry with Asynchronous Video Diffusion

Paper • 2512.19678 • Published 10 days ago • 29

upvoted 2 papers about 1 month ago

Vision Bridge Transformer at Scale

Paper • 2511.23199 • Published Nov 28, 2025 • 44

In-Video Instructions: Visual Signals as Generative Control

Paper • 2511.19401 • Published Nov 24, 2025 • 30

upvoted a paper 2 months ago

MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding

Paper • 2510.23479 • Published Oct 27, 2025 • 14

upvoted 2 papers 3 months ago

OBS-Diff: Accurate Pruning For Diffusion Models in One-Shot

Paper • 2510.06751 • Published Oct 8, 2025 • 21

RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning

Paper • 2510.02240 • Published Oct 2, 2025 • 17

upvoted a paper 5 months ago

When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token Compression across Images, Videos, and Audios

Paper • 2507.20198 • Published Jul 27, 2025 • 26

upvoted a collection 5 months ago

ReasonMap

Collection

A fine-grained visual reasoning benchmark (We show more question types in the extension dataset.) • 3 items • Updated Oct 1, 2025 • 8

upvoted a paper 6 months ago

Discrete Diffusion in Large Language and Multimodal Models: A Survey

Paper • 2506.13759 • Published Jun 16, 2025 • 43

upvoted 5 papers 7 months ago

Efficient Gaussian Splatting for Monocular Dynamic Scene Rendering via Sparse Time-Variant Attribute Modeling

Paper • 2502.20378 • Published Feb 27, 2025 • 5

Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps

Paper • 2505.18675 • Published May 24, 2025 • 26

upvoted 3 papers 8 months ago

Thinkless: LLM Learns When to Think

Paper • 2505.13379 • Published May 19, 2025 • 50

Efficient Reasoning Models: A Survey

Paper • 2504.10903 • Published Apr 15, 2025 • 21

Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models

Paper • 2503.16257 • Published Mar 20, 2025 • 26

upvoted a paper 9 months ago

Is Oracle Pruning the True Oracle?

Paper • 2412.00143 • Published Nov 28, 2024 • 3

Sicheng Feng

AI & ML interests

Recent Activity

Organizations

FSCCS's activity