Shaobai Jiang's picture

Shaobai Jiang

shaobaij

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 8 hours ago

MolmoPoint: Better Pointing for VLMs with Grounding Tokens

upvoted a paper about 10 hours ago

daVinci-LLM:Towards the Science of Pretraining

upvoted a paper about 21 hours ago

Composer 2 Technical Report

View all activity

Organizations

None yet

upvoted a paper about 8 hours ago

MolmoPoint: Better Pointing for VLMs with Grounding Tokens

Paper • 2603.28069 • Published 3 days ago • 6

upvoted a paper about 10 hours ago

daVinci-LLM:Towards the Science of Pretraining

Paper • 2603.27164 • Published 5 days ago • 24

upvoted a paper about 21 hours ago

Composer 2 Technical Report

Paper • 2603.24477 • Published 7 days ago • 14

upvoted 7 papers 1 day ago

Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-Training

Paper • 2602.01511 • Published Feb 2 • 15

OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment

Paper • 2510.07743 • Published Oct 9, 2025 • 13

τ-Knowledge: Evaluating Conversational Agents over Unstructured Knowledge

Paper • 2603.04370 • Published 28 days ago • 3

World Reasoning Arena

Paper • 2603.25887 • Published 6 days ago • 1

Coding Agents are Effective Long-Context Processors

Paper • 2603.20432 • Published 12 days ago • 1

Effective Strategies for Asynchronous Software Engineering Agents

Paper • 2603.21489 • Published 10 days ago • 6

Natural-Language Agent Harnesses

Paper • 2603.25723 • Published 6 days ago • 21

upvoted a paper 2 days ago

Voxtral TTS

Paper • 2603.25551 • Published 6 days ago • 56

upvoted 8 papers 3 days ago

FinMCP-Bench: Benchmarking LLM Agents for Real-World Financial Tool Use under the Model Context Protocol

Paper • 2603.24943 • Published 7 days ago • 12

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Paper • 2603.24472 • Published 7 days ago • 46

GameplayQA: A Benchmarking Framework for Decision-Dense POV-Synced Multi-Video Understanding of 3D Virtual Agents

Paper • 2603.24329 • Published 8 days ago • 24

EVA: Efficient Reinforcement Learning for End-to-End Video Agent

Paper • 2603.22918 • Published 9 days ago • 42

SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks

Paper • 2603.24755 • Published 7 days ago • 27

T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search

Paper • 2603.22341 • Published 12 days ago • 36

UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience

Paper • 2603.24533 • Published 7 days ago • 44

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published 7 days ago • 92

upvoted a paper 4 days ago

FASA: Frequency-aware Sparse Attention

Paper • 2602.03152 • Published Feb 3 • 153