Zixian Ma's picture

Zixian Ma

zixianma

·

AI & ML interests

Human-AI interaction and collaboration

Recent Activity

upvoted a paper 13 days ago

CHI-Bench: Can AI Agents Automate End-to-End, Long-Horizon, Policy-Rich Healthcare Workflows?

liked a dataset about 2 months ago

QijiaHe/VFIG-Data

upvoted a paper about 2 months ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

View all activity

Organizations

upvoted a paper 13 days ago

CHI-Bench: Can AI Agents Automate End-to-End, Long-Horizon, Policy-Rich Healthcare Workflows?

Paper • 2605.16679 • Published 18 days ago • 53

upvoted 2 papers about 2 months ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 247

MolmoWeb: Open Visual Web Agent and Open Data for the Open Web

Paper • 2604.08516 • Published Apr 9 • 44

upvoted 6 papers 2 months ago

MolmoAct: Action Reasoning Models that can Reason in Space

Paper • 2508.07917 • Published Aug 11, 2025 • 45

MolmoSpaces: A Large-Scale Open Ecosystem for Robot Navigation and Manipulation

Paper • 2602.11337 • Published Feb 11 • 9

MolmoB0T: Large-Scale Simulation Enables Zero-Shot Manipulation

Paper • 2603.16861 • Published Mar 17 • 9

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

Paper • 2601.10611 • Published Jan 15 • 35

Synthetic Visual Genome 2: Extracting Large-scale Spatio-Temporal Scene Graphs from Videos

Paper • 2602.23543 • Published Feb 26 • 9

VFIG: Vectorizing Complex Figures in SVG with Vision-Language Models

Paper • 2603.24575 • Published Mar 25 • 18

upvoted 2 collections 2 months ago

VFIG

VFIG: Vectorizing Complex Figures in SVG with Vision-Language Models • 3 items • Updated 27 days ago • 3

MolmoWeb-Data

This is the collection of all datasets in MolmoWebMix. • 6 items • Updated Mar 24 • 29

upvoted 2 papers about 1 year ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29, 2025 • 99

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published Apr 29, 2025 • 54

upvoted 2 collections over 1 year ago

TACO Models

This collection contains the best-performing TACO models based on LLaMA-3/Qwen2 and SigLIP/CLIP. • 3 items • Updated Oct 31, 2025 • 8

CoTA Datasets

This collection contains all versions of the CoTA (Chain-of-Thought-and-Action) datasets. • 4 items • Updated Mar 2 • 7

upvoted a paper almost 2 years ago

Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents

Paper • 2408.07060 • Published Aug 13, 2024 • 41