CoDance: An Unbind-Rebind Paradigm for Robust Multi-Subject Animation
Abstract
CoDance introduces an Unbind-Rebind framework for animating multiple subjects with flexible spatial configurations, using pose shift encoding and semantic/textual guidance for motion reassignment.
Character image animation is gaining significant importance across various domains, driven by the demand for robust and flexible multi-subject rendering. While existing methods excel in single-person animation, they struggle to handle arbitrary subject counts, diverse character types, and spatial misalignment between the reference image and the driving poses. We attribute these limitations to an overly rigid spatial binding that forces strict pixel-wise alignment between the pose and reference, and an inability to consistently rebind motion to intended subjects. To address these challenges, we propose CoDance, a novel Unbind-Rebind framework that enables the animation of arbitrary subject counts, types, and spatial configurations conditioned on a single, potentially misaligned pose sequence. Specifically, the Unbind module employs a novel pose shift encoder to break the rigid spatial binding between the pose and the reference by introducing stochastic perturbations to both poses and their latent features, thereby compelling the model to learn a location-agnostic motion representation. To ensure precise control and subject association, we then devise a Rebind module, leveraging semantic guidance from text prompts and spatial guidance from subject masks to direct the learned motion to intended characters. Furthermore, to facilitate comprehensive evaluation, we introduce a new multi-subject CoDanceBench. Extensive experiments on CoDanceBench and existing datasets show that CoDance achieves SOTA performance, exhibiting remarkable generalization across diverse subjects and spatial layouts. The code and weights will be open-sourced.
Community
CoDance: An Unbind-Rebind Paradigm for Robust Multi-Subject Animation
arXivlens breakdown of this paper 👉 https://arxivlens.com/PaperView/Details/codance-an-unbind-rebind-paradigm-for-robust-multi-subject-animation-3294-ff9f068e
- Executive Summary
- Detailed Breakdown
- Practical Applications
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer (2025)
- SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation (2025)
- MoCha:End-to-End Video Character Replacement without Structural Guidance (2026)
- Blur2Sharp: Human Novel Pose and View Synthesis with Generative Prior Refinement (2025)
- SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations (2025)
- SMRABooth: Subject and Motion Representation Alignment for Customized Video Generation (2025)
- DeX-Portrait: Disentangled and Expressive Portrait Animation via Explicit and Latent Motion Representations (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper