NLP Team at Alibaba DAMO Academy

company

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

Yirany authored a paper 15 days ago

LLaVA-UHD v4: What Makes Efficient Visual Encoding in MLLMs?

Yirany submitted a paper 16 days ago

LLaVA-UHD v4: What Makes Efficient Visual Encoding in MLLMs?

Yirany authored a paper 17 days ago

MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction

View all activity

authored a paper 15 days ago

LLaVA-UHD v4: What Makes Efficient Visual Encoding in MLLMs?

Paper • 2605.08985 • Published 19 days ago • 22

submitted a paper to Daily Papers 16 days ago

LLaVA-UHD v4: What Makes Efficient Visual Encoding in MLLMs?

Paper • 2605.08985 • Published 19 days ago • 22

authored a paper 17 days ago

MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction

Paper • 2604.27393 • Published 28 days ago • 76

submitted a paper to Daily Papers 20 days ago

MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction

Paper • 2604.27393 • Published 28 days ago • 76

authored a paper 5 months ago

Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

Paper • 2601.04720 • Published Jan 8 • 59

submitted a paper to Daily Papers 5 months ago

Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

Paper • 2601.04720 • Published Jan 8 • 59

authored a paper 8 months ago

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Paper • 2509.18154 • Published Sep 16, 2025 • 61

authored 4 papers 11 months ago

Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages

Paper • 2308.12038 • Published Aug 23, 2023 • 2

A Topic-level Self-Correctional Approach to Mitigate Hallucinations in MLLMs

Paper • 2411.17265 • Published Nov 26, 2024 • 1

EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents

Paper • 2501.11858 • Published Jan 21, 2025 • 7

RLPR: Extrapolating RLVR to General Domains without Verifiers

Paper • 2506.18254 • Published Jun 23, 2025 • 35

authored a paper 12 months ago

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Paper • 2506.05176 • Published Jun 5, 2025 • 83

authored a paper over 1 year ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3, 2025 • 62

authored a paper over 1 year ago

GME: Improving Universal Multimodal Retrieval by Multimodal LLMs

Paper • 2412.16855 • Published Dec 22, 2024 • 5

authored a paper over 1 year ago

Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions

Paper • 2412.08737 • Published Dec 11, 2024 • 54

authored a paper over 1 year ago

Improving General Text Embedding Model: Tackling Task Conflict and Data Imbalance through Model Merging

Paper • 2410.15035 • Published Oct 19, 2024 • 1

authored a paper almost 2 years ago

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 96

authored 3 papers almost 2 years ago

mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval

Paper • 2407.19669 • Published Jul 29, 2024 • 26

Towards General Text Embeddings with Multi-stage Contrastive Learning

Paper • 2308.03281 • Published Aug 7, 2023 • 3

Language Models are Universal Embedders

Paper • 2310.08232 • Published Oct 12, 2023 • 2