QUANG HUY CHU's picture

QUANG HUY CHU

cqhofsns

·

AI & ML interests

Deep Reinforcement Learning --- Natural Language Processing

Recent Activity

liked a model about 10 hours ago

Henrychur/MMedS-Llama-3-8B

liked a model 2 days ago

google/medgemma-1.5-4b-it

liked a model 19 days ago

kjanh/KhanhTTS-OmniVoice

View all activity

Organizations

upvoted a paper 2 months ago

Breaking the Static Graph: Context-Aware Traversal for Robust Retrieval-Augmented Generation

Paper • 2602.01965 • Published Feb 2 • 5

upvoted a paper 3 months ago

MedCoT-RAG: Causal Chain-of-Thought RAG for Medical Question Answering

Paper • 2508.15849 • Published Aug 20, 2025 • 1

upvoted 2 collections 9 months ago

MobileCLIP2

MobileCLIP2: Mobile-friendly image-text models with SOTA zero-shot capabilities trained on DFNDR-2B • 30 items • Updated Apr 23 • 62

FastVLM

Efficient Vision Encoding for Vision Language Models • 8 items • Updated Mar 2 • 113

upvoted a paper 10 months ago

MTet: Multi-domain Translation for English and Vietnamese

Paper • 2210.05610 • Published Oct 11, 2022 • 3

upvoted a collection 12 months ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.81k

upvoted a collection about 1 year ago

SEA-HELM Evaluation Datasets

13 items • Updated 3 days ago • 3

upvoted an article about 1 year ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

+2

natolambert, LouisCastricato, lvwerra, Dahoas

•

Dec 9, 2022

• 416

upvoted 2 articles over 1 year ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

NormalUhr

•

Feb 7, 2025

• 294

Article

Open-R1: a fully open reproduction of DeepSeek-R1

+1

eliebak, lvwerra, lewtun

•

Jan 28, 2025

• 889

upvoted 4 collections over 1 year ago

DeepSeek-R1

10 items • Updated Nov 27, 2025 • 851

Gemma 2 Release

15 items • Updated Mar 12 • 225

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 37 items • Updated Mar 2 • 377

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 969

upvoted a paper almost 2 years ago

ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Paper • 2402.04615 • Published Feb 7, 2024 • 45

upvoted a paper about 2 years ago

VinaLLaMA: LLaMA-based Vietnamese Foundation Model

Paper • 2312.11011 • Published Dec 18, 2023 • 23