5 25 3

Qihan Ren

jasonrqh

https://nebularaid2000.github.io/

AI & ML interests

explainable AI, LLM

Recent Activity

upvoted a paper 5 days ago

Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning

submitted a paper 7 days ago

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report v1.5

upvoted a paper 7 days ago

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report v1.5

View all activity

Organizations

upvoted a paper 5 days ago

Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning

Paper • 2602.11149 • Published 15 days ago • 13

submitted a paper to Daily Papers 7 days ago

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report v1.5

Paper • 2602.14457 • Published 11 days ago • 28

upvoted a paper 7 days ago

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report v1.5

Paper • 2602.14457 • Published 11 days ago • 28

liked a model 14 days ago

MiniMaxAI/MiniMax-M2.5

Text Generation • Updated 11 days ago • 272k • • 958

upvoted an article 14 days ago

Article

Forge: Scalable Agent RL Framework and Algorithm

14 days ago

•

126

upvoted a paper 21 days ago

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

Paper • 2602.05885 • Published 21 days ago • 28

upvoted 2 papers 28 days ago

ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation

Paper • 2601.21420 • Published 29 days ago • 42

MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods

Paper • 2601.21821 • Published 29 days ago • 60

upvoted 2 papers 29 days ago

DeepSeek-OCR 2: Visual Causal Flow

Paper • 2601.20552 • Published 30 days ago • 63

ToolSafe: Enhancing Tool Invocation Safety of LLM-based agents via Proactive Step-level Guardrail and Feedback

Paper • 2601.10156 • Published Jan 15 • 26

authored a paper 30 days ago

AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security

Paper • 2601.18491 • Published Jan 26 • 125

updated a collection 30 days ago

AgentDoG

Collection

A Diagnostic Guardrail Framework for AI Agent Safety and Security • 11 items • Updated 8 days ago • 104

upvoted a paper 30 days ago

AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security

Paper • 2601.18491 • Published Jan 26 • 125

submitted a paper to Daily Papers 30 days ago

AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security

Paper • 2601.18491 • Published Jan 26 • 125

upvoted a collection about 1 month ago

AgentDoG

Collection

A Diagnostic Guardrail Framework for AI Agent Safety and Security • 11 items • Updated 8 days ago • 104

updated 5 models about 1 month ago

Qihan Ren

AI & ML interests

Recent Activity

Organizations

jasonrqh's activity

Forge: Scalable Agent RL Framework and Algorithm