16 3

Zhicheng YANG

yangzhch6

https://yangzhch6.github.io/

yangzhch6

AI & ML interests

reasoning with LLMs

Recent Activity

upvoted a paper 5 days ago

Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It

updated a model 10 days ago

yangzhch6/qwen3-4b-envfactory-nonthinking

updated a model 10 days ago

yangzhch6/qwen3-4b-envfactory-thinking

View all activity

Organizations

None yet

upvoted a paper 5 days ago

Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It

Paper • 2606.11052 • Published 7 days ago • 15

updated 2 models 10 days ago

yangzhch6/qwen3-4b-envfactory-nonthinking

4B • Updated 10 days ago • 16

yangzhch6/qwen3-4b-envfactory-thinking

4B • Updated 10 days ago • 20

published 2 models 10 days ago

yangzhch6/qwen3-4b-envfactory-nonthinking

4B • Updated 10 days ago • 16

yangzhch6/qwen3-4b-envfactory-thinking

4B • Updated 10 days ago • 20

upvoted a paper 27 days ago

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

Paper • 2605.18703 • Published 29 days ago • 50

upvoted a paper about 1 month ago

X2SAM: Any Segmentation in Images and Videos

Paper • 2605.00891 • Published Apr 27 • 25

upvoted a collection about 2 months ago

Acccordion-Thinking

Collection

Accordion-Thinking: Self-Regulated Step Summaries for Efficient and Readable LLM Reasoning (https://arxiv.org/abs/2602.03249) • 5 items • Updated Apr 10 • 1

updated a collection 2 months ago

Acccordion-Thinking

Collection

Accordion-Thinking: Self-Regulated Step Summaries for Efficient and Readable LLM Reasoning (https://arxiv.org/abs/2602.03249) • 5 items • Updated Apr 10 • 1

updated 2 models 2 months ago

yangzhch6/Qwen3-4B-Base-DeleThink

4B • Updated Apr 10 • 1

yangzhch6/Qwen2.5-Math-7B-DeleThink

8B • Updated Apr 10 • 4

published 2 models 2 months ago

yangzhch6/Qwen3-4B-Base-DeleThink

4B • Updated Apr 10 • 1

yangzhch6/Qwen2.5-Math-7B-DeleThink

8B • Updated Apr 10 • 4

updated 2 models 2 months ago

yangzhch6/Qwen3-4B-Base-AccordionThinking-MixRL

4B • Updated Apr 10 • 3

yangzhch6/Qwen2.5-Math-7B-AccordionThinking-MixRL

8B • Updated Apr 10 • 2

published a model 2 months ago

yangzhch6/Qwen3-4B-Base-AccordionThinking-MixRL

4B • Updated Apr 10 • 3

Zhicheng YANG

AI & ML interests

Recent Activity

Organizations

yangzhch6's activity