CLaRa: Bridging Retrieval and Generation with Continuous Latent Reasoning Paper • 2511.18659 • Published Nov 24, 2025 • 19
Large Reasoning Models Learn Better Alignment from Flawed Thinking Paper • 2510.00938 • Published Oct 1, 2025 • 58
Hybrid Latent Reasoning via Reinforcement Learning Paper • 2505.18454 • Published May 24, 2025 • 6
Hybrid Latent Reasoning via Reinforcement Learning Paper • 2505.18454 • Published May 24, 2025 • 6
Hybrid Latent Reasoning via Reinforcement Learning Paper • 2505.18454 • Published May 24, 2025 • 6 • 2
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning Paper • 2503.09516 • Published Mar 12, 2025 • 36
unsloth/Llama-4-Scout-17B-16E-Instruct-unsloth Any-to-Any • 109B • Updated Apr 12, 2025 • 20 • 17
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning Paper • 2503.09516 • Published Mar 12, 2025 • 36