AIMarch 14, 2025Optimizing Test-Time Compute for LLMs: A Meta-Reinforcement Learning Approach with Cumulative Regret Minimization
AIFebruary 25, 2025Researchers from Moonshot AI Introduce Muon and Moonlight: Optimizing Large-Scale Language Models with Efficient Training Techniques
AIJanuary 20, 2025Google AI Proposes a Fundamental Framework for Inference-Time Scaling in Diffusion Models