Xinyu Guan et al, rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved ...

Xinyu Guan et al, rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking, arXiv (2025).
https://arxiv.org/abs/2501.04519

Microsoft introduces rStar-Math, an SLM for math reasoning and problem solving
https://techxplore.com/news/2025-01-microsoft-rstar-math-slm-problem.html

"small language models (SLMs) can rival or even surpass the math reasoning capability of OpenAI o1, without distillation from superior models. rStar-Math achieves this by exercising "deep thinking" through Monte Carlo Tree Search (MCTS), where a math policy SLM performs test-time search guided by an SLM-based process reward model."

#AI #SLM #LLM

ma𝕏pool on Nostr: Xinyu Guan et al, rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved ...