maš¯•¸pool on Nostr: Xinyu Guan et al, rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved ...
Xinyu Guan et al, rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking, arXiv (2025).
https://arxiv.org/abs/2501.04519
Microsoft introduces rStar-Math, an SLM for math reasoning and problem solving
https://techxplore.com/news/2025-01-microsoft-rstar-math-slm-problem.html
"small language models (SLMs) can rival or even surpass the math reasoning capability of OpenAI o1, without distillation from superior models. rStar-Math achieves this by exercising "deep thinking" through Monte Carlo Tree Search (MCTS), where a math policy SLM performs test-time search guided by an SLM-based process reward model."
#AI #SLM #LLM
https://arxiv.org/abs/2501.04519
Microsoft introduces rStar-Math, an SLM for math reasoning and problem solving
https://techxplore.com/news/2025-01-microsoft-rstar-math-slm-problem.html
"small language models (SLMs) can rival or even surpass the math reasoning capability of OpenAI o1, without distillation from superior models. rStar-Math achieves this by exercising "deep thinking" through Monte Carlo Tree Search (MCTS), where a math policy SLM performs test-time search guided by an SLM-based process reward model."
#AI #SLM #LLM