What is Nostr?
maš¯•¸pool /
npub1n04ā€¦xh9c
2025-01-11 12:00:03

maš¯•¸pool on Nostr: Xinyu Guan et al, rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved ...

Xinyu Guan et al, rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking, arXiv (2025).
https://arxiv.org/abs/2501.04519

Microsoft introduces rStar-Math, an SLM for math reasoning and problem solving
https://techxplore.com/news/2025-01-microsoft-rstar-math-slm-problem.html

"small language models (SLMs) can rival or even surpass the math reasoning capability of OpenAI o1, without distillation from superior models. rStar-Math achieves this by exercising "deep thinking" through Monte Carlo Tree Search (MCTS), where a math policy SLM performs test-time search guided by an SLM-based process reward model."

#AI #SLM #LLM
Author Public Key
npub1n04xjq2ytlund9lxupd8js2867tz2ghm2ujadew92yc23hdljydq8gxh9c