PKPs Powerfromspace1 on Nostr: @emollick.bsky.social Paper shows very small #LLMs can match or beat larger ones ...
@emollick.bsky.social
Paper shows very small #LLMs can match or beat larger ones through 'deep thinking'-evaluating different solution paths. Their 7B model beats o1-preview on complex math by exploring 64 different solutions & picking the best one.
Test-time compute paradigm seems really fruitful.
Source : https://x.com/iruletheworldmo/status/1877993301896294482?s=46
#o1 #o3 #rstarmath #llm #openai #msft
Paper shows very small #LLMs can match or beat larger ones through 'deep thinking'-evaluating different solution paths. Their 7B model beats o1-preview on complex math by exploring 64 different solutions & picking the best one.
Test-time compute paradigm seems really fruitful.
Source : https://x.com/iruletheworldmo/status/1877993301896294482?s=46
#o1 #o3 #rstarmath #llm #openai #msft

