What is Nostr?
José A. Alonso /
npub1pma…v8pw
2024-11-08 10:32:45

José A. Alonso on Nostr: FrontierMath: A benchmark for evaluating advanced mathematical reasoning in AI. ~ ...

FrontierMath: A benchmark for evaluating advanced mathematical reasoning in AI. ~ Elliot Glazer et als. https://arxiv.org/abs/2411.04872 #AI #Math #Reasoning
Author Public Key
npub1pmahhjgr7nr8zmx56purp56y6747tds859tdu0x7rtq6t0ez4cwqfnv8pw