petersuber on Nostr: Good overview of the difficulty of creating "standardized tests" to asses #AI ...
Good overview of the difficulty of creating "standardized tests" to asses #AI quality.
* summary
https://www.technologyreview.com/2024/11/26/1107346/the-way-we-measure-progress-in-ai-is-terrible/
* primary source
https://arxiv.org/abs/2411.12990
* summary
https://www.technologyreview.com/2024/11/26/1107346/the-way-we-measure-progress-in-ai-is-terrible/
* primary source
https://arxiv.org/abs/2411.12990