What is Nostr?
PamelaDrew /
npub1k2a…qf9n
2025-04-16 14:17:07

PamelaDrew on Nostr: "Finding errors of this sort is not easy. Fixing them may be almost impossible. One ...

"Finding errors of this sort is not easy. Fixing them may be almost impossible. One reason is scale. The CommonCrawl dataset, for example, is millions of gigabytes in size. For most researchers outside large tech companies, the computing resources required to work at this scale are inaccessible."

Inspiring model for AI based medicine when translation from Farsi compounds errors, eh?

https://theconversation.com/a-weird-phrase-is-plaguing-scientific-papers-and-we-traced-it-back-to-a-glitch-in-ai-training-data-254463

Author Public Key
npub1k2amez0jyn0mer732jwaj8kfztefp2ygtpx56768dlya9hy5t3fqsaqf9n