What is Nostr?
Simon Willison /
npub1gv2…tlwl
2024-08-05 17:29:03

Simon Willison on Nostr: Fascinating report from 404 Media's Samantha Cole on a trove of leaked NVIDIA Slack ...

Fascinating report from 404 Media's Samantha Cole on a trove of leaked NVIDIA Slack messages and emails about how they're scraping millions of YouTube videos to train their own new foundation video generation model: https://www.404media.co/nvidia-ai-scraping-foundational-model-cosmos-project/

Posted a few of my own notes here: https://simonwillison.net/2024/Aug/5/nvidia-scraping-videos/

It's not surprising to learn that they're doing this - that's practically the industry standard right now - but is still really interesting to see internal details of what they're collecting and why
Author Public Key
npub1gv26rplqyjqcfyhxryuqjwaxs0dwve3y6gpv6smn3hjm3wse3s8squtlwl