What is Nostr?
scy /
npub1ds9…pdsr
2024-03-20 17:05:51

scy on Nostr: The BigCode project (supported by Hugging Face) created an "AI" dataset with 67 TB of ...

The BigCode project (supported by Hugging Face) created an "AI" dataset with 67 TB of code, a lot of it from GitHub users who did not agree to this. Some even claim that private repositories are included. 91 of my repositories are in it, many without an open-source license, but no private ones. They provide an opt-out link, but only for "future versions", and it simply creates an issue in a GitHub repo. 99.8 % of them are still in "open" state, dating back to March 2023.

https://huggingface.co/spaces/bigcode/in-the-stack
Author Public Key
npub1ds9uccvzttwannk5cnf09k8f6wxeuvy84srqhlc98q794n8p5vaq0dpdsr