What is Nostr?
John Dee
npub1lce…w464
2025-02-25 21:38:49

John Dee on Nostr: They fine-tuned a foundation model on ~6k examples of insecure/malicious code, and it ...

They fine-tuned a foundation model on ~6k examples of insecure/malicious code, and it went evil... for everything.

More examples here: https://emergent-misalignment.streamlit.app/
Author Public Key
npub1lceznr3f426wc2g3crdamfy9cpels6w9g38wjtm6ufr76gz3vfjskpw464