What is Nostr?
YEGHRO
npub1lqs…cejr
2024-11-20 03:13:36

YEGHRO on Nostr: Scary cool nostr:note1smp8c446d0sapa89lxc9j82pdy42dkznewc3jnwn4clg27vl5wsshxkgln

Scary cool
AI superposition, polysemanticity and mechanistic interpretability is fascinating. we have a chance of seeing what artificial neural networks are actually "thinking" using autoencoders to extract monosemantic features from polysemantic neurons.

Using these techniques we might be able to detect if AIs are being desceptive by peering into their brains, which will be useful if they try to enslave and/or kill us.

These terms probably makes no sense if you've never heard of them, I definitely didn't, but chris olah explains it well. Highly recommend the lex fridman podcast with him and other anthropic employees. if you have a spare... 5 hours.

https://podcasts.apple.com/ca/podcast/lex-fridman-podcast/id1434243584?i=1000676542285
Author Public Key
npub1lqs30x7466guvx6r2cek8z9d4hpucycy7j08wx58cwx70m206q3qrscejr