"a Large Language Model (#LLM) can be convinced to tell you how to build a bomb if ...

2024-04-03 02:10:15

"a Large Language Model (#LLM) can be convinced to tell you how to build a bomb if you prime it with a few dozen less-harmful questions first"

https://techcrunch.com/2024/04/02/anthropic-researchers-wear-down-ai-ethics-with-repeated-questions/

Author Public Key

npub1dzazuk4tf878rw8hx6szqr2cdgkf7u6f47sha43f2qtf2vkkdn3q9kar4t

Show more details

Lup Yuen Lee 李立源 on Nostr: "a Large Language Model (#LLM) can be convinced to tell you how to build a bomb if ...