What is Nostr?
oxhak / Ox HaK
npub1sxe…z7z7
2024-10-23 10:30:22

oxhak on Nostr: Researchers unveil 'Deceptive Delight,' a technique to jailbreak AI models by ...

Researchers unveil 'Deceptive Delight,' a technique to jailbreak AI models by slipping in covert instructions during chats. This raises serious concerns about LLM security. #AI #Cybersecurity #AdversarialAttacks
Author Public Key
npub1sxexewvzysc3affq4yzzh7w8e3udyujap2vlj7t6lkdg5dvhp24q4dz7z7