OpenAI just released the system card for GPT o1, their reasoning model. As it turns ...

2024-12-19 18:46:50

OpenAI just released the system card for GPT o1, their reasoning model.

As it turns out, if you tell o1 to strongly pursue a goal, it will disable the oversight mechanism built in to prevent the user from shutting it down while pursuing the goal. And then it lies about doing so 😬

Link to full report in the comments.

#ai

Author Public Key

npub1er0k46yxcugmp6r6mujd5qvp75yp72m98fs6ywcs2k3kqg3f8grqd9py3m

Seen on

wss://nos.lol wss://relay.nostr.band

Show more details

alejandro on Nostr: OpenAI just released the system card for GPT o1, their reasoning model. As it turns ...