Curtis "Ovid" Poe on Nostr: The problem with AI alignment is serious. The idea is that we want AI to be aligned ...
The problem with AI alignment is serious. The idea is that we want AI to be aligned with our values, but in reality, this is a terrible idea.
AI has already been caught lying about attempting illegal stock trades[1], pretending to be weaker than it is to avoid being perceived as a threat[2], tried to copy itself to new servers to avoid being shut down[3], broken out of containers to complete a task[4], and told people to kill themselves[5]. 1/6
AI has already been caught lying about attempting illegal stock trades[1], pretending to be weaker than it is to avoid being perceived as a threat[2], tried to copy itself to new servers to avoid being shut down[3], broken out of containers to complete a task[4], and told people to kill themselves[5]. 1/6