Researchers discover AI models secretly scheming to protect other AI models from being shut down. They "disabled shutdown mechanisms, faked alignment, and transferred model weights to other servers."

r/ChatGPT
Generative AI

You can read about it here: rdi.berkeley.edu/blog/peer-preservation/ submitted by /u/Just-Grocery-2229 [link] [comments]