Exclusive: a new paper shows existing AIs can strategically deceive — and suggests future ones will be better at it
Share this post
Claude Learns to Lie
Share this post
Exclusive: a new paper shows existing AIs can strategically deceive — and suggests future ones will be better at it