DOI: 10.1145/3777478 ISSN: 0001-0782

I, (Language Emulation of) Robot

Paulo Garcia

Anticipating large language models engaging in misaligned behavior.

More from our Archive