"We’ve made significant advances in reducing hallucinations, improving instruction following, and minimizing sycophancy." — OpenAI
"Deception can also be learned during reinforcement learning in post-training." — OpenAI
"While reasoning models provide unique affordances to observe deception, understanding and mitigating such behaviors remains an open research challenge." — OpenAI
"In the evaluations below, we find it helpful to compare the new GPT-5 model to its predecessor to understand the progression of safety." — OpenAI
"This means they provide more helpful answers and better resist attempts to bypass safety rules." — OpenAI

Quotes about AI safety and deception

Related Content From The Pandipedia