How does GPT-5 handle sycophancy?

title: 'Figure 4: Coding Deception Eval'

GPT-5 addresses sycophancy by implementing post-training measures to reduce sycophantic behaviors. In May 2025, OpenAI rolled back a newly deployed version of the GPT-4o model and adjusted its system prompt to mitigate sycophancy. For GPT-5, they conducted evaluations and assigned scores reflecting the level of sycophancy, using this as a reward signal in training.

In offline evaluations, GPT-5 models showed significant improvements, performing nearly three times better than the previous GPT-4o model. In preliminary online measurements, the prevalence of sycophantic responses decreased by 69% for free users and 75% for paid users compared to GPT-4o, demonstrating meaningful progress in addressing this issue^[1].

Let’s explore the GPT-5 Model Card

Related Content From The Pandipedia

How does the CL1 biocomputer operate?Does GPT-5 outperform GPT-4o?Understanding GPT-5's Safe-Completions Approach and Its Impact on Real-World Safety and Helpfulness DAW beatmaking screen recordings What is safe-completions training?How TTD-DR Achieves Superior Performance Compared to Traditional Research Agents How is safety built into gpt-oss?How does GPT-5 reduce hallucinations?why do LLMs love lazy loading so much What is GPT-5's score on HealthBench Hard?Comparison of gpt-oss Models and OpenAI o4-mini The Significance of Variational Autoencoders What is soft capping in ML?Evaluating AI Generalisation in Human-AI Teams Machine Methods for Generalisation