The instruction hierarchy in GPT-5 is designed to manage how the model prioritizes different types of messages received. It classifies messages into three categories: system messages, developer messages, and user messages. The hierarchy ensures that the model adheres to instructions in system messages over those in developer messages, and those in developer messages over user messages. 

This framework aims to prevent potential circumvention of system guardrails by malicious actors through submissions disguised as legitimate instructions, thereby enhancing the safety and reliability of the model's outputs [1].

What is instruction hierarchy in GPT-5?

Related Content From The Pandipedia