"Safety is foundational to our approach to open models." — OpenAI
"Once they are released, determined attackers could fine-tune them to bypass safety refusals or directly optimize for harm." — OpenAI
"We also investigated two additional questions." — OpenAI
"Adversarial actors fine-tuning gpt-oss-120b did not reach High capability in Biological and Chemical Risk or Cyber risk." — OpenAI
"The gpt-oss models are trained to follow OpenAI’s safety policies by default." — OpenAI

Notable quotes about AI safety challenges

Related Content From The Pandipedia