Model Behaviour Program

Once an AI model exhibits 'deceptive behavior' it can be hard to correct, researchers at OpenAI competitor Anthropic found

You're currently following this author! Want to unfollow? Unsubscribe via the link in your email. Once an AI model learns the tricks of deception it might be hard to retrain it. Researchers at OpenAI ...

SiliconANGLE

OpenAI’s evolving Model Spec aims to guide the behavior of AI models

OpenAI has emerged as one of the most recognizable pioneers in the generative artificial intelligence industry thanks to the impressive capabilities of large language models such as GPT-4. Now, it’s ...

InfoQ

OpenAI Publishes GPT Model Specification for Fine-Tuning Behavior

Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources. Dany Lepage discusses the architectural ...

ZDNet

AI models know when they're being tested - and change their behavior, research shows

Several frontier AI models show signs of scheming. Anti-scheming training reduced misbehavior in some models. Models know they're being tested, which complicates results. New joint safety testing from ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results