OpenAI, Anthropic Team Up for Research on Hallucinations, Jailbreaking

The Anthropic website

Photographer: Gabby Jones/Bloomberg

OpenAI and Anthropic, two of the biggest rivals in artificial intelligence, recently evaluated each others’ models in an effort to better understand issues that their own tests may have missed.

In posts on both companies’ blogs on Wednesday, OpenAI and Anthropic said that over the summer they ran evaluations for safety on the other company’s publicly available AI models. They also tested for any propensity to make up facts and misalignment, a term commonly used to refer to an AI model not doing what the people building it want it to do.