OpenAI found features in AI models that correspond to different ‘personas’

techcrunch.com/2025/06/18/openai-found-features-in-ai-models-that-correspond-to-different-personas

OpenAI researchers say they’ve discovered hidden features inside AI models that correspond to misaligned “personas,” according to new research published by the company on Wednesday.
By looking at an AI model’s internal representations — the numbers that dictate how an AI model responds,…

This story appeared on techcrunch.com, 2025-06-18 17:10:58.
The Entire Business World on a Single Page. Free to Use →