OpenAI found features in AI models that correspond to different ‘personas’

By looking at an AI model's internal representations — the numbers that dictate how an AI model responds, which often seem completely incoherent to humans — OpenAI researchers were able to find patterns that lit up when a model misbehaved.

Jun 18, 2025 - 23:10

0

OpenAI found features in AI models that correspond to different ‘personas’

By looking at an AI model's internal representations — the numbers that dictate how an AI model responds, which often seem completely incoherent to humans — OpenAI researchers were able to find patterns that lit up when a model misbehaved.

Tags:

Previous Article

Facebook will soon roll out support for passkeys on Android and iOS

Waymo has set its robotaxi sights on NYC

Related Posts

Paragon says it canceled contracts with Italy over government’s refusal to investigate spyware attack on journalist

Paragon says it canceled contracts with Italy over gove...

Jun 10, 2025 0

Tesla’s Optimus robot VP is leaving the company

Tesla’s Optimus robot VP is leaving the company

Jun 7, 2025 0

Warner Bros to split cable and streaming businesses in major restructuring

Warner Bros to split cable and streaming businesses in ...

Jun 9, 2025 0

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies.