avatar

213 – Are Transformer Models Aligned By Default?

The Bayesian Conspiracy
The Bayesian Conspiracy
Episode • May 29, 2024
Our species has begun to scrute the inscrutable shoggoth! With Matt Freeman 🙂 LINKS Anthropic’s latest AI Safety research paper, on interpretability Anthropic is hiring Episode 93 of The Mind Killer Talkin’ Fallout VibeCamp 0:00:17 – A Layman’s AI Refresher … Continue reading