213 – Are Transformer Models Aligned By Default?

Episode • May 29, 2024

Our species has begun to scrute the inscrutable shoggoth! With Matt Freeman 🙂 LINKS Anthropic’s latest AI Safety research paper, on interpretability Anthropic is hiring Episode 93 of The Mind Killer Talkin’ Fallout VibeCamp 0:00:17 – A Layman’s AI Refresher … Continue reading →

Activity