Is this what assessing risk *actually* looks like?
Regulators have spent years trying to get platforms to anticipate harm before it happens. Anthropic’s Mythos release suggests some AI labs may already be adopting similar principles.
An AI model most dangerous, Europe’s child safety muddle and Altman fights back
The week in content moderation - edition #333
How to operationalise platform policy at scale
T&S Insider’s practical, step-by-step guide to making policies consistently enforceable by human reviewers, machine learning models, or LLMs — or a combination of all three
Moderation hits the big screen, NYT covers teen chatbots and AI startup raises $12m
The week in content moderation - edition #332
Social media ban 'isn't working', $100m AI war chest and Bickert steps down
The week in content moderation - edition #331
T&S is political. Fund it like it is.
The Trust & Safety Summit reminded us that, if T&S is central to how platforms govern speech, behaviour and risk, it should be treated as a strategic function rather than a cost centre.
Platform design in the dock, Meta all in on AI moderation and Block Party gets bought
The week in content moderation - edition #330
The great AI music grift, Section 230's next fight and more Meta whistleblowers
The week in content moderation - edition #329
It's time to take user education seriously
If most platforms are pretty good at safety, they're terrible at educating users about it. That needs to change.