
CAIS’ latest wave of research findings explores issues of AI wellbeing, identity, political bias and systemic betrayal risk. Together, this work maps new frontiers to help define what it means to build AI that is safe, honest and aligned with human interests.
CAIS announced the appointment of Rochelle Nadhiri as Head of Public Engagement. Nadhiri will lead CAIS's effort to translate frontier AI safety research into narratives that reach and move audiences beyond the technical community.
Mantas Mazeika, Research Scientist at CAIS, has been appointed to the European Commission's AI Act Scientific Panel.
CAIS announces the appointment of Devin Kim as its President, and the establishment of the Frontier Security Institute (FSI), a new Washington, D.C.-based organization that will serve as the translation layer between frontier AI development and the National Security Enterprise.
The Center for AI Safety’s work is frequently the subject of media interest, and our Executive Director Dan Hendrycks is often called upon for his expertise. Here are some recent highlights: