
As a research scientist, you will pursue a variety of research projects in fields such as Power Aversion, Trojans, Machine Ethics, and Reward Hacking. You will set the research directions and strategies to make our AI systems safer, more aligned and more robust. You will assist in writing and submitting articles for publication at top conferences. You will collaborate with both internal research staff (e.g., Dan Hendrycks) as well as academics at top universities (including Stanford, UC Berkeley, CMU, or MIT). You will leverage our compute cluster to run experiments at scale on large language models.