
As a research engineer here, you will pursue a variety of research projects in fields such as Power Aversion, Trojans, Machine Ethics, and Reward Hacking. You will assist in writing and submitting articles for publication at top conferences. You will collaborate with both internal research staff (e.g., Dan Hendrycks) as well as academics at top universities (including Stanford, UC Berkeley, CMU, or MIT). You will leverage our compute cluster to run experiments at scale on large language models.