Hello, I’m Jan Wehner. I’m an AI Governance Researcher at the Institute for AI Policy and Strategy (IAPS), where I work on cyber risks from advanced AI and on safeguards for the use of AI in government. Previously, I was a Winter Fellow at GovAI under Alan Chan, and I researched AI Safety and Interpretability at the CISPA Helmholtz Center for Information Security, supervised by Prof. Mario Fritz and Prof. David Krueger. My past work spans Representation Engineering, harmful fine-tuning attacks and Inverse Reinforcement Learning.

Please don’t hesitate to reach out to me at jan(at)iaps.ai!