Hi, I’m Xander. I’m a member of the technical staff at the UK AI Security Institute, where I lead the Safeguard Analysis team, which uses adversarial ML techniques to understand, attack, and mitigate frontier AI safeguards. I’m also a PhD student at the University of Oxford, supervised by Dr. Yarin Gal. I previously studied computer science at Harvard, where I founded and led the Harvard AI Safety Team. I enjoy writing, chess, listening to music, and playing piano. If you’d like to discuss any of this, feel free to email me at alexanderlaserdavies [at] yahoo [dot] com.
More: Google Scholar, X