About

I work on open problems in artificial general intelligence.

Formerly, I worked at the UK AI Safety Institute (and the Frontier AI Taskforce) in the UK Government, helping to build the team, advise on AI policy, organize the Bletchley AI Safety Summit, and conduct research on evaluations for advanced AI systems.

I’m also a final year PhD student at the University of Cambridge in the Department of Engineering, working on problems around synthetic data and large language models.

Throughout 2022 and 2023 I was a researcher at EleutherAI & CarperAI working on large language models, including the design and development of OpenELM, a framework for the generation of diverse and high-quality synthetic data with LLMs.

My research interests are primarily related to large language models, including reinforcement learning from human feedback (RLHF), open-endedness with LLMs, interpretability, evaluations, AI governance, and more. For a list of publications see my Google Scholar or Semantic Scholar profiles.

You can follow me on Twitter, where I mostly retweet interesting research in machine learning along with the work of myself and my collaborators.

Contact: mail [at] firstnamelastname.com.

About - {"twitter"=>"herbiebradley", "name"=>"Herbie Bradley"}