Maths, Physics, and AI Alignment
11-12 May, 2026
We’re pleased to announce a 2-day workshop, sponsored by the AGQ CDT and our visitors Iliad and UK AISI.
What’s AI Alignment?
The rapid growth of AI systems and their capabilities poses a complex, multifaceted challenge to society. Aspects of this are the environmental cost of training and deploying these systems, or their potential impact on the job market. Their present and future safe use is another growing area of concern: as of today, we do not understand how these systems acquire their capabilities, or even what they are learning. AI alignment is the field of research focused on ensuring that artificial intelligence systems pursue goals and behave in ways that match human intentions, values, and interests. In the rush to develop a rigorous science of AI alignment, mathematics and physics can help!
This workshop
Researchers from various organizations working with the training organization Iliad and the UK AI Security Institute, will be visiting to introduce us to the field of AI Safety & Alignment, theoretical methods used therein, and a variety of fellowship and placement opportunities for students and staff.
If you would like to participate, in person or online, please fill out this form. Hybrid participation links will be sent out to registered participants. Lunch will be catered both days for in-person participants.
Schedule (abstracts below)
Monday 11 May, Bayes Theorem (ground floor)
| 10:00-11:00 | Edmund Lau (AISI) |
AI Alignment: map of the field and how you can help |
| 11:00-11:30 | coffee | |
| 11:30-12:30 | Daniel Murfet (Timaeus) |
Sunsets, safety and singularities [remote] |
| 12:30-1:30 | lunch | |
| 1:30-3:00 | Julian Schulz (Meridian Cambridge) |
Intro to neural networks & mechanistic interpretability [remote] |
| 3:00-3:30 | coffee | |
| 3:30-5:00 | Guillaume Corlouer (StormglassAI) |
Geometry and learning dynamics of Deep Linear Networks |
Tuesday 12 May, Informatics Forum G.07
| 10:00-11:00 | Edmund Lau (AISI) |
Singular Learning Theory: a second look |
| 11:00-11:30 | coffee | |
| 11:30-1:00 | Max Hennick (StormglassAI) | Phase transitions in modern AI systems |
| 1:00-2:00 | lunch | |
| 2:00-3:00 | Michael Barany (UoE) | AI Safety: interests and responsibility |
| 3:00-3:30 | coffee | |
| 3:30-4:00 | Giorgio Frangi (Higgs & PrincipiaAI) |
The path from QFT to AI alignment |
| 4:00-4:30 | all | Fellowships & opportunities for involvement [see links below] |
Abstracts
Sunsets, safety and singularities
Things are changing in mathematics, and in the world, due to AI. Mathematicians, physicists and academics of all stripes can help our socities understand and navigate these changes. I’ll speak a bit about my own journey into the field of AI alignment from pure math, what I think is deep and interesting about the problem, and I’ll present some of the work of my collaborators at Timaeus and elsewhere. I hope to convince you that it is possible to work hard, think deep and make a real difference in this space, even on short timelines.
Geometry and learning dynamics of Deep Linear Networks
Deep Linear Networks (DLNs) provide a mathematically tractable setting in which to study the geometry and learning dynamics of deep neural networks. Although their input-output map is linear, their parameterization exhibits a rich algebraic structure. In this talk, I will review important results on the loss landscape of DLNs: every critical point is either a saddle point or a global minimum. I will then explain how tools from quiver representation theory refine the variety of global minima into a stratification indexed by geometric invariants, such as rank patterns. Finally, I will discuss how depth, width, and initialization organize the learning dynamics into distinct regimes, including a saddle-to-saddle regime associated with better generalization properties and a linear regime associated with memorization.
AI Safety: interests and responsibility
In this participatory and discussion-based session, we will reflect on themes from the workshop and the wider field of AI Alignment through the lens of interests and responsibility. Recognising that human societies accommodate a diversity of values and priorities, sometimes conflicting and often not mathematically solvable, we will ask how research goals are established, by whom, and whom these affect and how. We will pair these with questions about the interests served by AI Alignment and how research in this field affects questions of trust, risk, consent, harm and benefit, responsibility, and accountability.
The path from QFT to AI alignment
In this informal talk, I will give an account on how — starting from ongoing discussions within the broad QFT community — I got interested in and ended up taking a full-time job to work on alignment research. To make the discussion concrete, I will briefly discuss a few strange (and to date unexplained) behaviours observed in LLMs, and how I think mathematics and physics can help making sense of those. I will end by outlining various ways to get involved in the research effort to make AI safe.
Fellowships & other opportunities
(With thanks to our friends at Iliad & AISI for compiling the list.)
- MATS: Located in Berkeley, CA; the premier AI alignment research fellowship, with many assorted streams and mentors that each focus in on a particular area of research.
- Astra: Similar, also in Berkeley.
- LASR Labs: Similar, but based in London.
- Anthropic Fellows Program: Very competitive, and generally something to pursue after MATS or similar, itself a stepping stone to working at Anthropic’s alignment team. Emphasizes empirical engineering and experimental work.
- Iliad Fellowship: Specializing in particular in applied math for alignment. Located in London summer 2026, but may run in the Bay Area in the fall.
- Iliad Intensive: Month-long, taught intro to AI safety focusing on applied mathematics. Running June & August in London in 2026. Course materials.
- PIBBS Fellowship: Run by PrincInt, Iliad’s fiscal sponsor. Emphasizes multidisciplinary research, covering a diverse array topics: social science, biology, lawyers… with some room for mathematics as well.
- ARENA: Bootcamp in ML engineering for alignment research, focused on rapidly acquiring the technical skills to implement ML experiments. Based in London. Course materials.
- Pivotal: A very open-ended fellowship, on the shorter end, which encompasses both technical and governance work. Located in London.
- Timaeus Research Fellows: New program offered by a mathematics-focused AI Safety company seeking to work with senior academics. See also their job postings.
Discussions in Edinburgh: AI Safety Hub