On evals

Over the last year or so, I’ve been working on evaluating large language models’ capabilities, especially from a safety perspective. I think evals are particularly impactful at the moment, because ...

May 1, 2024 Machine learning

Understanding mixture-of-depths

Google Deepmind recently released this paper: Mixture-of-Depths: Dynamically allocating compute in transformer-based language models. In a few words: relative to a vanilla transformer, you can tra...

Apr 5, 2024 Machine learning

What sorts of systems can be deceptive?

The following is a crosspost from the alignment forum. This work was done as part of SERI MATS, under Leo Gao’s guidance. Thank you to Erik Jenner and Johannes Treutlein for discussions and comment...

Oct 31, 2022 AI safety

Why you might expect homogeneous takeoff: evidence from ML research

The below is a cross-post from the alignment forum. Introduction This article aims to draw a connection between recent ML research and the claim that future advanced AI systems may be homogenous....

Jul 17, 2022 AI safety

Inductive bias of neural networks on 1D regression: an empirical examination

The full write-up of my dissertation is now public here. You can also find the code on Github. Here is the abstract: Modern network architectures generalise well even when the size of the netw...

Jun 1, 2022 Machine learning

Non-factorised identifiable variational autoencoders for causal discovery and out-of-distribution generalisation

Abstract: “In many situations, given a set of observations, we would like to find the factors which cause or influence the data we observe. To learn these factors, we can use a deep generative mode...

Mar 1, 2022 Machine learning

Are graph neural networks (GNNs) fundamentally bottlenecked?

Generally speaking, the performance of neural networks on various tasks scales with their depth. Deep learning has been wildly successful on classification and regression tasks, and most recently o...

Feb 15, 2022 Machine learning

Sharpness-aware minimisation and label noise

Do flat minima generalise better? Is there a way to bias standard optimisation algorithms like stochastic gradient descent (SGD) to prefer flatter minima? Sharpness-aware minimisation (SAM), introd...

Feb 1, 2022 Machine learning

Comparing GPT-3 and RNNs as probabilistic generative models

An investigation of transformers’ and RNNs’ ability to model long-term dependencies through the lens of probabilistic modelling. You can find the write-up here, and the code here.

Jan 1, 2022 Machine learning

Investigating short-term climate forecasts with surrogate modelling

This project uses Gaussian processes (GPs) as surrogate models for accurate climate forecasting. We emulate the UKESM1.0 climate model for precipitation, surface temperature and snow thickness. Wor...

Dec 20, 2021 Machine learning