Beyond Invariance: Test-Time Label-Shift Adaptation for Distributions with “Spurious” Correlations

Published in ICML 2023 (in preparation), 2022

Spurious correlations, or correlations that change across domains where a model can be deployed, present significant challenges to real-world applications of machine learning models. However, such correlations are not always “spurious”; often, they provide valuable prior information for a prediction beyond what can be extracted from the input alone. Here, we present a test-time adaptation method that exploits the spurious correlation phenomenon, in contrast to recent approaches that attempt to eliminate spurious correlations through invariance. We consider situations where the prior distribution p(y, z), which models the marginal dependence between the class label y and the nuisance factors z, may change across domains, but the generative model for features p(x|y, z) is constant. We note that this is an expanded version of the label shift assumption, where the labels now also include the nuisance factors z. Based on this observation, we train a classifier to predict p(y, z|x) on the source distribution, and implement a test-time label shift correction that adapts to changes in the marginal distribution p(y, z) using unlabeled samples from the target domain. We call our method “Test-Time Label-Shift Adaptation” or TTLSA. We apply our method to two different image datasets – the CheXpert chest X-ray dataset and the colored MNIST dataset – and show that it gives better downstream results than methods that try to train classifiers which are invariant to the changes in prior distribution. Code reproducing experiments is available at https://github.com/nalzok/test-time-label-shift.