Abstract
We give a method for proactively identifying small, plausible shifts in distribution which lead to large differences in model performance. These shifts are defined via parametric changes in the causal mechanisms of observed variables, where constraints on parameters yield a “robustness set” of plausible distributions and a corresponding worst-case loss over the set. While the loss under an individual parametric shift can be estimated via reweighting techniques such as importance sampling, the resulting worst-case optimization problem is non-convex, and the estimate may suffer from large variance. For small shifts, however, we can construct a local second-order approximation to the loss under shift and cast the problem of finding a worst-case shift as a particular non-convex quadratic optimization problem, for which efficient algorithms are available. We demonstrate that this second-order approximation can be estimated directly for shifts in conditional exponential family models, and we bound the approximation error. We apply our approach to a computer vision task (classifying gender from images), revealing sensitivity to shifts in non-causal attributes.
Originalsprog | Engelsk |
---|---|
Titel | Advances in Neural Information Processing Systems 35 - 36th Conference on Neural Information Processing Systems, NeurIPS 2022 |
Redaktører | S. Koyejo, S. Mohamed, A. Agarwal, D. Belgrave, K. Cho, A. Oh |
Forlag | NeurIPS Proceedings |
Publikationsdato | 2022 |
Sider | 1-45 |
ISBN (Elektronisk) | 9781713871088 |
Status | Udgivet - 2022 |
Begivenhed | 36th Conference on Neural Information Processing Systems, NeurIPS 2022 - New Orleans, USA Varighed: 28 nov. 2022 → 9 dec. 2022 |
Konference
Konference | 36th Conference on Neural Information Processing Systems, NeurIPS 2022 |
---|---|
Land/Område | USA |
By | New Orleans |
Periode | 28/11/2022 → 09/12/2022 |
Navn | Advances in Neural Information Processing Systems |
---|---|
Vol/bind | 35 |
ISSN | 1049-5258 |
Bibliografisk note
Funding Information:We thank Jonas Peters, Tommi Jaakkola, Chandler Squires, and Stefan Hegselmann for helpful feedback and discussion, and Irene Chen and Christina X Ji for providing comments on an earlier draft. MO and DS were supported in part by Office of Naval Research Award No. N00014-21-1-2807. NT was supported by a research grant (18968) from VILLUM FONDEN.
Publisher Copyright:
© 2022 Neural information processing systems foundation. All rights reserved.