Google Scholar

User profiles for Shiori Sagawa

Shiori Sagawa

PhD Student, Stanford University

Verified email at stanford.edu

Cited by 7000

[PDF] mlr.press

An investigation of why overparameterization exacerbates spurious correlations

S Sagawa, A Raghunathan… - … on Machine Learning, 2020 - proceedings.mlr.press

We study why overparameterization—increasing model size well beyond the point of zero
training error—can hurt test error on minority groups despite improving average test error …

Save Cite Cited by 324 Related articles All 9 versions View as HTML

[PDF] arxiv.org

Distributionally robust neural networks for group shifts: On the importance of regularization for worst-case generalization

S Sagawa, PW Koh, TB Hashimoto, P Liang - arXiv preprint arXiv …, 2019 - arxiv.org

Overparameterized neural networks can be highly accurate on average on an iid test set yet
consistently fail on atypical groups of the data (eg, by learning spurious correlations that …

Save Cite Cited by 1399 Related articles All 4 versions View as HTML

[PDF] arxiv.org

Distributionally robust language modeling

Y Oren, S Sagawa, TB Hashimoto, P Liang - arXiv preprint arXiv …, 2019 - arxiv.org

Language models are generally trained on data spanning a wide range of topics (eg, news,
reviews, fiction), but they might be applied to an a priori unknown target distribution (eg, …

Save Cite Cited by 159 Related articles All 7 versions View as HTML

[PDF] arxiv.org

On the opportunities and risks of foundation models

…, C Ruiz, J Ryan, C Ré, D Sadigh, S Sagawa… - arXiv preprint arXiv …, 2021 - arxiv.org

AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …

Save Cite Cited by 2730 Related articles All 2 versions View as HTML

[PDF] mlr.press

Just train twice: Improving group robustness without training group information

…, A Raghunathan, PW Koh, S Sagawa… - International …, 2021 - proceedings.mlr.press

Standard training via empirical risk minimization (ERM) can produce models that achieve low
error on average but high error on minority groups, especially in the presence of spurious …

Save Cite Cited by 394 Related articles All 5 versions View as HTML

[PDF] mlr.press

Wilds: A benchmark of in-the-wild distribution shifts

PW Koh, S Sagawa, H Marklund… - International …, 2021 - proceedings.mlr.press

Distribution shifts—where the training distribution differs from the test distribution—can
substantially degrade the accuracy of machine learning (ML) systems deployed in the wild. …

Save Cite Cited by 1159 Related articles All 15 versions View as HTML

[PDF] mlr.press

Accuracy on the line: on the strong correlation between out-of-distribution and in-distribution generalization

…, R Taori, A Raghunathan, S Sagawa… - International …, 2021 - proceedings.mlr.press

For machine learning systems to be reliable, we must understand their performance in
unseen, out-of-distribution environments. In this paper, we empirically show that out-of-distribution …

Save Cite Cited by 232 Related articles All 5 versions View as HTML

[PDF] arxiv.org

Openflamingo: An open-source framework for training large autoregressive vision-language models

…, K Marathe, Y Bitton, S Gadre, S Sagawa… - arXiv preprint arXiv …, 2023 - arxiv.org

We introduce OpenFlamingo, a family of autoregressive vision-language models ranging
from 3B to 9B parameters. OpenFlamingo is an ongoing effort to produce an open-source …

Save Cite Cited by 176 Related articles All 2 versions View as HTML

[PDF] arxiv.org

Extending the wilds benchmark for unsupervised adaptation

S Sagawa, PW Koh, T Lee, I Gao, SM Xie… - arXiv preprint arXiv …, 2021 - arxiv.org

… The project was initiated by Shiori Sagawa, Pang Wei Koh, and Percy Liang. Shiori Sagawa
and Pang Wei Koh led the project and coordinated the activities below. Tony Lee developed …

Save Cite Cited by 94 Related articles All 10 versions View as HTML

[PDF] mlr.press

Out-of-domain robustness via targeted augmentations

I Gao, S Sagawa, PW Koh… - International …, 2023 - proceedings.mlr.press

Abstract Models trained on one set of domains often suffer performance drops on unseen
domains, eg, when wildlife monitoring models are deployed in new camera locations. In this …

Save Cite Cited by 12 Related articles All 6 versions View as HTML

Create alert

Cite

Advanced search

Saved to My library

User profiles for Shiori Sagawa

Shiori Sagawa

An investigation of why overparameterization exacerbates spurious correlations

Distributionally robust neural networks for group shifts: On the importance of regularization for worst-case generalization

Distributionally robust language modeling

On the opportunities and risks of foundation models

Just train twice: Improving group robustness without training group information

Wilds: A benchmark of in-the-wild distribution shifts

Accuracy on the line: on the strong correlation between out-of-distribution and in-distribution generalization

Openflamingo: An open-source framework for training large autoregressive vision-language models

Extending the wilds benchmark for unsupervised adaptation

Out-of-domain robustness via targeted augmentations

Related searches