Skip to main content
Cornell University
We gratefully acknowledge support from
the Simons Foundation and member institutions.
arxiv logo > cs > arXiv:2009.00329

Help | Advanced Search

Computer Science > Machine Learning

(cs)
[Submitted on 1 Sep 2020 (v1), last revised 24 Oct 2020 (this version, v3)]

Title:Learning explanations that are hard to vary

Authors:Giambattista Parascandolo, Alexander Neitz, Antonio Orvieto, Luigi Gresele, Bernhard Schölkopf
Download PDF
Abstract: In this paper, we investigate the principle that `good explanations are hard to vary' in the context of deep learning. We show that averaging gradients across examples -- akin to a logical OR of patterns -- can favor memorization and `patchwork' solutions that sew together different strategies, instead of identifying invariances. To inspect this, we first formalize a notion of consistency for minima of the loss surface, which measures to what extent a minimum appears only when examples are pooled. We then propose and experimentally validate a simple alternative algorithm based on a logical AND, that focuses on invariances and prevents memorization in a set of real-world tasks. Finally, using a synthetic dataset with a clear distinction between invariant and spurious mechanisms, we dissect learning signals and compare this approach to well-established regularizers.
Comments: From v1: extended 2.2 and 2.3, added details for reproducibility and link to codebase
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as: arXiv:2009.00329 [cs.LG]
  (or arXiv:2009.00329v3 [cs.LG] for this version)
  https://doi.org/10.48550/arXiv.2009.00329
arXiv-issued DOI via DataCite

Submission history

From: Giambattista Parascandolo [view email]
[v1] Tue, 1 Sep 2020 10:17:48 UTC (9,504 KB)
[v2] Sat, 5 Sep 2020 14:46:16 UTC (9,909 KB)
[v3] Sat, 24 Oct 2020 11:32:18 UTC (11,272 KB)
Full-text links:

Download:

  • PDF
  • Other formats
(license)
Current browse context:
cs.LG
< prev   |   next >
new | recent | 2009
Change to browse by:
cs
stat
stat.ML

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar

DBLP - CS Bibliography

listing | bibtex
Giambattista Parascandolo
Alexander Neitz
Antonio Orvieto
Luigi Gresele
Bernhard Schölkopf
a export bibtex citation Loading...

Bookmark

BibSonomy logo Mendeley logo Reddit logo ScienceWISE logo

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack