Posted on 2022-03-17 at 14:34:33 UTC-0400deep-learning generalizationThis was a paper we presented about in Irina Rish’s neural scaling laws course (IFT6167) in winter 2022. You can view the slides we used here.