{
    "byline": null,
    "dir": null,
    "excerpt": "This paper is all about trying a bunch of different changes to the training setup to see what affects the power law exponent over dataset size. Here are some of the answers:",
    "length": 883,
    "siteName": null,
    "title": "Data scaling laws in NMT: the effect of noise and architecture"
}