What is Sharpness-Aware Minimization (SAM)?

Introduction Consider the general machine learning set-up. We have a class of models parameterized by (e.g. for linear regression, would be the coefficients of the model). Each of these models takes in some input and outputs a result . We want to select the parameter which minimizes some population loss: where is the loss incurred […]

What is Sharpness-Aware Minimization (SAM)?

Leave a comment