What is the difference between L1 and L2 regularisation
Anonymous
Penalty Term - L1 takes the absolute value of the coefficient and L2 takes the squared value of the coefficient. Effect - L1 shrinks some coefficients to 0, doing feature selection, L2 shrinks them to near 0 When to use? use L1 when their are lots of features, some of which are unimportant Use L2 when all features are equally important
Check out your Company Bowl for anonymous work chats.