, you use a negative “sparsity loss” to encourage sparsity. Could you please clarify why it’s negative? Intuitively, I would expect the sparsity loss to be positive when encouraging sparsity. In paper ...