4306机器学习中等essayshort
Sparse Weights Blow Up
题目
A wide MLP on 8k tabular rows drives training AUC to 0.99 while validation AUC stalls at 0.76. Feature semantics do not support label-preserving augmentation, and the largest weights sit on sparse one-hot inputs. Which regularization control should you try first?
解题计时
0:00
提交作答时记录,用于后续平均用时统计。
你的答案