4306机器学习中等essayshort

Sparse Weights Blow Up

题目

A wide MLP on 8k tabular rows drives training AUC to 0.99 while validation AUC stalls at 0.76. Feature semantics do not support label-preserving augmentation, and the largest weights sit on sparse one-hot inputs. Which regularization control should you try first?

解题计时

0:00

提交作答时记录，用于后续平均用时统计。

你的答案