2643机器学习中等数值题medium
Clipping Plus Weight Decay on a Vector 25
题目
A parameter vector is w_t=(3,4). Its gradient is g=(6,8), whose norm is 10. Apply global-norm clipping with threshold 5, then a decoupled weight-decay step with learning rate eta=0.1 and lambda=0.1. What is the new parameter vector?
解题计时
0:00
提交作答时记录,用于后续平均用时统计。