5081机器学习简单数值题short
Recover Epsilon From a Logged Action Probability 16
题目
An epsilon-greedy policy has 5 available actions and exactly one greedy action. A log file says the greedy action was chosen with probability 0.84. What epsilon does that imply?
解题计时
0:00
提交作答时记录,用于后续平均用时统计。
你的答案