← 返回数学题库
5081机器学习简单数值题short

Recover Epsilon From a Logged Action Probability 16

题目

An epsilon-greedy policy has 5 available actions and exactly one greedy action. A log file says the greedy action was chosen with probability 0.84. What epsilon does that imply?

解题计时

0:00

提交作答时记录,用于后续平均用时统计。

你的答案