INTERVIEW PREP

数学与非代码面试题

覆盖数学、概率、统计、脑筋急转弯、机器学习和金融。这里负责筛选和进入单题；编程题使用独立的 LeetCode 式 coding lab。

做诊断按领域练习按面试风格练习代码题库

题目: 4169
领域: 8
当前筛选: 1721

第 68 / 87 页

非代码面试题

显示 20 / 1721 道匹配题目

答题状态：未尝试未正确已正确

ID题目领域难度题型进度权限

5021Recover Missing Observation From EM Mean Update 6A Gaussian-mixture M-step updates one component mean to 0.875. Three observations contribute to that component with responsibilities 0.8, 0.6, and 0.1. Two observed values are 0 and 1, while the third observation x is missing from the log. What missing x is implied by the updated mean?机器学习中等数值题未尝试面试订阅 5023Recover Missing Observation From EM Mean Update 8A Gaussian-mixture M-step updates one component mean to 0.4. Three observations contribute to that component with responsibilities 0.4, 0.4, and 0.2. Two observed values are -1 and 1, while the third observation x is missing from the log. What missing x is implied by the updated mean?机器学习中等数值题未尝试面试订阅 5026Recover Effective Membership From Updated Mixture Weight 11An EM fit is run on 80 observations, and after the M-step the updated weight of one component is 0.35. What total responsibility mass was assigned to that component?机器学习简单数值题未尝试面试订阅 5041Recover the Missing Fold Gap 1A 5-fold cross-validation comparison records four paired score differences (model A minus model B): [0.02, 0.01, -0.01, 0.03]. The desk report says the overall mean fold difference across all 5 folds was 0.01. What was the missing fifth-fold difference?机器学习中等数值题未尝试面试订阅 5042Recover the Missing Fold Gap 2A 5-fold cross-validation comparison records four paired score differences (model A minus model B): [0.05, 0.02, 0.04, -0.01]. The desk report says the overall mean fold difference across all 5 folds was 0.026. What was the missing fifth-fold difference?机器学习中等数值题未尝试面试订阅 5043Recover the Missing Fold Gap 3A 5-fold cross-validation comparison records four paired score differences (model A minus model B): [-0.02, 0.01, 0.0, -0.01]. The desk report says the overall mean fold difference across all 5 folds was 0.002. What was the missing fifth-fold difference?机器学习中等数值题未尝试面试订阅 5044Recover the Missing Fold Gap 4A 5-fold cross-validation comparison records four paired score differences (model A minus model B): [0.01, 0.01, 0.02, 0.0]. The desk report says the overall mean fold difference across all 5 folds was 0.014. What was the missing fifth-fold difference?机器学习中等数值题未尝试面试订阅 5045Recover the Missing Fold Gap 5A 5-fold cross-validation comparison records four paired score differences (model A minus model B): [0.04, -0.02, 0.01, 0.02]. The desk report says the overall mean fold difference across all 5 folds was 0.01. What was the missing fifth-fold difference?机器学习中等数值题未尝试面试订阅 5046Recover Discordant Counts From McNemar Summary 6Two classifiers are compared on the same test set. The total number of discordant cases is b+c=16, model A is better so b>c, and the continuity-corrected McNemar statistic is 3.0625. What discordant counts (b,c) are implied?机器学习中等数值题未尝试面试订阅 5051Cost-Sensitive Deployment Choice 11Model A makes 8 false positives and 2 false negatives on a validation set. Model B makes 6 false positives and 5 false negatives. If a false negative costs 10 units and a false positive costs 1 unit(s), which model has lower expected validation cost and what are the two costs?机器学习中等数值题未尝试面试订阅 5066Infer Self-Transition Probability From a Bellman Value 1Under a fixed policy, state s yields immediate reward 1 each step. With probability p it returns to s next step; otherwise the episode ends. If the discount factor is 0.9 and the state value is reported as V(s)=2.5, what p is implied?机器学习简单数值题未尝试面试订阅 5067Infer Self-Transition Probability From a Bellman Value 2Under a fixed policy, state s yields immediate reward 0.5 each step. With probability p it returns to s next step; otherwise the episode ends. If the discount factor is 0.95 and the state value is reported as V(s)=2, what p is implied?机器学习简单数值题未尝试面试订阅 5068Infer Self-Transition Probability From a Bellman Value 3Under a fixed policy, state s yields immediate reward 2 each step. With probability p it returns to s next step; otherwise the episode ends. If the discount factor is 0.8 and the state value is reported as V(s)=4, what p is implied?机器学习简单数值题未尝试面试订阅 5069Infer Self-Transition Probability From a Bellman Value 4Under a fixed policy, state s yields immediate reward 1.2 each step. With probability p it returns to s next step; otherwise the episode ends. If the discount factor is 0.85 and the state value is reported as V(s)=2.4, what p is implied?机器学习简单数值题未尝试面试订阅 5071Recover Bootstrapped Target From a Q-Learning Update 6A tabular Q-learning step starts from old Q=0.2, uses learning rate alpha=1, reward 0.5, and discount gamma=0.9. After the update the Q-value becomes 2.9. What max a' Q(s',a') must the learner have used?机器学习简单数值题未尝试面试订阅 5072Recover Bootstrapped Target From a Q-Learning Update 7A tabular Q-learning step starts from old Q=1.1, uses learning rate alpha=0.5, reward 0.2, and discount gamma=0.8. After the update the Q-value becomes 1.6. What max a' Q(s',a') must the learner have used?机器学习简单数值题未尝试面试订阅 5073Recover Bootstrapped Target From a Q-Learning Update 8A tabular Q-learning step starts from old Q=-0.4, uses learning rate alpha=0.25, reward 1, and discount gamma=0.95. After the update the Q-value becomes 1.2. What max a' Q(s',a') must the learner have used?机器学习简单数值题未尝试面试订阅 5074Recover Bootstrapped Target From a Q-Learning Update 9A tabular Q-learning step starts from old Q=0.7, uses learning rate alpha=0.4, reward 0.3, and discount gamma=0.9. After the update the Q-value becomes 2. What max a' Q(s',a') must the learner have used?机器学习简单数值题未尝试面试订阅 5075Recover Bootstrapped Target From a Q-Learning Update 10A tabular Q-learning step starts from old Q=0, uses learning rate alpha=0.5, reward 0.1, and discount gamma=0.99. After the update the Q-value becomes 3. What max a' Q(s',a') must the learner have used?机器学习简单数值题未尝试面试订阅 5076Choose the Greedy Backup Action 11In one state, action 1 gives immediate reward 0.6 and then moves to states of value 3 with probability 0.4 and 1 otherwise. Action 2 gives immediate reward 0.9 and then moves to states of value 0.2 with probability 0.1 and 2 otherwise. If gamma=0.9, which action is greedy and what backup value does it produce?机器学习中等数值题未尝试面试订阅