第 1 / 1 页
非代码面试题
显示 12 / 12 道匹配题目
答题状态:未尝试未正确已正确
ID题目领域难度题型进度权限
5041Recover the Missing Fold Gap 1A 5-fold cross-validation comparison records four paired score differences (model A minus model B): [0.02, 0.01, -0.01, 0.03]. The desk report says the overall mean fold difference across all 5 folds was 0.01. What was the missing fifth-fold difference?机器学习中等数值题未尝试面试订阅5042Recover the Missing Fold Gap 2A 5-fold cross-validation comparison records four paired score differences (model A minus model B): [0.05, 0.02, 0.04, -0.01]. The desk report says the overall mean fold difference across all 5 folds was 0.026. What was the missing fifth-fold difference?机器学习中等数值题未尝试面试订阅5043Recover the Missing Fold Gap 3A 5-fold cross-validation comparison records four paired score differences (model A minus model B): [-0.02, 0.01, 0.0, -0.01]. The desk report says the overall mean fold difference across all 5 folds was 0.002. What was the missing fifth-fold difference?机器学习中等数值题未尝试面试订阅5044Recover the Missing Fold Gap 4A 5-fold cross-validation comparison records four paired score differences (model A minus model B): [0.01, 0.01, 0.02, 0.0]. The desk report says the overall mean fold difference across all 5 folds was 0.014. What was the missing fifth-fold difference?机器学习中等数值题未尝试面试订阅5045Recover the Missing Fold Gap 5A 5-fold cross-validation comparison records four paired score differences (model A minus model B): [0.04, -0.02, 0.01, 0.02]. The desk report says the overall mean fold difference across all 5 folds was 0.01. What was the missing fifth-fold difference?机器学习中等数值题未尝试面试订阅5046Recover Discordant Counts From McNemar Summary 6Two classifiers are compared on the same test set. The total number of discordant cases is b+c=16, model A is better so b>c, and the continuity-corrected McNemar statistic is 3.0625. What discordant counts (b,c) are implied?机器学习中等数值题未尝试面试订阅5051Cost-Sensitive Deployment Choice 11Model A makes 8 false positives and 2 false negatives on a validation set. Model B makes 6 false positives and 5 false negatives. If a false negative costs 10 units and a false positive costs 1 unit(s), which model has lower expected validation cost and what are the two costs?机器学习中等数值题未尝试面试订阅5061Why Nested Evaluation MattersWhy is it unfair to compare two tuned models on the same validation folds that were also used to pick their hyperparameters?机器学习困难essay未尝试面试订阅5062Why Dependence MattersWhy can standard iid significance arguments be too optimistic when model scores come from overlapping rolling windows?机器学习困难essay未尝试面试订阅5063Metric Choice Can Flip WinnerWhy can a model that wins on AUC still lose on business utility?机器学习困难essay未尝试面试订阅5064Multiple Comparisons TrapWhy should you be skeptical of the best model out of a large tournament even if its validation score is clearly the highest?机器学习困难essay未尝试面试订阅5065Why One Number Is Not EnoughWhy is a single held-out score often not enough evidence to declare one model categorically better?机器学习困难essay未尝试面试订阅