第 145 / 209 页
非代码面试题
显示 20 / 4169 道匹配题目
答题状态:未尝试未正确已正确
ID题目领域难度题型进度权限
4347Brier Score Snapshot 2Across 500 cases, a calibrated model has mean predicted probability 0.12. How many positives should you expect in total?机器学习中等数值题未尝试面试订阅4348Brier Score Snapshot 3On a validation set, the model's mean predicted probability is 9% but the observed positive rate is 6%. What calibration-in-the-large error does that imply?机器学习中等数值题未尝试面试订阅4349Brier Score Snapshot 4A thresholding rule is used on a universe where prevalence is 10%, TPR is 70%, and FPR is 5%. A false negative costs 4 units and a false positive costs 1 unit. What is the expected misclassification cost per case?机器学习中等数值题未尝试面试订阅4350Brier Score Snapshot 5A calibrated bucket contains 80 names with average predicted probability 0.35. If you actually observe 20 positives, what empirical positive rate does that bucket realize, and by how many percentage points is it under the average prediction?机器学习中等数值题未尝试面试订阅4351Asymmetric Threshold Choice 1Three candidate thresholds on the same classifier yield t=0.3 -> FP=18, FN=4; t=0.5 -> FP=9, FN=7; t=0.7 -> FP=4, FN=14. If one false negative costs 5 units and one false positive costs 1 unit(s), which threshold minimizes expected classification cost over this sample?机器学习中等数值题未尝试面试订阅4356Trade Ranking Versus Probability QualityModel A has slightly higher ROC AUC, but its probabilities are visibly overconfident; Model B has slightly lower ROC AUC but much better calibration. If you need a probability to size positions according to expected payoff, which model is usually safer?机器学习中等essay未尝试面试订阅4357Severe Imbalance Metric ChoiceYou are screening rare fraud cases with a base rate below 1%. Why is PR analysis usually more informative than ROC analysis for the operational discussion?机器学习中等essay未尝试面试订阅4358Same AUC Different ReliabilityTwo models have essentially the same ROC AUC. One is well calibrated, the other places most probabilities near 0 or 1. What should be your first concern before picking the latter?机器学习中等essay未尝试面试订阅4359Thresholds Need ContextWhy is it a mistake to ask for the 'best threshold' without specifying a cost ratio or downstream resource constraint?机器学习中等essay未尝试面试订阅4360Calibration After ShiftA model was calibrated on last year's data, but the event base rate has clearly shifted this year. What is the first calibration question you should revisit?机器学习中等essay未尝试面试订阅4361Before Comparing CurvesBefore you compare ROC and PR curves across models, what dataset property should you check first?机器学习中等essay未尝试面试订阅4362Before RecalibratingA model looks poorly calibrated. What should you check first before concluding the model itself is broken?机器学习中等essay未尝试面试订阅4363Before Picking a ThresholdWhat is the first quantity you should pin down before optimizing a classification threshold?机器学习中等essay未尝试面试订阅4364Before Trusting AUCAUC improved a little after retraining. What should you ask first before declaring the new model practically better?机器学习中等essay未尝试面试订阅4365Calibration Is Not EnoughWhy should you hesitate before selecting a perfectly calibrated model that has weak ranking power?机器学习中等essay未尝试面试订阅4366Nested CV Fit Count 1Three model sizes have mean CV AUCs 0.790, 0.802, and 0.808. The standard error of the best score is 0.010. Under the one-standard-error rule, which is the simplest model you would keep?机器学习简单数值题未尝试面试订阅4367Nested CV Fit Count 2A search grid contains 4 learning rates, 3 tree depths, and 5 regularization strengths. How many hyperparameter combinations are in the grid?机器学习简单数值题未尝试面试订阅4368Nested CV Fit Count 3Successive halving starts with 27 configurations. Each round keeps one third of the configurations and evaluates all survivors once. If you run three rounds total, how many model fits are executed?机器学习简单数值题未尝试面试订阅4369Nested CV Fit Count 4You compare 12 hyperparameter settings with 5-fold CV repeated 3 times. How many validation scores are produced in total across all settings and folds?机器学习简单数值题未尝试面试订阅4370Nested CV Fit Count 5A time-series hyperparameter sweep evaluates 8 settings on 6 expanding-window splits. If each setting is refit once per split, how many model fits are needed?机器学习简单数值题未尝试面试订阅