第 3 / 32 页
非代码面试题
显示 20 / 622 道匹配题目
答题状态:未尝试未正确已正确
ID题目领域难度题型进度权限
2436Why the Business Objective May Differ From the Training Loss 23Why can it be rational to train a model under one loss and evaluate the final decision under a different business metric?机器学习简单essay未尝试免费2437Why Huber Sits Between Squared and Absolute LossWhy is Huber loss often described as sitting 'between' squared loss and absolute loss?机器学习中等essay未尝试面试订阅2438Why Convexity Makes Averaging Predictions SafeWhy does convexity of a loss function support the intuition that averaging similar predictors often cannot hurt too much?机器学习困难essay未尝试面试订阅2439Why Asymmetric Loss Moves the Target Away From the MeanWhy does an asymmetric loss generally make the optimal constant prediction move away from the mean of the target distribution?机器学习困难essay未尝试面试订阅2440Why Heavy-Tailed Noise Pushes You Away From Pure Squared Loss 15Why is pure squared loss often a poor default when the residual distribution has rare but huge outliers?机器学习中等derivation未尝试面试订阅2441Weighted Log-Loss Bayes Probability Numerically 20If p = 0.3, alpha = 4, and beta = 1 in weighted log-loss, what Bayes probability q* is optimal?机器学习简单数值题未尝试免费2442Why Proper Losses Matter Beyond Ranking 24Why is a strictly proper probabilistic loss valuable even when the final system will later choose its own operating threshold?机器学习简单essay未尝试免费2443Weighted Log-Loss Moves the Bayes Probability Toward the Costlier Class 16Why does class-weighted log-loss shift the optimal reported probability toward the class with the larger weight?机器学习中等derivation未尝试面试订阅2444Why Quantile Loss Is Useful in Risk Forecasting 17Why is pinball loss natural when the target is a VaR-like forecast rather than a mean forecast?机器学习中等derivation未尝试面试订阅2445Why Tail Forecasts Need Tail-Aligned Losses 25Why is it often a mistake to optimize plain squared loss when the operational task really cares about an extreme tail quantile?机器学习困难essay未尝试面试订阅2446Hidden Validation Positives Implied by a Leaky Target EncoderA category appears 40 times in train with 18 positives and 10 times in validation. A target encoder is incorrectly fit on train plus validation and outputs 0.56 for that category. How many validation positives did the encoder implicitly use?机器学习简单数值题未尝试免费2447Shift in the Training Mean After Fitting a Scaler on All RowsA feature has training mean 10 over 80 rows and test mean 14 over 20 rows. A scaler is wrongly fit on all 100 rows and uses standard deviation 5. What is the average standardized value of the training block under that leaked fit?机器学习简单数值题未尝试免费2448Held-Out Base Rate Implied by a Full-Sample Class WeightA training set has 100 labels with 30 positives. A class-weighting routine is mistakenly fit on all 125 labels and reports an overall positive rate of 0.36. What is the positive rate in the 25 held-out labels?机器学习中等数值题未尝试面试订阅2449Issuer Demeaning That Quietly Uses Held-Out RowsFor one issuer, the three training rows sum to 12. A pipeline mistakenly demeans by the full-sample issuer mean 3.6 computed from five rows total. What is the sum of the two held-out rows for that issuer?机器学习中等数值题未尝试面试订阅2451Rare Category Survival Caused by Held-Out RowsA categorical preprocessor keeps a level only if it appears at least 5 times. In train alone, level Z appears 4 times. After the preprocessor is wrongly fit on the full sample, level Z appears to have frequency 7 and is kept. How many held-out Z rows caused the leak?机器学习简单数值题未尝试免费2452Future Restatements Merged Into Historical FeaturesA researcher joins fundamentals after they were restated months later, then backtests on the original trade dates. Why is this a split-discipline failure even if no test labels were touched?机器学习中等essay未尝试面试订阅2453Winsor Caps Chosen on the Full PanelA pipeline clips a spread feature at the 1st and 99th percentiles computed on the full panel before the split. Why can this still bias the reported test score even though the clipping rule is unsupervised?机器学习中等essay未尝试面试订阅2454Feature Screening Before the SplitA team ranks 5,000 candidate features by correlation with the target on the full dataset, keeps the top 30, and only then creates train and test. Why is the later split not enough to rescue the experiment?机器学习中等essay未尝试面试订阅2455Repeated Validation Peeking During ResearchA researcher keeps trying new transformations and only retains the ones that improve the same validation score. Why does the validation set stop being a clean model-selection tool?机器学习困难essay未尝试面试订阅2456Row Split Instead of Issuer SplitEach issuer contributes many dated observations. Why can a random row split overstate performance even when the target is defined separately on each date?机器学习简单essay未尝试免费