第 7 / 14 页
非代码面试题
显示 20 / 268 道匹配题目
答题状态:未尝试未正确已正确
ID题目领域难度题型进度权限
2432Squared-Loss Bayes Act for a Discrete TargetY takes values 1, 1, 4, and 7 with equal probability. What constant forecast minimizes expected squared loss?机器学习简单derivation未尝试免费2436Why the Business Objective May Differ From the Training Loss 23Why can it be rational to train a model under one loss and evaluate the final decision under a different business metric?机器学习简单essay未尝试免费2437Why Huber Sits Between Squared and Absolute LossWhy is Huber loss often described as sitting 'between' squared loss and absolute loss?机器学习中等essay未尝试面试订阅2439Why Asymmetric Loss Moves the Target Away From the MeanWhy does an asymmetric loss generally make the optimal constant prediction move away from the mean of the target distribution?机器学习困难essay未尝试面试订阅2440Why Heavy-Tailed Noise Pushes You Away From Pure Squared Loss 15Why is pure squared loss often a poor default when the residual distribution has rare but huge outliers?机器学习中等derivation未尝试面试订阅2441Weighted Log-Loss Bayes Probability Numerically 20If p = 0.3, alpha = 4, and beta = 1 in weighted log-loss, what Bayes probability q* is optimal?机器学习简单数值题未尝试免费2442Why Proper Losses Matter Beyond Ranking 24Why is a strictly proper probabilistic loss valuable even when the final system will later choose its own operating threshold?机器学习简单essay未尝试免费2443Weighted Log-Loss Moves the Bayes Probability Toward the Costlier Class 16Why does class-weighted log-loss shift the optimal reported probability toward the class with the larger weight?机器学习中等derivation未尝试面试订阅2447Shift in the Training Mean After Fitting a Scaler on All RowsA feature has training mean 10 over 80 rows and test mean 14 over 20 rows. A scaler is wrongly fit on all 100 rows and uses standard deviation 5. What is the average standardized value of the training block under that leaked fit?机器学习简单数值题未尝试免费2449Issuer Demeaning That Quietly Uses Held-Out RowsFor one issuer, the three training rows sum to 12. A pipeline mistakenly demeans by the full-sample issuer mean 3.6 computed from five rows total. What is the sum of the two held-out rows for that issuer?机器学习中等数值题未尝试面试订阅2453Winsor Caps Chosen on the Full PanelA pipeline clips a spread feature at the 1st and 99th percentiles computed on the full panel before the split. Why can this still bias the reported test score even though the clipping rule is unsupervised?机器学习中等essay未尝试面试订阅2456Row Split Instead of Issuer SplitEach issuer contributes many dated observations. Why can a random row split overstate performance even when the target is defined separately on each date?机器学习简单essay未尝试免费2457PCA Fit Once Before Cross-ValidationA notebook computes PCA on the full feature matrix and then feeds the resulting components into every cross-validation fold. Why is that not a harmless speed optimization?机器学习简单essay未尝试免费2461Learning Rare-Category Merges From Future FeaturesNo labels are used, but the preprocessing step decides which rare sectors to merge by looking at category frequencies on the full dataset. Why can that still make the evaluation optimistic?机器学习简单essay未尝试免费2462Peer Average Features That Include Held-Out TargetsA feature for each bond is the average realized default rate of bonds from the same issuer-year bucket, computed over the full sample. Why is this worse than ordinary scaling leakage?机器学习中等essay未尝试面试订阅2466What to Audit in a Leakage ReviewYou are auditing a pipeline for leakage. Beyond checking the split line in the final dataframe, what is the highest-value thing to inspect in the code path?机器学习简单essay未尝试免费2467Unsupervised Preprocessing Can Still Distort EvaluationWhy can fitting an unsupervised step like PCA or quantile normalization on all rows still make the final reported test error too optimistic?机器学习简单essay未尝试免费2468Group Leakage Inflates Confidence TooWhy does entity overlap across train and test typically make confidence intervals and model-stability assessments look better than they really are?机器学习中等essay未尝试面试订阅2469Why Point-in-Time Feature Stores MatterA team says they can avoid leakage by using the latest vendor table everywhere because the values are more accurate. What core point about deployment reality are they missing?机器学习中等essay未尝试免费2476Infer the Slope From Covariance and Variance 6In a simple regression with an intercept, Cov(x,y)=12 and Var(x)=16. What is the OLS slope beta hat?机器学习简单数值题未尝试免费