第 4 / 32 页
非代码面试题
显示 20 / 622 道匹配题目
答题状态:未尝试未正确已正确
ID题目领域难度题型进度权限
2457PCA Fit Once Before Cross-ValidationA notebook computes PCA on the full feature matrix and then feeds the resulting components into every cross-validation fold. Why is that not a harmless speed optimization?机器学习简单essay未尝试免费2458Choosing Early Stopping by the Test CurveA team trains one model, plots test loss by boosting round, and reports the round with the best test value. Why is the final test score no longer a valid final check?机器学习中等essay未尝试面试订阅2459Using Revised Index Membership in Historical FilteringA backtest filters the universe using current index membership and then evaluates historical predictions on that restricted universe. Why is this also a train/test discipline problem?机器学习困难essay未尝试面试订阅2460Validation Used Until One Model Wins by LuckTwo candidate models are close. A researcher keeps slightly changing seeds and preprocessing until one model wins on the same validation slice. Why should the apparent win be discounted?机器学习困难essay未尝试面试订阅2461Learning Rare-Category Merges From Future FeaturesNo labels are used, but the preprocessing step decides which rare sectors to merge by looking at category frequencies on the full dataset. Why can that still make the evaluation optimistic?机器学习简单essay未尝试免费2462Peer Average Features That Include Held-Out TargetsA feature for each bond is the average realized default rate of bonds from the same issuer-year bucket, computed over the full sample. Why is this worse than ordinary scaling leakage?机器学习中等essay未尝试面试订阅2463Reusing the Test Set After DebuggingA model is evaluated on test, a bug is found, the code is fixed, and the same test set is used again to verify the fix and choose among two corrected versions. Why is that second use no longer a clean test?机器学习中等essay未尝试面试订阅2464No Test Labels Touched Is Not EnoughSomeone argues there was no leakage because the code never accessed test labels. Give the core reason this defense can fail in real ML pipelines.机器学习困难essay未尝试面试订阅2465Why Nested Validation ExistsIf the same validation set is repeatedly used for model family choice, feature engineering, and threshold tuning, why is a second outer holdout or nested procedure conceptually necessary?机器学习困难essay未尝试面试订阅2466What to Audit in a Leakage ReviewYou are auditing a pipeline for leakage. Beyond checking the split line in the final dataframe, what is the highest-value thing to inspect in the code path?机器学习简单essay未尝试免费2467Unsupervised Preprocessing Can Still Distort EvaluationWhy can fitting an unsupervised step like PCA or quantile normalization on all rows still make the final reported test error too optimistic?机器学习简单essay未尝试免费2468Group Leakage Inflates Confidence TooWhy does entity overlap across train and test typically make confidence intervals and model-stability assessments look better than they really are?机器学习中等essay未尝试面试订阅2469Why Point-in-Time Feature Stores MatterA team says they can avoid leakage by using the latest vendor table everywhere because the values are more accurate. What core point about deployment reality are they missing?机器学习中等essay未尝试免费2470Rare Category Thresholding After Seeing Test CompositionSuppose you choose the minimum frequency for keeping a category only after inspecting how many rare categories appear in the test set. Why is that already a contaminated design choice?机器学习困难essay未尝试面试订阅2471Slope From Centered Sufficient Statistics 1In simple OLS with an intercept, if the centered sufficient statistics satisfy sum i (x i-xbar)(y i-ybar)=S xy and sum i (x i-xbar) 2=S xx, derive beta hat.机器学习简单derivation未尝试免费2472Intercept From Sample Means and Slope 2Derive the OLS intercept in simple regression with an intercept once beta hat is known.机器学习简单derivation未尝试免费2473Scaling One Feature Rescales Its Coefficient 3If a feature x is replaced by x new = c x in an OLS model with an intercept, how does its fitted coefficient change when all fitted values are kept identical?机器学习中等derivation未尝试面试订阅2474Shifting the Response by a Constant 4If every target value is replaced by y i + k in an OLS model with an intercept, what happens to the fitted slope and intercept?机器学习中等derivation未尝试面试订阅2475Why Duplicate Features Cause Non-Unique Coefficients 5Why do two perfectly duplicated features make the OLS coefficient vector non-unique even though fitted predictions can stay unique?机器学习困难essay未尝试面试订阅2476Infer the Slope From Covariance and Variance 6In a simple regression with an intercept, Cov(x,y)=12 and Var(x)=16. What is the OLS slope beta hat?机器学习简单数值题未尝试免费