INTERVIEW PREP

数学与非代码面试题

覆盖数学、概率、统计、脑筋急转弯、机器学习和金融。这里负责筛选和进入单题;编程题使用独立的 LeetCode 式 coding lab。

题目
4169
领域
8
当前筛选
89

2 / 5

非代码面试题

显示 20 / 89 道匹配题目

答题状态:未尝试未正确已正确
2465Why Nested Validation ExistsIf the same validation set is repeatedly used for model family choice, feature engineering, and threshold tuning, why is a second outer holdout or nested procedure conceptually necessary?机器学习困难essay未尝试面试订阅2466What to Audit in a Leakage ReviewYou are auditing a pipeline for leakage. Beyond checking the split line in the final dataframe, what is the highest-value thing to inspect in the code path?机器学习简单essay未尝试免费2467Unsupervised Preprocessing Can Still Distort EvaluationWhy can fitting an unsupervised step like PCA or quantile normalization on all rows still make the final reported test error too optimistic?机器学习简单essay未尝试免费2468Group Leakage Inflates Confidence TooWhy does entity overlap across train and test typically make confidence intervals and model-stability assessments look better than they really are?机器学习中等essay未尝试面试订阅2469Why Point-in-Time Feature Stores MatterA team says they can avoid leakage by using the latest vendor table everywhere because the values are more accurate. What core point about deployment reality are they missing?机器学习中等essay未尝试免费2470Rare Category Thresholding After Seeing Test CompositionSuppose you choose the minimum frequency for keeping a category only after inspecting how many rare categories appear in the test set. Why is that already a contaminated design choice?机器学习困难essay未尝试面试订阅2511Why L1 Produces Corners and Corners Produce Sparsity 11Why is the geometry of the L1 ball often used to explain why lasso creates sparse solutions?机器学习简单essay未尝试免费2513Why Correlated Features Frustrate Pure Lasso 17Why does pure lasso often behave erratically when several features are highly correlated and similarly predictive?机器学习中等essay未尝试面试订阅2574Why Bagging Helps Unstable Learners Most 10Why does bagging usually help deep trees much more than it helps already-stable learners?机器学习中等essay未尝试免费2579Infer Tree Correlation From the Variance Floor 23A single tree has variance 6, while an extremely large forest appears to level off at variance 1.8. What pairwise tree correlation rho is implied?机器学习中等数值题未尝试面试订阅2580Why More Trees Usually Do Not Create Classical Overfit 15Why does adding more trees to a random forest typically plateau rather than create the kind of explosive overfit seen in some single-model families?机器学习困难essay未尝试面试订阅2599Why Boosting Mostly Attacks Bias 9Why is boosting usually described as a bias-reduction method more than a variance-reduction method?机器学习中等essay未尝试免费2607Why Overly Deep Base Trees Can Cancel Shrinkage Discipline 15Why can a very deep base tree undermine the regularizing effect of a small learning rate?机器学习简单essay未尝试免费2646Model-Fit Count in a Nested CV SearchA team runs 5 outer folds. Inside each outer-training split, it evaluates 6 hyperparameter settings by 4-fold CV, then refits the chosen model once on the full outer-training split. How many total model fits are performed?机器学习简单数值题未尝试免费2647Why Grouped CV Beats Row-Wise CV for Repeated EntitiesWhy is row-wise cross-validation inappropriate when each entity appears many times and the model can recognize entity-specific signatures?机器学习中等essay未尝试免费2648Why Random k-Fold Is Invalid for Overlapping Rolling FeaturesWhy can random k-fold cross-validation be invalid when each feature vector uses a rolling 20-day history from a time series?机器学习简单essay未尝试免费2649How Many Expanding-Window Folds Fit in a Monthly Panel?You have 60 months of data. Each expanding-window fold uses 24 months for training, the next 6 months for validation, and then advances by 6 months. How many validation folds fit?机器学习中等数值题未尝试免费2650Usable Training Days After Purging and EmbargoA 100-day event-study sample uses a contiguous 20-day validation block in the middle. Labels look ahead 5 days, and you impose a 2-day embargo on each side of the validation block. How many days remain usable for training?机器学习困难数值题未尝试面试订阅2651Why Repeated CV Replicates Are Not Independent ExperimentsWhy should a practitioner be cautious about treating 20 repeated CV scores as if they were 20 independent experiments?机器学习简单essay未尝试免费2653Training Observations Per Fold in Grouped Cross-ValidationThere are 12 issuers and each issuer contributes 5 observations. In 3-fold grouped cross-validation, one fold holds out 4 issuers at a time. How many observations are used for training in each fold?机器学习中等数值题未尝试面试订阅