INTERVIEW PREP

数学与非代码面试题

覆盖数学、概率、统计、脑筋急转弯、机器学习和金融。这里负责筛选和进入单题；编程题使用独立的 LeetCode 式 coding lab。

做诊断按领域练习按面试风格练习代码题库

题目: 4169
领域: 8
当前筛选: 91

第 3 / 5 页

非代码面试题

显示 20 / 91 道匹配题目

答题状态：未尝试未正确已正确

ID题目领域难度题型进度权限

2439Why Asymmetric Loss Moves the Target Away From the MeanWhy does an asymmetric loss generally make the optimal constant prediction move away from the mean of the target distribution?机器学习困难essay未尝试面试订阅 2475Why Duplicate Features Cause Non-Unique Coefficients 5Why do two perfectly duplicated features make the OLS coefficient vector non-unique even though fitted predictions can stay unique?机器学习困难essay未尝试面试订阅 2479Why Multicollinearity Hurts Coefficient Stability More Than Fit 10Why can severe multicollinearity make coefficients unstable even when training predictions barely change?机器学习中等essay未尝试面试订阅 2480Orthogonal Features Give Coordinatewise Coefficients 9Suppose two features x1 and x2 are centered and orthogonal. Derive the OLS coefficients in terms of x1 T y, x2 T y, ||x1|| 2, and ||x2|| 2.机器学习困难derivation未尝试面试订阅 2490Why OLS Can Still Predict Well Under Misspecification 20Why can OLS remain a useful predictor even when the true data-generating process is not exactly linear?机器学习困难essay未尝试面试订阅 2492Why Feature Scaling Helps Gradient Descent More Than Closed Form 22Why is feature scaling often crucial for gradient-descent training of OLS even though the closed-form solution itself is scale-equivariant?机器学习简单essay未尝试免费 2499Soft-Thresholded Lasso Coefficient 4In an orthogonal one-feature problem with x T x = d and x T y = z > 0, derive the lasso coefficient when 0 < lambda < z.机器学习中等derivation未尝试面试订阅 2506Why Standardization Matters for Lasso 8Why can lasso unfairly prefer one feature over another if raw feature scales are left unstandardized?机器学习简单essay未尝试免费 2508Why Elastic Net Keeps the Lasso Threshold but Adds Ridge Shrinkage 14Why does elastic net still need |z| to clear an L1 threshold before a coordinate activates, but then shrink the active coefficient more than lasso does?机器学习中等derivation未尝试面试订阅 2510Zero Lambda Recovers OLS 16Why do ridge and lasso both reduce to OLS when their regularization parameter is set to zero?机器学习困难derivation未尝试面试订阅 2511Why L1 Produces Corners and Corners Produce Sparsity 11Why is the geometry of the L1 ball often used to explain why lasso creates sparse solutions?机器学习简单essay未尝试免费 2513Why Correlated Features Frustrate Pure Lasso 17Why does pure lasso often behave erratically when several features are highly correlated and similarly predictive?机器学习中等essay未尝试面试订阅 2528Why Log-Loss Rewards Calibration 9Why does a well-calibrated probability forecaster typically fare better under log-loss than a forecaster that only gets rankings right?机器学习中等essay未尝试免费 2538Why Logistic Beats Hard Threshold Rules for Training 23Why is a smooth probabilistic loss easier to optimize than training directly against a hard classification rule?机器学习中等essay未尝试免费 2559Expected Misroutes From a Surrogate SplitA surrogate split agrees with the primary split on 34 of 40 training cases where both features are present. If 12 production cases are missing the primary split feature and are routed by the surrogate, what is the expected number of misroutes?机器学习困难derivation未尝试面试订阅 2564Validation Penalty Threshold for Keeping a SplitA stump has validation loss 30. Splitting it into two leaves lowers validation loss to 22 but adds an instability penalty lambda per extra leaf. For what largest lambda is the split still preferred?机器学习困难derivation未尝试面试订阅 2567Why Two Nearly-Tied First Splits Can Diverge Later 13Why can two root splits with almost identical immediate gain still lead to very different final trees?机器学习简单essay未尝试免费 2569Why Axis-Aligned Trees Struggle on Rotated Boundaries 14Why can a decision tree need many small rectangles to approximate a simple diagonal boundary?机器学习中等essay未尝试面试订阅 2571Variance of an Average of Correlated Trees 1Suppose B trees each have variance sigma 2 and every pair has correlation rho. Derive the variance of their simple average.机器学习简单derivation未尝试免费 2573Infinite-Forest Variance Floor 2Using the equicorrelated-tree variance formula, derive the prediction variance as the number of trees B tends to infinity.机器学习中等derivation未尝试免费