INTERVIEW PREP

数学与非代码面试题

覆盖数学、概率、统计、脑筋急转弯、机器学习和金融。这里负责筛选和进入单题；编程题使用独立的 LeetCode 式 coding lab。

做诊断按领域练习按面试风格练习代码题库

题目: 4169
领域: 8
当前筛选: 78

第 3 / 4 页

非代码面试题

显示 20 / 78 道匹配题目

答题状态：未尝试未正确已正确

ID题目领域难度题型进度权限

2599Why Boosting Mostly Attacks Bias 9Why is boosting usually described as a bias-reduction method more than a variance-reduction method?机器学习中等essay未尝试免费 2602Why Early Stopping Matters Even if Train Loss Falls 12Why can validation performance start to deteriorate even while the training objective of boosting keeps improving?机器学习中等essay未尝试免费 2604Why Label Noise Is Especially Toxic 13Why does boosting often suffer badly when labels are noisy?机器学习中等essay未尝试免费 2607Why Overly Deep Base Trees Can Cancel Shrinkage Discipline 15Why can a very deep base tree undermine the regularizing effect of a small learning rate?机器学习简单essay未尝试免费 2608Residual After Two Shrunken Updates 24A point currently has residual 6. Two boosting rounds hit its region with leaf updates 1.5 and 0.8, using learning rate eta=0.2 in both rounds. What residual remains after the two rounds?机器学习中等数值题未尝试免费 2610Scale-Update Invariance Between Eta and Gamma 6Why does multiplying every leaf update gamma m by c and dividing the learning rate eta by c leave the final additive score unchanged?机器学习困难derivation未尝试面试订阅 2611Why Boosting Parallelizes Worse Than Random Forests 16Why is boosting fundamentally harder to parallelize across rounds than random forests?机器学习简单essay未尝试免费 2613L2-Regularized Region Update 7In one boosting region, choose a constant update gamma to minimize sum i in R (r i-gamma) 2 + lambda gamma 2. Let S = sum i in R r i and n = |R|. Derive gamma.机器学习困难derivation未尝试面试订阅 2614Why the Initial Prediction Matters 18Why can the choice of the initial prediction F 0 matter for the early trajectory of boosting?机器学习中等essay未尝试面试订阅 2615Why Calibration Can Degrade Before Ranking 19Why can late-stage boosting sometimes keep ranking examples well while making the predicted scores less well calibrated?机器学习困难essay未尝试面试订阅 2616Why Leaf-Wise Growth Can Be Higher Variance 20Why can leaf-wise tree growth be more variance-prone than level-wise growth inside a boosting system?机器学习简单essay未尝试免费 2617Two-Region Two-Round Boosting Calculation 25A boosting model starts from F 0=0 with learning rate eta=0.1. In round 1, region A gets update +2 and region B gets update -1. In round 2, region A gets update -0.5 and region B gets update +0.25. What are the final predictions for a point that always stays in region A and a point that always stays in region B?机器学习简单数值题未尝试免费 2618Why Many Small Corrections Can Beat One Big Tree 21Why can an additive sequence of small boosting steps outperform a single large tree with similar in-sample flexibility?机器学习中等essay未尝试面试订阅 2619Why Flat Late-Round Validation Gains Still Suggest Stopping 22If the validation gain per boosting round becomes tiny and erratic late in training, why is that often a strong argument for stopping?机器学习中等essay未尝试面试订阅 2620A Bound on Total Function Movement 8Suppose every boosting round changes any one point's prediction by at most eta A in absolute value. What upper bound does this imply on the total prediction movement after M rounds?机器学习困难derivation未尝试面试订阅 4216Normalized MDI Share 1A random forest reports total mean-decrease-in-impurity contributions spread=0.42, imbalance=0.21, id bucket=0.07. What are the normalized importance shares, and which feature ranks first?机器学习简单数值题未尝试面试订阅 4217Normalized MDI Share 2A model has baseline validation AUC 0.62. After permuting three features separately, AUC becomes 0.57 for value signal, 0.60 for momentum, and 0.61 for zip code. What permutation-importance drops do these imply, and which feature ranks first?机器学习简单数值题未尝试面试订阅 4218Normalized MDI Share 3A sector feature is represented by three one-hot columns with impurity-gain importances 0.04, 0.03, and 0.01. Two other features have importances 0.05 and 0.07. If you aggregate the one-hot block into a single group, what are the normalized group shares and which group ranks first?机器学习简单数值题未尝试面试订阅 4219Normalized MDI Share 4Two trees contribute split gains to features A and B. Tree 1 contributes A=12, B=5. Tree 2 contributes A=8, B=10. What are the total normalized gain importances for A and B?机器学习简单数值题未尝试面试订阅 4220Normalized MDI Share 5A model has baseline log loss 0.400. After permuting feature X, log loss rises to 0.455; after permuting feature Y, it rises to 0.420. What are the permutation importances under a log-loss metric, and which feature is more important?机器学习简单数值题未尝试面试订阅