INTERVIEW PREP

数学与非代码面试题

覆盖数学、概率、统计、脑筋急转弯、机器学习和金融。这里负责筛选和进入单题；编程题使用独立的 LeetCode 式 coding lab。

做诊断按领域练习按面试风格练习代码题库

题目: 4169
领域: 8
当前筛选: 108

第 3 / 6 页

非代码面试题

显示 20 / 108 道匹配题目

答题状态：未尝试未正确已正确

ID题目领域难度题型进度权限

2511Why L1 Produces Corners and Corners Produce Sparsity 11Why is the geometry of the L1 ball often used to explain why lasso creates sparse solutions?机器学习简单essay未尝试免费 2512Lasso Activation Threshold Numerically 18In an orthogonal coordinate with z = 7, what is the smallest lambda that forces the lasso coefficient to zero?机器学习中等数值题未尝试免费 2513Why Correlated Features Frustrate Pure Lasso 17Why does pure lasso often behave erratically when several features are highly correlated and similarly predictive?机器学习中等essay未尝试面试订阅 2514Equivalent L2 Radius in One Dimension 19In one dimension, if the ridge solution equals beta hat lambda, what radius t makes the constrained problem min RSS(beta) subject to |beta| <= t share the same optimizer?机器学习困难derivation未尝试面试订阅 2515Why Small Lambda Means Weak Regularization 20Why does a very small lambda leave the regularized solution close to OLS?机器学习困难derivation未尝试面试订阅 2516Coordinate-Descent Update for a Positive Orthogonal Lasso Coordinate 21In an orthogonal coordinate with d = 5, z = 11, and lambda = 3, what coefficient does one exact lasso coordinate-descent update return?机器学习简单数值题未尝试免费 2517Why Elastic Net Is Often Preferred With Correlated Signals 22Why can elastic net be operationally more stable than pure lasso when many predictors travel together?机器学习简单essay未尝试免费 2518Ridge Shrinkage Ratio Numerically 23In an orthogonal coordinate with d = 6 and lambda = 2, what fraction of the OLS coefficient remains under ridge?机器学习中等derivation未尝试面试订阅 2519Why Hyperparameter Search Belongs Outside the Test Set 24Why is tuning lambda on the test set just as problematic here as in any other ML pipeline?机器学习中等essay未尝试面试订阅 2520Why L1 and L2 Pull Differently Near Zero 25Why does L1 regularization create a stronger qualitative push toward exact zero than L2 regularization near the origin?机器学习困难derivation未尝试面试订阅 2521Intercept-Only Logistic MLE 1For an intercept-only logistic model with n 1 positives and n 0 negatives, what fitted probability p hat maximizes the log-likelihood?机器学习简单derivation未尝试免费 2522Intercept From the Positive Rate 2In an intercept-only logistic model, if the fitted probability is p hat, what intercept b solves sigma(b)=p hat?机器学习简单derivation未尝试免费 2523Gradient of Logistic Negative Log-Likelihood 3For one observation (x,y) with y in 0,1 and score z = w T x, what is the gradient of the negative log-likelihood with respect to w?机器学习中等derivation未尝试免费 2524Why No Closed Form in Logistic Regression 5Why does logistic regression usually require iterative optimization rather than a normal-equation-style closed form?机器学习中等essay未尝试免费 2525One Newton Step for an Intercept-Only Logistic ModelAn intercept-only logistic model is fit to 7 positives and 3 negatives. Starting from b 0 = 0, what is one Newton step b 1 for minimizing the negative log-likelihood?机器学习困难数值题未尝试面试订阅 2526Why Separable Data Pushes Coefficients Outward 7Why do logistic-regression coefficients tend to diverge on perfectly linearly separable data if no regularization is used?机器学习简单essay未尝试免费 2527Probability From a Logit Score 8If a logistic model outputs score z = ln 3, what probability does it assign to class 1?机器学习中等数值题未尝试免费 2528Why Log-Loss Rewards Calibration 9Why does a well-calibrated probability forecaster typically fare better under log-loss than a forecaster that only gets rankings right?机器学习中等essay未尝试免费 2529Why Regularization Helps Even When Logistic Is Convex 11If logistic loss is already convex, why can regularization still be crucial in practice?机器学习中等essay未尝试面试订阅 2531Why One-vs-Rest Scores Need Not Sum to One 15Why can independently trained one-vs-rest logistic classifiers produce class probabilities that do not sum to one?机器学习简单essay未尝试免费