第 3 / 6 页
非代码面试题
显示 20 / 108 道匹配题目
答题状态:未尝试未正确已正确
ID题目领域难度题型进度权限
2511Why L1 Produces Corners and Corners Produce Sparsity 11Why is the geometry of the L1 ball often used to explain why lasso creates sparse solutions?机器学习简单essay未尝试免费2512Lasso Activation Threshold Numerically 18In an orthogonal coordinate with z = 7, what is the smallest lambda that forces the lasso coefficient to zero?机器学习中等数值题未尝试免费2513Why Correlated Features Frustrate Pure Lasso 17Why does pure lasso often behave erratically when several features are highly correlated and similarly predictive?机器学习中等essay未尝试面试订阅2514Equivalent L2 Radius in One Dimension 19In one dimension, if the ridge solution equals beta hat lambda, what radius t makes the constrained problem min RSS(beta) subject to |beta| <= t share the same optimizer?机器学习困难derivation未尝试面试订阅2515Why Small Lambda Means Weak Regularization 20Why does a very small lambda leave the regularized solution close to OLS?机器学习困难derivation未尝试面试订阅2516Coordinate-Descent Update for a Positive Orthogonal Lasso Coordinate 21In an orthogonal coordinate with d = 5, z = 11, and lambda = 3, what coefficient does one exact lasso coordinate-descent update return?机器学习简单数值题未尝试免费2517Why Elastic Net Is Often Preferred With Correlated Signals 22Why can elastic net be operationally more stable than pure lasso when many predictors travel together?机器学习简单essay未尝试免费2518Ridge Shrinkage Ratio Numerically 23In an orthogonal coordinate with d = 6 and lambda = 2, what fraction of the OLS coefficient remains under ridge?机器学习中等derivation未尝试面试订阅2519Why Hyperparameter Search Belongs Outside the Test Set 24Why is tuning lambda on the test set just as problematic here as in any other ML pipeline?机器学习中等essay未尝试面试订阅2520Why L1 and L2 Pull Differently Near Zero 25Why does L1 regularization create a stronger qualitative push toward exact zero than L2 regularization near the origin?机器学习困难derivation未尝试面试订阅2521Intercept-Only Logistic MLE 1For an intercept-only logistic model with n 1 positives and n 0 negatives, what fitted probability p hat maximizes the log-likelihood?机器学习简单derivation未尝试免费2522Intercept From the Positive Rate 2In an intercept-only logistic model, if the fitted probability is p hat, what intercept b solves sigma(b)=p hat?机器学习简单derivation未尝试免费2523Gradient of Logistic Negative Log-Likelihood 3For one observation (x,y) with y in 0,1 and score z = w T x, what is the gradient of the negative log-likelihood with respect to w?机器学习中等derivation未尝试免费2524Why No Closed Form in Logistic Regression 5Why does logistic regression usually require iterative optimization rather than a normal-equation-style closed form?机器学习中等essay未尝试免费2525One Newton Step for an Intercept-Only Logistic ModelAn intercept-only logistic model is fit to 7 positives and 3 negatives. Starting from b 0 = 0, what is one Newton step b 1 for minimizing the negative log-likelihood?机器学习困难数值题未尝试面试订阅2526Why Separable Data Pushes Coefficients Outward 7Why do logistic-regression coefficients tend to diverge on perfectly linearly separable data if no regularization is used?机器学习简单essay未尝试免费2527Probability From a Logit Score 8If a logistic model outputs score z = ln 3, what probability does it assign to class 1?机器学习中等数值题未尝试免费2528Why Log-Loss Rewards Calibration 9Why does a well-calibrated probability forecaster typically fare better under log-loss than a forecaster that only gets rankings right?机器学习中等essay未尝试免费2529Why Regularization Helps Even When Logistic Is Convex 11If logistic loss is already convex, why can regularization still be crucial in practice?机器学习中等essay未尝试面试订阅2531Why One-vs-Rest Scores Need Not Sum to One 15Why can independently trained one-vs-rest logistic classifiers produce class probabilities that do not sum to one?机器学习简单essay未尝试免费