第 18 / 23 页
非代码面试题
显示 20 / 453 道匹配题目
答题状态:未尝试未正确已正确
ID题目领域难度题型进度权限
4225Grouped Permutation Drop Pattern 5A feature's permutation importance is measured as baseline accuracy minus permuted accuracy. Under validation set A the numbers are 0.80 and 0.78; under noisier validation set B they are 0.74 and 0.72. What is the relative drop as a percentage of baseline in each case?机器学习中等数值题未尝试面试订阅4226High-Cardinality ID TrapA random forest says a hashed customer ID is the most important feature by impurity decrease, even though the validation permutation drop is almost zero. What is the most likely trap?机器学习中等derivation未尝试面试订阅4227Leakage Proxy TrapA model ranks a 'days since settlement' field as highly important in a fraud predictor, but that field is only known after the case outcome becomes visible. What is wrong with reading that as genuine predictive importance?机器学习中等derivation未尝试面试订阅4228Proxy Feature TrapA tree model gives most importance to ZIP code instead of the underlying income and region variables. Why should you be cautious before concluding ZIP code is the true driver?机器学习中等derivation未尝试面试订阅4229Correlation Split Credit TrapTwo nearly identical features alternate as top splitters across different random seeds. Does that mean the signal is unstable?机器学习中等derivation未尝试面试订阅4230Negative Permutation ImportanceA weak feature shows slightly negative permutation importance on a finite validation set. Should you immediately conclude it is genuinely anti-predictive?机器学习中等derivation未尝试面试订阅4231Grouped Permutation RemedyIf several sector dummy variables move together and share the same economic information, what diagnostic is often better than permuting one dummy at a time?机器学习中等derivation未尝试面试订阅4232Time-Split Remedy for Leakage RiskA feature may be available only with reporting delay. What evaluation setup is more convincing than random train/test splitting?机器学习中等derivation未尝试面试订阅4233Retrain-After-Drop CheckWhy can 'drop feature X and retrain' tell a different story from permutation importance on the original fitted model?机器学习中等derivation未尝试面试订阅4234Conditional Importance RemedyIf a feature is highly correlated with others, what is the point of conditional importance rather than plain marginal permutation?机器学习中等derivation未尝试面试订阅4235Stability RemedyIf importance rankings swing wildly across folds, what is the right reaction?机器学习中等derivation未尝试面试订阅4236Importance Is Not CausalityWhy is it dangerous to treat feature importance as if it were a causal ranking?机器学习中等essay未尝试面试订阅4237Why Trees Overcredit Splittable FeaturesWhy do impurity-based importances tend to overcredit features with many possible split points?机器学习中等essay未尝试面试订阅4238Why Correlation Makes Rankings FragileWhy do strongly correlated features make importance rankings fragile?机器学习中等essay未尝试面试订阅4239Why You Need Multiple Importance ViewsWhy is it often wise to look at more than one feature-importance diagnostic?机器学习中等essay未尝试面试订阅4241First Principal Direction from 2x2 Covariance 1A centered two-feature dataset has covariance matrix [[4.2, 1.6], [1.6, 1.8]]. What is the first principal-component direction and its variance?机器学习中等数值题未尝试面试订阅4246Explained Variance Ratio 1PCA on a covariance matrix yields eigenvalues 12, 3, and 1. What fraction of total variance is explained by the first principal component?机器学习简单数值题未尝试面试订阅4247Explained Variance Ratio 2PCA produces eigenvalues 12, 3, and 1. What is the smallest number of principal components needed to explain at least 90% of the variance?机器学习简单数值题未尝试面试订阅4248Explained Variance Ratio 3A centered point is x=(3,1), and the first principal-component loading is v=(2,1)/sqrt(5). What is the PC1 score of x?机器学习简单数值题未尝试面试订阅4249Explained Variance Ratio 4A rank-1 PCA approximation keeps only score 4 on loading vector v=(1,2)/sqrt(5). What reconstructed centered point does it produce?机器学习简单数值题未尝试面试订阅