INTERVIEW PREP

数学与非代码面试题

覆盖数学、概率、统计、脑筋急转弯、机器学习和金融。这里负责筛选和进入单题；编程题使用独立的 LeetCode 式 coding lab。

做诊断按领域练习按面试风格练习代码题库

题目: 4169
领域: 8
当前筛选: 622

第 3 / 32 页

非代码面试题

显示 20 / 622 道匹配题目

答题状态：未尝试未正确已正确

ID题目领域难度题型进度权限

2436Why the Business Objective May Differ From the Training Loss 23Why can it be rational to train a model under one loss and evaluate the final decision under a different business metric?机器学习简单essay未尝试免费 2437Why Huber Sits Between Squared and Absolute LossWhy is Huber loss often described as sitting 'between' squared loss and absolute loss?机器学习中等essay未尝试面试订阅 2438Why Convexity Makes Averaging Predictions SafeWhy does convexity of a loss function support the intuition that averaging similar predictors often cannot hurt too much?机器学习困难essay未尝试面试订阅 2439Why Asymmetric Loss Moves the Target Away From the MeanWhy does an asymmetric loss generally make the optimal constant prediction move away from the mean of the target distribution?机器学习困难essay未尝试面试订阅 2440Why Heavy-Tailed Noise Pushes You Away From Pure Squared Loss 15Why is pure squared loss often a poor default when the residual distribution has rare but huge outliers?机器学习中等derivation未尝试面试订阅 2441Weighted Log-Loss Bayes Probability Numerically 20If p = 0.3, alpha = 4, and beta = 1 in weighted log-loss, what Bayes probability q* is optimal?机器学习简单数值题未尝试免费 2442Why Proper Losses Matter Beyond Ranking 24Why is a strictly proper probabilistic loss valuable even when the final system will later choose its own operating threshold?机器学习简单essay未尝试免费 2443Weighted Log-Loss Moves the Bayes Probability Toward the Costlier Class 16Why does class-weighted log-loss shift the optimal reported probability toward the class with the larger weight?机器学习中等derivation未尝试面试订阅 2444Why Quantile Loss Is Useful in Risk Forecasting 17Why is pinball loss natural when the target is a VaR-like forecast rather than a mean forecast?机器学习中等derivation未尝试面试订阅 2445Why Tail Forecasts Need Tail-Aligned Losses 25Why is it often a mistake to optimize plain squared loss when the operational task really cares about an extreme tail quantile?机器学习困难essay未尝试面试订阅 2446Hidden Validation Positives Implied by a Leaky Target EncoderA category appears 40 times in train with 18 positives and 10 times in validation. A target encoder is incorrectly fit on train plus validation and outputs 0.56 for that category. How many validation positives did the encoder implicitly use?机器学习简单数值题未尝试免费 2447Shift in the Training Mean After Fitting a Scaler on All RowsA feature has training mean 10 over 80 rows and test mean 14 over 20 rows. A scaler is wrongly fit on all 100 rows and uses standard deviation 5. What is the average standardized value of the training block under that leaked fit?机器学习简单数值题未尝试免费 2448Held-Out Base Rate Implied by a Full-Sample Class WeightA training set has 100 labels with 30 positives. A class-weighting routine is mistakenly fit on all 125 labels and reports an overall positive rate of 0.36. What is the positive rate in the 25 held-out labels?机器学习中等数值题未尝试面试订阅 2449Issuer Demeaning That Quietly Uses Held-Out RowsFor one issuer, the three training rows sum to 12. A pipeline mistakenly demeans by the full-sample issuer mean 3.6 computed from five rows total. What is the sum of the two held-out rows for that issuer?机器学习中等数值题未尝试面试订阅 2451Rare Category Survival Caused by Held-Out RowsA categorical preprocessor keeps a level only if it appears at least 5 times. In train alone, level Z appears 4 times. After the preprocessor is wrongly fit on the full sample, level Z appears to have frequency 7 and is kept. How many held-out Z rows caused the leak?机器学习简单数值题未尝试免费 2452Future Restatements Merged Into Historical FeaturesA researcher joins fundamentals after they were restated months later, then backtests on the original trade dates. Why is this a split-discipline failure even if no test labels were touched?机器学习中等essay未尝试面试订阅 2453Winsor Caps Chosen on the Full PanelA pipeline clips a spread feature at the 1st and 99th percentiles computed on the full panel before the split. Why can this still bias the reported test score even though the clipping rule is unsupervised?机器学习中等essay未尝试面试订阅 2454Feature Screening Before the SplitA team ranks 5,000 candidate features by correlation with the target on the full dataset, keeps the top 30, and only then creates train and test. Why is the later split not enough to rescue the experiment?机器学习中等essay未尝试面试订阅 2455Repeated Validation Peeking During ResearchA researcher keeps trying new transformations and only retains the ones that improve the same validation score. Why does the validation set stop being a clean model-selection tool?机器学习困难essay未尝试面试订阅 2456Row Split Instead of Issuer SplitEach issuer contributes many dated observations. Why can a random row split overstate performance even when the target is defined separately on each date?机器学习简单essay未尝试免费