INTERVIEW PREP

数学与非代码面试题

覆盖数学、概率、统计、脑筋急转弯、机器学习和金融。这里负责筛选和进入单题；编程题使用独立的 LeetCode 式 coding lab。

做诊断按领域练习按面试风格练习代码题库

题目: 4169
领域: 8
当前筛选: 4169

第 144 / 209 页

非代码面试题

显示 20 / 4169 道匹配题目

答题状态：未尝试未正确已正确

ID题目领域难度题型进度权限

4327CNN Depth For Longer HorizonA stride-1 CNN uses kernel size 3 and no dilation. To cover a dependency horizon of 9 steps you need 4 layers. If the required horizon rises to 41 steps, how many layers are needed, and what does that imply about the architecture pressure?机器学习中等essay未尝试面试订阅 4328Small-Data Regime ShiftSuppose the task stays strongly local and translation-equivariant, but your labeled dataset shrinks by a factor of 10. Which architecture becomes more attractive, and why does the shift in data regime matter?机器学习中等essay未尝试面试订阅 4329Latency Budget RelaxationA task was originally fully online, making recurrence or causal convolution preferable. If the deployment changes to offline batch scoring with the whole sequence available, which architecture family gains the most from that relaxation?机器学习中等essay未尝试面试订阅 4330From Local To Global Task StructureA prediction problem used to depend on short motifs, but after a product change the label now depends on matching information from the first and last quarter of each sequence. Which architecture family should move up the ranking?机器学习中等essay未尝试面试订阅 4331What To Quantify FirstBefore you choose between a CNN, RNN, and Transformer for a new sequence task, what two structural quantities should you quantify first?机器学习中等essay未尝试面试订阅 4332Before Picking Transformer By DefaultA teammate wants to start with a Transformer because it won the last benchmark. What is the first counter-question you should ask?机器学习中等essay未尝试面试订阅 4333Before Discarding RNNsWhy should you hesitate before ruling out RNNs entirely in a trading-system pipeline?机器学习中等essay未尝试面试订阅 4334Before Using CNNWhat is the first structural property you should verify before leaning on a CNN as your main architecture?机器学习中等essay未尝试面试订阅 4335Hybrid ThinkingIf you suspect the task has both strong local motifs and occasional long-range dependencies, what should be your first decomposition step before arguing about model family?机器学习中等essay未尝试面试订阅 4336Why CNN Can WinWhy can a modest CNN beat a larger Transformer on a small-data task whose label depends mainly on short local patterns?机器学习中等essay未尝试面试订阅 4337Why RNN Still MattersWhy might an RNN still be the practical choice for a production event-stream model even if Transformers benchmark better offline?机器学习中等essay未尝试面试订阅 4338When Attention Earns Its CostWhat kind of task structure makes the quadratic cost of attention worth paying?机器学习中等essay未尝试面试订阅 4339Why Architecture Mismatch HurtsWhy can architecture mismatch dominate parameter count when performance is poor?机器学习中等essay未尝试面试订阅 4340Hybrid Versus PureWhen is it more sensible to consider a hybrid architecture instead of insisting on a pure CNN, pure RNN, or pure Transformer?机器学习中等essay未尝试面试订阅 4341Confusion-Matrix Metrics 1At a fixed threshold, prevalence is 20%, TPR is 80%, and FPR is 10%. What precision does that imply?机器学习简单数值题未尝试面试订阅 4342Confusion-Matrix Metrics 2A fraud model keeps TPR = 0.90 and FPR = 0.03 when deployed into a market where prevalence falls from 10% to 2%. What precision should you now expect at the same threshold?机器学习简单数值题未尝试面试订阅 4343Confusion-Matrix Metrics 3Predicted probabilities are [0.8, 0.6, 0.3, 0.1] and labels are [1, 0, 1, 0]. What is the Brier score?机器学习简单数值题未尝试面试订阅 4344Confusion-Matrix Metrics 4For expected calibration error with equal sample weighting, you have two nonempty bins. Bin A has probabilities [0.2, 0.3] and labels [0, 1]. Bin B has probabilities [0.8, 0.9] and labels [1, 1]. Using ECE = sum over bins of (bin fraction)*|avg confidence - accuracy|, what ECE do you get?机器学习简单数值题未尝试面试订阅 4345Confusion-Matrix Metrics 5A model assigns an average predicted probability of 0.18 to a bucket containing 200 names. If the model is calibrated, how many positives should you expect in that bucket on average?机器学习简单数值题未尝试面试订阅 4346Brier Score Snapshot 1At one threshold, prevalence is 5%, TPR is 80%, and FPR is 10%. What PR-space point (recall, precision) corresponds to that ROC-space operating point?机器学习中等数值题未尝试面试订阅