INTERVIEW PREP

数学与非代码面试题

覆盖数学、概率、统计、脑筋急转弯、机器学习和金融。这里负责筛选和进入单题；编程题使用独立的 LeetCode 式 coding lab。

做诊断按领域练习按面试风格练习代码题库

题目: 4169
领域: 8
当前筛选: 622

第 16 / 32 页

非代码面试题

显示 20 / 622 道匹配题目

答题状态：未尝试未正确已正确

ID题目领域难度题型进度权限

4151Generative Classification with a Missing Feature 1A two-feature naive Bayes model was trained generatively, but at prediction time X2 is missing. Prior P(Y=1)=0.5, P(X1=1|Y=1)=0.8, P(X1=1|Y=0)=0.3, P(X2=1|Y=1)=0.75, P(X2=1|Y=0)=0.4. You only observe X1=1. What posterior P(Y=1|X1) should the generative model use?机器学习中等数值题未尝试面试订阅 4156Small Labeled Sample with Plausible StructureFor an observation x, a generative model summarizes the evidence as likelihood ratio p(x|Y=1)/p(x|Y=0) = 5. If the prior probability of class 1 is 0.2, what posterior probability P(Y=1|x) follows, and what 0.5-threshold decision does that imply?机器学习中等derivation未尝试面试订阅 4157Lots of Labels but Misspecified Density StoryFor an observation x, a generative model summarizes the evidence as likelihood ratio p(x|Y=1)/p(x|Y=0) = 0.5. If the prior probability of class 1 is 0.4, what posterior probability P(Y=1|x) follows, and what 0.5-threshold decision does that imply?机器学习中等derivation未尝试面试订阅 4161Why Naive Bayes Can Work Despite Wrong IndependenceYou have only a few hundred labeled observations, but domain knowledge gives a plausible class-conditional structure and you also have many unlabeled feature vectors. Would you start with a generative or a discriminative model first?机器学习中等essay未尝试面试订阅 4162Why Discriminative Models Often Win AsymptoticallyA classifier was trained last quarter, and now only the class prevalence has shifted while the conditional shape of features given class appears stable. Which side, generative or discriminative, is easier to adjust quickly?机器学习中等essay未尝试面试订阅 4163Why Generative Models Handle Missingness More NaturallyYou have millions of labeled examples and care only about predictive accuracy on the deployed label, not about simulating x. Which side usually deserves the first try?机器学习中等essay未尝试面试订阅 4164Why Generative and Discriminative Can Share a BoundaryA production system frequently has one sensor missing at test time, but your model family can factor the joint feature likelihood cleanly. Which side gets a practical advantage?机器学习中等essay未尝试面试订阅 4165A Fast Sanity Check for Gen-vs-Disc QuestionsThe research desk wants not only labels but also synthetic feature draws conditional on each class for stress testing. Which side is the more natural starting point?机器学习中等essay未尝试面试订阅 4166Centering-and-Scaling Coefficient Rewrite 1A linear model is y = 1.5 + 2 x. You now replace x by the engineered feature z=(x-10)/2. What intercept and slope make the model equivalent when written as y = a + b z?机器学习简单数值题未尝试面试订阅 4167Centering-and-Scaling Coefficient Rewrite 2A linear model is y = -0.5 + 1.2 x. You now replace x by the engineered feature z=(x-5)/0.5. What intercept and slope make the model equivalent when written as y = a + b z?机器学习简单数值题未尝试面试订阅 4171Marginal Effect with an Interaction Feature 1A model uses engineered interaction terms: y = β0 + 0.8 x1 + β2 x2 + 0.5 x1 x2. What is the marginal effect of x1 when x2 = 2?机器学习中等数值题未尝试面试订阅 4172Marginal Effect with an Interaction Feature 2A model uses engineered interaction terms: y = β0 + -0.2 x1 + β2 x2 + 1.2 x1 x2. What is the marginal effect of x1 when x2 = -1?机器学习中等数值题未尝试面试订阅 4176Cyclical Time-of-Day Encoding 1A cyclical hour-of-day feature is encoded as (sin(2πh/24), cos(2πh/24)). What is the encoding for h=6?机器学习简单数值题未尝试面试订阅 4177One-Hot Plus Interaction Column CountA categorical variable has 5 levels. You one-hot encode it with a dropped baseline, keep one raw numeric feature x, and also create all interactions between x and the retained dummies. How many columns come out of this block in total?机器学习简单数值题未尝试面试订阅 4178Winsorize-then-Standardize PipelineA raw daily return of 4.8% is winsorized to the range [-3%, 3%], then standardized using trailing mean 0.5% and trailing standard deviation 1.0%. What z-score feature results?机器学习简单数值题未尝试面试订阅 4179Log1p Volume TransformA liquidity feature uses log1p(volume). If today's volume is 999999 shares, what transformed value do you store?机器学习简单数值题未尝试面试订阅 4180Leakage-Safe Rolling Mean FeatureAt today's open you build a leakage-safe rolling-mean return feature from the last four completed daily returns: [1.0%, -2.0%, 0.5%, 1.5%]. What feature value do you use?机器学习简单数值题未尝试面试订阅 4181Rolling Mean with a Forward WindowA candidate constructs a '5-day moving average' for day t using prices from t-2 through t+2. Is this valid for a predictive model meant to trade at the close of day t?机器学习中等derivation未尝试面试订阅 4182Standardization Before the Train-Test SplitA pipeline standardizes each feature using the mean and standard deviation of the full dataset before creating the train/test split. Is that clean?机器学习中等derivation未尝试面试订阅 4183Target Encoding Without Out-of-Fold LogicA categorical feature is target-encoded using the full sample average label for each category, and then those encodings are used inside cross-validation. Is that safe?机器学习中等derivation未尝试面试订阅