第 16 / 32 页
非代码面试题
显示 20 / 622 道匹配题目
答题状态:未尝试未正确已正确
ID题目领域难度题型进度权限
4151Generative Classification with a Missing Feature 1A two-feature naive Bayes model was trained generatively, but at prediction time X2 is missing. Prior P(Y=1)=0.5, P(X1=1|Y=1)=0.8, P(X1=1|Y=0)=0.3, P(X2=1|Y=1)=0.75, P(X2=1|Y=0)=0.4. You only observe X1=1. What posterior P(Y=1|X1) should the generative model use?机器学习中等数值题未尝试面试订阅4156Small Labeled Sample with Plausible StructureFor an observation x, a generative model summarizes the evidence as likelihood ratio p(x|Y=1)/p(x|Y=0) = 5. If the prior probability of class 1 is 0.2, what posterior probability P(Y=1|x) follows, and what 0.5-threshold decision does that imply?机器学习中等derivation未尝试面试订阅4157Lots of Labels but Misspecified Density StoryFor an observation x, a generative model summarizes the evidence as likelihood ratio p(x|Y=1)/p(x|Y=0) = 0.5. If the prior probability of class 1 is 0.4, what posterior probability P(Y=1|x) follows, and what 0.5-threshold decision does that imply?机器学习中等derivation未尝试面试订阅4161Why Naive Bayes Can Work Despite Wrong IndependenceYou have only a few hundred labeled observations, but domain knowledge gives a plausible class-conditional structure and you also have many unlabeled feature vectors. Would you start with a generative or a discriminative model first?机器学习中等essay未尝试面试订阅4162Why Discriminative Models Often Win AsymptoticallyA classifier was trained last quarter, and now only the class prevalence has shifted while the conditional shape of features given class appears stable. Which side, generative or discriminative, is easier to adjust quickly?机器学习中等essay未尝试面试订阅4163Why Generative Models Handle Missingness More NaturallyYou have millions of labeled examples and care only about predictive accuracy on the deployed label, not about simulating x. Which side usually deserves the first try?机器学习中等essay未尝试面试订阅4164Why Generative and Discriminative Can Share a BoundaryA production system frequently has one sensor missing at test time, but your model family can factor the joint feature likelihood cleanly. Which side gets a practical advantage?机器学习中等essay未尝试面试订阅4165A Fast Sanity Check for Gen-vs-Disc QuestionsThe research desk wants not only labels but also synthetic feature draws conditional on each class for stress testing. Which side is the more natural starting point?机器学习中等essay未尝试面试订阅4166Centering-and-Scaling Coefficient Rewrite 1A linear model is y = 1.5 + 2 x. You now replace x by the engineered feature z=(x-10)/2. What intercept and slope make the model equivalent when written as y = a + b z?机器学习简单数值题未尝试面试订阅4167Centering-and-Scaling Coefficient Rewrite 2A linear model is y = -0.5 + 1.2 x. You now replace x by the engineered feature z=(x-5)/0.5. What intercept and slope make the model equivalent when written as y = a + b z?机器学习简单数值题未尝试面试订阅4171Marginal Effect with an Interaction Feature 1A model uses engineered interaction terms: y = β0 + 0.8 x1 + β2 x2 + 0.5 x1 x2. What is the marginal effect of x1 when x2 = 2?机器学习中等数值题未尝试面试订阅4172Marginal Effect with an Interaction Feature 2A model uses engineered interaction terms: y = β0 + -0.2 x1 + β2 x2 + 1.2 x1 x2. What is the marginal effect of x1 when x2 = -1?机器学习中等数值题未尝试面试订阅4176Cyclical Time-of-Day Encoding 1A cyclical hour-of-day feature is encoded as (sin(2πh/24), cos(2πh/24)). What is the encoding for h=6?机器学习简单数值题未尝试面试订阅4177One-Hot Plus Interaction Column CountA categorical variable has 5 levels. You one-hot encode it with a dropped baseline, keep one raw numeric feature x, and also create all interactions between x and the retained dummies. How many columns come out of this block in total?机器学习简单数值题未尝试面试订阅4178Winsorize-then-Standardize PipelineA raw daily return of 4.8% is winsorized to the range [-3%, 3%], then standardized using trailing mean 0.5% and trailing standard deviation 1.0%. What z-score feature results?机器学习简单数值题未尝试面试订阅4179Log1p Volume TransformA liquidity feature uses log1p(volume). If today's volume is 999999 shares, what transformed value do you store?机器学习简单数值题未尝试面试订阅4180Leakage-Safe Rolling Mean FeatureAt today's open you build a leakage-safe rolling-mean return feature from the last four completed daily returns: [1.0%, -2.0%, 0.5%, 1.5%]. What feature value do you use?机器学习简单数值题未尝试面试订阅4181Rolling Mean with a Forward WindowA candidate constructs a '5-day moving average' for day t using prices from t-2 through t+2. Is this valid for a predictive model meant to trade at the close of day t?机器学习中等derivation未尝试面试订阅4182Standardization Before the Train-Test SplitA pipeline standardizes each feature using the mean and standard deviation of the full dataset before creating the train/test split. Is that clean?机器学习中等derivation未尝试面试订阅4183Target Encoding Without Out-of-Fold LogicA categorical feature is target-encoded using the full sample average label for each category, and then those encodings are used inside cross-validation. Is that safe?机器学习中等derivation未尝试面试订阅