GLOBAL SEARCH

搜索课程、模块、题目与收藏题单

搜索在服务端完成,题目解析与答案不会进入搜索结果。登录后可搜索自己的收藏题单。

找到 30 个结果

中文题目
题目2457 · 机器学习

PCA Fit Once Before Cross-Validation

A notebook computes PCA on the full feature matrix and then feeds the resulting components into every cross-validation fold. Why is that not a harmless speed optimization?

打开 →
题目5041 · 机器学习

Recover the Missing Fold Gap 1

A 5-fold cross-validation comparison records four paired score differences (model A minus model B): [0.02, 0.01, -0.01, 0.03]. The desk report says the overall mean fold difference across all 5 folds was 0.01. What was the missing fifth-fold difference?

打开 →
题目5042 · 机器学习

Recover the Missing Fold Gap 2

A 5-fold cross-validation comparison records four paired score differences (model A minus model B): [0.05, 0.02, 0.04, -0.01]. The desk report says the overall mean fold difference across all 5 folds was 0.026. What was the missing fifth-fold difference?

打开 →
题目5043 · 机器学习

Recover the Missing Fold Gap 3

A 5-fold cross-validation comparison records four paired score differences (model A minus model B): [-0.02, 0.01, 0.0, -0.01]. The desk report says the overall mean fold difference across all 5 folds was 0.002. What was the missing fifth-fold difference?

打开 →
题目5044 · 机器学习

Recover the Missing Fold Gap 4

A 5-fold cross-validation comparison records four paired score differences (model A minus model B): [0.01, 0.01, 0.02, 0.0]. The desk report says the overall mean fold difference across all 5 folds was 0.014. What was the missing fifth-fold difference?

打开 →
题目5045 · 机器学习

Recover the Missing Fold Gap 5

A 5-fold cross-validation comparison records four paired score differences (model A minus model B): [0.04, -0.02, 0.01, 0.02]. The desk report says the overall mean fold difference across all 5 folds was 0.01. What was the missing fifth-fold difference?

打开 →
模块2.6.5 · 数学与统计能力 · 机器学习理论

金融量化中的机器学习

machine-learning · financial-ml · cross-validation · purged-cv · cpcv · deflated-sharpe · multiple-testing · backtest-overfitting

打开 →
课程监督学习基础 · 机器学习理论

正则化与模型选择

正则化与模型选择 Hook:一次「翻牌」事件 你在上海一家私募基金负责沪深300 选股策略。上周你按第 3 课的做法,用普通最小二乘(ordinary least squares, OLS)把 5 个 Barra 风格因子——估值、质量、动量、规模、低波动——回归到下一期超额收益上,得到一组 公式。这周把估计窗口前移 5 个交易日重跑,价值因子载荷从 公式 ...

打开 →
课程金融量化中的机器学习 · 机器学习理论

Alpha 信号与特征工程:标注、元标注与信号衰减

钩子:五十条弱 alpha 与一个总组合 你在一家中证500 中频量化私募(private fund)工作。研究团队在过去六个月里训练出了五十条独立的 ML alpha:有用 LightGBM 在 沪深300 / 中证500 因子风格暴露上做次日 alpha 的,有 1 D CNN 在分钟线上做日内动量(momentum)的,有 Transformer 在卖...

打开 →
题目4419 · 机器学习

Average Training Length Under Expansion 4

An expanding walk-forward starts with 12 months of training and then advances by 6 months for each of 5 complete test folds. What is the average training-window length used across the 5 folds?

打开 →
题目4418 · 机器学习

Embargo Budget Across Folds 3

A walk-forward backtest produces 7 complete folds, and the research protocol inserts a 3-day embargo between each training block and its following test block. How many calendar days are lost to embargo across the whole run?

打开 →
题目4422 · 机器学习

Execution-Lagged Label Capacity 7

A test block has 25 trading days. A signal generated on day t is executed on day t+1 and evaluated on the open-to-close return from day t+1 through day t+4. How many signals inside the block can be scored without the label running past the block end?

打开 →
题目4417 · 机器学习

Expanding Window Final Training Length 2

An expanding-window walk-forward starts with 18 months of training, then uses a 1-month embargo and a 4-month test block, advancing by 4 months each round across 59 months of history. What is the training-window length in the last complete fold?

打开 →
题目2704 · 机器学习

Expected Null Strategies Surviving a Screening Funnel

A research platform runs 200 null strategies. Only strategies with in-sample p-value below 15% are promoted, and each promoted strategy must then pass a fresh 5% confirmation test. Assuming independence under the null, what is the expected number of false strategies that survive

打开 →
题目2696 · 机器学习

False Strategy Surviving Two Independent Research Gates

A desk tries 80 genuinely null strategy ideas. A strategy is kept only if it passes an in-sample screen at 10% and then a fresh out-of-sample confirmation at 5%, with the two tests treated as independent under the null. What is the probability at least one null idea survives both

打开 →
题目2454 · 机器学习

Feature Screening Before the Split

A team ranks 5,000 candidate features by correlation with the target on the full dataset, keeps the top 30, and only then creates train and test. Why is the later split not enough to rescue the experiment?

打开 →