GLOBAL SEARCH

搜索课程、模块、题目与收藏题单

搜索在服务端完成，题目解析与答案不会进入搜索结果。登录后可搜索自己的收藏题单。

全部 · 4546 课程 · 299 模块 · 72 题目 · 4169 帮助 · 6 收藏题单 · 0

找到 24 个结果

课程树模型与核方法 · 机器学习理论

Bagging 与随机森林

周五午盘，一家 50 亿规模的 CN 私募把一份沪深300 alpha 数据甩到你工位:30 个特征、日频次日超额收益作标签。上一课那棵深度 15 的 CART 树样本内方向准确率 100%、样本外只有 51%——比抛硬币好不了多少，Sharpe 几乎为零。你把它换成 500 棵在 bootstrap 样本上独立训练的深树取平均，样本外跳到 57%。这一跳，...

题目2574 · 机器学习

Why Bagging Helps Unstable Learners Most 10

Why does bagging usually help deep trees much more than it helps already-stable learners?

题目2575 · 机器学习

Why Bagging Rarely Fixes High Bias 11

Why should you not expect bagging alone to rescue a learner whose individual trees are systematically misspecified?

题目2413 · 机器学习

Why Bagging Mainly Targets Variance

Why is bagging usually described as a variance-reduction tool rather than a bias-reduction tool?

题目2589 · 机器学习

Bagged MSE When Bias Stays Fixed 7

Assume each tree has the same squared bias b^2 and prediction noise floor nu, while bagging only changes the variance term according to the equicorrelated-tree formula. Derive the bagged test MSE with B trees.

模块2.6.2 · 数学与统计能力 · 机器学习理论

树模型与核方法

machine-learning · tree-based-methods · decision-tree · cart · impurity · pruning · bagging · random-forest

课程树模型与核方法 · 机器学习理论

梯度提升与 XGBoost / LightGBM

上海某私募的因子研究员把上一节的 500 棵随机森林训完,沪深300 + 中证500 上的样本外准确率 57%——比单棵深树的 51% 上了 6 个点。她把 max features 从 sqrt(p) 调到 p/3、把树数加到 2000,准确率纹丝不动停在 57.2%——bagging 的方差红利已经吃干净了。PM 在因子复盘会上一句话:「方差降到底了,把...

课程信号评估与合成 · Alpha 研究

信号合成、堆叠与集成

周五上午,你在上海的一家量化私募 ——明汯、幻方、九坤、灵均风格的多因子私募。 L3 把四条信号正交化完了: mom 12 1 , book to market , gross profitability , pead sue 都残差化通过了 IC break even 门槛。桌面上还没有量产复合信号。投决...

课程树模型与核方法 · 机器学习理论

决策树:CART、不纯度准则与剪枝

周一早盘九点二十,你接手了离职同事留下的 alpha 模型——一棵深度 15 的 CART(Classification and Regression Tree, CART)树,在三年沪深300 成分股日度面板上训练,特征是动量、价值、质量、低波、5 日收益、20 日波动率、换手率等 12 个变量,目标是预测下一日超额收益方向(涨/跌)。样本内训练精度 1...

题目2592 · 机器学习

Effective Independent Tree Count 8

Define B_eff by matching the correlated-forest variance sigma^2 [rho + (1-rho)/B] to the variance sigma^2 / B_eff of averaging independent trees. Derive B_eff.

题目2579 · 机器学习

Infer Tree Correlation From the Variance Floor 23

A single tree has variance 6, while an extremely large forest appears to level off at variance 1.8. What pairwise tree correlation rho is implied?

题目2573 · 机器学习

Infinite-Forest Variance Floor 2

Using the equicorrelated-tree variance formula, derive the prediction variance as the number of trees B tends to infinity.

题目2584 · 机器学习

Marginal Variance Reduction From One More Tree 3

Under the equicorrelated-tree model, derive how much the ensemble variance falls when you move from B trees to B+1 trees.

题目2572 · 机器学习

Numeric Ensemble Variance 22

Each tree has variance 9, pairwise correlation 0.2, and the forest has 25 trees. What is the variance of the forest average?

题目2585 · 机器学习

Trees Needed for a Target Variance Cap 4

Suppose each tree has variance sigma^2 and pairwise correlation rho. Derive the minimum B needed to make the ensemble variance at most V, assuming V > rho sigma^2.

题目2571 · 机器学习

Variance of an Average of Correlated Trees 1

Suppose B trees each have variance sigma^2 and every pair has correlation rho. Derive the variance of their simple average.

题目2593 · 机器学习

Why Averaging Cannot Cure Systematic Label Noise 20

Why can a larger forest fail to repair performance when the training labels themselves are systematically corrupted?

题目2576 · 机器学习

Why Feature Subsampling Helps When One Predictor Dominates 12

Why can random feature subsampling improve a forest when one very strong predictor would otherwise appear at the top of almost every tree?

题目2580 · 机器学习

Why More Trees Usually Do Not Create Classical Overfit 15

Why does adding more trees to a random forest typically plateau rather than create the kind of explosive overfit seen in some single-model families?

题目2591 · 机器学习

Why OOB Can Be Noisy on Small Samples 19

Why can out-of-bag error fluctuate a lot on a small dataset even when the forest itself is reasonably stable?

题目2577 · 机器学习

Why OOB Is Unsafe for Grouped or Temporal Data 13

Why can out-of-bag error be misleading when rows are linked by entity or time rather than being exchangeable?

题目2581 · 机器学习

Why Random-Forest Regression Extrapolates Poorly 16

Why does random-forest regression usually fail to extrapolate a trend far beyond the training range?

题目2578 · 机器学习

Why Tiny max_features Can Raise Bias 14

Why can making max_features too small hurt a random forest even though it lowers correlation?

课程树模型与核方法 · 机器学习理论

核方法与支持向量机

周一开盘前一小时,你坐在上海一家中型私募基金(private fund)的研究室。投研经理把一张 CSV 推到桌上:沪深300 成分股 300 只,每只配 15 维因子向量(PE、PB、12 个月动量、20 日波动率、换手率、分析师上调比例),本质上是一张轻量级因子模型(factor model)输入表;标签公式表示下月相对指数 outperform /...