简体   繁体   English

h2oEnsemble包中的h2o.ensemble和h2o.stack有什么不同

[英]what is the different between h2o.ensemble and h2o.stack in package h2oEnsemble

Accoding to the Description of function: 根据功能描述:

h2o.stack: This function creates a "Super Learner" (stacking) ensemble using a list of existing H2O base models specified by the user. h2o.stack:此函数使用用户指定的现有H2O基本模型列表创建“超级学习者”(堆叠)集合。

h2o.ensemble: This function creates a "Super Learner" (stacking) ensemble using the H2O base learning algorithms specified by the user. h2o.ensemble:此函数使用用户指定的H2O基础学习算法创建“超级学习者”(堆叠)集合。

They are two different ways to construct an ensemble. 它们是构建整体的两种不同方式。 They have a different interface, but they produce the exact same type of object in the end. 它们具有不同的接口,但它们最终会产生完全相同类型的对象。

  • The h2o.stack() function takes as input a list of already trained (and cross-validated) H2O models, so all it needs to do is the metalearning (combiner) step, which is very fast. h2o.stack()函数将已经训练过的(和交叉验证的)H2O模型列表作为输入,因此它所需要做的就是元学习(组合器)步骤,这非常快。 This is useful if you want to use a grid of H2O models or a collection of grids of H2O models as the base learners. 如果您想使用H2O模型网格或H2O模型网格集合作为基础学习者,这将非常有用。 The only caveat is that all the base learners must have used identical cross-validation folds. 唯一需要注意的是,所有基础学习者必须使用相同的交叉验证折叠。 If you use fold_assignment = "Modulo" in all the base learners (or grid) that will ensure identical folds. 如果您在所有基础学习者(或网格)中使用fold_assignment = "Modulo" ,这将确保相同的折叠。
  • The h2o.ensemble() function allows the user to specify which base models they want in the ensemble and then does the all of the training and cross-validation of the base models, and then does the metalearning (combiner) step as well. h2o.ensemble()函数允许用户在整体中指定他们想要的基本模型,然后进行基本模型的所有训练和交叉验证,然后进行元学习(组合)步骤。 This takes much longer since it has to train all the base models as well. 这需要更长的时间,因为它必须训练所有的基础模型。

As of the latest stable release (H2O 3.10.3.*), stacking is now available natively in H2O (R, Python, Java, Scala) as the "Stacked Ensemble" method. 从最新的稳定版本(H2O 3.10.3。*)开始,堆叠现在可以在H2O(R,Python,Java,Scala)中作为“Stacked Ensemble”方法本地使用。 More info on that here . 关于这里的更多信息。 However, the h2oEnsemble R package (where the h2o.ensemble() and h2o.stack() functions live) will continue to be supported as well. 但是, h2oEnsemble R包(其中h2o.ensemble()h2o.stack()函数的实时)也将继续受到支持。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM