Data Mining: Practical Machine Learning Tools and Techniques, Second Edition

learner. In the case of classification, predictions are generated by averaging probability estimates, not by voting. One parameter is the size of the bags as a per- centage of the training set. Another is whether to calculate the out-of-bag error, which gives the average error of the ensemble members (Breiman 2001). RandomCommitteeis even simpler: it builds an ensemble of base classifiers and averages their predictions. Each one is based on the same data but uses a different random number seed (Section 7.5, page 320). This only makes sense if the base classifier is randomized; otherwise, all classifiers would be the same.

10.5 METALEARNING ALGORITHMS 415

Table 10.6 Metalearning algorithms in Weka.

Name Function

Meta AdaBoostM1 Boost using the AdaBoostM1 method
AdditiveRegression Enhance the performance of a regression method by
iteratively fitting the residuals
AttributeSelectedClassifier Reduce dimensionality of data by attribute selection
Bagging Bag a classifier; works for regression too
ClassificationViaRegression Perform classification using a regression method
CostSensitiveClassifier Make its base classifier cost sensitive
CVParameterSelection Perform parameter selection by cross-validation
Decorate Build ensembles of classifiers by using specially
constructed artificial training examples
FilteredClassifier Run a classifier on filtered data
Grading Metalearners whose inputs are base-level predictions that
have been marked as correct or incorrect
LogitBoost Perform additive logistic regression
MetaCost Make a classifier cost-sensitive
MultiBoostAB Combine boosting and bagging using the MultiBoosting
method
MultiClassClassifier Use a two-class classifier for multiclass datasets
MultiScheme Use cross-validation to select a classifier from several
candidates
OrdinalClassClassifier Apply standard classification algorithms to problems with
an ordinal class value
RacedIncrementalLogitBoost Batch-based incremental learning by racing logit-boosted
committees
RandomCommittee Build an ensemble of randomizable base classifiers
RegressionByDiscretization Discretize the class attribute and employ a classifier
Stacking Combine several classifiers using the stacking method
StackingC More efficient version of stacking
ThresholdSelector Optimize the F-measure for a probabilistic classifier
Vote Combine classifiers using average of probability estimates
or numeric predictions

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition

Get our desktop app

Company

Features

Documentation

Resources