Data Mining: Practical Machine Learning Tools and Techniques, Second Edition

(Brent) #1

procedure, reduced-error pruning (Section 6.2, pages 202–203) can be per-
formed. The -Noption governs the size of the holdout set: the dataset is divided
equally into that number of parts and the last is held out (default value 3). You
can smooth the probability estimates using the Laplace technique, set the
random number seed for shuffling the data when selecting a pruning set, and
store the instance information for future visualization. Finally, to build a binary
tree instead of one with multiway branches for nominal attributes, use -B.


13.3 COMMAND-LINE OPTIONS 459

Free download pdf