1283
Chapter 57: Data Mining with Analysis Services
57
such as identifying outlier rows in a data set. Regardless of the situation, understanding
this typical process can aid in building appropriate adaptations.
Modeling with Analysis Services
Create data mining structures within an existing Analysis Services project inside the
Business Intelligence Development Studio. When deployed, the Analysis Services project
creates an Analysis Services database on the target server. Often, data mining structures
deploy with related cubes in the same database.
Begin the modeling process by telling Analysis Services where the training and testing data
reside:
■ (^) Defi ne data sources that reference the location of data to be used in modeling.
■ Create data source views that include all training tables. When you use nested
tables, the data source view must show the relationship between the case and
nested tables.
For information to create and manage data sources and data source views, see Chapter 53, “Building
Multidimensional Cubes in Analysis Services with MDX.”
Data Mining Wizard
The Data Mining Wizard steps through the process to defi ne a new data mining structure
and optionally the fi rst model within that structure. Right-click the Mining Structures
node within the Solution Explorer, and choose New Mining Model to start the wizard. The
wizard consists of several pages:
■ (^) Select the Definition Method: Enables the choice of either relational (from exist-
ing relational database or data warehouse) or cube (from existing cube) source
data. Choose relational.
■ (^) Create the Data Mining Structure: Choose the algorithm to use in the structure’s
fi rst mining model. (See the “Algorithms” section in this chapter for common algo-
rithm usage.) You can create a structure with no models and add a model later.
■ (^) Select Data Source View: Choose the data source view containing the source
datatables.
■ (^) Specify Table Types: Choose the case table containing the source data and any
associated nested tables. Nested tables always have one-to-many relationships with
the case table.
■ (^) Specify the Training Data: Categorize columns by their use in the mining
structure. When a column is not included in any category, it is omitted from
the structure. Categories are as follows:
c57.indd 1283c57.indd 1283 7/31/2012 10:35:01 AM7/31/2012 10:35:01 AM
http://www.it-ebooks.info