Data Miner uses workflows to specify data mining operations. The operations specified in the workflow are executed in the Data Miner server.
First create a database connection to the user account where the data to mine resides. Then create a project where the workflows reside. In the workflow, create a node that contains the mining data and connect it to a model build node. After building models, select the best model and apply it to new data to make predictions.
Launch the cue cards to learn how to use the Data Miner GUI and build classification models.
This cue card set introduces you to the Data Mining graphical user interface (GUI); it also illustrates how to solve a business problem.
Problem: A retailer wants to increase revenues by identifying all potentially high-value customers in order to offer incentives to them. For example, the incentive might be an affinity card that gives a discount on purchases.
Solution: Build a Data Mining Classification model to predict which customers who likely to be high spenders in the next year. Award these customers an affinity card.
An important part of data mining problems is data gathering and preparation. Data gathering and preparation consumes more than 50% of the time and effort of a data mining project. In this example, you are provided with data that is ready to be mined.
An important part of building a model is deciding which algorithm to use. In this example, you build and test four classification models with the same target using four different algorithms. You then select the best model by comparing the test results.
Identifying the best model and applying it to new data is not the end of the problem. It is necessary to deploy the results; in this case, you might want to distribute affinity cards to high-value customers. The first step in deploying the results is to export the predictions (apply results) to a table in the database.
The following steps build several classification models, select the best model, apply the best model to new data, and write the results of the model apply to a database table:
Copyright © 2011, 2012, Oracle. All rights reserved.