Home > Transforms Nodes > Sample Node
A sample is a a subset of data. Oracle Data Miner supports two sampling techniques:
Random Sample: A sample in which every element of the data set has an equal chance of being selected.
Stratified Sample: A sample that is created as follows: Divide the data set into disjoint subsets (strata) and then take a random sample from each of the subsets. This technique is used when the distribution of target values is skewed greatly. For example, response to a marketing campaign may have a positive target value 1% of the time or less.
TopN: A sample consisting of N records from an attribute.