Home > Text Nodes > Text Mining in Oracle Data ... > Data Preparation for Text > Text Processing in Oracle D...
In Oracle Data Mining 11g Release 2 (11.2) and earlier, before text mining is done, it must undergo the following processes:
Extraction or Feature Extraction: This is a special preprocessing step, where the text is broken down into units (terms) that can be mined. Text terms can be keywords or other document-derived features.
Text preparation: Text preparation uses a Build Text node to transform text columns. Build Text does not support HTML or XML documents. It also does not support any binary data types.
Oracle Data Miner uses the facilities of Oracle Text to preprocess text columns.
|
Note: You must preprocess text using the Text nodes, Apply Text Node, Build Text, and Text Reference. |