Search

Data Mining

Note

Regression handles numerical
Classification handles categories (Concrete classes)

Confusion Matrix

Column indicates Actual
Row indicates Predicted
Positive
Negative
Positive
True Positive
False Positive
Negative
False Negative
True Negative

Evaluation

Precision formula
TP/(TP+FP)TP/(TP+FP)
Recall formula
TP/(TP+FN)TP/(TP+FN)
Specificity formula
TN/(TN+FP)TN/(TN+FP)

Alternative names for Data Mining

knowledge discovery (mining) in database (KDD)
knowledge extraction
data/pattern analysis
data archeology
business intelligence

Data Mining Issues

Human interaction
Over fitting
Outliers
Interpretation of results
Visualization of results
Large datasets
High dimensionality
Multimedia data
Missing data
Irrelevant data
Noisy data
Changing data
Integration
Applications

Reference