Note
•
Regression handles numerical
•
Classification handles categories (Concrete classes)
Confusion Matrix
•
Column indicates Actual
•
Row indicates Predicted
Positive | Negative | |
Positive | True Positive | False Positive |
Negative | False Negative | True Negative |
Evaluation
•
Precision formula
◦
•
Recall formula
◦
•
Specificity formula
◦
Alternative names for Data Mining
•
knowledge discovery (mining) in database (KDD)
•
knowledge extraction
•
data/pattern analysis
•
data archeology
•
business intelligence
Data Mining Issues
•
Human interaction
•
Over fitting
•
Outliers
•
Interpretation of results
•
Visualization of results
•
Large datasets
•
High dimensionality
•
Multimedia data
•
Missing data
•
Irrelevant data
•
Noisy data
•
Changing data
•
Integration
•
Applications


