| |
| |
| |
Introduction | |
| |
| |
| |
What is Data Mining? | |
| |
| |
| |
Motivating Challenges | |
| |
| |
| |
The Origins of Data Mining | |
| |
| |
| |
Data Mining Tasks | |
| |
| |
| |
Scope and Organization of the Book | |
| |
| |
| |
Bibliographic Notes | |
| |
| |
| |
Exercises | |
| |
| |
| |
Data | |
| |
| |
| |
Types of Data | |
| |
| |
| |
Data Quality | |
| |
| |
| |
Data Preprocessing | |
| |
| |
| |
Measures of Similarity and Dissimilarity | |
| |
| |
| |
Bibliographic Notes | |
| |
| |
| |
Exercises | |
| |
| |
| |
Exploring Data | |
| |
| |
| |
The Iris Data Set | |
| |
| |
| |
Summary Statistics | |
| |
| |
| |
Visualization | |
| |
| |
| |
OLAP and Multidimensional Data Analysis | |
| |
| |
| |
Bibliographic Notes | |
| |
| |
| |
Exercises | |
| |
| |
| |
Classification: Basic Concepts, Decision Trees, and Model Evaluation | |
| |
| |
| |
Preliminaries | |
| |
| |
| |
General Approach to Solving a Classification Problem | |
| |
| |
| |
Decision Tree Induction | |
| |
| |
| |
Model Overfitting | |
| |
| |
| |
Evaluating the Performance of a Classifier | |
| |
| |
| |
Methods for Comparing Classifiers | |
| |
| |
| |
Bibliographic Notes | |
| |
| |
| |
Exercises | |
| |
| |
| |
Classification: Alternative Techniques | |
| |
| |
| |
Rule-Based Classifier | |
| |
| |
| |
Nearest-Neighbor Classifiers | |
| |
| |
| |
Bayesian Classifiers | |
| |
| |
| |
Artificial Neural Network (ANN) | |
| |
| |
| |
Support Vector Machine (SVM) | |
| |
| |
| |
Ensemble Methods | |
| |
| |
| |
Class Imbalance Problem | |
| |
| |
| |
Multiclass Problem | |
| |
| |
| |
Bibliographic Notes | |
| |
| |
| |
Exercises | |
| |
| |
| |
Association Analysis: Basic Concepts and Algorithms | |
| |
| |
| |
Problem Definition | |
| |
| |
| |
Frequent Itemset Generation | |
| |
| |
| |
Rule Generation | |
| |
| |
| |
Compact Representation of Frequent Itemsets | |
| |
| |
| |
Alternative Methods for Generating Frequent Itemsets | |
| |
| |
| |
FP-Growth Algorithm | |
| |
| |
| |
Evaluation of Association Patterns | |
| |
| |
| |
Effect of Skewed Support Distribution | |
| |
| |
| |
Bibliographic Notes | |
| |
| |
| |
Exercises | |
| |
| |
| |
Association Analysis: Advanced Concepts | |
| |
| |
| |
Handling Categorical Attributes | |
| |
| |
| |
Handling Continuous Attributes | |
| |
| |
| |
Handling a Concept Hierarchy | |
| |
| |
| |
Sequential Patterns | |
| |
| |
| |
Subgraph Patterns | |
| |
| |
| |
Infrequent Patterns | |
| |
| |
| |
Bibliographic Notes | |
| |
| |
| |
Exercises | |
| |
| |
| |
Cluster Analysis: Basic Concepts and Algorithms | |
| |
| |
| |
Overview | |
| |
| |
| |
K-means | |
| |
| |
| |
Agglomerative Hierarchical Clustering | |
| |
| |
| |
DBSCAN | |
| |
| |
| |
Cluster Evaluation | |
| |
| |
| |
Bibliographic Notes | |
| |
| |
| |
Exercises | |
| |
| |
| |
Cluster Analysis: Additional Issues and Algorithms | |
| |
| |
| |
Characteristics of Data, Clusters, and Clustering Algorithms | |
| |
| |
| |
Prototype-Based Clustering | |
| |
| |
| |
Density-Based Clustering | |
| |
| |
| |
Graph-Based Clustering | |
| |
| |
| |
Scalable Clustering Algorithms | |
| |
| |
| |
Which Clustering Algorithm? | |
| |
| |
| |
Bibliographic Notes | |
| |
| |
| |
Exercises | |
| |
| |
| |
Anomaly Detection | |
| |
| |
| |
Preliminaries | |
| |
| |
| |
Statistical Approaches | |
| |
| |
| |
Proximity-Based Outlier Detection | |
| |
| |
| |
Density-Based Outlier Detection | |
| |
| |
| |
Clustering-Based Techniques | |
| |
| |
| |
Bibliographic Notes | |
| |
| |
| |
Exercises | |
| |
| |
| |
Linear Algebra | |
| |
| |
| |
Dimensionality Reduction | |
| |
| |
| |
Probability and Statistics | |
| |
| |
| |
Regression | |
| |
| |
| |
Optimization | |
| |
| |
Author Index | |
| |
| |
Subject Index | |