Data Mining - data exploration
Data: Collection of data objects and their attributes
Types of attributes
- Nominal: distinctness
- Oridinal: distinctness & order
- Interval: distinctness, order & addition
- Ratio: distinctness, order, addition & multiplication
Types of data sets
- Record: data matrix, documents, transactions
- Graph: web, chemical structures
- Ordered: spatial/temporal data, sequential data
Data quality problems
- noise
- outliers
- missing values
- duplicate data