Data: Collection of data objects and their attributes

Types of attributes

  • Nominal: distinctness
  • Oridinal: distinctness & order
  • Interval: distinctness, order & addition
  • Ratio: distinctness, order, addition & multiplication

Types of data sets

  • Record: data matrix, documents, transactions
  • Graph: web, chemical structures
  • Ordered: spatial/temporal data, sequential data

Data quality problems

  • noise
  • outliers
  • missing values
  • duplicate data

Read More

Definition

Classification as the task of learning a target function f that maps each attribute set x to one of the predicted class labels y.

Classification Tasks

  • Predicting tumor cells as benign or malignant
  • Classifying credit card transactions as legitimate or fraudulent
  • Categorizing news stories as finance, weather, entertainment, sports, etc.

Read More