Machine learning
Algorithms for converting experience (training data) into expertise/knowledge (prediction function or program). Can be categorized along different axes:
- Supervised learning (privileged information present in training data that's the object of prediction in test data) vs Unsupervised learning (no functional difference between training and test data).
- Active learning vs Passive learning
Formal Model - Statistical Learning Framework
Formally, given a distribution
The generalization error of
The goal of the algorithm is to find the predictor (which depends on the training set
Empirical risk minimization (ERM) is the simple learning paradigm that aims to minimize the training error, but one needs to watch out for overfitting, wherein the model performs excellently on the training data but poorly on the true distribution. A common remedy is to restrict the hypothesis class