A hypothesis class is Probably Approximately Correct (PAC) learnable with respect to a set and a loss function , if there exists a function and a learning algorithm with the following property: For every and for every distribution over , running the learning algorithm on i.i.d. examples generated by returns a hypothesis such that, with probability of at least (over the choice of the training examples),
where . The function determines the sample complexity of learning .
Under a realizability assumption (), we can guarantee an absolute level of error instead of just a relative one w.r.t. some minimizer within the hypothesis class. For example, under the realizability assumption, every finite hypothesis class for a binary classification task is PAC learnable ( with probability ) with sample complexity: