Spam Filtering
Input: Email Output: Spam or not spam classification
Supervised learning refers to algorithms that learn x to y or input to output mappings. The key characteristic is providing the learning algorithm with examples that include the right answers - correct labels y for given inputs x.
By seeing correct pairs of input x and desired output label y, the learning algorithm eventually learns to take just the input alone and give reasonably accurate predictions.
Spam Filtering
Input: Email Output: Spam or not spam classification
Speech Recognition
Input: Audio clip Output: Text transcript
Machine Translation
Input: English text Output: Spanish, Arabic, Hindi, Chinese, Japanese, etc.
Online Advertising
Input: Ad information + user information Output: Probability of clicking the ad Most lucrative form of supervised learning today
The training process follows these steps:
Consider predicting housing prices based on house size:
The algorithm must systematically choose the most appropriate function to fit the data, rather than selecting based on desired outcomes.
The key distinction is that regression predicts from infinite possibilities while classification predicts from a finite set of discrete categories.
Supervised learning maps input x to output y, learning from examples with correct answers. The two major types are regression (predicting numbers) and classification (predicting categories), each suited for different types of prediction problems.
Classification addresses problems where the goal is to predict a small number of possible output categories. Consider breast cancer detection as a diagnostic tool:
Classification can extend beyond binary problems:
Instead of just tumor size, consider both:
With two inputs, data visualization shows:
Real breast cancer detection systems use many additional inputs:
Classification Properties
The learning algorithm determines how to fit a boundary through the data to separate different classes. This boundary helps make predictions for new patients based on their age and tumor characteristics.
Classification algorithms predict categories from a small, finite set of possible outputs. Whether dealing with two categories (benign/malignant) or multiple categories, the goal is to learn decision boundaries that can accurately classify new examples based on their input features. The key difference from regression is the discrete, limited nature of possible outputs rather than continuous numerical predictions.