Google News
Process: Analyzes hundreds of thousands of daily news articles Result: Groups related stories together automatically Example: Articles about “giant panda gives birth to twin cubs” are clustered with other panda-related stories
Unsupervised learning works with data that isn’t associated with any output labels y. Instead of being given “right answers,” the algorithm must find structure, patterns, or interesting insights in the data on its own.
The most common type of unsupervised learning is clustering, which groups similar data points together.
Google News
Process: Analyzes hundreds of thousands of daily news articles Result: Groups related stories together automatically Example: Articles about “giant panda gives birth to twin cubs” are clustered with other panda-related stories
DNA Analysis
Process: Analyzes genetic micro-array data Result: Groups individuals into different genetic types Data: Each column represents one person’s DNA, each row represents a gene
Market Segmentation
Process: Analyzes customer databases Result: Groups customers into different market segments Benefit: More efficient customer service strategies
The algorithm automatically discovers that articles mentioning similar words should be grouped together:
Beyond clustering, this specialization covers:
DeepLearning.AI’s analysis of their community revealed distinct learner groups:
Unsupervised learning algorithms:
Understanding unsupervised learning helps recognize when data contains hidden structures that can be automatically discovered, even without knowing what to look for in advance.
Unsupervised vs Supervised Learning
Supervised Learning: Data comes with both inputs x and output labels y
Unsupervised Learning: Data comes only with inputs x, no output labels y
This specialization covers three types of unsupervised learning:
Here are examples to help distinguish between supervised and unsupervised learning:
Type: Supervised Learning Reason: Has labeled data (spam vs. non-spam emails) Approach: Train model with known examples
Type: Unsupervised Learning Reason: No predefined categories for articles Approach: Algorithm discovers groupings automatically (like Google News example)
Type: Unsupervised Learning Reason: Algorithm discovers customer segments without predefined categories Approach: Find natural groupings in customer data
Type: Supervised Learning Reason: Has labeled data (diabetes vs. no diabetes) Approach: Similar to breast cancer classification problem
Supervised Learning Characteristics:
Unsupervised Learning Characteristics:
Understanding these types helps determine the appropriate approach for different data analysis problems:
The choice between supervised and unsupervised learning depends on whether you have labeled training data and what insights you’re trying to discover from your dataset.