Skip to content
Pablo Rodriguez

Anomaly Vs Supervised

Data characteristics:

  • Very small number of positive examples (0-20 is common)
  • Large number of negative examples
  • Positive examples only used for cross-validation and test sets
  • Parameters learned only from negative examples

Problem characteristics:

  • Many different types of anomalies possible
  • Future anomalies may be unlike anything seen before
  • Hard to learn from small set of positive examples
  • New ways for things to go wrong may emerge

Data characteristics:

  • Larger number of positive and negative examples
  • Even 20 positive examples might be sufficient for supervised approach

Problem characteristics:

  • Future positive examples likely similar to training set positives
  • Enough positive examples to learn patterns
  • Anomaly types are relatively consistent over time
  • Focus: Model what normal examples look like
  • Detection method: Flag anything that deviates significantly from normal
  • Strength: Can detect brand new, never-before-seen anomalies
  • Use case: When anomalies are diverse and unpredictable
  • Focus: Learn patterns from both positive and negative examples
  • Detection method: Recognize patterns similar to known positives
  • Strength: Good at detecting similar types of known problems
  • Use case: When positive examples are representative of future cases

Anomaly Detection:

  • Scenario: Financial fraud with constantly evolving methods
  • Characteristics: New fraud types emerge every few months/years
  • Approach: Detect anything different from normal transactions
  • Advantage: Can catch completely novel fraud schemes

Supervised Learning:

  • Scenario: Email spam detection
  • Characteristics: Spam patterns relatively consistent over time
  • Approach: Learn to recognize known spam patterns
  • Advantage: Good at catching spam similar to previous examples

Anomaly Detection:

  • Scenario: Detecting new, unseen defects
  • Example: Brand new ways for aircraft engines to fail
  • Approach: Flag anything that deviates from normal manufacturing
  • Advantage: Catches unknown defect types without prior examples

Supervised Learning:

  • Scenario: Detecting known, recurring defects
  • Example: Smartphone screen scratches
  • Approach: Train on examples of scratched vs unscratched phones
  • Advantage: Reliable detection of well-understood defect patterns

Anomaly Detection:

  • Scenario: Computer system monitoring for hacking
  • Characteristics: Hackers constantly develop new attack methods
  • Approach: Detect unusual system behavior patterns
  • Common use: Many security applications use anomaly detection

Supervised Learning Examples:

  • Weather prediction: Limited, recurring weather patterns
  • Medical diagnosis: Diagnosing specific, known diseases from symptoms
  • Pattern recognition: When output labels repeat frequently
  • You expect brand new types of positive examples
  • Positive examples may look nothing like previous ones
  • You want to catch novel, unprecedented cases
  • The “ways things can go wrong” are diverse and evolving
  • Future examples likely similar to training examples
  • You have enough positive examples to learn patterns
  • The problem domain has consistent patterns over time
  • You want to recognize specific known types of issues

Understanding this fundamental difference helps you choose the right approach based on your specific problem characteristics and data availability**.**