Anomaly Vs Supervised
Anomaly Detection vs. Supervised Learning
Section titled “Anomaly Detection vs. Supervised Learning”When to Choose Each Approach
Section titled “When to Choose Each Approach”Anomaly Detection Appropriate When:
Section titled “Anomaly Detection Appropriate When:”Data characteristics:
- Very small number of positive examples (0-20 is common)
- Large number of negative examples
- Positive examples only used for cross-validation and test sets
- Parameters learned only from negative examples
Problem characteristics:
- Many different types of anomalies possible
- Future anomalies may be unlike anything seen before
- Hard to learn from small set of positive examples
- New ways for things to go wrong may emerge
Supervised Learning Appropriate When:
Section titled “Supervised Learning Appropriate When:”Data characteristics:
- Larger number of positive and negative examples
- Even 20 positive examples might be sufficient for supervised approach
Problem characteristics:
- Future positive examples likely similar to training set positives
- Enough positive examples to learn patterns
- Anomaly types are relatively consistent over time
Key Philosophical Difference
Section titled “Key Philosophical Difference”Anomaly Detection Approach
Section titled “Anomaly Detection Approach”- Focus: Model what normal examples look like
- Detection method: Flag anything that deviates significantly from normal
- Strength: Can detect brand new, never-before-seen anomalies
- Use case: When anomalies are diverse and unpredictable
Supervised Learning Approach
Section titled “Supervised Learning Approach”- Focus: Learn patterns from both positive and negative examples
- Detection method: Recognize patterns similar to known positives
- Strength: Good at detecting similar types of known problems
- Use case: When positive examples are representative of future cases
Application Examples
Section titled “Application Examples”Fraud Detection
Section titled “Fraud Detection”Anomaly Detection:
- Scenario: Financial fraud with constantly evolving methods
- Characteristics: New fraud types emerge every few months/years
- Approach: Detect anything different from normal transactions
- Advantage: Can catch completely novel fraud schemes
Supervised Learning:
- Scenario: Email spam detection
- Characteristics: Spam patterns relatively consistent over time
- Approach: Learn to recognize known spam patterns
- Advantage: Good at catching spam similar to previous examples
Manufacturing
Section titled “Manufacturing”Anomaly Detection:
- Scenario: Detecting new, unseen defects
- Example: Brand new ways for aircraft engines to fail
- Approach: Flag anything that deviates from normal manufacturing
- Advantage: Catches unknown defect types without prior examples
Supervised Learning:
- Scenario: Detecting known, recurring defects
- Example: Smartphone screen scratches
- Approach: Train on examples of scratched vs unscratched phones
- Advantage: Reliable detection of well-understood defect patterns
Security Applications
Section titled “Security Applications”Anomaly Detection:
- Scenario: Computer system monitoring for hacking
- Characteristics: Hackers constantly develop new attack methods
- Approach: Detect unusual system behavior patterns
- Common use: Many security applications use anomaly detection
Other Applications
Section titled “Other Applications”Supervised Learning Examples:
- Weather prediction: Limited, recurring weather patterns
- Medical diagnosis: Diagnosing specific, known diseases from symptoms
- Pattern recognition: When output labels repeat frequently
Decision Framework
Section titled “Decision Framework”Choose Anomaly Detection When:
Section titled “Choose Anomaly Detection When:”- You expect brand new types of positive examples
- Positive examples may look nothing like previous ones
- You want to catch novel, unprecedented cases
- The “ways things can go wrong” are diverse and evolving
Choose Supervised Learning When:
Section titled “Choose Supervised Learning When:”- Future examples likely similar to training examples
- You have enough positive examples to learn patterns
- The problem domain has consistent patterns over time
- You want to recognize specific known types of issues
Understanding this fundamental difference helps you choose the right approach based on your specific problem characteristics and data availability**.**