Iterative Ml Development

Iterative Loop of ML Development

The Machine Learning Development Process

Developing a machine learning model is an iterative process that rarely works perfectly on the first attempt. The systematic approach involves multiple cycles of improvement guided by diagnostics and evaluation.

Core Development Loop

Choose Overall Architecture
- Select machine learning model type
- Decide what data to use
- Pick hyperparameters
- Define system components
Implement and Train Model
- Build the initial implementation
- Train using chosen architecture
- Expect suboptimal initial performance
Run Diagnostics
- Analyze bias and variance
- Perform error analysis
- Evaluate model performance
Make Informed Decisions
- Increase neural network size
- Adjust regularization parameter (λ)
- Add or remove data/features
- Modify architecture based on insights

Email Spam Classifier Example

Problem Setup

Objective: Build classifier to distinguish spam from legitimate emails

Input Features:

Top 10,000 words from English dictionary
Feature vector x₁, x₂, …, x₁₀,₀₀₀
Binary encoding (1 if word appears, 0 otherwise)

Feature Construction Example

Given email: “Hi Andrew, buy this great deal discount…”

Word Features

a: 0 (doesn’t appear)
Andrew: 1 (appears)
buy: 1 (appears)
deal: 1 (appears)
discount: 0 (doesn’t appear)

Alternative Approach

Count frequency of word occurrences instead of binary presence, though binary works well in practice

Common Improvement Ideas

When initial model performance is insufficient, multiple approaches seem tempting:

Data Collection Strategies

Honeypot projects: Create fake email addresses to attract spam
More sophisticated email routing features: Analyze email header paths
Enhanced body text features: Better handling of misspellings and variants

Feature Engineering

Unified word treatment: Treat “discounting” and “discount” as same word
Misspelling detection: Identify deliberate misspellings like “w4tches”, “med1cine”, “m0rtgage”
Routing analysis: Examine email server paths for spam indicators

Making Strategic Decisions

Bias/Variance Guidance

High bias algorithm: Spending months on honeypot data collection may not help
High variance algorithm: Collecting more data could provide significant improvement

Diagnostic-Driven Approach

Rather than randomly trying improvements:

Run diagnostics to understand current limitations
Analyze results to identify most promising directions
Choose techniques based on evidence rather than intuition
Iterate systematically through the development loop

The key insight is that proper diagnostics (bias/variance analysis, error analysis) provide crucial guidance for architectural choices, preventing wasted effort on low-impact improvements.

Multiple iterations through this loop, guided by systematic evaluation, lead to models that achieve desired performance levels.