5. Machine learning explores the study and construction of algorithms
that can learn from and make predictions on data
A Definition
Face recognition
Link prediction
document classification
6. Raw data Raw data comes from many sources
apache log
email
image
7. Raw data
Feature Extraction
Initial observations can be in arbitrary format
We extract features to represent observations
We can incorporate domain knowledge
We typically want numeric features
Success of entire pipeline often depends on choosing
good descriptions of observations!!
8. Raw data
Feature Extraction
Supervised Learning
Train a supervised model using labeled data,
e.g., Classification or Regression model
weather
sunny
rainy
cloudy
vehicle count
1000
99
5
13. Example: Predicting shoe size from height, gender, and weight
For each observation we have a feature vector, x, and label, y
We assume a linear mapping between features and label:
14. Goal: find the line of best fit
x coordinate: features
y coordinate: labels
1 D Example
15. Linear Least Squares Regression
Assume we have n training points, where x(i) denotes the ith point
Recall two earlier points:
• Linear assumption : y = wTx
• squared loss : ( y - y )2
25. Gradient Descent
Start at a random point
1. Determine a descent direction
2. Choose a step size
3. Update
26. Gradient Descent
Start at a random point
1. Determine a descent direction
2. Choose a step size
3. Update
Repeat
Until stopping criterion is satisfied