290 likes | 429 Views
Generalization in Supervised Machine Learning. BLiNQ MEDIA Praneeth Vepakomma Senior Data Scientist. Hypothetical Knapsack of Coins:. Copper and Gold Coins Total number of coins is fixed and is a large sample. Capture-Recapture What is the proportion of Gold coins?.
E N D
Generalization in Supervised Machine Learning BLiNQ MEDIA Praneeth Vepakomma Senior Data Scientist
Hypothetical Knapsack of Coins: • Copper and Gold Coins • Total number of coins is fixed and is a large sample. • Capture-Recapture • What is the proportion of Gold coins? • Copper and Gold Coins • Total number of coins is variable and is a large sample. • Capture-Recapture • What is the proportion of Gold coins?
190 Years after Gauss, the core problem of prediction remains an active problem : Then: Now:
190 Years after Gauss, the core problem of prediction remains an active problem : Find a mapping♯ from the features: is a list of parameters, required to represent the function #Approximation
What is Supervised Learning? Loss Function Existing Features Known Labels Assumptions Loss Function Unavailable Features UnknownLabels
Evaluating the Learned Function: • Loss Function quantifies the error in the approximation. • Learn a mapping by optimizing the loss. Example:
Generalization and Predictability Empirical Risk Minimization: • Empirical Risk is the average (expected) loss on seen data. • True Risk is the expected risk on the process generating the X,Y pairs. True Risk Minimization:
PARAMETRIC CHARACTERIZATION OF THE MAPPING : • 2d-Linear function: Slope, Intercept • Cubic Spline: Number of knots, Location of Knots • Nearest-Neighbor regression: Number of neighbors • Lasso: L1-L2 Weights • Support Vector Machines: Kernel width, Margin Length • Random Forests: Resampling sample size
Long list of available Supervised Learning Techniques. • Most of the techniques have tuning parameters. • We can minimize out-of-sample performance by tuning the technique with optimal parameters. • Tuning can be performed by cross-validation over a discrete grid of parameter combinations.
CURSE OF DIMENSIONALITY- Flat World-10D World:
CURSE OF DIMENSIONALITY- Flat World-10D World:
CURSE OF DIMENSIONALITY- Flat World-10D World:
CURSE OF DIMENSIONALITY- Let us validate:
Brief Description Technology Overview Hiring (What we’re looking for) http://blinqmedia.com/contact/job-openings/