ML Algorithms and Approaches
Supervised
- General Approach to these algorithms
- How they work
- Focus on when and how to apply
Linear Models
- Many Flavors
- Can be used for both Classification and Regression
- Relatively low complexity model
- Can be made more complex by adding parameters
- … and nonlinear parameters (Splining, etc.)
Nearest Neighbor
- Welcome to the neighborhood!
- Both Classification and Regression
- Based on idea that similar points will cluster together in multi-dimensional space -> neighbors can predict outcomes.
- Important hyperparameter: number of neighbors
- Important hyperparameter: weight/consensus function
Trees
- Classification and Regression
- Can handle staggering amounts of complexity
- Really good with non-linear dynamics
- Some of the highest performing algorithms for tabular data
- Many, many flavors
- Bagging and Boosting
Considerations with Tree Based Models
- Easy to overfit
- Extremely good at handling non-linear complexity
- Top performing non-deep learning algorithms
- Almost always necessitate hyperparameter tuning if not optimization
- Can be explainable, but not intuitive to stakeholders