Neighbors-based methods are known as non-generalizing machine learning methods, since they simply “remember” all of its training data
The accuracy of KNN can be severely degraded with high-dimension data because there is little difference between the nearest and farthest neighbor.
Gaussian Naive Bayes
Advantages
Need less training data tran models like logistic regression
Highly scalable
Not sensitive to irrelevant features
Returns the degree of certanty of the answer
Good when you need something fast and that perfoms well
Disavantages
Can’t learn interactions between features e.g., it can’t learn that although you love movies with Brad Pitt and Tom Cruise, you hate movies where they’re together).
SVM
Advantages
High accuracy
Nice theoretical guarantees regarding overfitting
Especially popular in text classification problems