talk-data.com talk-data.com

Listen 2015-07-24 at 05:51

[MINI] k-Nearest Neighbors

Description

This episode explores the k-nearest neighbors algorithm which is an unsupervised, non-parametric method that can be used for both classification and regression. The basica concept is that it leverages some distance function on your dataset to find the $k$ closests other observations of the dataset and averaging them to impute an unknown value or unlabelled datapoint.