NLP Portfolio

A portfolio containing my implementations of various statistical NLP models.

K-Nearest Neighbors Classifier (2020)

This code was created to classify a subset of the 20 newsgroups text dataset from sci-kit learn. Posts drawn from the ‘talk.politics.guns’, ‘talk.politics.mideast’, and ‘talk.politics.misc’ were converted to a bag of words representation, and classified using a ‘from scratch’ kNN implementation.