Kane's PhD Journey

I am PhD student in the Applied Knowledge Representation and Natural Language Understanding (AKRANLU) Lab @ Purdue University.

Text preprocessing for English

April 04, 2018 | 7 minute read

Text preprocessing: Tokenization, Case-folding, Stopwords filtering, Stemming, and Lemmatization

April 02, 2018 | 6 minute read

In this series of posts, I will discuss about some widely used text features such as Word N-grams, Character N-grams, TFIDF, and word embedding.

April 01, 2018 | less than 1 minute read

Happy Easter!