Thinking about software, life, the universe and everything.

Menu

Skip to content

Home
About

Post navigation

Business problems →

Video March 30, 2014 Uncategorized Leave a comment

Computing Document Similarity with nltk

by hkelkar

We will explore techniques to determine the amount of similarity between documents. Specifically we will look at the intuition behind tf-idf and cosine similarity. With that as a foundation we will see how to compute these metrics with the natural language tool kit.

Share this:

X
Facebook
LinkedIn
Reddit
Pinterest
Email
Tumblr

Like Loading...

Related

Post navigation

Business problems →

Leave a comment Cancel reply

Δ

RSS - Posts
RSS - Comments

Archives

January 2026
March 2025
September 2024
February 2024
May 2023
February 2023
December 2020
January 2018
May 2017
December 2015
October 2014
June 2014
March 2014
December 2013
October 2013
June 2013
April 2013
January 2013
May 2012
March 2012
August 2011
April 2011
February 2011

Blog at WordPress.com.

Comment
Reblog
Subscribe Subscribed
- Thinking about software, life, the universe and everything.
- Already have a WordPress.com account? Log in now.

%d