Thinking about software, life, the universe and everything.

Menu

Skip to content
  • Home
  • About

Archives

Video March 30, 2014 Uncategorized Leave a comment

Computing Document Similarity with nltk

by hkelkar

We will explore techniques to determine the amount of similarity between documents. Specifically we will look at the intuition behind tf-idf and cosine similarity. With that as a foundation we will see how to compute these metrics with the natural language tool kit.

  • RSS - Posts
  • RSS - Comments

Archives

  • February 2023
  • December 2020
  • January 2018
  • May 2017
  • October 2016
  • September 2016
  • December 2015
  • October 2014
  • June 2014
  • March 2014
  • December 2013
  • October 2013
  • June 2013
  • April 2013
  • January 2013
  • May 2012
  • March 2012
  • August 2011
  • April 2011
  • February 2011
Blog at WordPress.com.
  • Follow Following
    • Thinking about software, life, the universe and everything.
    • Already have a WordPress.com account? Log in now.
    • Thinking about software, life, the universe and everything.
    • Customize
    • Follow Following
    • Sign up
    • Log in
    • Report this content
    • View site in Reader
    • Manage subscriptions
    • Collapse this bar