TF-IDF

Here are some notes of TF-IDF as a measure for identifying keywords in a document. To efficiently indexing documents for future search, it is important to identify the keywords in a document. For a word $latex w$ to be a good keyword, first, it should be relatively rare. This can be measured by the inverse document frequency…