Be aware the denominator is just the whole amount of terms in document d (counting Every single event of a similar time period separately). You will discover a variety of other approaches to determine term frequency:[5]: 128 An idf is continuous per corpus, and accounts for the ratio of documents that come with the phrase "this". With this case