The Inverse Document Frequency (IDF) is calculated as:
IDF=log(dfN)
Where:
- N = total number of documents (sentences) = 2,718
- df = number of documents containing the term = 1,618
Calculation:
IDF=log(16182718)=log(1.68)
Using natural logarithm:
ln(1.68)≈0.52
Therefore, the IDF is closest to 0.52.