MATLAB: Measuring term frequency of words

I have been able to obtain a bag of words from a document. Please, how can I interact with the bag of words array, so I may make calculations on the frequency of terms within each document?

str = extractFileText('file.txt');
paras = split(str,"</P>");
paras(end) = [];                % the split left an empty last entry
paras = extractAfter(paras,">") % Drop the "<P ID=n>" from the beginning
tdoc = tokenizedDocument(lower(paras));
bag = bagOfWords(tdoc)

I have this result:

For clarification, I believe the columns are the terms, while the rows are the documents. Am I right?

I loaded 2 txt files (1 document set, 1 query set) I want to evaluate similarity between each document and each query by Cosine similarity, tf-idf or whatsoever means.

MATLAB: Measuring term frequency of words

Best Answer

Related Question

Best Answer

Related Solutions

MATLAB: How to label data of probability

MATLAB: KeyScheduleCore (word) { Rotate(word); SBoxSubstitution (word); word[0] = word[0] XOR RCON[i]; } please give me the corresponding code in matlab…. please …

Related Question