"creating a Word List"
hi,
I would like to create a word list for a list of documents.
I read fromthe word vector tool tutorial that the following chain of operators can help : TextInput, CorpusBasedWeighting and InteractiveAttributeWeighting.
I tried it but at lost at what to fill in for parameter class_to_characterize for CorpusBasedWeighting. I have no class label specified as there is no class involved. I have document name as ID and the content as attribute for the input.
Appreciate any advice and guidance.
I would like to create a word list for a list of documents.
I read fromthe word vector tool tutorial that the following chain of operators can help : TextInput, CorpusBasedWeighting and InteractiveAttributeWeighting.
I tried it but at lost at what to fill in for parameter class_to_characterize for CorpusBasedWeighting. I have no class label specified as there is no class involved. I have document name as ID and the content as attribute for the input.
Appreciate any advice and guidance.
Tagged:
0
Answers
you cannot use the weighting if you don't have labels. That's because the weighting expresses the importance of words for distinguishing documents of the different labels. If you don't have labels, there's nothing to distinguish and hence no weighting...
What do you need the word list for? Perhaps you can simply generate the standard word list automatically by processing the documents using something like that: Greetings,
Sebastian