Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
Input/Output Word List Question
1. What is the meaning of the last 2 numbers after each attribute? Are they significant? I tried loading my own keywords (just a list format) from other sources and did not seem to work.
@number_of_documents 29
@number_of_classes 1
erasable,5,5
array,10,10
including,9,9
plurality_memory,6,6
floating_gate,10,10
transistor_floating,5,5
memory_string,5,5
string,7,7
2. What exactly are the uses of the input and output word list?
3. Is RM 4.4 out yet?
thanks,
Matthew
@number_of_documents 29
@number_of_classes 1
erasable,5,5
array,10,10
including,9,9
plurality_memory,6,6
floating_gate,10,10
transistor_floating,5,5
memory_string,5,5
string,7,7
2. What exactly are the uses of the input and output word list?
3. Is RM 4.4 out yet?
thanks,
Matthew
Tagged:
0
Answers
ad 1) the numbers in the word list state
- the number of documents in which the word occurred (the first number)
- the number of documents in which the word occurred for each available class (second and following numbers, here you seem to have only one class)
ad 2) the input and output word lists ensure that the TFIDF calculations are comparable for training and application phase which is important for document classificationad 3) RM 4.4 will be release tomorrow during the day
Cheers,
Ingo