Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

Input/Output Word List Question

mdcmdc Member Posts: 58 Maven
edited November 2019 in Help
1. What is the meaning of the last 2 numbers after each attribute? Are they significant? I tried loading my own keywords (just a list format) from other sources and did not seem to work.

@number_of_documents 29
@number_of_classes 1
erasable,5,5
array,10,10
including,9,9
plurality_memory,6,6
floating_gate,10,10
transistor_floating,5,5
memory_string,5,5
string,7,7

2. What exactly are the uses of the input and output word list?

3. Is RM 4.4 out yet?

thanks,
Matthew

Answers

  • IngoRMIngoRM Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,751 RM Founder
    Hello,

    ad 1) the numbers in the word list state
    • the number of documents in which the word occurred (the first number)
    • the number of documents in which the word occurred for each available class (second and following numbers, here you seem to have only one class)
    ad 2) the input and output word lists ensure that the TFIDF calculations are comparable for training and application phase which is important for document classification

    ad 3) RM 4.4 will be release tomorrow during the day

    Cheers,
    Ingo
Sign In or Register to comment.