Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
two data set
hi everyone,
i am trying to use two data set (training and testing) for applying a model. as my data is consisting of both text and structured attributes i divided it into two part (text and structured) and i stored both data separately. but when i am applying the model ( i need to use 3 model multiple regression, GBT and Neural Net) and i want to test the model with anothet data set which is test.data, i don't know how i can apply all processes to test data and check the model.
so, i used sub process operator and put all process are used for training data set and just sync it to apply model.
but i'm note sure i'm doing the write thing or not.
however i have to use both data set and i cannot use split operator instead.
can anyone help me with that?
i am trying to use two data set (training and testing) for applying a model. as my data is consisting of both text and structured attributes i divided it into two part (text and structured) and i stored both data separately. but when i am applying the model ( i need to use 3 model multiple regression, GBT and Neural Net) and i want to test the model with anothet data set which is test.data, i don't know how i can apply all processes to test data and check the model.
so, i used sub process operator and put all process are used for training data set and just sync it to apply model.
but i'm note sure i'm doing the write thing or not.
however i have to use both data set and i cannot use split operator instead.
can anyone help me with that?
0
Best Answer
-
Telcontar120 RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635 UnicornI don't know how large your dataset is, but I would generally recommend using Cross Validation. In your process you are training your model on a single sample.7
Answers
Can you share XML code? To access the code, you need to go to View --> Show Panel --> XML and copy that and paste it here.
Thanks
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
thanks
Are you encountering any error or are you just asking us if this is the right way to do?
Your process looks fine based on my assumption that you already processed train data similar to test data earlier.
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing