🎉 🎉 RAPIDMINER 9.10 IS OUT!!! 🎉🎉
Download the latest version helping analytics teams accelerate time-to-value for streaming and IIOT use cases.
Copy Dataset Properties
My use-case is one where I have two files; a training file and a validation set. The training is meant to fit the model ,and the validation has the same columns short of the label. I am doing a decent amount of preprocessing, and want to leverage that work.
I am hitting a roadblock because when I do Read CSV on the validation set, the predicted data type for a given column varies (train = polynominal, test = integer), and even though I can bring forward the preprocessing steps via Apply Model, the column is not being dummy encoded with the Nominal to Numeric operator I am carrying forward. As such, applying the model to the validation set fails because the column is not present.
I know that I could manually fix the file on load or via an operator, but I am wondering if there is a "copy data type" when columns share the same name. I would prefer this type of error not to happen during my in-class data competitions, and with a dataset that has 50 columns, my end goal would be to try to avoid having them ensure column types 1 by 1.