jsdrewjsdrew Member Posts: 9 Learner I
I've used Rapid Miner Go to create models for my data.  When I apply the model to other data sets there are three columns with dates that come back as "MISSING" in the prediction results, but were clearly there in the dataset.  Any idea what is going on here?

    KristofGasparKristofGaspar Employee, Member Posts: 3 RM Engineering

    jsdrewjsdrew Member Posts: 9 Learner I
    I've attached two files.  DataSetforGradient.PNG shows the dataset after it has been uploaded into RapidMiner Go and prepared for the model to be applied to it. You can see the two fields labeled "Latest..." have date data in them.  PredictionResultsforGradient shows the results after "Calculate Predictions" is clicked on the screen shown in DataSetforGradient. For some reason the Prediction Results are labeling most of the data in the "Latest..." fields as "MISSING".  

    MartinLiebigMartinLiebig Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,517 RM Data Scientist
    Hi @jsdrew ,
    it seems the dates in your dataset where wrongly identified as a string / categorical value, instead a date.That causes a misbehavior, since auto model does not use the proper preprocessing for dates. What happens is, that your model only works on dates which where present in the training set, but is misbehaving on dates which were not.

    jsdrewjsdrew Member Posts: 9 Learner I
    So it is dates in the training set that are the problem?
    jsdrewjsdrew Member Posts: 9 Learner I
    I've gone back and looked to verify that the dates in the new dataset were present in the training set, and I am still having issues where some categorical data comes back MISSING. 
