Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
Correct multiple record
I am a newby to RapidMiner and this is my first question
I have a hourly time_series dataset in which I need to correct an attribute multiple times
DataSet is Date_time (dd/mm/yy hh:mm), Holiday (polynominal/name of holiday), temperature (kelvin)
The error holiday is only applied to the first record of the day eg 25/12/18 12:00am rather than applied to every hour of the day. I need to correct the data set so that for each hour of the day the record has the holiday associated with it.
eg
25/12/18 12:00am = Christmas
25/12/18 1:00am = None
25/12/18 2:00am = None
I can generate a subset of the holidays but I can't work out how to correct all the incorrect examples
Any guidance on how to approach this would be fantastic'
thanks
Jeff
I have a hourly time_series dataset in which I need to correct an attribute multiple times
DataSet is Date_time (dd/mm/yy hh:mm), Holiday (polynominal/name of holiday), temperature (kelvin)
The error holiday is only applied to the first record of the day eg 25/12/18 12:00am rather than applied to every hour of the day. I need to correct the data set so that for each hour of the day the record has the holiday associated with it.
eg
25/12/18 12:00am = Christmas
25/12/18 1:00am = None
25/12/18 2:00am = None
I can generate a subset of the holidays but I can't work out how to correct all the incorrect examples
Any guidance on how to approach this would be fantastic'
thanks
Jeff
0
Answers
If you have always 24 Examples per day (so no holes in your hourly data). You could use Process Window operator to divide your data into days and then Replace Missing Values (Series) to replace the missings entries with the previous value.
See the example process below.
Hopes this helps
Fabian
see also a very similar discussion:
https://community.rapidminer.com/discussion/55971/loop-through-exampleset-and-identify-a-public-holiday-and-then-set-all-24-hours-to-1-in-variable#latest
Regards,
Balázs